Estimates the total number of record pairs generated by a dataset and specified blocking conditions.
getExpectedSize(object, ...) # S4 method for RLBigDataDedup
getExpectedSize(object)
# S4 method for RLBigDataLinkage
getExpectedSize(object)
# S4 method for data.frame
getExpectedSize(object, blockfld = list())
Either a record linkage object or a dataset.
A blocking definition, such as in compare.dedup
Placeholder for additional arguments.
The expected number of record pairs.
The "RLBigData*"
methods are only left for backward compability.
Since version 0.4, all record pairs for such objects are generated and stored
in a disk file. The methods return the true number of record pairs.
For the "data.frame"
method, estimation is based on the assumption
that agreement or disagreement of one attribute is independent of the other attributes.
blockfld
is a blocking definition such as for
RLBigDataDedup
.