Learn R Programming

RecordLinkage (version 0.4-1)

getExpectedSize: Estimate number of record pairs.

Description

Estimates the total number of record pairs generated by a dataset and specified blocking conditions.

Usage

getExpectedSize(object, ...)

  ## S3 method for class 'RLBigDataDedup':
getExpectedSize(object)

  ## S3 method for class 'RLBigDataLinkage':
getExpectedSize(object)

  ## S3 method for class 'data.frame':
getExpectedSize(object, blockfld = list())

Arguments

object
Either a record linkage object or a dataset.
blockfld
A blocking definition, such as in compare.dedup
...
Placeholder for additional arguments.

Value

  • The expected number of record pairs.

Details

The "RLBigData*" methods are only left for backward compability. Since version 0.4, all record pairs for such objects are generated and stored in a disk file. The methods return the true number of record pairs. For the "data.frame" method, estimation is based on the assumption that agreement or disagreement of one attribute is independent of the other attributes. blockfld is a blocking definition such as for RLBigDataDedup.