Learn R Programming

RecordLinkage (version 0.3-2)

getExpectedSize: Estimate number of record pairs.

Description

Estimates the total number of record pairs generated by a dataset and specified blocking conditions.

Usage

getExpectedSize(object, ...)

  ## S3 method for class 'RLBigDataDedup':
getExpectedSize(object)

  ## S3 method for class 'RLBigDataLinkage':
getExpectedSize(object)

  ## S3 method for class 'data.frame':
getExpectedSize(object, blockfld = list())

Arguments

object
Either a record linkage object or a dataset.
blockfld
A blocking definition, such as in compare.dedup
...
Placeholder for additional arguments.

Value

  • The expected number of record pairs.

Details

Estimation is based on the assumption that agreement or disagreement of one attribute is independent of the other attributes. For the "data.frame" method, the blocking definition has to be provided, for the other methods, it is taken from the object.