RecordLinkage (version 0.4-11)

getExpectedSize: Estimate number of record pairs.

Description

Estimates the total number of record pairs generated by a dataset and specified blocking conditions.

Usage

getExpectedSize(object, ...)

# S4 method for RLBigDataDedup getExpectedSize(object)

# S4 method for RLBigDataLinkage getExpectedSize(object)

# S4 method for data.frame getExpectedSize(object, blockfld = list())

Arguments

object

Either a record linkage object or a dataset.

blockfld

A blocking definition, such as in compare.dedup

Placeholder for additional arguments.

Value

The expected number of record pairs.

Details

The "RLBigData*" methods are only left for backward compability. Since version 0.4, all record pairs for such objects are generated and stored in a disk file. The methods return the true number of record pairs.

For the "data.frame" method, estimation is based on the assumption that agreement or disagreement of one attribute is independent of the other attributes.

blockfld is a blocking definition such as for RLBigDataDedup.