Learn R Programming

RecordLinkage (version 0.4-12.6)

Record Linkage Functions for Linking and Deduplicating Data Sets

Description

Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) .

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

4,147

Version

0.4-12.6

License

GPL (>= 2)

Maintainer

Murat Sariyar Developer

Last Published

January 25th, 2026

Functions in RecordLinkage (0.4-12.6)

clone

Serialization of record linkage object.
compare

Compare Records
genSamples

Generate Training Set
getExpectedSize

Estimate number of record pairs.
emWeights

Calculate weights
epiClassify

Classify record pairs with EpiLink weights
epiWeights

Calculate EpiLink weights
getPairs

Extract Record Pairs
getMinimalTrain

Create a minimal training set
ff_vector-class

Class "ff_vector"
ffdf-class

Class "ffdf"
isFALSE

Check for FALSE
internals

Internal functions and methods
getErrorMeasures-methods

Calculate Error Measures
mygllm

Generalized Log-Linear Fitting
optimalThreshold

Optimal Threshold for Record Linkage
getPairsBackend

Backend function for getPairs
strcmp

String Metrics
getParetoThreshold

Estimate Threshold from Pareto Distribution
stochastic

Stochastic record linkage.
getFrequencies-methods

Get attribute frequencies
resample

Safe Sampling
phonetics

Phonetic Code
summary.RLResult

Summary method for "RLResult" objects.
gpdEst

Estimate Threshold from Pareto Distribution
getTable-methods

Build contingency table
summary

Print Summary of Record Linkage Data
trainSupv

Train a Classifier
texSummary

LaTeX Summary of linkage results
show

Show a RLBigData object
mrl

Mean Residual Life Plot
makeBlockingPairs

Create record pairs from blocks of ids.
splitData

Split Data
subset

Subset operator for record linkage objects
summary.RLBigData

summary methods for "RLBigData" objects.
unorderedPairs

Create Unordered Pairs
RLResult-class

Class "RLResult"
classifySupv

Supervised Classification
%append%-methods

Concatenate comparison patterns or classification results
RecLinkClassif-class

Class "RecLinkClassif"
RLBigDataLinkage-class

Class "RLBigDataLinkage"
RecLinkResult.object

Record Linkage Result Object
RecLinkData.object

Record Linkage Data Object
RLdata

Test data for Record Linkage
RLBigDataDedup-class

Class "RLBigDataDedup"
emClassify

Weight-based Classification of Data Pairs
RLBigDataDedup

Constructors for big data objects.
editMatch

Edit Matching Status
deleteNULLs

Remove NULL Values
RecLinkData-class

Class "RecLinkData"
RLBigData-class

Class "RLBigData"
classifyUnsup

Unsupervised Classification
RecLinkResult-class

Class "RecLinkResult"