Learn R Programming

⚠️There's a newer version (0.4-12.4) of this package.Take me there.

RecordLinkage (version 0.3-2)

Record Linkage in R

Description

Provides functions for linking and deduplicating data sets. Methods based on a stochastical approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

20,085

Version

0.3-2

License

GPL (>= 2)

Maintainer

Murat Sariyar

Last Published

May 11th, 2011

Functions in RecordLinkage (0.3-2)

RecLinkResult-class

Class "RecLinkResult"
epiWeights

Calculate EpiLink weights
epiClassify

Classify record pairs with EpiLink weights
gpdEst

Estimate Threshold from Pareto Distribution
RLBigData-class

Class "RLBigData"
clone

Serialization of record linkage object.
classifyUnsup

Unsupervised Classification
genSamples

Generate Training Set
RecLinkResult.object

Record Linkage Result Object
mrl

Mean Residual Life Plot
getExpectedSize

Estimate number of record pairs.
RecLinkData.object

Record Linkage Data Object
getPairs

Extract Record Pairs
getErrorMeasures-methods

Calculate Error Measures
RLBigDataLinkage-class

Class "RLBigDataLinkage"
RLdata

Test data for Record Linkage
classifySupv

Supervised Classification
getTable-methods

Build contingency table
internals

Internal functions and methods
RecLinkClassif-class

Class "RecLinkClassif"
isFALSE

Check for FALSE
emClassify

Weight-based Classification of Data Pairs
makeBlockingPairs

Create record pairs from blocks of ids.
getParetoThreshold

Estimate Threshold from Pareto Distribution
subset

Subset operator for record linkage objects
emWeights

Calculate weights
RLBigDataDedup-class

Class "RLBigDataDedup"
%append%-methods

Concatenate comparison patterns or classification results
phonetics

Phonetic Code
summary

Print Summary of Record Linkage Data
unorderedPairs

Create Unordered Pairs
getFrequencies-methods

Get attribute frequencies
getMinimalTrain

Create a minimal training set
resample

Safe Sampling
delete.NULLs

Remove NULL Values
show

Show a RLBigData object
compare

Compare Records
summary.RLBigData

summary methods for "RLBigData" objects.
trainSupv

Train a Classifier
splitData

Split Data
optimalThreshold

Optimal Threshold for Record Linkage
RecLinkData-class

Class "RecLinkData"
editMatch

Edit Matching Status
RLResult-class

Class "RLResult"
mygllm

Generalized Log-Linear Fitting
strcmp

String Metrics
texSummary

LaTeX Summary of linkage results
RLBigDataDedup

Constructors for big data objects.
getPairsBackend

Backend function for getPairs