Learn R Programming

⚠️There's a newer version (0.4-12.6) of this package.Take me there.

RecordLinkage (version 0.2-0)

Record Linkage in R

Description

Provides functions for linking and deduplicating data sets. Methods based on a stochastical approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

2,587

Version

0.2-0

License

GPL (>= 2)

Maintainer

Andreas Borg

Last Published

April 12th, 2010

Functions in RecordLinkage (0.2-0)

delete.NULLs

Remove NULL Values
epiClassify

Classify record pairs with EpiLink weights
emWeights

Calculate weights
genSamples

Generate Training Set
emClassify

Weight-based Classification of Data Pairs
getPairs

Extract Record Pairs
gpdEst

Estimate Threshold from Pareto Distribution
RecLinkData.object

Record Linkage Data Object
editMatch

Edit Matching Status
getMinimalTrain

Create a minimal training set
resample

Safe Sampling
optimalThreshold

Optimal Threshold for Record Linkage
trainSupv

Train a Classifier
mrl

Mean Residual Life Plot
splitData

Split Data
classifySupv

Supervised Classification
classifyUnsup

Unsupervised Classification
RecLinkResult.object

Record Linkage Result Object
phonetics

Phonetic Code
texSummary

LaTeX Summary of linkage results
getParetoThreshold

Estimate Threshold from Pareto Distribution
errorMeasures

Calculate Error Measures
mygllm

Generalized Log-Linear Fitting
isFALSE

Check for FALSE
unorderedPairs

Create Unordered Pairs
epiWeights

Calculate EpiLink weights