Learn R Programming

⚠️There's a newer version (0.4-12.4) of this package.Take me there.

RecordLinkage (version 0.4-10)

Record Linkage in R

Description

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

20,085

Version

0.4-10

License

GPL (>= 2)

Maintainer

Andreas Borg

Last Published

July 27th, 2016

Functions in RecordLinkage (0.4-10)

emClassify

Weight-based Classification of Data Pairs
epiClassify

Classify record pairs with EpiLink weights
editMatch

Edit Matching Status
%append%-methods

Concatenate comparison patterns or classification results
emWeights

Calculate weights
epiWeights

Calculate EpiLink weights
deleteNULLs

Remove NULL Values
classifySupv

Supervised Classification
classifyUnsup

Unsupervised Classification
clone

Serialization of record linkage object.
ff_vector-class

Class "ff_vector"
internals

Internal functions and methods
getExpectedSize

Estimate number of record pairs.
isFALSE

Check for FALSE
getFrequencies-methods

Get attribute frequencies
getTable-methods

Build contingency table
getParetoThreshold

Estimate Threshold from Pareto Distribution
getPairsBackend

Backend function for getPairs
gpdEst

Estimate Threshold from Pareto Distribution
genSamples

Generate Training Set
RLBigData-class

Class "RLBigData"
makeBlockingPairs

Create record pairs from blocks of ids.
mygllm

Generalized Log-Linear Fitting
resample

Safe Sampling
RLBigDataDedup-class

Class "RLBigDataDedup"
phonetics

Phonetic Code
RecLinkClassif-class

Class "RecLinkClassif"
mrl

Mean Residual Life Plot
optimalThreshold

Optimal Threshold for Record Linkage
RecLinkResult-class

Class "RecLinkResult"
trainSupv

Train a Classifier
show

Show a RLBigData object
unorderedPairs

Create Unordered Pairs
texSummary

LaTeX Summary of linkage results
RLResult-class

Class "RLResult"
splitData

Split Data
subset

Subset operator for record linkage objects
RLBigDataLinkage-class

Class "RLBigDataLinkage"
getMinimalTrain

Create a minimal training set