Learn R Programming

⚠️There's a newer version (0.4-12.6) of this package.Take me there.

RecordLinkage (version 0.4-8)

Record Linkage in R

Description

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

2,587

Version

0.4-8

License

GPL (>= 2)

Maintainer

Andreas Borg

Last Published

May 28th, 2015

Functions in RecordLinkage (0.4-8)

RLResult-class

Class "RLResult"
internals

Internal functions and methods
getPairsBackend

Backend function for getPairs
getExpectedSize

Estimate number of record pairs.
getParetoThreshold

Estimate Threshold from Pareto Distribution
splitData

Split Data
epiClassify

Classify record pairs with EpiLink weights
genSamples

Generate Training Set
gpdEst

Estimate Threshold from Pareto Distribution
trainSupv

Train a Classifier
deleteNULLs

Remove NULL Values
RLBigDataDedup-class

Class "RLBigDataDedup"
clone

Serialization of record linkage object.
RLBigData-class

Class "RLBigData"
unorderedPairs

Create Unordered Pairs
RecLinkResult-class

Class "RecLinkResult"
subset

Subset operator for record linkage objects
isFALSE

Check for FALSE
makeBlockingPairs

Create record pairs from blocks of ids.
classifyUnsup

Unsupervised Classification
show

Show a RLBigData object
RecLinkClassif-class

Class "RecLinkClassif"
ff_vector-class

Class "ff_vector"
resample

Safe Sampling
texSummary

LaTeX Summary of linkage results
emClassify

Weight-based Classification of Data Pairs
emWeights

Calculate weights
%append%-methods

Concatenate comparison patterns or classification results
getMinimalTrain

Create a minimal training set
mygllm

Generalized Log-Linear Fitting
mrl

Mean Residual Life Plot
phonetics

Phonetic Code
editMatch

Edit Matching Status
RLBigDataLinkage-class

Class "RLBigDataLinkage"
getTable-methods

Build contingency table
classifySupv

Supervised Classification
getFrequencies-methods

Get attribute frequencies
epiWeights

Calculate EpiLink weights
optimalThreshold

Optimal Threshold for Record Linkage