Learn R Programming

⚠️There's a newer version (0.4-12.6) of this package.Take me there.

RecordLinkage (version 0.4-1)

Record Linkage in R

Description

Provides functions for linking and deduplicating data sets. Methods based on a stochastical approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

2,587

Version

0.4-1

License

GPL (>= 2)

Maintainer

Murat Sariyar

Last Published

January 12th, 2012

Functions in RecordLinkage (0.4-1)

emClassify

Weight-based Classification of Data Pairs
RLBigDataDedup-class

Class "RLBigDataDedup"
getPairs

Extract Record Pairs
RLdata

Test data for Record Linkage
RLBigDataDedup

Constructors for big data objects.
RecLinkResult-class

Class "RecLinkResult"
RecLinkData.object

Record Linkage Data Object
getFrequencies-methods

Get attribute frequencies
ffdf-class

Class "ffdf"
epiClassify

Classify record pairs with EpiLink weights
classifySupv

Supervised Classification
RecLinkResult.object

Record Linkage Result Object
mrl

Mean Residual Life Plot
deleteNULLs

Remove NULL Values
getTable-methods

Build contingency table
%append%-methods

Concatenate comparison patterns or classification results
show

Show a RLBigData object
RecLinkClassif-class

Class "RecLinkClassif"
classifyUnsup

Unsupervised Classification
mygllm

Generalized Log-Linear Fitting
summary

Print Summary of Record Linkage Data
summary.RLBigData

summary methods for "RLBigData" objects.
resample

Safe Sampling
RLBigData-class

Class "RLBigData"
trainSupv

Train a Classifier
internals

Internal functions and methods
gpdEst

Estimate Threshold from Pareto Distribution
RecLinkData-class

Class "RecLinkData"
getPairsBackend

Backend function for getPairs
summary.RLResult

Summary method for "RLResult" objects.
splitData

Split Data
editMatch

Edit Matching Status
subset

Subset operator for record linkage objects
genSamples

Generate Training Set
getParetoThreshold

Estimate Threshold from Pareto Distribution
RLResult-class

Class "RLResult"
ff_vector-class

Class "ff_vector"
stochastic

Stochastic record linkage.
clone

Serialization of record linkage object.
epiWeights

Calculate EpiLink weights
getMinimalTrain

Create a minimal training set
makeBlockingPairs

Create record pairs from blocks of ids.
isFALSE

Check for FALSE
strcmp

String Metrics
unorderedPairs

Create Unordered Pairs
emWeights

Calculate weights
compare

Compare Records
getExpectedSize

Estimate number of record pairs.
RLBigDataLinkage-class

Class "RLBigDataLinkage"
getErrorMeasures-methods

Calculate Error Measures
optimalThreshold

Optimal Threshold for Record Linkage
texSummary

LaTeX Summary of linkage results
phonetics

Phonetic Code