50% off | Unlimited Data & AI Learning

Last chance! 50% off unlimited learning

Sale ends in


⚠️There's a newer version (0.4-12.4) of this package.Take me there.

RecordLinkage (version 0.3-4)

Record Linkage in R

Description

Provides functions for linking and deduplicating data sets. Methods based on a stochastical approach are implemented as well as classification algorithms from the machine learning domain.

Copy Link

Version

Install

install.packages('RecordLinkage')

Monthly Downloads

16,659

Version

0.3-4

License

GPL (>= 2)

Maintainer

Murat Sariyar

Last Published

August 10th, 2011

Functions in RecordLinkage (0.3-4)

genSamples

Generate Training Set
RLBigDataLinkage-class

Class "RLBigDataLinkage"
RLdata

Test data for Record Linkage
RLResult-class

Class "RLResult"
RLBigData-class

Class "RLBigData"
RecLinkData-class

Class "RecLinkData"
RLBigDataDedup-class

Class "RLBigDataDedup"
RecLinkClassif-class

Class "RecLinkClassif"
classifySupv

Supervised Classification
RecLinkResult-class

Class "RecLinkResult"
RecLinkData.object

Record Linkage Data Object
editMatch

Edit Matching Status
clone

Serialization of record linkage object.
%append%-methods

Concatenate comparison patterns or classification results
classifyUnsup

Unsupervised Classification
emClassify

Weight-based Classification of Data Pairs
getExpectedSize

Estimate number of record pairs.
getMinimalTrain

Create a minimal training set
getTable-methods

Build contingency table
compare

Compare Records
RLBigDataDedup

Constructors for big data objects.
epiClassify

Classify record pairs with EpiLink weights
delete.NULLs

Remove NULL Values
emWeights

Calculate weights
epiWeights

Calculate EpiLink weights
getParetoThreshold

Estimate Threshold from Pareto Distribution
resample

Safe Sampling
gpdEst

Estimate Threshold from Pareto Distribution
RecLinkResult.object

Record Linkage Result Object
isFALSE

Check for FALSE
phonetics

Phonetic Code
internals

Internal functions and methods
getFrequencies-methods

Get attribute frequencies
unorderedPairs

Create Unordered Pairs
show

Show a RLBigData object
mrl

Mean Residual Life Plot
texSummary

LaTeX Summary of linkage results
summary.RLResult

Summary method for "RLResult" objects.
summary.RLBigData

summary methods for "RLBigData" objects.
strcmp

String Metrics
stochastic

Stochastic record linkage.
mygllm

Generalized Log-Linear Fitting
getErrorMeasures-methods

Calculate Error Measures
summary

Print Summary of Record Linkage Data
trainSupv

Train a Classifier
subset

Subset operator for record linkage objects
optimalThreshold

Optimal Threshold for Record Linkage
splitData

Split Data
getPairs

Extract Record Pairs
makeBlockingPairs

Create record pairs from blocks of ids.
getPairsBackend

Backend function for getPairs