Learn R Programming

⚠️There's a newer version (0.6.1) of this package.Take me there.

fastLink (version 0.6.0)

Fast Probabilistic Record Linkage with Missing Data

Description

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2019) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', American Political Science Review and is available at .

Copy Link

Version

Install

install.packages('fastLink')

Monthly Downloads

632

Version

0.6.0

License

GPL (>= 3)

Maintainer

Ted Enamorado

Last Published

April 29th, 2020

Functions in fastLink (0.6.0)

gammaCKpar

gammaCKpar
emlinkRS

emlinkRS
emlinkMARmov

emlinkMARmov
gammaCK2par

gammaCK2par
fastLink

fastLink
getPosterior

getPosterior
inspectEM

inspectEM
getMatches

getMatches
statemove

In-state movers rates by state
getPatterns

getPatterns
plot.fastLink

Plot matching patterns of the EM object by posterior probability of match
gammaKpar

gammaKpar
stateinflow

State-level inflow rates by state
preprocText

preprocText
calcMoversPriors

calcMoversPriors
stringSubset

stringSubset
emlinklog

emlinklog
gammaNUMCK2par

gammaNUMCK2par
dfB

Sample dataset B
gammaNUMCKpar

gammaNUMCKpar
matchesLink

matchesLink
nameReweight

nameReweight
print.inspectEM

print.inspectEM
dfA

Sample dataset A
summary.fastLink

Get summaries of fastLink() objects
tableCounts

tableCounts
stateoutflow

State-level outflow rates by state
statefips

State-level FIPS Codes
fastLink-package

Fast Probabilistic Record Linkage with Missing Data
blockData

blockData
clusterMatch

clusterMatch
aggconfusion

aggconfusion
countyinflow

County-level inflow rates by state
aggregateEM

Aggregate EM objects for use in `summary.fastLink()`
confusion

Get confusion table for fastLink objects
countyfips

County-level FIPS Codes
countyoutflow

County-level outflow rates by state
dedupeMatches

dedupeMatches