Learn R Programming

⚠️There's a newer version (0.6.1) of this package.Take me there.

fastLink (version 0.1.1)

Fast Probabilistic Record Linkage with Missing Data

Description

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', available at .

Copy Link

Version

Install

install.packages('fastLink')

Monthly Downloads

681

Version

0.1.1

License

GPL (>= 3)

Maintainer

Ted Enamorado

Last Published

July 11th, 2017

Functions in fastLink (0.1.1)

aggregateEM

Aggregate EM objects for use in `summary.fastLink()`
calcMoversPriors

calcMoversPriors
dfA

Sample dataset A
dfB

Sample dataset B
fastLink-package

Fast Probabilistic Record Linkage with Missing Data
fastLink

fastLink
countyoutflow

County-level outflow rates by state
dedupeMatches

dedupeMatches
gammaKpar

gammaKpar
getMatches

getMatches
countyfips

County-level FIPS Codes
countyinflow

County-level inflow rates by state
matchesLink

matchesLink
nameReweight

nameReweight
statemove

In-state movers rates by state
stateoutflow

State-level outflow rates by state
emlinkMARmov

emlinkMARmov
emlinkRS

emlinkRS
statefips

State-level FIPS Codes
summary.fastLink

Get summaries of fastLink() objects
tableCounts

tableCounts
stateinflow

State-level inflow rates by state
cleanAddressUSPS

cleanAddressUSPS
clusterMatch

clusterMatch
gammaCK2par

gammaCK2par
gammaCKpar

gammaCKpar