These functions and methods are used internally and should usually not be called from outside the package.
begin(x, ...)
nextPairs(x, n = 10000, ...)
clear(x, ...)
blockfldfun(blockfld, phoneticFld, phoneticFun, coln)
getSQLStatement(data1, data2 = data1, con, type, blockFld, excludeFld,
strcmpFld, strcmpFun, phoneticFld, phoneticFun)
init_sqlite_extensions(db)
.allows_extensions(db)
.lib_path()
getPatternCounts(x, n=10000, cutoff=1, withProgressBar = (sink.number()==0))
getMatchCount(object)
getNonMatchCount(object)
getNACount(object)
getColumnNames(object, withExcluded = FALSE)
getThresholds(W, M, U, my, ny)
countpattern(x, matching = FALSE)
hasWeights(object)
Depends on the function, usually a "RecLinkData",
"RLBigData" or "RLResult".
A matrix of binary observations for countpattern.
Maximum number of comparison patterns to fetch.
Blocking definition as in
RLBigData*.
Indices of attributes for which phonetic code should be used.
Name of phonetic function as in
RLBigData*.
Data frame with records.
"linkage" or "deduplication", the type of linkage.
Indices or names of fields to exclude from comparison.
Indices of attributes for which string comparison should be used.
Name of string comparison function as in
RLBigData*.
A SQLite database connection.
Column names of records.
Database connection.
Threshold from which string comparison values are rounded to 1.
Whether to display a progress bar.
Logical. Whether to output names of excluded columns
Linkage weights, m- and u-probabilities. See emWeights
for details.
Error bounds. See emClassify for details.
Optional arguments.
If TRUE an additional vector is returned which stores which row belongs to which pattern.
Andreas Borg, Murat Sariyar
beginSends a query to the database of an "RLBigData"
object which creates comparison patterns.
nextPairsRetrieves the next n comparison patterns.
clearCloses the result set opened by begin.
blockfldfunConstructs the part of SQL statement which implements blocking.
blockfldfunConstructs the part of SQL statement which implements blocking.
getSQLStatementConstructs SQL statement for retrieving comparison patterns.
init_sqlite_extensionsLoad string comparison and phonetic functions into database.
.allows_extensionsTests whether SQLite engine supports extension functions.
.lib_pathGets path of shared library of package.
getPatternCountsCounts binary patterns, implemented through
countpattern.
getMatchCountGets number of matches from a
"RLBigData" object.
getNACountGets number of pairs with unknown matching status
from a "RLBigData" object.
getDbFileReturns path of database file for a data or result object.
getColumnNamesReturns column names of a
"RLBigData" object as a character vector.
getThresholdsCalculates upper and lower classification based on error bounds.
countpatternModified version of countpattern
in package e1071.
hasWeightsDetermines whether a data object has weights.