The modified Statistics Canada name coding procedure
Usage
statcan(word, maxCodeLen = 4)
Arguments
word
string or vector of strings to encode
maxCodeLen
maximum length of the resulting encodings, in characters
Value
the Statistics Canada encoded character vector
Caveats
The statcan algorithm is only
defined for inputs over the standard French alphabet.
Details
The variable word is the name to be encoded. The variable
maxCodeLen is the limit on how long the returned name code
should be. The default is 4.
References
Billy T. Lynch and William L. Arends. "Selection of surname coding
procedure for the SRS record linkage system." United States
Department of Agriculture, Sample Survey Research Branch, Research
Division, Washington, 1977.