Learn R Programming

phonics (version 1.1.0)

nysiis: New York State Identification and Intelligence System

Description

The NYSIIS phonetic algorithm

Usage

nysiis(word, maxCodeLen = 6, modified = FALSE)

Arguments

word

string or vector of strings to encode

maxCodeLen

maximum length of the resulting encodings, in characters

modified

if TRUE, use the modified NYSIIS algorithm

Value

the NYSIIS encoded character vector

Caveats

The nysiis algorithm is only defined for inputs over the standard English alphabet, i.e., "A-Z." For inputs outside this range, the output is undefined.

Details

The nysiis function phentically encodes the given string using the New York State Identification and Intelligence System (NYSIIS) algorithm. The algorithm is based on the implementation provided by Wikipedia and is implemented in pure R using regular expressions.

The variable maxCodeLen is the limit on how long the returned NYSIIS code should be. The default is 6.

The variable modified directs nysiis to use the modified method instead of the original.

References

Robert L. Taft, Name search techniques, Bureau of Systems Development, Albany, New York, 1970.

See Also

Other phonics: caverphone, cologne, lein, metaphone, mra_encode, onca, phonex, rogerroot, soundex, statcan

Examples

Run this code
# NOT RUN {
nysiis("Robert")
nysiis("rupert")
nysiis(c("Alabama", "Alaska"), modified = TRUE)
nysiis("mississippi", 4)

# }

Run the code above in your browser using DataLab