amatch(c("hello","g'day"),c("hi","hallo","ola"),maxDist=2)
returns c(2,NA) since "hello" matches closest with "hallo", and within
the maximum (optimal string alignment) distance. The second element, "g'day",
matches closest with "ola" but since the distance equals 4, no match is reported.
A second typical use is to compute string distances. For example
stringdist(c("g'day"),c("hi","hallo","ola"))
Returns c(5,5,4) since these are the distances between "g'day" and
respectively "hi", "hallo", and "ola".
A third typical use would be to compute a dist object, that can be
used to cluster text strings.
stringdistmatrix(c("foo","bar","boo","baz"))
Returns an object of class dist that can be used by clustering algorithms in
the cluster package (such as hclust).
Besides documentation for each function, the main topics documented are:
stringdist-metrics-- string metrics supported by the packagestringdist-encoding -- how encoding is handled by the packagestringdist-parallelization -- on multithreadingstringdistpackage for approximate string matching.
R Journal 6(1) pp 111-122citation('stringdist')