Learn R Programming

⚠️There's a newer version (0.9.15) of this package.Take me there.

stringdist (version 0.9.4.4)

Approximate String Matching and String Distance Functions

Description

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Copy Link

Version

Install

install.packages('stringdist')

Monthly Downloads

86,358

Version

0.9.4.4

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Mark der Loo

Last Published

December 16th, 2016

Functions in stringdist (0.9.4.4)

seq_dist

Compute distance metrics between integer sequences
qgrams

Get a table of qgram counts from one or more character vectors.
seq_amatch

Approximate matching for integer sequences.
seq_qgrams

Get a table of qgram counts for integer sequences
amatch

Approximate string matching
phonetic

Phonetic algorithms
printable_ascii

Detect the presence of non-printable or non-ascii characters
stringdist-encoding

String metrics in stringdist
stringdist-metrics

String metrics in stringdist
seq_sim

Compute similarity scores between sequences of integers
stringdist-package

A package for string distance calculation and approximate string matching.
stringdist-parallelization

Multithreading and parallelization in stringdist
stringsim

Compute similarity scores between strings
stringdist

Compute distance metrics between strings