Learn R Programming

⚠️There's a newer version (0.9.15) of this package.Take me there.

stringdist (version 0.9.4.1)

Approximate String Matching and String Distance Functions

Description

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (damerau-levenshtein, hamming, levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (jaro, jaro-winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Copy Link

Version

Install

install.packages('stringdist')

Monthly Downloads

66,896

Version

0.9.4.1

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Mark der Loo

Last Published

January 2nd, 2016

Functions in stringdist (0.9.4.1)

printable_ascii

Detect the presence of non-printable or non-ascii characters
seq_amatch

Approximate matching for integer sequences.
stringdist-package

A package for string distance calculation and approximate string matching.
stringdist-encoding

String metrics in stringdist
stringdist-metrics

String metrics in stringdist
phonetic

Phonetic algorithms
stringdist

Compute distance metrics between strings
stringdist-parallelization

Multithreading and parallelization in stringdist
seq_sim

Compute similarity scores between sequences of integers
seq_dist

Compute distance metrics between integer sequences
stringsim

Compute similarity scores between strings
seq_qgrams

Get a table of qgram counts for integer sequences
amatch

Approximate string matching
qgrams

Get a table of qgram counts from one or more character vectors.