powered by
Calculate Jaccard Similarity of two character vectors
jaccard_similarity(a, b, ngram_width = 2, nthread = NULL)
a vector of jaccard similarities of the strings
the first character vector
the length of the shingles / ngrams used in the similarity calculation
Maximum number of threads to use. If NULL (default), Rayon's global thread pool is used, which typically uses all logical CPU cores available.
NULL
jaccard_similarity( c("the quick brown fox", "jumped over the lazy dog"), c("the quck bron fx", "jumped over hte lazy dog") )
Run the code above in your browser using DataLab