⚠️There's a newer version (0.1.5) of this package. Take me there.

textreuse (version 0.1.1)

Detect Text Reuse and Document Similarity

Description

Tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language.

Copy Link

Version

Down Chevron

Install

install.packages('textreuse')

Monthly Downloads

559

Version

0.1.1

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

November 4th, 2015

Functions in textreuse (0.1.1)