⚠️There's a newer version (0.10.2) of this package.Take me there.
corpus (version 0.6.0)
Text Corpus Analysis
Description
Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, and for computing term occurrence frequencies (including n-grams).