Learn R Programming

⚠️There's a newer version (0.10.2) of this package.Take me there.

corpus (version 0.5.1)

Text Corpus Analysis

Description

Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, and for computing term occurrence frequencies.

Copy Link

Version

Install

install.packages('corpus')

Monthly Downloads

218

Version

0.5.1

License

Apache License (== 2.0) | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Patrick Perry

Last Published

May 25th, 2017

Functions in corpus (0.5.1)

text

Text Vectors
tokens

Text Tokenization
segmentation

Segmenting Text
stopwords

Stop Words
term_counts

Term Frequencies
term_matrix

Term Frequency Matrix
corpus-package

The Corpus Package
read_ndjson

JSON Data Input