Learn R Programming

⚠️There's a newer version (0.10.2) of this package.Take me there.

corpus (version 0.3.1)

Text Corpus Analysis

Description

Text corpus data analysis, with full support for UTF8-encoded Unicode text. The package provides the ability to seamlessly read and process text from large JSON files without holding all of the data in memory simultaneously.

Copy Link

Version

Install

install.packages('corpus')

Monthly Downloads

218

Version

0.3.1

License

Apache License (== 2.0) | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Patrick Perry

Last Published

May 5th, 2017

Functions in corpus (0.3.1)

read_ndjson

JSON Data Input
segmentation

Segmenting Text
text

Text Vectors
tokens

Text Tokenization