⚠️There's a newer version (0.3.0) of this package. Take me there.

tokenizers (version 0.1.0)

Tokenize Text

Description

Convert natural language text into tokens. The tokenizers have a consistent interface and are compatible with Unicode, thanks to being built on the 'stringi' package. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, lines, and regular expressions.

Copy Link

Version

Install

install.packages('tokenizers')

Monthly Downloads

34,594

Version

0.1.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

183

Forks

Repository

https://github.com/lmullen/tokenizers

Maintainer

Lincoln Mullen

Last Published

April 2nd, 2016

Tokenize Text

Description

Copy Link

Version

Install

Monthly Downloads

Version

License

Issues

Pull Requests

Stars

Forks

Repository

Maintainer

Last Published

Functions in tokenizers (0.1.0)