ngrams-class

ngrams

ngrams,partition-method

ngrams,partitionBundle-method

object of class <code>partition</code>

.Object

the p-attribute to use (can be &gt; 1)

pAttribute

if NULL, tokens will be counted, else characters, keeping only those provided by a character vector

char

progress

logical, whether to use multicore, passed into call to <code>blapply</code> (see respective documentation)

Count n-grams, either of words, or of characters.

Library for corpus analysis using the Corpus Workbench as an
efficient back end for indexing and querying large corpora. The package offers
functionality to flexibly create partitions and to carry out basic statistical
operations (count, co-occurrences etc.). The original full text of documents
can be reconstructed and inspected at any time. Beyond that, the package is
intended to serve as an interface to packages implementing advanced statistical
procedures. Respective data structures (document term matrices, term co-
occurrence matrices etc.) can be created based on the indexed corpora.

ngrams-class: Get N-Grams

Description

Usage

Arguments

Examples