get_dtm

<code>HashCorpus</code> or <code>VocabCorpus</code> object. See
<a rd-options="" href="/link/create_corpus?package=text2vec&version=0.3.0" data-mini-rdoc="text2vec::create_corpus">create_corpus</a> for details.

corpus

character, one of <code>c("dgCMatrix", "dgTMatrix", "lda_c")</code>.
<code>"lda_c"</code> is Blei's lda-c format (a list of 2 * doc_terms_size); see
<a href="https://www.cs.princeton.edu/~blei/lda-c/readme.txt">https://www.cs.princeton.edu/~blei/lda-c/readme.txt</a>

type


This function extracts a document-term matrix from a
  <code>Corpus</code> object.


Very fast and memory-friendly tools for text vectorization and
state-of-the-art word embeddings (GloVe). This package provides a
source-agnostic streaming API, which allows researchers to perform analysis
of collections of documents which are much larger than available RAM. All
core functions are parallelized to benefit from multicore machines.

get_dtm: Extract document-term matrix

Description

Usage

Arguments

Examples