cast_tdm

0th

Percentile

Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm

This turns a "tidy" one-term-per-document-per-row data frame into a DocumentTermMatrix or TermDocumentMatrix from the tm package, or a dfm from the quanteda package. These functions support non-standard evaluation through the tidyeval framework. Groups are ignored.

Usage
cast_tdm(data, term, document, value, weighting = tm::weightTf, ...)

cast_dtm(data, document, term, value, weighting = tm::weightTf, ...)

cast_dfm(data, document, term, value, ...)

Arguments
data

Table with one-term-per-document-per-row

term

Column containing terms as string or symbol

document

Column containing document IDs as string or symbol

value

Column containing values as string or symbol

weighting

The weighting function for the DTM/TDM (default is term-frequency, effectively unweighted)

...

Extra arguments passed on to sparseMatrix

Details

The arguments term, document, and value are passed by expression and support quasiquotation; you can unquote strings and symbols.

Aliases
  • cast_tdm
  • cast_dtm
  • cast_dfm
Documentation reproduced from package tidytext, version 0.2.2, License: MIT + file LICENSE

Community examples

Looks like there are no examples yet.