Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm

This turns a "tidy" one-term-per-document-per-row data frame into a DocumentTermMatrix or TermDocumentMatrix from the tm package, or a dfm from the quanteda package. Each caster can be called either with non-standard evaluation (bare column names) or character vectors (for cast_tdm_ and cast_dtm_). It ignores groups.

cast_tdm_(data, term_col, document_col, value_col, weighting = tm::weightTf,

cast_tdm(data, term, document, value, weighting = tm::weightTf, ...)

cast_dtm_(data, document_col, term_col, value_col, weighting = tm::weightTf, ...)

cast_dtm(data, document, term, value, weighting = tm::weightTf, ...)

cast_dfm_(data, document_col, term_col, value_col, ...)

cast_dfm(data, document, term, value, ...)


Table with one-term-per-document-per-row


The weighting function for the DTM/TDM (default is term-frequency, effectively unweighted)


Extra arguments passed on to sparseMatrix

term, term_col

(Bare) name of a column with terms

document, document_col

(Bare) name of a column with documents

value, value_col

(Bare) name of a column containing values

  • cast_tdm_
  • cast_tdm
  • cast_dtm_
  • cast_dtm
  • cast_dfm_
  • cast_dfm
Documentation reproduced from package tidytext, version 0.1.3, License: MIT + file LICENSE

Community examples

Looks like there are no examples yet.