dfm_tfidf

object for which idf or tf-idf will be computed (a document-feature 
matrix)

scheme for <code><a rd-options="" href="/link/dfm_weight?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::dfm_weight">dfm_weight</a></code>; defaults to <code>"count"</code>

scheme_tf

scheme for <code><a rd-options="" href="/link/docfreq?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::docfreq">docfreq</a></code>; defaults to
<code>"inverse"</code>. Other options to <code><a rd-options="" href="/link/docfreq?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::docfreq">docfreq</a></code> can be passed
through the ellipsis (<code>...</code>).

scheme_df

the base for the logarithms in the <code><a rd-options="" href="/link/tf?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::tf">tf</a></code> and
<code><a rd-options="" href="/link/docfreq?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::docfreq">docfreq</a></code> calls; default is 10

base

additional arguments passed to <code><a rd-options="" href="/link/docfreq?package=quanteda&version=1.3.4" data-mini-rdoc="quanteda::docfreq">docfreq</a></code>.

Weight a dfm by term frequency-inverse document frequency (tf-idf), 
with full control over options. Uses fully sparse methods for efficiency.

weighting

A fast, flexible, and comprehensive framework for
quantitative text analysis in R.  Provides functionality for corpus management,
creating and manipulating tokens and ngrams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.

Kenneth Benoit

quanteda

Quantitative Analysis of Textual Data

Kohei Watanabe

Haiyan Wang

Paul Nulty

Adam Obeng

Stefan M<c3><bc>ller

Akitaka Matsuo

Patrick O. Perry

Jouni Kuha

Benjamin Lauderdale

William Lowe

Christian M<c3><bc>ller

Lori Young

Stuart Soroka

Ian Fellows

European Research Council 

dfm_tfidf function

scheme for <code><a rd-options='' href='dfm_weight'>dfm_weight</a></code>; defaults to <code>"count"</code>

scheme for <code><a rd-options='' href='docfreq'>docfreq</a></code>; defaults to
<code>"inverse"</code>. Other options to <code><a rd-options='' href='docfreq'>docfreq</a></code> can be passed
through the ellipsis (<code>...</code>).

the base for the logarithms in the <code><a rd-options='' href='tf'>tf</a></code> and
<code><a rd-options='' href='docfreq'>docfreq</a></code> calls; default is 10

additional arguments passed to <code><a rd-options='' href='docfreq'>docfreq</a></code>.

dfm_tfidf: Weight a dfm by tf-idf

Description

Usage

Arguments

Details

References

See Also

Examples