textstat_keyness

a <a rd-options="" href="/link/dfm?package=quanteda&version=1.3.13" data-mini-rdoc="quanteda::dfm">dfm</a> containing the features to be examined for keyness

the document index (numeric, character or logical) identifying 
the document forming the "target" for computing keyness; all other 
documents' feature frequencies will be combined for use as a reference

target

(signed) association measure to be used for computing keyness.
Currently available: <code>"chi2"</code>; <code>"exact"</code> (Fisher's exact test); 
<code>"lr"</code> for the likelihood ratio; <code>"pmi"</code> for pointwise mutual 
information.

measure

logical; if <code>TRUE</code> sort features scored in descending order 
of the measure, otherwise leave in original feature order

sort

if <code>"default"</code>, Yates correction is applied to 
<code>"chi2"</code>; William's correction is applied to <code>"lr"</code>; and no 
correction is applied for the <code>"exact"</code> and <code>"pmi"</code> measures. 
Specifying a value other than the default can be used to override the 
defaults, for instance to apply the Williams correction to the chi2 
measure. Specifying a correction for the <code>"exact"</code> and <code>"pmi"</code> 
measures has no effect and produces a warning.

correction

Calculate "keyness", a score for features that occur differentially across 
different categories. Here, the categories are defined by reference to a
"target" document index in the <a rd-options="" href="/link/dfm?package=quanteda&version=1.3.13" data-mini-rdoc="quanteda::dfm">dfm</a>, with the reference group
consisting of all other documents.

textstat

A fast, flexible, and comprehensive framework for
quantitative text analysis in R.  Provides functionality for corpus management,
creating and manipulating tokens and ngrams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.

Kenneth Benoit

quanteda

Quantitative Analysis of Textual Data

Kohei Watanabe

Haiyan Wang

Paul Nulty

Adam Obeng

Stefan M<c3><bc>ller

Akitaka Matsuo

Patrick O. Perry

Jouni Kuha

Benjamin Lauderdale

William Lowe

Christian M<c3><bc>ller

Lori Young

Stuart Soroka

Ian Fellows

European Research Council 

textstat_keyness function

a <a rd-options='' href='dfm'>dfm</a> containing the features to be examined for keyness

Calculate "keyness", a score for features that occur differentially across 
different categories. Here, the categories are defined by reference to a
"target" document index in the <a rd-options='' href='dfm'>dfm</a>, with the reference group
consisting of all other documents.

textstat_keyness: Calculate keyness statistics

Description

Usage

Arguments

Value

References

Examples