textmodel_svm

the <a rd-options="" href="/link/dfm?package=quanteda.textmodels&version=0.9.1" data-mini-rdoc="quanteda.textmodels::dfm">dfm</a> on which the model will be fit. Does not need to
contain only the training documents.

vector of training labels associated with each document identified
in <code>train</code>. (These will be converted to factors if not already
factors.)

weights for different classes for imbalanced training sets,
passed to <code>wi</code> in <code><a rd-options="LiblineaR" href="/link/LiblineaR?package=quanteda.textmodels&version=0.9.1&to=LiblineaR" data-mini-rdoc="LiblineaR::LiblineaR">LiblineaR</a></code>. <code>"uniform"</code>
uses default; <code>"docfreq"</code> weights by the number of training examples,
and <code>"termfreq"</code> by the relative sizes of the training classes in
terms of their total lengths in tokens.

weight

additional arguments passed to <code><a rd-options="LiblineaR" href="/link/LiblineaR?package=quanteda.textmodels&version=0.9.1&to=LiblineaR" data-mini-rdoc="LiblineaR::LiblineaR">LiblineaR</a></code>

Fit a fast linear SVM classifier for texts, using the
LiblineaR package.

Scaling models and classifiers for sparse matrix objects representing
textual data in the form of a document-feature matrix. Includes original
implementations of 'Laver', 'Benoit', and Garry's (2003) <doi:10.1017/S0003055403000698>,
'Wordscores' model, Perry and 'Benoit's' (2017) <arXiv:1710.08963> class affinity scaling model,
and 'Slapin' and 'Proksch's' (2008) <doi:10.1111/j.1540-5907.2008.00338.x> 'wordfish'
model, as well as methods for correspondence analysis, latent semantic analysis,
and fast Naive Bayes and linear 'SVMs' specially designed for sparse textual data.

Kenneth Benoit

quanteda.textmodels

Scaling Models and Classifiers for Textual Data

Kohei Watanabe

Haiyan Wang

Stefan M<c3><bc>ller

Patrick O. Perry

Benjamin Lauderdale

William Lowe

European Research Council 

textmodel_svm function

the <a rd-options='' href='dfm'>dfm</a> on which the model will be fit. Does not need to
contain only the training documents.

weights for different classes for imbalanced training sets,
passed to <code>wi</code> in <code><a rd-options='LiblineaR' href='LiblineaR'>LiblineaR</a></code>. <code>"uniform"</code>
uses default; <code>"docfreq"</code> weights by the number of training examples,
and <code>"termfreq"</code> by the relative sizes of the training classes in
terms of their total lengths in tokens.

additional arguments passed to <code><a rd-options='LiblineaR' href='LiblineaR'>LiblineaR</a></code>

textmodel_svm: Linear SVM classifier for texts

Description

Usage

Arguments

References

See Also

Examples