textFeatures

An object of class <code><a rd-options="koRpus" href="/link/kRp.tagged-class?package=koRpus&version=0.06-5&to=koRpus" data-mini-rdoc="koRpus::kRp.tagged-class">kRp.tagged-class</a></code>,
<code><a rd-options="koRpus" href="/link/kRp.txt.freq-class?package=koRpus&version=0.06-5&to=koRpus" data-mini-rdoc="koRpus::kRp.txt.freq-class">kRp.txt.freq-class</a></code> or <code><a rd-options="koRpus" href="/link/kRp.analysis-class?package=koRpus&version=0.06-5&to=koRpus" data-mini-rdoc="koRpus::kRp.analysis-class">kRp.analysis-class</a></code>. Can
also be a list of these objects, if you want to analyze more than one text at once.

text

An object of class <code><a rd-options="koRpus" href="/link/kRp.hyphen-class?package=koRpus&version=0.06-5&to=koRpus" data-mini-rdoc="koRpus::kRp.hyphen-class">kRp.hyphen-class</a></code>,
      if <code>text</code> has
already been hyphenated. If <code>text</code> is a list and <code>hyphen</code> is not <code>NULL</code>,
      it must
also be a list with one object for each text, in the same order.

hyphen


This function combines several of <code>koRpus</code>' methods to extract the 9-Feature Set for
authorship detection (Brannon, Afroz & Greenstadt, 2011; Brannon & Greenstadt, 2009).


A set of tools to analyze texts. Includes, amongst others,
functions for automatic language detection, hyphenation, several indices of
lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability
(e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language
corpora are also provided, to enable frequency analyses (supports Celex and
Leipzig Corpora Collection file formats) and measures like tf-idf. Support for
additional languages can be added on-the-fly or by plugin packages. Note: For
full functionality a local installation of TreeTagger is recommended. 'koRpus'
also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs
for its basic features. The respective R package 'rkward' cannot be installed
directly from a repository, as it is a part of RKWard. To make full use of this
feature, please install RKWard from https://rkward.kde.org (plugins are detected
automatically). Due to some restrictions on CRAN, the full package sources are
only available from the project homepage. To ask for help, report bugs, suggest
feature improvements, or discuss the global development of the package, please
subscribe to the koRpus-dev mailing list (https://ml06.ispgateway.de/mailman/
listinfo/korpus-dev_r.reaktanz.de).

Meik Michalke

koRpus

An R Package for Text Analysis

textFeatures function

An object of class <code><a rd-options='koRpus' href='kRp.tagged-class'>kRp.tagged-class</a></code>,
<code><a rd-options='koRpus' href='kRp.txt.freq-class'>kRp.txt.freq-class</a></code> or <code><a rd-options='koRpus' href='kRp.analysis-class'>kRp.analysis-class</a></code>. Can
also be a list of these objects, if you want to analyze more than one text at once.

An object of class <code><a rd-options='koRpus' href='kRp.hyphen-class'>kRp.hyphen-class</a></code>,
      if <code>text</code> has
already been hyphenated. If <code>text</code> is a list and <code>hyphen</code> is not <code>NULL</code>,
      it must
also be a list with one object for each text, in the same order.

textFeatures: Extract text features for authorship analysis

Description

Usage

Arguments

Value

References

Examples