textstat_proxy

This is an underlying function for <code>textstat_dist</code> and
<code>textstat_simil</code> but returns <code>TsparseMatrix</code>.

internal

Textual statistics functions formerly in the 'quanteda' package.
Textual statistics for characterizing and comparing textual data. Includes
functions for measuring term and document frequency, the co-occurrence of
words, similarity and distance between features and documents, feature entropy,
keyword occurrence, readability, and lexical diversity.  These functions
extend the 'quanteda' package and are specially designed for sparse textual data.

Kenneth Benoit

quanteda.textstats

Textual Statistics for the Quantitative Analysis of Textual Data

Kohei Watanabe

Haiyan Wang

Jiong Wei Lua

Jouni Kuha

European Research Council 

textstat_proxy function

<dl><dt>y</dt>
<dd>if a dfm object is provided, proximity between
documents or features in <code>x</code> and <code>y</code> is computed.</dd>
<dt>margin</dt>
<dd>identifies the margin of the dfm on which similarity or
difference will be computed: <code>"documents"</code> for documents or
<code>"features"</code> for word/term features.</dd>
<dt>method</dt>
<dd>character; the method identifying the similarity or distance
measure to be used; see Details.</dd>
<dt>p</dt>
<dd>The power of the Minkowski distance.</dd>
<dt>min_proxy</dt>
<dd>the minimum proximity value to be recoded.</dd>
<dt>rank</dt>
<dd>an integer value specifying top-n most proximity values to be
recorded.</dd>
<dt>use_na</dt>
<dd>if <code>TRUE</code>, return <code>NA</code> for proximity to empty
vectors. Note that use of <code>NA</code> makes the proximity matrices denser.</dd></dl>

Arguments

[Experimental] Compute document/feature proximity — textstat_proxy

<dl>

<dt>y</dt>
<dd>if a dfm object is provided, proximity between
documents or features in <code>x</code> and <code>y</code> is computed.</dd>


<dt>margin</dt>
<dd>identifies the margin of the dfm on which similarity or
difference will be computed: <code>"documents"</code> for documents or
<code>"features"</code> for word/term features.</dd>


<dt>method</dt>
<dd>character; the method identifying the similarity or distance
measure to be used; see Details.</dd>


<dt>p</dt>
<dd>The power of the Minkowski distance.</dd>


<dt>min_proxy</dt>
<dd>the minimum proximity value to be recoded.</dd>


<dt>rank</dt>
<dd>an integer value specifying top-n most proximity values to be
recorded.</dd>


<dt>use_na</dt>
<dd>if <code>TRUE</code>, return <code>NA</code> for proximity to empty
vectors. Note that use of <code>NA</code> makes the proximity matrices denser.</dd>

</dl>

textstat_proxy: [Experimental] Compute document/feature proximity

Description

Usage

Arguments

See Also