dtm_resampler

Takes any DTM and randomly resamples from each row, creating a new DTM

This is a collection of functions optimized for working with
with various kinds of text matrices. Focusing on
the text matrix as the primary object - represented
either as a base R dense matrix or a 'Matrix' package sparse
matrix - allows for a consistent and intuitive interface
that stays close to the underlying mathematical foundation
of computational text analysis. In particular, the package
includes functions for working with word embeddings,
text networks, and document-term matrices. Methods developed in
Stoltz and Taylor (2019) <doi:10.1007/s42001-019-00048-6>,
Taylor and Stoltz (2020) <doi:10.1007/s42001-020-00075-8>,
Taylor and Stoltz (2020) <doi:10.15195/v7.a23>, and
Stoltz and Taylor (2021) <doi:10.1016/j.poetic.2021.101567>.

Dustin Stoltz

text2map

R Tools for Text Matrices, Embeddings, and Networks

Marshall Taylor

dtm_resampler function

<dl><dt>dtm</dt>
<dd>Document-term matrix with terms as columns. Works with DTMs
produced by any popular text analysis package, or you can use the
<code>dtm_builder()</code> function.</dd>
<dt>alpha</dt>
<dd>Number indicating proportion of document lengths, e.g.,
<code>alpha = 1</code> returns resampled rows that are the same lengths
as the original DTM.</dd>
<dt>n</dt>
<dd>Integer indicating the length of documents to be returned, e.g.,
<code>n = 100L</code> will bring documents shorter than 100 tokens up to 100,
while bringing documents longer than 100 tokens down to 100.</dd></dl>

Arguments

Resamples an input DTM to generate new DTMs — dtm_resampler

<dl>

<dt>dtm</dt>
<dd>Document-term matrix with terms as columns. Works with DTMs
produced by any popular text analysis package, or you can use the
<code>dtm_builder()</code> function.</dd>


<dt>alpha</dt>
<dd>Number indicating proportion of document lengths, e.g.,
<code>alpha = 1</code> returns resampled rows that are the same lengths
as the original DTM.</dd>


<dt>n</dt>
<dd>Integer indicating the length of documents to be returned, e.g.,
<code>n = 100L</code> will bring documents shorter than 100 tokens up to 100,
while bringing documents longer than 100 tokens down to 100.</dd>

</dl>

dtm_resampler: Resamples an input DTM to generate new DTMs

Description

Usage

Value

Arguments

Details