matrix_via_r

Should punctuation and digits be stripped
 from the text before constructing the document term matrix? If <code>TRUE</code>,
 the default:<ul>
<li>The corporaexplorer object will be lighter and most searches in
 the corpus exploration app will be faster.</li>
<li>Searches including punctuation and digits will be carried out in
 the full text documents.</li>
<li>The only "risk" with this strategy is that the corpus exploration
 app in some cases can produce false positives. E.g. searching for the
 term "donkey" will also find the term "don%key".
This should not be a problem for the vast opportunity of use cases, but if
one so desires, there are three different solutions: set this parameter to
<code>FALSE</code>, create a corporaexplorerobject without a matrix by setting
the <code>use_matrix</code> parameter to <code>FALSE</code>, or run
<code><a rd-options="corporaexplorer" href="/link/run_corpus_explorer?package=corporaexplorer&version=0.6.2&to=corporaexplorer" data-mini-rdoc="corporaexplorer::run_corpus_explorer">run_corpus_explorer</a></code> with the
<code>use_matrix</code> parameter set to <code>FALSE</code>.</li>
</ul>If <code>FALSE</code>, the corporaexplorer object will be larger, and most
 simple searches will be slower.

matrix_without_punctuation

internal

Facilitates dynamic exploration of text collections through
an intuitive graphical user interface.
The package contains 1) a helper function to convert a data
frame to a 'corporaexplorerobject', 2) a 'Shiny' app for fast and flexible
exploration of a 'corporaexplorerobject', and 3) a 'Shiny' app for simple
retrieval/extraction of documents from a 'corporaexplorerobject' in a
reading-friendly format. The intended primary audience
is qualitatively oriented researchers who rely on close reading of textual
documents as part of their academic activity. Auto-scrolling is enabled by the 'jquery.scrollTo' plugin,
see <https://github.com/flesler/jquery.scrollTo>. For 'shinytest' to work, 'PhantomJS' (<http://phantomjs.org/>) is required.

Kristian Lundby Gjerde

corporaexplorer

A 'Shiny' App for Exploration of Text Collections

matrix_via_r function

Should punctuation and digits be stripped
 from the text before constructing the document term matrix? If <code>TRUE</code>,
 the default:<ul>
<li>The corporaexplorer object will be lighter and most searches in
 the corpus exploration app will be faster.</li>
<li>Searches including punctuation and digits will be carried out in
 the full text documents.</li>
<li>The only "risk" with this strategy is that the corpus exploration
 app in some cases can produce false positives. E.g. searching for the
 term "donkey" will also find the term "don%key".
This should not be a problem for the vast opportunity of use cases, but if
one so desires, there are three different solutions: set this parameter to
<code>FALSE</code>, create a corporaexplorerobject without a matrix by setting
the <code>use_matrix</code> parameter to <code>FALSE</code>, or run
<code><a rd-options='corporaexplorer' href='run_corpus_explorer'>run_corpus_explorer</a></code> with the
<code>use_matrix</code> parameter set to <code>FALSE</code>.</li>
</ul>If <code>FALSE</code>, the corporaexplorer object will be larger, and most
 simple searches will be slower.

matrix_via_r: Create document term matrix for fast search of single words

Description

Usage

Arguments

Value