subset_query

A convenience function that searches for contexts (documents, sentences), and uses the results to <a href="/link/subset?package=corpustools&version=0.5.1" data-mini-rdoc="corpustools::subset">subset</a> the tCorpus token data.

Provides text analysis in R, focusing on the use of a tokenized text format. In this format, the positions of tokens are maintained, and each token can be annotated (e.g., part-of-speech tags, dependency relations).
Prominent features include advanced Lucene-like querying for specific tokens or contexts (e.g., documents, sentences),
similarity statistics for words and documents, exporting to DTM for compatibility with many text analysis packages,
and the possibility to reconstruct original text from tokens to facilitate interpretation.

Kasper Welbers

corpustools

Managing, Querying and Analyzing Tokenized Text

subset_query function

<dl><dt>tc</dt>
<dd>A <code>tCorpus</code></dd>
<dt>query</dt>
<dd>A character string that is a query. See search_contexts for query syntax.</dd>
<dt>feature</dt>
<dd>The name of the feature columns on which the query is used.</dd>
<dt>context_level</dt>
<dd>Select whether the query and subset are performed at the document or sentence level.</dd>
<dt>not</dt>
<dd>If TRUE, perform a NOT search. Return the articles/sentences for which the query is not found.</dd>
<dt>as_ascii</dt>
<dd>if TRUE, perform search in ascii.</dd>
<dt>window</dt>
<dd>If used, uses a word distance as the context (overrides context_level)</dd></dl>

Arguments

A convenience function that searches for contexts (documents, sentences), and uses the results to <a href='https://rdrr.io/r/base/subset.html'>subset</a> the tCorpus token data.

Subset tCorpus token data using a query — subset_query

<dl>

<dt>tc</dt>
<dd>A <code>tCorpus</code></dd>


<dt>query</dt>
<dd>A character string that is a query. See search_contexts for query syntax.</dd>


<dt>feature</dt>
<dd>The name of the feature columns on which the query is used.</dd>


<dt>context_level</dt>
<dd>Select whether the query and subset are performed at the document or sentence level.</dd>


<dt>not</dt>
<dd>If TRUE, perform a NOT search. Return the articles/sentences for which the query is not found.</dd>


<dt>as_ascii</dt>
<dd>if TRUE, perform search in ascii.</dd>


<dt>window</dt>
<dd>If used, uses a word distance as the context (overrides context_level)</dd>

</dl>

subset_query: Subset tCorpus token data using a query

Description

Usage

Arguments

Details

Examples