stanza_pipeline

An interface to the 'Python' package 'stanza' <https://stanfordnlp.github.io/stanza/index.html>.
'stanza' is a 'Python' 'NLP' library for many human languages.
It contains support for running various accurate natural language processing tools on 60+ languages.

Florian Schwendinger

stanza

'Stanza' - A 'R' NLP Package for Many Human Languages

Kurt Hornik

Julian Amon

stanza_pipeline function

<dl><dt>language</dt>
<dd>a character string giving the language (default is <code>"en"</code>).</dd>
<dt>model_dir</dt>
<dd>path to the directory for storing the for <code>Stanza</code> models
(default is <code>"~/stanza_resources"</code>).</dd>
<dt>package</dt>
<dd>(default is <code>"default"</code>.</dd>
<dt>processors</dt>
<dd>FIXME: we should define if we want to use comma seperated string or a character vector.</dd>
<dt>logging_level</dt>
<dd>a character string giving the logging level (default is <code>"INFO"</code>),
available levels are <code>c('DEBUG', 'INFO', 'WARNING', 'WARN', 'ERROR', 'CRITICAL', 'FATAL')</code>.</dd>
<dt>use_gpu</dt>
<dd>a logical giving if <code>GPU</code> or <code>CPU</code> should be used (default is <code>FALSE</code>).</dd>
<dt>download_method</dt>
<dd>an integer or character string giving the download method code.
If a character string is provided, it is passed to <code>stanza_download_method_code</code>
to obtain the integer code.
Use <code>stanza_download_method_code</code> to obtain the code and list all
available download methods.</dd>
<dt>...</dt>
<dd>additional named arguments passed to the stanza pipeline.</dd></dl>

Arguments

NLP Pipeline — stanza_pipeline

<dl>

<dt>language</dt>
<dd>a character string giving the language (default is <code>"en"</code>).</dd>


<dt>model_dir</dt>
<dd>path to the directory for storing the for <code>Stanza</code> models
(default is <code>"~/stanza_resources"</code>).</dd>


<dt>package</dt>
<dd>(default is <code>"default"</code>.</dd>


<dt>processors</dt>
<dd>FIXME: we should define if we want to use comma seperated string or a character vector.</dd>


<dt>logging_level</dt>
<dd>a character string giving the logging level (default is <code>"INFO"</code>),
available levels are <code>c('DEBUG', 'INFO', 'WARNING', 'WARN', 'ERROR', 'CRITICAL', 'FATAL')</code>.</dd>


<dt>use_gpu</dt>
<dd>a logical giving if <code>GPU</code> or <code>CPU</code> should be used (default is <code>FALSE</code>).</dd>


<dt>download_method</dt>
<dd>an integer or character string giving the download method code.
If a character string is provided, it is passed to <code>stanza_download_method_code</code>
to obtain the integer code.
Use <code>stanza_download_method_code</code> to obtain the code and list all
available download methods.</dd>


<dt>...</dt>
<dd>additional named arguments passed to the stanza pipeline.</dd>

</dl>

stanza_pipeline: NLP Pipeline

Description

Usage

Value

Arguments

Examples