tokenize

tokenize_tbl

tokenize_tidytext

tokenize_tidy

Simple version of tokenizer function.

This is the R wrapper package Kiwi(Korean Intelligent Word Identifier),
a blazing fast speed morphological analyzer for Korean.
It supports configuration of user dictionary and detection of
unregistered nouns based on frequency.

mrchypark

elbird

Blazing Fast Morphological Analyzer Based on Kiwi(Korean
Intelligent Word Identifier)

Chanyub Park

tokenize function

<dl><dt>text</dt>
<dd>target text.</dd>
<dt>match_option</dt>
<dd><code>Match</code>: use Match. Default is Match$ALL</dd>
<dt>stopwords</dt>
<dd>stopwords option. Default is TRUE which is
to use embaded stopwords dictionany.
If FALSE, use not embaded stopwords dictionany.
If char: path of dictionary txt file, use file.
If <code>Stopwords</code> class, use it.
If not valid value, work same as FALSE.
Check <code>analyze()</code> how to use stopwords param.</dd></dl>

Arguments

Simple version of tokenizer function. — tokenize

<dl>

<dt>text</dt>
<dd>target text.</dd>


<dt>match_option</dt>
<dd><code>Match</code>: use Match. Default is Match$ALL</dd>


<dt>stopwords</dt>
<dd>stopwords option. Default is TRUE which is
to use embaded stopwords dictionany.
If FALSE, use not embaded stopwords dictionany.
If char: path of dictionary txt file, use file.
If <code>Stopwords</code> class, use it.
If not valid value, work same as FALSE.
Check <code>analyze()</code> how to use stopwords param.</dd>

</dl>

tokenize: Simple version of tokenizer function.

Description

Usage

Value

Arguments

Examples