write_nametagger

Save a tokenised dataset as nametagger train data

Wraps the 'nametag' library <https://github.com/ufal/nametag>, allowing users to find and extract entities (names, persons, locations, addresses, ...) in raw text and build your own entity recognition models.
Based on a maximum entropy Markov model which is described in Strakova J., Straka M. and Hajic J. (2013) <https://ufal.mff.cuni.cz/~straka/papers/2013-tsd_ner.pdf>.

Jan Wijffels

nametagger

Named Entity Recognition in Texts using 'NameTag'

BNOSAC 

Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic 

Milan Straka 

Jana Straková 

write_nametagger function

<dl><dt>x</dt>
<dd>a tokenised data.frame with columns doc_id, sentence_id, token containing 1 row per token. 
In addition it can have columns lemma and pos representing the lemma and the parts-of-speech tag of the token</dd>
<dt>file</dt>
<dd>the path to the file where the training data will be saved</dd></dl>

Arguments

Save a tokenised dataset as nametagger train data — write_nametagger

<dl>

<dt>x</dt>
<dd>a tokenised data.frame with columns doc_id, sentence_id, token containing 1 row per token. 
In addition it can have columns lemma and pos representing the lemma and the parts-of-speech tag of the token</dd>


<dt>file</dt>
<dd>the path to the file where the training data will be saved</dd>

</dl>

write_nametagger: Save a tokenised dataset as nametagger train data

Description

Usage

Value

Arguments

Examples