txt_feature

a character string, which can be one of 'is_capitalised', 'is_url', 'is_email', 'is_number', 'prefix', 'suffix'

type

for type 'prefix' or 'suffix', the number of characters of the prefix/suffix

Extract basic text features which are useful for entity recognition

Wraps the 'CRFsuite' library <https://github.com/chokkan/crfsuite> allowing users
to fit a Conditional Random Field model and to apply it on existing data.
The focus of the implementation is in the area of Natural Language Processing where this R package allows you to easily build and apply models
for named entity recognition, text chunking, part of speech tagging, intent recognition or classification of any category you have in mind. Next to training, a small web application
is included in the package to allow you to easily construct training data.

Jan Wijffels

crfsuite

Conditional Random Fields for Labelling Sequential Data in
Natural Language Processing

txt_feature: Extract basic text features which are useful for entity recognition

Description

Usage

Arguments

Value

Examples