get_document

annotation

<p>Access document meta data from an annotation object</p>

Provides a set of fast tools for converting a textual corpus into a set of normalized
tables. Users may make use of a Python back end with 'spaCy' <https://spacy.io>
or the Java back end 'CoreNLP' <http://stanfordnlp.github.io/CoreNLP/>. A minimal back
end with no external dependencies is also provided. Exposed annotation tasks include
tokenization, part of speech tagging, named entity recognition, entity linking, sentiment
analysis, dependency parsing, coreference resolution, and word embeddings. Summary
statistics regarding token unigram, part of speech tag, and dependency type frequencies
are also included to assist with analyses.

Taylor Arnold

cleanNLP

get_document: Access document meta data from an annotation object

Description

Usage

Arguments

Value

References

Examples