Encode CWB Corpus.
encode(.Object, ...)# S4 method for data.frame
encode(.Object, name, pAttributes = "word",
sAttributes = NULL, registry = Sys.getenv("CORPUS_REGISTRY"),
indexedCorpusDir = NULL, verbose = TRUE)
# S4 method for data.table
encode(.Object, corpus, sAttribute)
a data.frame to encode
further parameters
name of the (new) CWB corpus
columns of .Object with tokens (such as word/pos/lemma)
columns of .Object that will be encoded as structural attributes
path to the corpus registry
directory where to create directory for indexed corpus files
logical, whether to be verbose
the name of the CWB corpus
a single s-attribute