Turn regions of a corpus defined by corpus positions into the original text.
getTokenStream(.Object, ...)# S4 method for numeric
getTokenStream(.Object, corpus, pAttribute,
encoding = NULL, collapse = NULL, beautify = TRUE, cpos = FALSE,
cutoff = NULL)
# S4 method for matrix
getTokenStream(.Object, ...)
# S4 method for character
getTokenStream(.Object, left = NULL, right = NULL,
...)
# S4 method for partition
getTokenStream(.Object, pAttribute, collapse = NULL,
cpos = FALSE, ...)
# S4 method for Regions
getTokenStream(.Object, pAttribute = "word", ...)
an object of class matrix
or partition
further arguments
the CWB corpus
the pAttribute to decode
encoding to use
character string length 1
logical, whether to adjust whitespace before and after interpunctation
logical, whether to return cpos as names of the tokens
maximum number of tokens to be reconstructed
left corpus position
right corpus position