- x
a (quanteda) tokens-class
object
- pattern
a character vector, list of character vectors, dictionary,
or collocations object. See pattern for details.
- window
the number of context words to be displayed around the keyword
- valuetype
the type of pattern matching: "glob"
for "glob"-style
wildcard expressions; "regex"
for regular expressions; or "fixed"
for
exact matching. See valuetype for details.
- case_insensitive
logical; if TRUE
, ignore case when matching a
pattern
or dictionary values
- hard_cut
(logical) - if TRUE then a context must have window
x 2 tokens,
if FALSE it can have window
x 2 or fewer (e.g. if a doc begins with a target word,
then context will have window
tokens rather than window
x 2)
- rm_keyword
(logical) if FALSE, keyword matching pattern is included in the tokenized contexts
- verbose
(logical) if TRUE, report the total number of instances per pattern found