- x
a (quanteda) tokens-class object
- pattern
a character vector, list of character vectors, dictionary,
or collocations object. See pattern for details.
- window
the number of context words to be displayed around the keyword
- valuetype
the type of pattern matching: "glob" for "glob"-style
wildcard expressions; "regex" for regular expressions; or "fixed" for
exact matching. See valuetype for details.
- case_insensitive
logical; if TRUE, ignore case when matching a
pattern or dictionary values
- hard_cut
(logical) - if TRUE then a context must have window x 2 tokens,
if FALSE it can have window x 2 or fewer (e.g. if a doc begins with a target word,
then context will have window tokens rather than window x 2)
- rm_keyword
(logical) if FALSE, keyword matching pattern is included in the tokenized contexts
- verbose
(logical) if TRUE, report the total number of instances per pattern found