Recombine documents tokens by groups
tokens_group(x, groups = NULL)
tokens object
either: a character vector containing the names of document variables to be used for grouping; or a factor or object that can be coerced into a factor equal in length or rows to the number of documents. See groups for details.
# NOT RUN {
# dfm_group examples
corp <- corpus(c("a a b", "a b c c", "a c d d", "a c c d"),
docvars = data.frame(grp = c("grp1", "grp1", "grp2", "grp2")))
toks <- tokens(corp)
quanteda:::tokens_group(toks, groups = "grp")
quanteda:::tokens_group(toks, groups = c(1, 1, 2, 2))
quanteda:::tokens_group(toks, groups = factor(c(1, 1, 2, 2), levels = 1:3))
# }
Run the code above in your browser using DataLab