quanteda (version 1.5.2)

tokens_group: Recombine documents tokens by groups

Description

Recombine documents tokens by groups

Usage

tokens_group(x, groups = NULL)

Arguments

x

tokens object

groups

either: a character vector containing the names of document variables to be used for grouping; or a factor or object that can be coerced into a factor equal in length or rows to the number of documents. See groups for details.

Examples

Run this code
# NOT RUN {
# dfm_group examples
corp <- corpus(c("a a b", "a b c c", "a c d d", "a c c d"), 
                   docvars = data.frame(grp = c("grp1", "grp1", "grp2", "grp2")))
toks <- tokens(corp)
quanteda:::tokens_group(toks, groups = "grp")
quanteda:::tokens_group(toks, groups = c(1, 1, 2, 2))
quanteda:::tokens_group(toks, groups = factor(c(1, 1, 2, 2), levels = 1:3))
# }

Run the code above in your browser using DataLab