quanteda (version 1.5.2)

tokens_tolower: Convert the case of tokens

Description

tokens_tolower and tokens_toupper convert the features of a tokens object and re-index the types.

Usage

tokens_tolower(x, keep_acronyms = FALSE)

tokens_toupper(x)

Arguments

x

the input object whose character/tokens/feature elements will be case-converted

keep_acronyms

logical; if TRUE, do not lowercase any all-uppercase words (applies only to *_tolower functions)

Examples

Run this code
# NOT RUN {
# for a document-feature matrix
toks <- tokens(c(txt1 = "b A A", txt2 = "C C a b B"))
tokens_tolower(toks) 
tokens_toupper(toks)
# }

Run the code above in your browser using DataCamp Workspace