Learn R Programming

textclean (version 0.3.0)

strip: Strip Text

Description

Strip text of unwanted characters.

strip.character - factor method for strip.

strip.factor - factor method for strip.

strip.default - factor method for strip.

strip.list - factor method for strip.

Usage

strip(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = FALSE, lower.case = TRUE)
"strip"(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = FALSE, lower.case = TRUE)
"strip"(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE)
"strip"(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE)
"strip"(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE)

Arguments

x
The text variable.
char.keep
A character vector of symbols (i.e., punctuation) that strip should keep. The default is to strip every symbol except apostrophes and a double tilde "~~". The double tilde "~~" is included for a convenient means of keeping word groups together in functions that split text apart based on spaces. To remove double tildes "~~" set char.keep to NULL.
digit.remove
logical. If TRUE strips digits from the text.
apostrophe.remove
logical. If TRUE removes apostrophes from the output.
lower.case
logical. If TRUE forces all alpha characters to lower case.

Value

Returns a vector of text that has been stripped of unwanted characters.

Examples

Run this code
## Not run: 
# DATA$state #no strip applied
# strip(DATA$state)
# strip(DATA$state, apostrophe.remove=TRUE)
# strip(DATA$state, char.keep = c("?", "."))
# ## End(Not run)

Run the code above in your browser using DataLab