Remove/replace/extract 'all caps' words containing 2 or more consecutive upper case letters from a string.
rm_caps(
text.var,
trim = !extract,
clean = TRUE,
pattern = "@rm_caps",
replacement = "",
extract = FALSE,
dictionary = getOption("regex.library"),
...
)ex_caps(
text.var,
trim = !extract,
clean = TRUE,
pattern = "@rm_caps",
replacement = "",
extract = TRUE,
dictionary = getOption("regex.library"),
...
)
Returns a character string with "all caps" removed.
The text variable.
logical. If TRUE removes leading and trailing white
spaces.
trim logical. If TRUE extra white spaces and escaped
character will be removed.
A character string containing a regular expression (or
character string for fixed = TRUE) to be matched in the given
character vector. Default, @rm_caps uses the
rm_caps regex from the regular expression dictionary from
the dictionary argument.
Replacement for matched pattern.
logical. If TRUE the all caps strings are extracted
into a list of vectors.
A dictionary of canned regular expressions to search within
if pattern begins with "@rm_".
Other arguments passed to gsub.
Other rm_ functions:
rm_abbreviation(),
rm_between(),
rm_bracket(),
rm_caps_phrase(),
rm_citation(),
rm_citation_tex(),
rm_city_state(),
rm_city_state_zip(),
rm_date(),
rm_default(),
rm_dollar(),
rm_email(),
rm_emoticon(),
rm_endmark(),
rm_hash(),
rm_nchar_words(),
rm_non_ascii(),
rm_non_words(),
rm_number(),
rm_percent(),
rm_phone(),
rm_postal_code(),
rm_repeated_characters(),
rm_repeated_phrases(),
rm_repeated_words(),
rm_tag(),
rm_time(),
rm_title_name(),
rm_url(),
rm_white(),
rm_zip()
x <- c("UGGG! When I use caps I am YELLING!")
rm_caps(x)
rm_caps(x, replacement="\\L\\1")
ex_caps(x)
Run the code above in your browser using DataLab