stopwords

0th

Percentile

Collection of stopwords in multiple languages

This function returns character vectors of stopwords for different languages, using the ISO-639-1 language codes, and allows for different sources of stopwords to be defined.

The default source is the Snowball stopwords collection but other sources are also available.

Usage
stopwords(language = "en", source = "snowball")
Arguments
language

specify language of stopwords by ISO 639-1 code

source

specify a stopwords source. To list the currently available options, use stopwords_getsources.

Details

The language codes for each stopword list use the two-letter ISO code from https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes. For backwards compatibility, the full English names of the stopwords from the quanteda package may also be used, although these are deprecated.

Value

a character vector containing the stopwords

Aliases
  • stopwords
Examples
# NOT RUN {
stopwords("en")
stopwords("de")
# }
Documentation reproduced from package stopwords, version 1.0, License: MIT + file LICENSE

Community examples

Looks like there are no examples yet.