Return various kinds of stopwords with support for different languages.
stopwords(kind = "en")
- A character string identifying the desired stopword list.
Available stopword lists are:
- Catalan stopwords (obtained from http://latel.upf.edu/morgana/altres/pub/ca_stop.htm),
- Romanian stopwords (extracted from http://snowball.tartarus.org/otherapps/romanian/romanian1.tgz),
- English stopwords from the SMART information retrieval system (obtained from http://jmlr.csail.mit.edu/papers/volume5/lewis04a/a11-smart-stop-list/english.stop) (which coincides with the stopword list used by the MC toolkit (http://www.cs.utexas.edu/users/dml/software/mc/)),
and a set of stopword lists from the Snowball stemmer project in different
languages (obtained from
Supported languages are
swedish. Language names are case sensitive. Alternatively, their
IETF language tags may be used.
is raised if no stopwords are available for the requested
stopwords("en") stopwords("SMART") stopwords("german")