This function retrieves stopwords from the type specified in the kind
argument and returns the stopword list as a character vector The default is
English.
Usage
stopwords(kind = "english", verbose = FALSE)
Arguments
kind
The pre-set kind of stopwords (as a character string). Allowed
values are english, SMART, danish, french,
hungarian, norwegian, russian, swedish,
verbose
if FALSE, suppress the annoying warning note
Value
a character vector of stopwords
A note of caution
Stop words are an arbitrary choice imposed by the
user, and accessing a pre-defined list of words to ignore does not mean
that it will perfectly fit your needs. You are strongly encourged to
inspect the list and to make sure it fits your particular requirements.
Details
The stopword list are SMART English stopwords from the SMART information
retrieval system (obtained from
http://jmlr.csail.mit.edu/papers/volume5/lewis04a/a11-smart-stop-list/english.stop)
and a set of stopword lists from the Snowball stemmer project in different
languages (obtained from
http://svn.tartarus.org/snowball/trunk/website/algorithms/*/stop.txt).
Supported languages are arabic, danish, dutch, english, finnish, french,
german, hungarian, italian, norwegian, portuguese, russian, spanish, and
swedish. Language names are case sensitive.