⚠️There's a newer version (2.3) of this package. Take me there.

stopwords

Overview

Exposes the full Stopwords ISO collection as an easy to use R data structure.

Installation

# Install the stopword collections
install.packages("stopwords")

# Or get the development version from GitHub:
# install.packages("devtools")
devtools::install_github("davnn/stopwords")

Usage

The package simply exposes a list of stopword character vectors called stopwords and that's the only name exported. The list elements can be referenced by their ISO 639-1 language code.

library(stopwords)

stopwords$de
# [1] "a"     "ab"    "aber"  "ach"   "acht"  "achte" ...

stopwords$en
# [1] "'ll"   "'tis"  "'twas" "'ve"   "10"    "39" ...

The following languages are currently available:

  • af, Afrikaans
  • ar, Arabic
  • hy, Armenian
  • eu, Basque
  • bn, Bengali
  • br, Breton
  • bg, Bulgarian
  • ca, Catalan
  • zh, Chinese
  • hr, Croatian
  • cs, Czech
  • da, Danish
  • nl, Dutch
  • en, English
  • eo, Esperanto
  • et, Estonian
  • fi, Finnish
  • fr, French
  • gl, Galician
  • de, German
  • el, Greek
  • ha, Hausa
  • he, Hebrew
  • hi, Hindi
  • hu, Hungarian
  • id, Indonesian
  • ga, Irish
  • it, Italian
  • ja, Japanese
  • ko, Korean
  • ku, Kurdish
  • la, Latin
  • lt, Lithuanian
  • lv, Latvian
  • ms, Malay
  • mr, Marathi
  • no, Norwegian
  • fa, Persian
  • pl, Polish
  • pt, Portuguese
  • ro, Romanian
  • ru, Russian
  • sk, Slovak
  • sl, Slovenian
  • so, Somali
  • st, Southern Sotho
  • es, Spanish
  • sw, Swahili
  • sv, Swedish
  • th, Thai
  • tl, Tagalog
  • tr, Turkish
  • uk, Ukrainian
  • ur, Urdu
  • vi, Vietnamese
  • yo, Yoruba
  • zu, Zulu

Contributing

If there are stopwords missing, check if you have the latest version installed and if this version also represents the latest version of Stopwords ISO.

If there is a newer version of Stopwords ISO available, please submit a pull request with the updated version.

If you would like to contribute stopwords, please do so in the official Stopwords ISO repositories.

License

This package as well as the source repositories are licensed under MIT.

Copy Link

Version

Down Chevron

Install

install.packages('stopwords')

Monthly Downloads

17,114

Version

0.1.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

November 12th, 2017

Functions in stopwords (0.1.0)