Unlimited learning, half price | 50% off

Last chance! 50% off unlimited learning

Sale ends in


stopwords (version 0.1.0)

stopwords: Stopwords ISO Data

Description

The Stopwords ISO Dataset is the most comprehensive collection of stopwords for multiple languages. The collection follows the ISO 639-1 language code.

Usage

stopwords

Arguments

Format

A list of character vectors that represent stopwords:

af

Afrikaans

ar

Arabic

hy

Armenian

eu

Basque

bn

Bengali

br

Breton

bg

Bulgarian

ca

Catalan

zh

Chinese

hr

Croatian

cs

Czech

da

Danish

nl

Dutch

en

English

eo

Esperanto

et

Estonian

fi

Finnish

fr

French

gl

Galician

de

German

el

Greek

ha

Hausa

he

Hebrew

hi

Hindi

hu

Hungarian

id

Indonesian

ga

Irish

it

Italian

ja

Japanese

ko

Korean

ku

Kurdish

la

Latin

lt

Lithuanian

lv

Latvian

ms

Malay

mr

Marathi

no

Norwegian

fa

Persian

pl

Polish

pt

Portuguese

ro

Romanian

ru

Russian

sk

Slovak

sl

Slovenian

so

Somali

st

Southern Sotho

es

Spanish

sw

Swahili

sv

Swedish

th

Thai

tl

Tagalog

tr

Turkish

uk

Ukrainian

ur

Urdu

vi

Vietnamese

yo

Yoruba

zu

Zulu

Examples

Run this code
# NOT RUN {
stopwords$en
# [1] "'ll" "'tis" "'twas" ...
stopwords$de
# [1] "a" "ab" "aber" "ach" ...

# }

Run the code above in your browser using DataLab