Learn R Programming

arabicStemR (version 1.3)

Arabic Stemmer for Text Analysis

Description

Allows users to stem Arabic texts for text analysis.

Copy Link

Version

Install

install.packages('arabicStemR')

Monthly Downloads

326

Version

1.3

License

GPL (>= 2)

Maintainer

Rich Nielsen

Last Published

July 18th, 2022

Functions in arabicStemR (1.3)

cleanChars

Clean all characters that are not Latin or Arabic
transliterate

Transliterate Arabic unicode characters into latin characters
removeStopWords

Remove Arabic stopwords.
removePunctuation

Remove punctuation.
stem

Arabic Stemmer for Text Analysis
stemArabic

Arabic Stemmer for Text Analysis
removePrefixes

Remove Arabic prefixes
removeNumbers

Remove English, Arabic, and Farsi numerals.
removeSuffixes

Remove Arabic suffixes
reverse.transliterate

Transliterate latin characters into Arabic unicode characters
removeFarsiNumbers

Remove Farsi numbers
removeArabicNumbers

Remove Arabic numbers
cleanLatinChars

Clean Latin characters
removeNewlineChars

Remove new line characters
removeDiacritics

Remove Arabic diacritics
doStemming

Removes Arabic prefixes and suffixes
fixAlifs

Standardize different hamzas on alif seats
arabicStemR-package

A package for stemming Arabic for text analysis.
removeEnglishNumbers

Remove English numbers