lexicon v1.2.1

0

Monthly downloads

0th

Percentile

Lexicons for Text Analysis

A collection of lexical hash tables, dictionaries, and word lists.

Readme

lexicon

Project Status: Active - The project has reached a stable, usable
state and is being actively
developed. Build
Status

Table of Contents

Description

lexicon is a collection of lexical hash tables, dictionaries, and word lists. The data prefixes help to categorize the data types:

Prefix Meaning
key A data.frame with a lookup and return value
hash A keyed data.table hash table
freq A data.table of terms with frequencies
profanity A profane words vector
pos_ A part of speech vector
posdf A part of speech data.frame
sw_ A stopword vector

Data

Data Description
cliches Common Cliches
common_names First Names (U.S.)
constraining_loughran_mcdonald Loughran-McDonald Constraining Words
emojis_sentiment Emoji Sentiment Data
freq_first_names Frequent U.S. First Names
freq_last_names Frequent U.S. Last Names
function_words Function Words
grady_augmented Augmented List of Grady Ward’s English Words and Mark Kantrowitz’s Names List
hash_emojis Emoji Description Lookup Table
hash_emojis_identifier Emoji Identifier Lookup Table
hash_emoticons Emoticons
hash_grady_pos Grady Ward’s Moby Parts of Speech
hash_internet_slang List of Internet Slang and Corresponding Meanings
hash_lemmas Lemmatization List
hash_nrc_emotions NRC Emotion Table
hash_sentiment_emojis Emoji Sentiment Polarity Lookup Table
hash_sentiment_huliu Hu Liu Polarity Lookup Table
hash_sentiment_jockers Jockers Sentiment Polarity Table
hash_sentiment_jockers_rinker Combined Jockers & Rinker Polarity Lookup Table
hash_sentiment_loughran_mcdonald Loughran-McDonald Polarity Table
hash_sentiment_nrc NRC Sentiment Polarity Table
hash_sentiment_senticnet Augmented SenticNet Polarity Table
hash_sentiment_sentiword Augmented Sentiword Polarity Table
hash_sentiment_slangsd SlangSD Sentiment Polarity Table
hash_sentiment_socal_google SO-CAL Google Polarity Table
hash_valence_shifters Valence Shifters
key_contractions Contraction Conversions
key_corporate_social_responsibility Nadra Pencle and Irina Malaescu’s Corporate Social Responsibility Dictionary
key_grade Grades Data Set
key_rating Ratings Data Set
key_regressive_imagery Colin Martindale’s English Regressive Imagery Dictionary
key_sentiment_jockers Jockers Sentiment Data Set
modal_loughran_mcdonald Loughran-McDonald Modal List
nrc_emotions NRC Emotions
pos_action_verb Action Word List
pos_df_irregular_nouns Irregular Nouns Word Dataframe
pos_df_pronouns Pronouns
pos_interjections Interjections
pos_preposition Preposition Words
profanity_alvarez Alejandro U. Alvarez’s List of Profane Words
profanity_arr_bad Stackoverflow user2592414’s List of Profane Words
profanity_banned bannedwordlist.com’s List of Profane Words
profanity_racist Titus Wormer’s List of Racist Words
profanity_zac_anger Zac Anger’s List of Profane Words
sw_dolch Leveled Dolch List of 220 Common Words
sw_fry_100 Fry’s 100 Most Commonly Used English Words
sw_fry_1000 Fry’s 1000 Most Commonly Used English Words
sw_fry_200 Fry’s 200 Most Commonly Used English Words
sw_fry_25 Fry’s 25 Most Commonly Used English Words
sw_jockers Matthew Jocker’s Expanded Topic Modeling Stopword List
sw_loughran_mcdonald_long Loughran-McDonald Long Stopword List
sw_loughran_mcdonald_short Loughran-McDonald Short Stopword List
sw_lucene Lucene Stopword List
sw_mallet MALLET Stopword List
sw_python Python Stopword List

Installation

To download the development version of lexicon:

Download the zip ball or tar ball, decompress and run R CMD INSTALL on it, or use the pacman package to install the development version:

if (!require("pacman")) install.packages("pacman")
pacman::p_load_gh("trinker/lexicon")

Contact

You are welcome to:

Functions in lexicon

Name Description
hash_sentiment_senticnet Augmented SenticNet Polarity Table
constraining_loughran_mcdonald Loughran-McDonald Constraining Words
hash_nrc_emotions NRC Emotion Table
hash_sentiment_socal_google SO-CAL Google Polarity Table
sw_dolch Leveled Dolch List of 220 Common Words
sw_fry_100 Fry's 100 Most Commonly Used English Words
hash_sentiment_slangsd SlangSD Sentiment Polarity Table
key_grade Grades Data Set
hash_lemmas Lemmatization List
hash_sentiment_sentiword Augmented Sentiword Polarity Table
key_sentiment_jockers Jockers Sentiment Key
lexicon Lexicons for Text Analysis
hash_sentiment_emojis Emoji Sentiment Polarity Lookup Table
cliches Common Cliches
hash_sentiment_huliu Hu Liu Polarity Lookup Table
available_data Get Available lexicon Data
pos_df_pronouns Pronouns
pos_interjections Interjections
profanity_racist Titus Wormer's List of Racist Words
pos_action_verb Action Word List
pos_df_irregular_nouns Irregular Nouns Word Dataframe
pos_preposition Preposition Words
hash_internet_slang List of Internet Slang and Corresponding Meanings
profanity_alvarez Alejandro U. Alvarez's List of Profane Words
sw_lucene Lucene Stopword List
profanity_arr_bad Stackoverflow user2592414's List of Profane Words
sw_mallet MALLET Stopword List
profanity_banned bannedwordlist.com's List of Profane Words
freq_last_names Frequent U.S. Last Names
sw_loughran_mcdonald_long Loughran-McDonald Long Stopword List
sw_loughran_mcdonald_short Loughran-McDonald Short Stopword List
profanity_zac_anger Zac Anger's List of Profane Words
hash_valence_shifters Valence Shifters
key_contractions Contraction Conversions
sw_python Python Stopword List
function_words Function Words
modal_loughran_mcdonald Loughran-McDonald Modal List
hash_emojis_identifier Emoji Identifier Lookup Table
nrc_emotions NRC Emotions
hash_sentiment_loughran_mcdonald Loughran-McDonald Polarity Table
hash_emoticons Emoticons
hash_sentiment_nrc NRC Sentiment Polarity Table
key_rating Ratings Data Set
key_regressive_imagery Colin Martindale's English Regressive Imagery Dictionary
sw_fry_1000 Fry's 1000 Most Commonly Used English Words
sw_fry_200 Fry's 200 Most Commonly Used English Words
sw_fry_25 Fry's 25 Most Commonly Used English Words
sw_jockers Matthew Jocker's Expanded Topic Modeling Stopword List
common_names First Names (U.S.)
hash_grady_pos Grady Ward's Moby Parts of Speech
grady_augmented Augmented List of Grady Ward's English Words and Mark Kantrowitz's Names List
hash_emojis Emoji Description Lookup Table
key_corporate_social_responsibility Nadra Pencle and Irina M<U+0103>l<U+0103>escu's Corporate Social Responsibility Dictionary
emojis_sentiment Emoji Sentiment Data
freq_first_names Frequent U.S. First Names
hash_sentiment_jockers Jockers Polarity Lookup Table
hash_sentiment_jockers_rinker Combined Jockers & Rinker Polarity Lookup Table
No Results!

Last month downloads

Details

License GPL-3
LazyData TRUE
Encoding UTF-8
RoxygenNote 6.1.1
BugReports https://github.com/trinker/lexicon/issues?state=open
URL https://github.com/trinker/lexicon
Collate 'available_data.R' 'cliches.R' 'common_names.R' 'constraining_loughran_mcdonald.R' 'freq_first_names.R' 'freq_last_names.R' 'function_words.R' 'grady_augmented.R' 'hash_emoticons.R' 'hash_grady_pos.R' 'hash_internet_slang.R' 'hash_lemmas.R' 'hash_nrc_emotion.R' 'hash_sentiment_emojis.R' 'hash_sentiment_huliu.R' 'utils.R' 'hash_sentiment_jockers.R' 'hash_sentiment_jockers_rinker.R' 'hash_sentiment_loughran_mcdonald.R' 'hash_sentiment_nrc.R' 'hash_sentiment_senticnet.R' 'hash_sentiment_sentiword.R' 'hash_sentiment_slangsd.R' 'hash_sentiment_socal_google.R' 'hash_valence_shifters.R' 'key_contractions.R' 'key_corporate_social_responsibility.R' 'key_grade.R' 'key_ratings.R' 'key_regressive_imagery.R' 'lexicon-package.R' 'modal_loughran_mcdonald.R' 'nrc_emotions.R' 'pos_action_verb.R' 'pos_df_irregular_nouns.R' 'pos_df_pronouns.R' 'pos_interjections.R' 'pos_preposition.R' 'profanity_alvarez.R' 'profanity_arr_bad.R' 'profanity_banned.R' 'profanity_racist.R' 'profanity_zac_anger.R' 'sw_dolch.R' 'sw_fry_100.R' 'sw_fry_1000.R' 'sw_fry_200.R' 'sw_fry_25.R' 'sw_jockers.R' 'sw_loughran_mcdonald.R' 'sw_lucene.R' 'sw_mallet.R' 'sw_python.R'
NeedsCompilation no
Packaged 2019-03-20 16:40:42 UTC; trinker
Repository CRAN
Date/Publication 2019-03-21 10:40:03 UTC

Include our badge in your README

[![Rdoc](http://www.rdocumentation.org/badges/version/lexicon)](http://www.rdocumentation.org/packages/lexicon)