tidytext (version 0.1.2)

sentiments: Sentiment lexicons from three sources

Description

Three lexicons for sentiment analysis are combined here in a tidy data frame. The lexicons are the NRC Emotion Lexicon from Saif Mohammad and Peter Turney, the sentiment lexicon from Bing Liu and collaborators, and the lexicon of Finn Arup Nielsen. Words with non-ASCII characters were removed from the lexicons.

Usage

sentiments

Arguments

Format

A data frame with 23,165 rows and 4 variables:

word

An English word

sentiment

One of either positive, negative, anger, anticipation, disgust, fear, joy, sadness, surprise, trust, or NA. The Bing lexicon has positive/negative, the NRC lexicon has all options except NA, and the AFINN lexicon has only NA.

lexicon

The source of the sentiment for the word. One of either "nrc", "bing", or "AFINN".

score

A numerical score for the sentiment. This value is NA for the Bing and NRC lexicons, and runs between -5 and 5 for the AFINN lexicon.