The raw data behind the story "Al Gore's New Movie Exposes The Big Flaw In Online Movie Ratings" https://fivethirtyeight.com/features/al-gores-new-movie-exposes-the-big-flaw-in-online-movie-ratings/.
ratings
A data frame with 80053 rows representing movie ratings and 27 variables:
The date at which the rating was recorded.
The number of respondents in a category associated with a given timestamp.
The subgroups of respondents differentiated by demographics like gender, age, and nationality.
The website associated with a given category's responses.
The average rating reported by a given category.
The mean rating reported by a given category.
The median rating reported by a given category.
The count of votes denoting a rating of one that respondents gave.
The count of votes denoting a rating of two that respondents gave.
The count of votes denoting a rating of three that respondents gave.
The count of votes denoting a rating of four that respondents gave.
The count of votes denoting a rating of five that respondents gave.
The count of votes denoting a rating of six that respondents gave.
The count of votes denoting a rating of seven that respondents gave.
The count of votes denoting a rating of eight that respondents gave.
The count of votes denoting a rating of nine that respondents gave.
The count of votes denoting a rating of ten that respondents gave.
The percentage of votes denoting a rating of one that respondents gave.
The percentage of votes denoting a rating of two that respondents gave.
The percentage of votes denoting a rating of three that respondents gave.
The percentage of votes denoting a rating of four that respondents gave.
The percentage of votes denoting a rating of five that respondents gave.
The percentage of votes denoting a rating of six that respondents gave.
The percentage of votes denoting a rating of seven that respondents gave.
The percentage of votes denoting a rating of eight that respondents gave.
The percentage of votes denoting a rating of nine that respondents gave.
The percentage of votes denoting a rating of ten that respondents gave.
# NOT RUN {
# To convert data frame to tidy data (long) format, run:
library(tidyverse)
library(stringr)
ratings_tidy <- ratings %>%
gather(votes, count, -c(timestamp, respondents, category, link, average, mean, median)) %>%
arrange(timestamp)
# }
Run the code above in your browser using DataLab