Jester: Jester Dataset

Description

The data set contains the anonymous ratings data from the Jester Online Joke Recommender System collected between April 1999 and May 2003.

Usage

data(Jester)

Arguments

Format

The format is: Formal class 'realRatingMatrix' [package "recommenderlab"]

Details

24983 x 100 rating matrix (24983 users and 100 jokes) with 1,810,455 ratings between -10.00 and +10.00. All selected users have rated 15 or more jokes.

The text for the jokes are also available as a character vector of length 100 via data{Jester_text}.

References

Ken Goldberg, Theresa Roeder, Dhruv Gupta, and Chris Perkins. "Eigentaste: A Constant Time Collaborative Filtering Algorithm." Information Retrieval, 4(2), 133-151. July 2001.

Examples

Run this code

# NOT RUN {
data(Jester)
Jester

hist(getRatings(Jester), main="Distribution of ratings")

# what is the best joke? (highest average rating)
data(Jester_text)
best <- which.max(colMeans(Jester))
cat(Jester_text[best])

# }

Run the code above in your browser using DataLab