A dataset containing a random sample (n = 5000 of 1,621,956) of Wang, Lu, & Zhai's (2011) hotel reviews data set scraped by the authors from http://www.tripadvisor.com.
data(hotel_reviews)
A data frame with 5000 rows and 2 variables
sentiment. The overall rating for the experience
text. The text review of the hotel
Wang, H., Lu, Y., and Zhai, C. (2011). Latent aspect rating analysis without aspect keyword supervision. In Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'2011), 618-626. Original URL: 'http://sifaka.cs.uiuc.edu/~wang296/Data/index.html'