Recommender Systems

Showing 1-4 of 4 results

Behance Community Art Data

Publication Date: 2016
Creators: He, Ruining; Fang, Chen; Wang, Zhaowen; McAuley, Julian

Likes and image data from the community art website Behance. This is a small, anonymized, version of a larger proprietary dataset.

The dataset is about 3.5 GB large and includes:

  • Users: 63,497
  • Items: 178,788
  • Appreciates (“likes”): 1,000,000

Multi-aspect Reviews

Publication Date: 2013
Creators: Julian McAuley; Jure Leskovec; Dan Jurafsky
These datasets include reviews with multiple rated dimensions. The most comprehensive of these are beer review datasets from Ratebeer and Beeradvocate, which include sensory aspects such as taste, look, feel, and smell. The data set is about 1 GB large.Ratebeer:

  • Number of users: 40,213
  • Number of items: 110,419
  • Number of ratings/reviews: 2,855,232
  • Timespan: April, 2000 – November, 2011

BeerAdvocate:

  • Number of users: 33,387
  • Number of items: 66,051
  • Number of ratings/reviews: 1,586,259
  • Timespan: January, 1998 – November, 2011

 

Popular Movies of TMDb

Publication Date: 2020
Creators: Mondal, Sankha Subhra

This dataset of the 10,000 most popular movies across the world has been fetched through the read API.
TMDB’s free API provides for developers and their team to programmatically fetch and use TMDb’s data.
Their API is to use as long as you attribute TMDb as the source of the data and/or images. Also, they update their API from time to time. The data set is 3.2 MB large.

Web data: Amazon movie reviews

Publication Date: 2012
Creators: McAuley, Julian; Leskovec, Jure

This dataset consists of movie reviews from amazon. The data span a period of more than 10 years, including all ~8 million reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.