MovieTweetings

Creators:
Dooms, Simon; De Pessemier, Toon; Martens, Luc
Publication Date:
2013
Data Category:
Dataset Description:
MovieTweetings is a dataset consisting of ratings on movies that were contained in well-structured tweets on Twitter. The goal of this dataset is to provide the RecSys community with a live, natural and always up-to-date movie ratings dataset. The dataset has been actively collecting ratings since February 28, 2013, and will be updated as much as possible to incorporate rating data from the newest tweets available. The dataset includes 921,398 ratings from 71,707 unique users. The ratings contained in the tweets are scaled from 0 to 10, as is the norm on the IMDb platform. In total, the dataset has a size of 26,2 MB and consists of two main files:
  • ratings.dat contains extracted ratings, structured as:
    user_id::movie_id::rating::rating_timestamp

    • user_id: Unique identifier for the user.
    • movie_id: IMDb identifier for the movie.
    • rating: User’s score on a 10-star scale.
    • rating_timestamp: Unix timestamp when the rating was extracted.
  • items.dat includes metadata about the rated movies, structured as:
    movie_id::movie_title (movie_year)::genre|genre|genre

    • movie_id: IMDb identifier for the movie.
    • movie_title: Name of the movie along with the release year.
    • genre: Pipe-separated list of genres.
Variables:
Details:

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.