Rotten Tomatoes movies and critic reviews dataset
Creators:
(Leone, Stefano)
Publication Date:
2020
Data Category:
Dataset Description:
The Rotten Tomatoes Movies and Critic Reviews dataset is a collection of information scraped from the Rotten Tomatoes website as of October 31, 2020. It encompasses data on over 17,000 movies, including details such as movie titles, descriptions, genres, durations, directors, actors, as well as user and critic ratings. A distinctive feature of this dataset is its ability to facilitate comparisons between audience scores (ratings from regular users) and tomatometer scores (ratings from certified critics), offering valuable insights into differing perspectives on films. In the movies dataset each record represents a movie available on Rotten Tomatoes, with the URL used for the scraping, movie tile, description, genres, duration, director, actors, users' ratings, and critics' ratings.
In the critics dataset each record represents a critic review published on Rotten Tomatoes, with the URL used for the scraping, critic name, review publication, date, score, and content.
Rotten Tomatoes allows to compare the ratings given by regular users (audience score) and the ratings given/reviews provided by critics (tomatometer) who are certified members of various writing guilds or film critic-associations.The dataset is 0.23 GB large.
The dataset is structured into two main components:
-
Movies Dataset: Each record represents a movie available on Rotten Tomatoes, containing fields such as:
- rotten_tomatoes_link: The specific URL from which the movie data was scraped.
- movie_title: The title of the movie as displayed on the Rotten Tomatoes website.
- movie_info: A brief description of the movie.
- genres: The genres associated with the movie, separated by commas if multiple.
- original_release_date: The date on which the movie was originally released.
- content_rating: The category indicating the movie's suitability for different audiences.
- critics_consensus: Comments from Rotten Tomatoes summarizing critics' opinions.
-
Critics Dataset: Each record represents a critic's review published on Rotten Tomatoes, including details such as:
- critic_name: The name of the critic who reviewed the movie.
- top_critic: A boolean value indicating whether the critic is classified as a top critic.
- publisher_name: The name of the publication for which the critic works.
- review_type: Specifies whether the review was labeled as 'fresh' or 'rotten'.
- review_score: The score provided by the critic for the movie.
- review_date: The date when the review was published.
- review_content: The content of the review.
Variables:
Details: