Yelp Open Dataset

Creators:
Yelp, Inc.
Publication Date:
2015
Data Category:
Dataset Description:
The Yelp dataset offers a collection of real-world data from Yelp, intended for educational and academic purposes. It encompasses information about businesses, user reviews, photos, and check-ins, providing valuable insights into local commerce and consumer behavior. In total, this dataset contains 6.9M online reviews for 150k businesses and covers 11 metropolitan areas. It also includes more than 200,000 images related to the reviews. It has a compressed size of 4,9 GB and uncompressed 10,9 GB available in JSON files. The data consists of multiple sub datasets:
  1. Yelp Business data: Contains business data including location data, attributes, and categories.
  2. Yelp Review data: Contains full review text data including the user_id that wrote the review and the business_id the review is written for.
  3. Yelp User data: User data including the user's friend mapping and all the metadata associated with the user.
  4. Yelp Checkin data: Checkins on a business.
  5. Yelp Tip data: Tips written by a user on a business. Tips are shorter than reviews and tend to convey quick suggestions.
  6. Yelp Photo data: Contains photo data including the caption and classification (one of "food", "drink", "menu", "inside" or "outside").
Available as JSON files, use can use it to teach students about databases, to learn NLP, or for sample production data while you learn how to make mobile apps.  
Variables:
Details:

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.