food

Showing 1-2 of 2 results

TripAdvisor European restaurants

Creators: (Leone, Stefano)
Publication Date: 2021
Creators: (Leone, Stefano)

TripAdvisor is the most popular travel website and it stores data for almost all restaurants, showing locations (even latitude and longitude coordinates), restaurant descriptions, user ratings and reviews, and many more aspects. The dataset is 0.68 GB large.

The TripAdvisor dataset includes 1,083,397 restaurants with attributes such as location data, average rating, number of reviews, open hours, cuisine types, awards, etc.

The dataset combines the restaurants from the main European countries, the data has been scraped in early May 2021.

The dataset is structured with various variables for each restaurant, such as:

  • restaurant_link: Unique TripAdvisor restaurant link.
  • restaurant_name: Name of the restaurant on TripAdvisor.
  • original_location: Original location displayed on TripAdvisor.
  • country: Country name retrieved from original_location.
  • region: Region name retrieved from original_location.
  • province: Province name retrieved from original_location.
  • city: City name retrieved from original_location.
  • address: Address displayed on TripAdvisor.
  • latitude: Latitude coordinate.
  • longitude: Longitude coordinate.
  • claimed: Indicates if the restaurant business is claimed on TripAdvisor.
  • awards: Award names.
  • popularity_detailed: Detailed popularity ranking.
  • popularity_generic: Generic popularity ranking (among all places to eat in the area).

Food.com Recipe & Review Data

Creators: Majumder, Bodhisattwa P.; Li, Shuyang; Ni, Jianmo; McAuley, Julian
Publication Date: 2019
Creators: Majumder, Bodhisattwa P.; Li, Shuyang; Ni, Jianmo; McAuley, Julian
This dataset consists of 180K+ recipes and 700K+ recipe reviews covering 18 years of user interactions and uploads on Food.com (formerly GeniusKitchen), an online recipe aggregator. This extensive collection allows for in-depth analysis of culinary trends, user preferences, and recipe characteristics over nearly two decades.The dataset is 0,85 GB in size and contains three sets of data from Food.com:Interaction splits

  • interactions_test.csv
  • interactions_validation.csv
  • interactions_train.csv

Preprocessed data for result reproduction

In this format, the recipe text metadata is tokenized via the GPT subword tokenizer with start-of-step, etc. tokens.

  • PP_recipes.csv
  • PP_users.csv

To convert these files into the pickle format required to run our code off-the-shelf, you may use pandas.read_csv and pandas.to_pickle to convert the CSV’s into the proper pickle format.

 

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.