Showing 1-8 of 14 results

Raw Bay Area Craigslist Rental Housing Posts

Publication Date: 2018
Creators: Pennigton, Kate

Like many cities, San Francisco doesn’t track rents. I created a panel of historic Craigslist rents by scraping posts archived by the Wayback Machine. Please feel free to use the data, and of course, please cite.

Amazon Brand and Exclusives

Publication Date: 2021
Creators: Jeffries, Adrianne; Yin, Leon

My co-author Adrianne Jeffries and I found Amazon gave its own branded products an advantage over better-rated competitors in search results.

Google Restaurants

Publication Date: 2022
Creators: Zhankui He, Yan; Li, Jiacheng; Zhang, Tianyang; McAuley, Julian

This is a mutli-modal dataset of restaurants from Google Local (Google Maps). Data includes images and reviews posted by users, as well as other metadata for each restaurant.

TripAdvisor European restaurants

Publication Date: 2021
Creators: (Leone, Stefano)

The TripAdvisor dataset includes 1,083,397 restaurants with attributes such as location data, average rating, number of reviews, open hours, cuisine types, awards, etc.

The dataset combines the restaurants from the main European countries.

Amazon product co-purchasing network metadata

Publication Date: 2006
Creators: Leskovec, Jure

The data was collected by crawling Amazon website and contains product metadata and review information about 548,552 different products (Books, music CDs, DVDs and VHS video tapes). For each product the following information is available:

Title
Salesrank
List of similar products (that get co-purchased with the current product)
Detailed product categorization
Product reviews: time, customer, rating, number of votes, number of people that found the review helpful

The data was collected in summer 2006.

Web data: Amazon Fine Foods reviews

Publication Date: 2012
Creators: McAuley, Julian; Leskovec, Jure

This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review.

Web data: Amazon movie reviews

Publication Date: 2012
Creators: McAuley, Julian; Leskovec, Jure

This dataset consists of movie reviews from amazon. The data span a period of more than 10 years, including all ~8 million reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review.

Amazon product co-purchasing network and ground-truth communities

Publication Date: 2012
Creators: Yang, Jaewon; Leskovec, Jure

Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains an undirected edge from i to j. Each product category provided by Amazon defines each ground-truth community. We regard each connected component in a product category as a separate ground-truth community. We remove the ground-truth communities which have less than 3 nodes. We also provide the top 5,000 communities with highest quality which are described in our paper. As for the network, we provide the largest connected component.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.