Showing 9-14 of 14 results

Google web graph

Publication Date: 2002
Creators: Leskovec, Jure; Lang, Kevin J.; Dasgupta, Anirban; Mahoney, Michael W.

Nodes represent web pages and directed edges represent hyperlinks between them. The data was released in 2002 by Google as a part of Google Programming Contest.

AmazonQA

Publication Date: 2019
Creators: Gupta, Mansi; Kulkarni, Nitish; Chanda, Raghuveer; Rayasam, Anirudha; Lipton, Zachary C.

We introduce a new dataset and propose a method that combines information retrieval techniques for selecting relevant reviews (given a question) and “reading comprehension” models for synthesizing an answer (given a question and review). Our dataset consists of 923k questions, 3.6M answers and 14M reviews across 156k products. Building on the well-known Amazon dataset, we collect additional annotations, marking each question as either answerable or unanswerable based on the available reviews.

Amazon Reviews: Unlocked Mobile Phones

Publication Date: 2019
Creators: PromptCloud, Inc.
We analyzed more than 400,000 reviews of close to 4,400 unlocked mobile phones sold on Amazon.com to find out insights with respect to reviews, ratings, price and their relationships. The author found that on Amazon’s product review platform most of the reviewers have given 4-star and 3-star ratings. The average length of the reviews comes close to 230 characters. They also uncovered that lengthier reviews tend to be more helpful and there is a positive correlation between price & rating. 

Amazon question/answer data

Publication Date: 2016
Creators: McAuley, Julian; Yang, Alex
This dataset contains Question and Answer data from Amazon, totaling around 1.4 million answered questions and around 4 million answers. This dataset can be combined with Amazon product review data (available here) by matching ASINs in the Q/A dataset with ASINs in the review data. 

Amazon Product Reviews

Publication Date: 2018
Creators: Ni, Jianmo; Li, Jiacheng; McAuley, Julian

This dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). The total number of reviews is 233.1 million.

Marketing Bias data

Publication Date: 2020
Creators: Wan, Mengting; Ni, Jianmo; Misra, Rishabh; McAuley, Julian

These datasets contain attributes about products sold on ModCloth and Amazon which may be sources of bias in recommendations (in particular, attributes about how the products are marketed). Data also includes user/item interactions for recommendation. The dataset includes 99,893 reviews for ModCloth and 1,292,954 reviews for the Electronics category of Amazon.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.