Amazon question/answer data

Creators:
McAuley, Julian; Yang, Alex
Publication Date:
2016
Data Category:
Dataset Description:
This dataset contains Question and Answer data from Amazon, totaling around 1.4 million answered questions and around 4 million answers. This dataset offers valuable insights into consumer inquiries and the corresponding responses, facilitating research in natural language processing, question-answering systems, and e-commerce analytics. It can be combined with Amazon product review data (available here) by matching ASINs in the Q/A dataset with ASINs in the review data. The dataset is approximately 766 kB in size and is available in JSON format.

Structurally, each entry in the dataset includes the following variables:

  • asin: The Amazon Standard Identification Number (ASIN) of the product, e.g., "B000050B6Z".

  • questionType: The type of question, either 'yes/no' or 'open-ended'.

  • answerType: For yes/no questions, this indicates the type of answer: 'Y' for yes, 'N' for no, or '?' if the polarity of the answer could not be determined.

  • answerTime: The raw timestamp of when the answer was provided.

  • unixTime: The answer timestamp converted to Unix time.

  • question: The text of the question asked by the consumer.

  • answer: The text of the answer provided.

Variables:
Details:

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.