AmazonQA
The dataset uses the following variables:
-
questionText: The text of the question posed by the consumer.
-
questionType: Indicates whether the question is 'yes/no' for boolean questions or 'descriptive' for open-ended questions.
-
review_snippets: A list of extracted review snippets relevant to the question (up to ten).
-
answerText: The text of the answer provided.
-
answerType: Specifies the type of answer.
-
helpful: A list containing two integers; the first indicates the number of users who found the answer helpful, and the second indicates the total number of responses.
-
asin: The unique Amazon Standard Identification Number (ASIN) for the product the question pertains to.
-
qid: A unique question identifier within the dataset.
-
category: The product category.
-
top_review_wilson: The review with the highest Wilson score.
-
top_review_helpful: The review voted as most helpful by users.
-
is_answerable: A boolean indicating whether the question is answerable using the review snippets, based on an answerability classifier.
-
top_sentences_IR: A list of top sentences (up to ten) based on Information Retrieval (IR) score with the question.