This dataset comes originally from kaggle . It was originally split into three tables (CSV files) (Questions, Answers, and Tags) now merged into a single table. Each row corresponds to a pair (question-answer) and their associated tags.
The dataset contains all questions asked between August 2, 2008 and Ocotober 19, 2016.
This might be useful for open-domain question-answering tasks.
All Stack Overflow user contributions are licensed under CC-BY-SA 3.0 with attribution required.