数据集:
sciq
任务:
问答子任务:
closed-domain-qa语言:
en计算机处理:
monolingual大小:
10K<n<100K语言创建人:
crowdsourced批注创建人:
no-annotation源数据集:
original许可:
cc-by-nc-3.0The SciQ dataset contains 13,679 crowdsourced science exam questions about Physics, Chemistry and Biology, among others. The questions are in multiple-choice format with 4 answer options each. For the majority of the questions, an additional paragraph with supporting evidence for the correct answer is provided.
An example of 'train' looks as follows.
This example was too long and was cropped: { "correct_answer": "coriolis effect", "distractor1": "muon effect", "distractor2": "centrifugal effect", "distractor3": "tropical effect", "question": "What phenomenon makes global winds blow northeast to southwest or the reverse in the northern hemisphere and northwest to southeast or the reverse in the southern hemisphere?", "support": "\"Without Coriolis Effect the global winds would blow north to south or south to north. But Coriolis makes them blow northeast to..." }
The data fields are the same among all splits.
defaultname | train | validation | test |
---|---|---|---|
default | 11679 | 1000 | 1000 |
The dataset is licensed under the Creative Commons Attribution-NonCommercial 3.0 Unported License .
@inproceedings{SciQ, title={Crowdsourcing Multiple Choice Science Questions}, author={Johannes Welbl, Nelson F. Liu, Matt Gardner}, year={2017}, journal={arXiv:1707.06209v1} }
Thanks to @patrickvonplaten , @lewtun , @thomwolf for adding this dataset.