数据集:

multi_re_qa

任务:

问答

子任务:

extractive-qa open-domain-qa

语言:

计算机处理:

monolingual

大小:

100K<n<1M 10K<n<100K 1K<n<10K

语言创建人:

expert-generated found

批注创建人:

expert-generated found

源数据集:

extended|other-BioASQ extended|other-DuoRC extended|other-HotpotQA

预印本库:

arxiv:2005.02507

许可:

license:unknown

数据集介绍文件清单

中文

Dataset Card for MultiReQA

Dataset Summary

MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, in cluding BioASQ, RelationExtraction, TextbookQA, contain only the test data (also includes DuoRC but not specified in the official documentation)

Supported Tasks and Leaderboards

Question answering (QA)
Retrieval question answering (ReQA)

Languages

Sentence boundary annotation for SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, TextbookQA and DuoRC

Dataset Structure

Data Instances

The general format is: { "candidate_id": <candidate_id>, "response_start": <response_start>, "response_end": <response_end> } ...

An example from SearchQA: {'candidate_id': 'SearchQA_000077f3912049dfb4511db271697bad/_0_1', 'response_end': 306, 'response_start': 243}

Data Fields

{ "candidate_id": <STRING>, "response_start": <INT>, "response_end": <INT> } ...

candidate_id: The candidate id of the candidate sentence. It consists of the original qid from the MRQA shared task.
response_start: The start index of the sentence with respect to its original context.
response_end: The end index of the sentence with respect to its original context

Data Splits

Train and Dev splits are available only for the following datasets,

SearchQA
TriviaQA
HotpotQA
SQuAD
NaturalQuestions

Test splits are available only for the following datasets,

BioASQ
RelationExtraction
TextbookQA

The number of candidate sentences for each dataset in the table below.

MultiReQA
train	test
SearchQA	629,160	454,836
TriviaQA	335,659	238,339
HotpotQA	104,973	52,191
SQuAD	87,133	10,642
NaturalQuestions	106,521	22,118
BioASQ	-	14,158
RelationExtraction	-	3,301
TextbookQA	-	3,701

Dataset Creation

Curation Rationale

MultiReQA is a new multi-domain ReQA evaluation suite composed of eight retrieval QA tasks drawn from publicly available QA datasets from the MRQA shared task . The dataset was curated by converting existing QA datasets from MRQA shared task to the format of MultiReQA benchmark.

Source Data

Initial Data Collection and Normalization

The Initial data collection was performed by converting existing QA datasets from MRQA shared task to the format of MultiReQA benchmark.

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

The annotators/curators of the dataset are mandyguo-xyguo and mwurts4google , the contributors of the official MultiReQA github repository

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

The annotators/curators of the dataset are mandyguo-xyguo and mwurts4google , the contributors of the official MultiReQA github repository

Licensing Information

[More Information Needed]

Citation Information

@misc{m2020multireqa,
    title={MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models},
    author={Mandy Guo and Yinfei Yang and Daniel Cer and Qinlan Shen and Noah Constant},
    year={2020},
    eprint={2005.02507},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Contributions

Thanks to @Karthik-Bhaskar for adding this dataset.

作者:

佚名

数据集大小:

36.81 KB