数据集:
GEM/ART
语言:
en计算机处理:
unknown语言创建人:
unknown批注创建人:
automatically-created源数据集:
original其他:
reasoning许可:
apache-2.0You can find the main data card on the GEM Website .
Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house and caused the mess, as the most plausible explanation. This data loader focuses on abductive NLG: a conditional English generation task for explaining given observations in natural language.
You can load the dataset via:
import datasets data = datasets.load_dataset('GEM/ART')
The data loader can be found here .
website paper authorsChandra Bhagavatula (AI2), Ronan Le Bras (AI2), Chaitanya Malaviya (AI2), Keisuke Sakaguchi (AI2), Ari Holtzman (AI2, UW), Hannah Rashkin (AI2, UW), Doug Downey (AI2), Wen-tau Yih (AI2), Yejin Choi (AI2, UW)
@inproceedings{ Bhagavatula2020Abductive, title={Abductive Commonsense Reasoning}, author={Chandra Bhagavatula and Ronan Le Bras and Chaitanya Malaviya and Keisuke Sakaguchi and Ari Holtzman and Hannah Rashkin and Doug Downey and Wen-tau Yih and Yejin Choi}, booktitle={International Conference on Learning Representations}, year={2020}, url={https://openreview.net/forum?id=Byg1v1HKDB} }Contact Name
Chandra Bhagavatulla
Contact Emailchandrab@allenai.org
Has a Leaderboard?no
no
Covered LanguagesEnglish
Whose Language?Crowdworkers on the Amazon Mechanical Turk platform based in the U.S, Canada, U.K and Australia.
Licenseapache-2.0: Apache License 2.0
Intended UseTo study the viability of language-based abductive reasoning. Training and evaluating models to generate a plausible hypothesis to explain two given observations.
Primary TaskReasoning
industry
Curation Organization(s)Allen Institute for AI
Dataset CreatorsChandra Bhagavatula (AI2), Ronan Le Bras (AI2), Chaitanya Malaviya (AI2), Keisuke Sakaguchi (AI2), Ari Holtzman (AI2, UW), Hannah Rashkin (AI2, UW), Doug Downey (AI2), Wen-tau Yih (AI2), Yejin Choi (AI2, UW)
FundingAllen Institute for AI
Who added the Dataset to GEM?Chandra Bhagavatula (AI2), Ronan LeBras (AI2), Aman Madaan (CMU), Nico Daheim (RWTH Aachen University)
Explanations were authored by crowdworkers on the Amazon Mechanical Turk platform using a custom template designed by the creators of the dataset.
Example Instance{ 'gem_id': 'GEM-ART-validation-0', 'observation_1': 'Stephen was at a party.', 'observation_2': 'He checked it but it was completely broken.', 'label': 'Stephen knocked over a vase while drunk.' }Data Splits
Abductive reasoning is a crucial capability of humans and ART is the first dataset curated to study language-based abductive reasoning.
Similar Datasetsno
Ability that the Dataset measuresWhether models can reason abductively about a given pair of observations.
no
Additional Splits?no
Whether models can reason abductively about a given pair of observations.
MetricsBLEU , BERT-Score , ROUGE
Previous results available?no
no
Crowdsourced
Where was it crowdsourced?Amazon Mechanical Turk
Language ProducersLanguage producers were English speakers in U.S., Canada, U.K and Australia.
Topics CoveredNo
Data Validationvalidated by crowdworker
Was Data Filtered?algorithmically
Filter CriteriaAdversarial filtering algorithm as described in the paper
automatically created
Annotation Service?no
Annotation ValuesEach observation is associated with a list of COMET ( https://arxiv.org/abs/1906.05317 ) inferences.
Any Quality Control?none
no
no PII
Justification for no PIIThe dataset contains day-to-day events. It does not contain names, emails, addresses etc.
no
no
no
no
None
public domain
Copyright Restrictions on the Language Datapublic domain