数据集:
strombergnlp/rumoureval_2019
任务:
文本分类子任务:
fact-checking语言:
en计算机处理:
monolingual大小:
10K<n<100K语言创建人:
found批注创建人:
crowdsourced预印本库:
arxiv:1809.06683其他:
stance-detection许可:
cc-by-4.0Stance prediction task in English. The goal is to predict whether a given reply to a claim either supports, denies, questions, or simply comments on the claim. Ran as a SemEval task in 2019.
English of various origins, bcp47: en
An example of 'train' looks as follows.
{ 'id': '0', 'source_text': 'Appalled by the attack on Charlie Hebdo in Paris, 10 - probably journalists - now confirmed dead. An attack on free speech everywhere.', 'reply_text': '@m33ryg @tnewtondunn @mehdirhasan Of course it is free speech, that\'s the definition of "free speech" to openly make comments or draw a pic!', 'label': 3 }
0: "support", 1: "deny", 2: "query", 3: "comment"
name | instances |
---|---|
train | 7 005 |
dev | 2 425 |
test | 2 945 |
Twitter users
Detailed in Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads
Who are the annotators?The dataset is curated by the paper's authors.
The authors distribute this data under Creative Commons attribution license, CC-BY 4.0.
@inproceedings{gorrell-etal-2019-semeval, title = "{S}em{E}val-2019 Task 7: {R}umour{E}val, Determining Rumour Veracity and Support for Rumours", author = "Gorrell, Genevieve and Kochkina, Elena and Liakata, Maria and Aker, Ahmet and Zubiaga, Arkaitz and Bontcheva, Kalina and Derczynski, Leon", booktitle = "Proceedings of the 13th International Workshop on Semantic Evaluation", month = jun, year = "2019", address = "Minneapolis, Minnesota, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/S19-2147", doi = "10.18653/v1/S19-2147", pages = "845--854", }
Author-added dataset @leondz