数据集:

McGill-NLP/TopiOCQA

语言:

en

计算机处理:

monolingual

大小:

10K<n<100K

批注创建人:

crowdsourced

预印本库:

arxiv:2110.00768
中文

Dataset Card for TopiOCQA

Dataset Summary

TopiOCQA is an information-seeking conversational dataset with challenging topic switching phenomena.

Languages

The language in the dataset is English as spoken by the crowdworkers. The BCP-47 code for English is en.

Additional Information

Licensing Information

TopiOCQA is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License .

Citation Information

@inproceedings{adlakha2022topiocqa,
  title={Topi{OCQA}: Open-domain Conversational Question Answering with Topic Switching},
  author={Adlakha, Vaibhav and Dhuliawala, Shehzaad and Suleman, Kaheer and de Vries, Harm and Reddy, Siva},
  journal={Transactions of the Association for Computational Linguistics},
  volume = {10},
  pages = {468-483},
  year = {2022},
  month = {04},
  year={2022},
  issn = {2307-387X},
  doi = {10.1162/tacl_a_00471},
  url = {https://doi.org/10.1162/tacl\_a\_00471},
  eprint = {https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl\_a\_00471/2008126/tacl\_a\_00471.pdf},
}