数据集:

potsawee/wiki_bio_gpt3_hallucination

语言:

en

大小:

n<1K

预印本库:

arxiv:2303.08896
中文

Dataset Card for WikiBio GPT-3 Hallucination Dataset

Dataset Summary

  • We generate Wikipedia-like passages using GPT-3 (text-davinci-003) using the prompt: This is a Wikipedia passage about {concept} where concept represents an individual from the WikiBio dataset.
  • We split the generated passages into sentences, and we annotate each sentence into one of the 3 options: (1) accurate (2) minor_inaccurate (3) major_inaccurate.
  • We report the data statistics, annotation process, and inter-annotator agreement in our paper.

Update

  • v3 (5 May 2023): 238 test IDs have been annotated in total.
  • v2 (6 April 2023): 142 test IDs have been annotated, GPT-3 sampled passages are now included in this dataset.
  • v1 (15 March 2023): 65 test IDs -- here is wiki_bio_test_idx of the documents in v1 [Link]

Dataset Structure

Each instance consists of:

  • gpt3_text : GPT-3 generated passage
  • wiki_bio_text : Actual Wikipedia passage (first paragraph)
  • gpt3_sentences : gpt3_text split into sentences using spacy
  • annotation : human annotation at the sentence level
  • wiki_bio_test_idx : ID of the concept/individual from the original wikibio dataset (testset)
  • gpt3_text_samples : list of 20 sampled passages (do_sample = True & temperature = 1.0)

Citation Information

@misc{manakul2023selfcheckgpt,
      title={SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models}, 
      author={Potsawee Manakul and Adian Liusie and Mark J. F. Gales},
      year={2023},
      eprint={2303.08896},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}