数据集:

PORTULAN/glue-ptpt

语言:

pt

大小:

10K<n<100K

语言创建人:

machine-generated

源数据集:

glue

预印本库:

arxiv:2305.06721
中文

GLUE-PTPT -- The General Language Understanding Evaluation benchmark translated to European Portuguese

This dataset has been created to evaluate Albertina PT-* models .

If you use this dataset please cite:

@misc{rodrigues2023advancing,
  title={Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*}, 
  author={João Rodrigues and Luís Gomes and João Silva and António Branco and Rodrigo Santos and Henrique Lopes Cardoso and Tomás Osório},
  year={2023},
  eprint={2305.06721},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

Thus far, only 4 tasks have been translated to European Portuguese:

  • MRPC
  • RTE
  • STS-B
  • WNLI

The remainder tasks will be added in the future.

See gluebenchmark.com for information about the General Language Understanding Evaluation (GLUE) dataset.