数据集:
msr_zhen_translation_parity
Achieving Human Parity on Automatic Chinese to English News Translation
Human evaluation results and translation output for the Translator Human Parity Data release, as described in https://blogs.microsoft.com/ai/machine-translation-news-test-set-human-parity/
The Translator Human Parity Data release contains all human evaluation results and translations related to our paper "Achieving Human Parity on Automatic Chinese to English News Translation", published on March 14, 2018. We have released this data to
The dataset includes:
two new references for Chinese-English language pair of WMT17, one based on human translation from scratch (Reference-HT), the other based on human post-editing (Reference-PE);
human parity translations generated by our research systems Combo-4, Combo-5, and Combo-6, as well as translation output from online machine translation service Online-A-1710, collected on October 16, 2017;
The data package provided with the study also includes (but not parsed and provided as workable features of this dataset) all data points collected in human evaluation campaigns.
[More Information Needed]
This dataset contains 6 extra English translations to Chinese-English language pair of WMT17.
[More Information Needed]
As mentioned in the summary, this dataset provides 6 extra English translations of Chinese-English language pair of WMT17.
Data fields are named exactly like the associated paper for easier cross-referenceing.
All data fields of a record are translations for the same Chinese source sentence.
[More Information Needed]
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Citation information is available at this link Achieving Human Parity on Automatic Chinese to English News Translation
Thanks to @leoxzhao for adding this dataset.