数据集:
persiannlp/parsinlu_translation_en_fa
任务:
翻译语言:
fa大小:
1K<n<10K语言创建人:
expert-generated批注创建人:
expert-generated源数据集:
extended预印本库:
arxiv:2012.06154许可:
cc-by-nc-sa-4.0A Persian translation dataset (English -> Persian).
[More Information Needed]
The text dataset is in Persian ( fa ) and English ( en ).
Here is an example from the dataset:
{ "source": "how toil to raise funds, propagate reforms, initiate institutions!", "targets": ["چه زحمتها که بکشد تا منابع مالی را تامین کند اصطلاحات را ترویج کند نهادهایی به راه اندازد."], "category": "mizan_dev_en_fa" }
The train/de/test split contains 1,621,666/2,138/48,360 samples.
For details, check the corresponding draft .
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
CC BY-NC-SA 4.0 License
@article{huggingface:dataset, title = {ParsiNLU: A Suite of Language Understanding Challenges for Persian}, authors = {Khashabi, Daniel and Cohan, Arman and Shakeri, Siamak and Hosseini, Pedram and Pezeshkpour, Pouya and Alikhani, Malihe and Aminnaseri, Moin and Bitaab, Marzieh and Brahman, Faeze and Ghazarian, Sarik and others}, year={2020} journal = {arXiv e-prints}, eprint = {2012.06154}, }
Thanks to @danyaljj for adding this dataset.