数据集:
udhr
任务:
翻译计算机处理:
multilingual大小:
n<1K语言创建人:
found批注创建人:
no-annotation源数据集:
original许可:
license:unknownThe Universal Declaration of Human Rights (UDHR) is a milestone document in the history of human rights. Drafted by representatives with different legal and cultural backgrounds from all regions of the world, it set out, for the first time, fundamental human rights to be universally protected. The Declaration was adopted by the UN General Assembly in Paris on 10 December 1948 during its 183rd plenary meeting.
© 1996 – 2009 The Office of the High Commissioner for Human Rights
This plain text version prepared by the "UDHR in Unicode" project, https://www.unicode.org/udhr .
[More Information Needed]
The dataset includes translations of the document in over 400 languages and dialects. The list of languages can be found here .
Each instance corresponds to a different language and includes information about the language and the full document text.
Only a train split included which includes the full document in all languages.
train | |
---|---|
Number of examples | 488 |
In addition to its social significance, the document set a world record in 1999 for being the most translated document in the world and as such can be useful for settings requiring paired text between many languages.
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
In addition to the social and political significance of the United Nations' Universal Declaration of Human Rights, the document set a world record in 1999 for being the most translated document in the world and as such can be useful for settings requiring paired text between many languages including those that are low resource and significantly underrepresented in NLP research.
[More Information Needed]
Although the document is translated into a very large number of languages, the text is very short and therefore may have limited usefulness for most types of modeling and evaluation.
The txt/xml data files used here were compiled by The Unicode Consortium, which can be found here . The original texts can be found on the United Nations website .
Source text © 1996 – 2022 The Office of the High Commissioner for Human Rights
The Unicode license applies to these translations.
United Nations. (1998). The Universal Declaration of Human Rights, 1948-1998. New York: United Nations Dept. of Public Information.
Thanks to @joeddav for adding this dataset. Updated May 2022 @leondz .