数据集:

ml6team/xsum_nl

语言:

计算机处理:

monolingual

大小:

size_categories:unknown

语言创建人:

machine-generated

批注创建人:

machine-generated

源数据集:

extended|xsum

许可:

license:unknown

数据集介绍文件清单

中文

Dataset Card for XSum NL

Dataset Summary

This dataset is a machine translated dataset. It's the XSum dataset translated with this model from English to Dutch.

See the Hugginface page of the original dataset for more information on the format of this dataset.

Use with:

from datasets import load_dataset
load_dataset("csv", "ml6team/xsum_nl")

Languages

Dutch

Dataset Structure

Data Instances

[More Information Needed]

Data Fields

id : BBC ID of the article.
document : a string containing the body of the news article
summary : a string containing a one sentence summary of the article.

Data Splits

train
test
validation

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

[More Information Needed]

Citation Information

[More Information Needed]

Contributions

Thanks to @github-username for adding this dataset.

作者:

ml6team

数据集大小:

246.26 MB