数据集:

ajders/machine_translated_cnn_dailymail_da_small

批注创建人:

machine-generated

许可:

apache-2.0

语言创建人:

machine-generated

大小:

1K<n<10K

计算机处理:

translation

语言:

da
中文

Dataset Card for machine_translated_cnn_dailymail_da_small

Dataset Summary

This dataset is a machine translated subset of the CNN Dailymail Dataset into Danish. The dataset is translated using the Helsinki-NLP/opus-mt-en-da -model. The dataset consists of 2872 articles with summaries with intended usage for Danish text summarisation.

Dataset Structure

Machine translated articles ( article ) with corresponding summaries ( highlights ).

{
  'article': Value(dtype='string', id=None),
  'highlights': Value(dtype='string', id=None),
  'id': Value(dtype='string', id=None)
}

Licensing Information

The dataset is released under the Apache-2.0 License .