数据集:

daekeun-ml/naver-news-summarization-ko

语言:

ko

大小:

10K<n<100K

许可:

apache-2.0
中文

This dataset is a custom dataset created by the author by crawling Naver News ( https://news.naver.com ) for the Korean NLP model hands-on.

  • Period: July 1, 2022 - July 10, 2022
  • Subject: IT, economics
DatasetDict({
    train: Dataset({
        features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'],
        num_rows: 22194
    })
    test: Dataset({
        features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'],
        num_rows: 2740
    })
    validation: Dataset({
        features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'],
        num_rows: 2466
    })
})

license: apache-2.0