数据集:
daekeun-ml/naver-news-summarization-ko
This dataset is a custom dataset created by the author by crawling Naver News ( https://news.naver.com ) for the Korean NLP model hands-on.
DatasetDict({ train: Dataset({ features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'], num_rows: 22194 }) test: Dataset({ features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'], num_rows: 2740 }) validation: Dataset({ features: ['date', 'category', 'press', 'title', 'document', 'link', 'summary'], num_rows: 2466 }) })