数据集:
Gabriel/citesum_swe
The Swedish citesum dataset has only been machine-translated to improve downstream fine-tuning on Swedish summarization tasks.
Read about the full details at original English version: https://huggingface.co/datasets/citesum
https://arxiv.org/abs/2205.06207
Yuning Mao, Ming Zhong, Jiawei Han University of Illinois Urbana-Champaign {yuningm2, mingz5, hanj}@illinois.edu
The Swedish xsum dataset follows the same splits as the original English version and has 3 splits: train , validation , and test .
Dataset Split | Number of Instances in Split |
---|---|
Train | 83,304 |
Validation | 4,721 |
Test | 4,921 |