数据集:

singletongue/wikipedia-utils

中文

Wikipedia-Utils: Preprocessed Wikipedia Texts for NLP

Preprocessed Wikipedia texts generated with the scripts in singletongue/wikipedia-utils repo.

For detailed information on how the texts are processed, please refer to the repo.