数据集:

AhmedSSabir/Japanese-wiki-dump-sentence-dataset

中文

Dataset

5M (5121625) clean Japanese full sentence with the context. This dataset can be used to learn unsupervised semantic similarity, etc.