数据集:

mteb/reddit-clustering-p2p

语言:

en
中文

10 sets with the following stats:

  • 91 labels & 15592 samples
  • 64 labels & 79172 samples
  • 38 labels & 1942 samples
  • 11 labels & 13224 samples
  • 64 labels & 92303 samples
  • 87 labels & 28607 samples
  • 10 labels & 69146 samples
  • 48 labels & 67469 samples
  • 64 labels & 29683 samples
  • 31 labels & 62261 samples
  • Selected at random using the script available on the mteb github repository.