中文

ruRoberta-large

Model was trained by SberDevices team.

  • Task: mask filling
  • Type: encoder
  • Tokenizer: bbpe
  • Dict size: 50 257
  • Num Parameters: 355 M
  • Training Data Volume 250 GB

Authors