模型:

phob0s/bert-tiny

中文

Testclone of https://huggingface.co/prajjwal1/bert-tiny

Mentioned in

  • Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics(Bhargava, Drozd and Rogers)
  • Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation (Turc et al.)