neulab/codebert-python | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

模型:

neulab/codebert-python

任务:

填充掩码

类库:

PyTorch Transformers

其他:

roberta AutoTrain Compatible

预印本库:

arxiv:2302.05527

模型介绍文件清单

英文

这是一个 microsoft/codebert-base-mlm 模型，使用 batch_size=32 ，从 codeparrot/github-code-clean 数据集中的 Python 代码上进行了 1,000,000 步的训练，用于掩码语言建模任务。

它旨在用于 CodeBERTScore: https://github.com/neulab/code-bert-score ，但也可以用于任何其他模型或任务。

更多信息，请参阅： https://github.com/neulab/code-bert-score

引用

如果您在研究中使用此模型，请引用：

@article{zhou2023codebertscore,
  url = {https://arxiv.org/abs/2302.05527},
  author = {Zhou, Shuyan and Alon, Uri and Agarwal, Sumit and Neubig, Graham},
  title = {CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code},  
  publisher = {arXiv},
  year = {2023},
}

作者:

NeuLab @ LTI/CMU

数据集大小:

478.96 MB