moussaKam/mbarthez | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

模型:

moussaKam/mbarthez

任务:

填充掩码

类库:

PyTorch Transformers

语言:

其他:

mbart 文生文摘要生成 AutoTrain Compatible

预印本库:

arxiv:2010.12321

许可:

apache-2.0

模型介绍文件清单

中文

A french sequence to sequence pretrained model based on BART . BARThez is pretrained by learning to reconstruct a corrupted input sentence. A corpus of 66GB of french raw text is used to carry out the pretraining. Unlike already existing BERT-based French language models such as CamemBERT and FlauBERT, BARThez is particularly well-suited for generative tasks (such as abstractive summarization), since not only its encoder but also its decoder is pretrained.

In addition to BARThez that is pretrained from scratch, we continue the pretraining of a multilingual BART mBART which boosted its performance in both discriminative and generative tasks. We call the french adapted version mBARThez .

Model	Architecture	#layers	#params
BARThez	BASE	12	165M
mBARThez	LARGE	24	458M

paper: https://arxiv.org/abs/2010.12321 github: https://github.com/moussaKam/BARThez

@article{eddine2020barthez,
  title={BARThez: a Skilled Pretrained French Sequence-to-Sequence Model},
  author={Eddine, Moussa Kamal and Tixier, Antoine J-P and Vazirgiannis, Michalis},
  journal={arXiv preprint arXiv:2010.12321},
  year={2020}
}

作者:

Moussa Kamal Eddine

数据集大小:

1.71 GB