数据集:
pasinit/xlwic
任务:
文本分类计算机处理:
multilingual大小:
10K<n<100K语言创建人:
found批注创建人:
expert-generated源数据集:
original许可:
cc-by-nc-4.0Huggingface dataset for the XL-WiC paper https://www.aclweb.org/anthology/2020.emnlp-main.584.pdf . Please refer to the official website for more information.
When loading one of the XL-WSD datasets one has to specify the training language and the target language (on which dev and test will be performed). Please refer to Languages section to see in which languages training data is available. For example, we can load the dataset having English as training language and Italian as target language as follows:
from datasets import load_dataset dataset = load_dataset('pasinit/xlwic', 'en_it')
Training data
Dev & Test data