数据集:
cdsc
许可:
cc-by-nc-sa-4.0语言创建人:
other批注创建人:
expert-generated源数据集:
original语言:
pl计算机处理:
monolingual大小:
10K<n<100KPolish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. The dataset was presented at ACL 2017. Please refer to the Wróblewska and Krasnowska-Kieraś (2017) for a detailed description of the resource.
[More Information Needed]
Polish
[More Information Needed]
for cdsc-e domain:
for cdsc-r domain:
Data is splitted in train/dev/test split.
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Dataset provided for research purposes only. Please check dataset license for additional information.
[More Information Needed]
CC BY-NC-SA 4.0
[More Information Needed]
Thanks to @abecadel for adding this dataset.