数据集:
alexandrainst/ddisco
任务:
文本分类语言:
da计算机处理:
monolingual大小:
1K<n<10K语言创建人:
expert-generated批注创建人:
expert-generated许可:
afl-3.0The DDisco dataset is a dataset which can be used to train models to classify levels of coherence in danish discourse. Each entry in the dataset is annotated with a discourse coherence label (rating from 1 to 3):
1: low coherence (difficult to understand, unorganized, contained unnecessary details and can not be summarized briefly and easily) 2: medium coherence 3: high coherence (easy to understand, well organized, only contain details that support the main point and can be summarized briefly and easily). Grammatical and typing errors are ignored (i.e. they do not affect the coherency score) and the coherence of a text is considered within its own domain.
DDisCo: A Discourse Coherence Dataset for Danish