数据集:

liyucheng/chinese_metaphor_dataset

中文

Chinese Metaphor Corpus (CMC)

Dataset Summary

The first Chinese metaphor corpus serving both metaphor identification and generation. We construct a big metaphor resoruce in Chinese with around 9000 metaphorical sentences with tenor and vehicle annotated. Check out more details in the github repo and our paper presenting at COLING 2022.

首个中文比喻数据集,可以用于中文比喻识别与中文比喻生成。在 知乎 查看更多细节。

Languages

Chinese

Citation Information

@inproceedings{li-etal-2022-cm,
    title = "{CM}-Gen: A Neural Framework for {C}hinese Metaphor Generation with Explicit Context Modelling",
    author = "Li, Yucheng  and
      Lin, Chenghua  and
      Guerin, Frank",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.563",
    pages = "6468--6479",
}