数据集:

bigbio/an_em

计算机处理:

monolingual

语言:

en
中文

Dataset Card for AnEM

AnEM corpus is a domain- and species-independent resource manually annotated for anatomical entity mentions using a fine-grained classification system. The corpus consists of 500 documents (over 90,000 words) selected randomly from citation abstracts and full-text papers with the aim of making the corpus representative of the entire available biomedical scientific literature. The corpus annotation covers mentions of both healthy and pathological anatomical entities and contains over 3,000 annotated mentions.

Citation Information

@inproceedings{ohta-etal-2012-open,
  author    = {Ohta, Tomoko and Pyysalo, Sampo and Tsujii, Jun{'}ichi and Ananiadou, Sophia},
  title     = {Open-domain Anatomical Entity Mention Detection},
  journal   = {},
  volume    = {W12-43},
  year      = {2012},
  url       = {https://aclanthology.org/W12-4304},
  doi       = {},
  biburl    = {},
  bibsource = {},
  publisher = {Association for Computational Linguistics}
}