数据集:
bigbio/anat_em
The extended Anatomical Entity Mention corpus (AnatEM) consists of 1212 documents (approx. 250,000 words) manually annotated to identify over 13,000 mentions of anatomical entities. Each annotation is assigned one of 12 granularity-based types such as Cellular component, Tissue and Organ, defined with reference to the Common Anatomy Reference Ontology.
@article{pyysalo2014anatomical, title={Anatomical entity mention recognition at literature scale}, author={Pyysalo, Sampo and Ananiadou, Sophia}, journal={Bioinformatics}, volume={30}, number={6}, pages={868--875}, year={2014}, publisher={Oxford University Press} }