数据集:
bigbio/cpi
The compound-protein relationship (CPI) dataset consists of 2,613 sentences from abstracts containing annotations of proteins, small molecules, and their relationships.
@article{doring2020automated,
  title={Automated recognition of functional compound-protein relationships in literature},
  author={D{\"o}ring, Kersten and Qaseem, Ammar and Becer, Michael and Li, Jianyu and Mishra, Pankaj and Gao, Mingjie and Kirchner, Pascal and Sauter, Florian and Telukunta, Kiran K and Moumbock, Aur{\'e}lien FA and others},
  journal={Plos one},
  volume={15},
  number={3},
  pages={e0220925},
  year={2020},
  publisher={Public Library of Science San Francisco, CA USA}
}