数据集:
bigbio/pubtator_central
PubTator Central (PTC, https://www.ncbi.nlm.nih.gov/research/pubtator/ ) is a web service for exploring and retrieving bioconcept annotations in full text biomedical articles. PTC provides automated annotations from state-of-the-art text mining systems for genes/proteins, genetic variants, diseases, chemicals, species and cell lines, all available for immediate download. PTC annotates PubMed (30 million abstracts), the PMC Open Access Subset and the Author Manuscript Collection (3 million full text articles). Updated entity identification methods and a disambiguation module based on cutting-edge deep learning techniques provide increased accuracy.
@article{10.1093/nar/gkz389, title = {{PubTator central: automated concept annotation for biomedical full text articles}}, author = {Wei, Chih-Hsuan and Allot, Alexis and Leaman, Robert and Lu, Zhiyong}, year = 2019, month = {05}, journal = {Nucleic Acids Research}, volume = 47, number = {W1}, pages = {W587-W593}, doi = {10.1093/nar/gkz389}, issn = {0305-1048}, url = {https://doi.org/10.1093/nar/gkz389}, eprint = {https://academic.oup.com/nar/article-pdf/47/W1/W587/28880193/gkz389.pdf} }