数据集:
bigbio/osiris
The OSIRIS corpus is a set of MEDLINE abstracts manually annotated with human variation mentions. The corpus is distributed under the terms of the Creative Commons Attribution License Creative Commons Attribution 3.0 Unported License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (Furlong et al, BMC Bioinformatics 2008, 9:84).
@ARTICLE{Furlong2008, author = {Laura I Furlong and Holger Dach and Martin Hofmann-Apitius and Ferran Sanz}, title = {OSIRISv1.2: a named entity recognition system for sequence variants of genes in biomedical literature.}, journal = {BMC Bioinformatics}, year = {2008}, volume = {9}, pages = {84}, doi = {10.1186/1471-2105-9-84}, pii = {1471-2105-9-84}, pmid = {18251998}, timestamp = {2013.01.15}, url = {http://dx.doi.org/10.1186/1471-2105-9-84} }