数据集:
bigbio/bioasq_2021_mesinesp
The main aim of MESINESP2 is to promote the development of practically relevant semantic indexing tools for biomedical content in non-English language. We have generated a manually annotated corpus, where domain experts have labeled a set of scientific literature, clinical trials, and patent abstracts. All the documents were labeled with DeCS descriptors, which is a structured controlled vocabulary created by BIREME to index scientific publications on BvSalud, the largest database of scientific documents in Spanish, which hosts records from the databases LILACS, MEDLINE, IBECS, among others.
MESINESP track at BioASQ9 explores the efficiency of systems for assigning DeCS to different types of biomedical documents. To that purpose, we have divided the task into three subtracks depending on the document type. Then, for each one we generated an annotated corpus which was provided to participating teams:
@conference {396,
title = {Overview of BioASQ 2021-MESINESP track. Evaluation of
advance hierarchical classification techniques for scientific
literature, patents and clinical trials.},
booktitle = {Proceedings of the 9th BioASQ Workshop
A challenge on large-scale biomedical semantic indexing
and question answering},
year = {2021},
url = {http://ceur-ws.org/Vol-2936/paper-11.pdf},
author = {Gasco, Luis and Nentidis, Anastasios and Krithara, Anastasia
and Estrada-Zavala, Darryl and Toshiyuki Murasaki, Renato and Primo-Pe{\~n}a,
Elena and Bojo-Canales, Cristina and Paliouras, Georgios and Krallinger, Martin}
}