数据集:
bigbio/paramed
NEJM is a Chinese-English parallel corpus crawled from the New England Journal of Medicine website. English articles are distributed through https://www.nejm.org/ and Chinese articles are distributed through http://nejmqianyan.cn/ . The corpus contains all article pairs (around 2000 pairs) since 2011.
@article{liu2021paramed, author = {Liu, Boxiang and Huang, Liang}, title = {ParaMed: a parallel corpus for English–Chinese translation in the biomedical domain}, journal = {BMC Medical Informatics and Decision Making}, volume = {21}, year = {2021}, url = {https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-021-01621-8}, doi = {10.1186/s12911-021-01621-8} }