Next Article in Journal
Manufacturing, Processing, and Characterization of Self-Expanding Metallic Stents: A Comprehensive Review
Previous Article in Journal
The Application of Deep Learning to Accurately Identify the Dimensions of Spinal Canal and Intervertebral Foramen as Evaluated by the IoU Index
Previous Article in Special Issue
Application of Machine Learning in the Diagnosis of Early Gastric Cancer Using the Kyoto Classification Score and Clinical Features Collected from Medical Consultations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Comparative Analysis of Large Language Models in Chinese Medical Named Entity Recognition

1
College of Computer Science, Beijing University of Technology, Beijing 100124, China
2
Department of Anesthesiology, Beijing Anzhen Hospital, Capital Medical University, Beijing 100013, China
3
Department of Automation, Tsinghua University, Beijing 100084, China
*
Authors to whom correspondence should be addressed.
Bioengineering 2024, 11(10), 982; https://doi.org/10.3390/bioengineering11100982 (registering DOI)
Submission received: 9 August 2024 / Revised: 13 September 2024 / Accepted: 23 September 2024 / Published: 29 September 2024

Abstract

The emergence of large language models (LLMs) has provided robust support for application tasks across various domains, such as name entity recognition (NER) in the general domain. However, due to the particularity of the medical domain, the research on understanding and improving the effectiveness of LLMs on biomedical named entity recognition (BNER) tasks remains relatively limited, especially in the context of Chinese text. In this study, we extensively evaluate several typical LLMs, including ChatGLM2-6B, GLM-130B, GPT-3.5, and GPT-4, on the Chinese BNER task by leveraging a real-world Chinese electronic medical record (EMR) dataset and a public dataset. The experimental results demonstrate the promising yet limited performance of LLMs with zero-shot and few-shot prompt designs for Chinese BNER tasks. More importantly, instruction fine-tuning significantly enhances the performance of LLMs. The fine-tuned offline ChatGLM2-6B surpassed the performance of the task-specific model BiLSTM+CRF (BC) on the real-world dataset. The best fine-tuned model, GPT-3.5, outperforms all other LLMs on the publicly available CCKS2017 dataset, even surpassing half of the baselines; however, it still remains challenging for it to surpass the state-of-the-art task-specific models, i.e., Dictionary-guided Attention Network (DGAN). To our knowledge, this study is the first attempt to evaluate the performance of LLMs on Chinese BNER tasks, which emphasizes the prospective and transformative implications of utilizing LLMs on Chinese BNER tasks. Furthermore, we summarize our findings into a set of actionable guidelines for future researchers on how to effectively leverage LLMs to become experts in specific tasks.
Keywords: large language model; biomedical named entity recognition; electronic medical record large language model; biomedical named entity recognition; electronic medical record

Share and Cite

MDPI and ACS Style

Zhu, Z.; Zhao, Q.; Li, J.; Ge, Y.; Ding, X.; Gu, T.; Zou, J.; Lv, S.; Wang, S.; Yang, J.-J. Comparative Analysis of Large Language Models in Chinese Medical Named Entity Recognition. Bioengineering 2024, 11, 982. https://doi.org/10.3390/bioengineering11100982

AMA Style

Zhu Z, Zhao Q, Li J, Ge Y, Ding X, Gu T, Zou J, Lv S, Wang S, Yang J-J. Comparative Analysis of Large Language Models in Chinese Medical Named Entity Recognition. Bioengineering. 2024; 11(10):982. https://doi.org/10.3390/bioengineering11100982

Chicago/Turabian Style

Zhu, Zhichao, Qing Zhao, Jianjiang Li, Yanhu Ge, Xingjian Ding, Tao Gu, Jingchen Zou, Sirui Lv, Sheng Wang, and Ji-Jiang Yang. 2024. "Comparative Analysis of Large Language Models in Chinese Medical Named Entity Recognition" Bioengineering 11, no. 10: 982. https://doi.org/10.3390/bioengineering11100982

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop