A Knowledge-Enhanced Hierarchical Reinforcement Learning-Based Dialogue System for Automatic Disease Diagnosis
Abstract
:1. Introduction
- We incorporate the learned medical knowledge into the low-level strategy of an HRL model, which can further improve the symptom matching rate and the diagnosis success rate.
- Inspired by the process of the doctor’s consultation in real life, we leverage a classifier to feed the user’s disease probabilities into system states, and propose a new decision-making method by considering the medical knowledge graph and the learned disease–symptom relation matrix.
- The proposed KNHRL model outperforms strong baseline methods on a public available medical dialogue dataset for automatic disease diagnosis.
2. Related Work
3. Model Overview
4. Knowledge Construction
5. Knowledge-Enhanced Hierarchical Reinforcement Learning Model
5.1. Deep Reinforcement Learning Model for Disease Diagnosis
5.2. High-Level Strategy for HRL
5.3. Low-Level Strategy for Knowledge Enhanced Decision-Making
5.4. User Simulator
6. Experiments and Analysis
6.1. Experimental Data and Settings
6.2. Baseline Models
6.3. Experimental Results and Analysis
6.4. Further Analysis
7. Conclusions
7.1. Limitations
7.2. Ethics Statement
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Moro, G.; Ragazzi, L.; Valgimigli, L.; Freddi, D. Discriminative marginalized probabilistic neural method for multi-document summarization of medical literature. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Dublin, Ireland, 22–27 May 2022; pp. 180–189. [Google Scholar]
- Yan, S. Memory-aligned knowledge graph for clinically accurate radiology image report generation. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; pp. 116–122. [Google Scholar]
- Soleimani, A.; Nikoulina, V.; Favre, B.; Ait-Mokhtar, S. Zero-Shot Aspect-Based Scientific Document Summarization using Self-Supervised Pre-training. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; pp. 49–62. [Google Scholar]
- Boissonnet, A.; Saeidi, M.; Plachouras, V.; Vlachos, A. Explainable assessment of healthcare articles with QA. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; pp. 1–9. [Google Scholar]
- Pappas, D.; Malakasiotis, P.; Androutsopoulos, I. Data Augmentation for Biomedical Factoid Question Answering. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; p. 63. [Google Scholar]
- Gupta, D.; Demner-Fushman, D. Overview of the MedVidQA 2022 shared task on medical video question-answering. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; pp. 264–274. [Google Scholar]
- Giorgi, J.; Bader, G.D.; Wang, B. A sequence-to-sequence approach for document-level relation extraction. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; p. 10. [Google Scholar]
- Papanikolaou, Y.; Staib, M.; Grace, J.; Bennett, F. Slot Filling for Biomedical Information Extraction. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; p. 82. [Google Scholar]
- Phan, U.; Nguyen, N. Simple Semantic-based Data Augmentation for Named Entity Recognition in Biomedical Texts. In Proceedings of the 21st Workshop on Biomedical Language Processing, Dublin, Ireland, 26 May 2022; pp. 123–129. [Google Scholar]
- Jonnalagadda, S.R.; Adupa, A.K.; Garg, R.P.; Corona-Cox, J.; Shah, S.J. Text mining of the electronic health record: An information extraction approach for automated identification and subphenotyping of HFpEF patients for clinical trials. J. Cardiovasc. Transl. Res. 2017, 10, 313–321. [Google Scholar] [CrossRef] [PubMed]
- Doshi-Velez, F.; Ge, Y.; Kohane, I. Comorbidity clusters in autism spectrum disorders: An electronic health record time-series analysis. Pediatrics 2014, 133, e54–e63. [Google Scholar] [CrossRef] [PubMed]
- Wen, T.H.; Vandyke, D.; Mrksic, N.; Gasic, M.; Rojas-Barahona, L.M.; Su, P.H.; Ultes, S.; Young, S. A network-based end-to-end trainable task-oriented dialogue system. arXiv 2016, arXiv:1604.04562. [Google Scholar]
- Lipton, Z.; Li, X.; Gao, J.; Li, L.; Ahmed, F.; Deng, L. Bbq-networks: Efficient exploration in deep reinforcement learning for task-oriented dialogue systems. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
- Yan, Z.; Duan, N.; Chen, P.; Zhou, M.; Zhou, J.; Li, Z. Building task-oriented dialogue systems for online shopping. In Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017; Volume 31. [Google Scholar]
- Huang, Y.; Feng, J.; Hu, M.; Wu, X.; Du, X.; Ma, S. Meta-reinforced multi-domain state generator for dialogue systems. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 5–10 July 2020; pp. 7109–7118. [Google Scholar]
- Wang, S.; Zhou, K.; Lai, K.; Shen, J. Task-completion dialogue policy learning via Monte Carlo tree search with dueling network. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, 16–20 November 2020; pp. 3461–3471. [Google Scholar]
- Rehman, U.U.; Chang, D.J.; Jung, Y.; Akhtar, U.; Razzaq, M.A.; Lee, S. Medical instructed real-time assistant for patient with glaucoma and diabetic conditions. Appl. Sci. 2020, 10, 2216. [Google Scholar] [CrossRef]
- Wei, Z.; Liu, Q.; Peng, B.; Tou, H.; Chen, T.; Huang, X.J.; Wong, K.F.; Dai, X. Task-oriented dialogue system for automatic diagnosis. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Melbourne, Australia, 15–20 July 2018; pp. 201–207. [Google Scholar]
- Liao, K.; Liu, Q.; Wei, Z.; Peng, B.; Chen, Q.; Sun, W.; Huang, X. Task-oriented dialogue system for automatic disease diagnosis via hierarchical reinforcement learning. arXiv 2020, arXiv:2004.14254. [Google Scholar]
- Fang, M.; Li, Y.; Cohn, T. Learning how to active learn: A deep reinforcement learning approach. arXiv 2017, arXiv:1708.02383. [Google Scholar]
- Chen, J.; Wang, Z.; Tomizuka, M. Deep hierarchical reinforcement learning for autonomous driving with distinct behaviors. In Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26–30 June 2018; IEEE: Piscataway Township, NJ, USA, 2018; pp. 1239–1244. [Google Scholar]
- Liu, J.; Pan, F.; Luo, L. Gochat: Goal-oriented chatbots with hierarchical reinforcement learning. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, China, 25–30 July 2020; pp. 1793–1796. [Google Scholar]
- Zhao, D.; Zhang, L.; Zhang, B.; Zheng, L.; Bao, Y.; Yan, W. Mahrl: Multi-goals abstraction based deep hierarchical reinforcement learning for recommendations. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, China, 25–30 July 2020; pp. 871–880. [Google Scholar]
- Wang, X.; Chen, W.; Wu, J.; Wang, Y.F.; Wang, W.Y. Video captioning via hierarchical reinforcement learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4213–4222. [Google Scholar]
- Zhou, X.; Bai, T.; Gao, Y.; Han, Y. Vision-based robot navigation through combining unsupervised learning and hierarchical reinforcement learning. Sensors 2019, 19, 1576. [Google Scholar] [CrossRef] [PubMed]
- Xie, R.; Zhang, S.; Wang, R.; Xia, F.; Lin, L. Hierarchical reinforcement learning for integrated recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence, Online, 2–9 February 2021; Volume 35, pp. 4521–4528. [Google Scholar]
- Huang, Q.; Gan, Z.; Celikyilmaz, A.; Wu, D.; Wang, J.; He, X. Hierarchically structured reinforcement learning for topically coherent visual story generation. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 8465–8472. [Google Scholar]
- Chen, Y.; Tao, L.; Wang, X.; Yamasaki, T. Weakly supervised video summarization by hierarchical reinforcement learning. In Proceedings of the ACM Multimedia Asia, Beijing, China, 15–18 December 2019; pp. 1–6. [Google Scholar]
- Jain, D.; Iscen, A.; Caluwaerts, K. Hierarchical reinforcement learning for quadruped locomotion. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 3–8 November 2019; IEEE: Piscataway Township, NJ, USA, 2019; pp. 7551–7557. [Google Scholar]
- Li, T.; Lambert, N.; Calandra, R.; Meier, F.; Rai, A. Learning generalizable locomotion skills with hierarchical reinforcement learning. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 413–419. [Google Scholar]
- Budzianowski, P.; Ultes, S.; Su, P.H.; Mrkšić, N.; Wen, T.H.; Casanueva, I.; Rojas-Barahona, L.; Gašić, M. Sub-domain modelling for dialogue management with hierarchical reinforcement learning. arXiv 2017, arXiv:1706.06210. [Google Scholar]
- Saha, T.; Gupta, D.; Saha, S.; Bhattacharyya, P. Towards integrated dialogue policy learning for multiple domains and intents using hierarchical deep reinforcement learning. Expert Syst. Appl. 2020, 162, 113650. [Google Scholar] [CrossRef]
- Saha, T.; Saha, S.; Bhattacharyya, P. Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning. PLoS ONE 2020, 15, e0235367. [Google Scholar] [CrossRef] [PubMed]
- Ghandeharioun, A.; Shen, J.H.; Jaques, N.; Ferguson, C.; Jones, N.; Lapedriza, A.; Picard, R. Approximating interactive human evaluation with self-play for open-domain dialog systems. In Proceedings of the 33rd International Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
- Peng, Y.S.; Tang, K.F.; Lin, H.T.; Chang, E. Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Proceedings of the 33rd International Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, BC, Canada, 8–14 December 2019; Volume 31. [Google Scholar]
- Hou, Z.; Liu, B.; Zhao, R.; Ou, Z.; Liu, Y.; Chen, X.; Zheng, Y. Imperfect also deserves reward: Multi-level and sequential reward modeling for better dialog management. arXiv 2021, arXiv:2104.04748. [Google Scholar]
- Teixeira, M.S.; Maran, V.; Dragoni, M. The interplay of a conversational ontology and AI planning for health dialogue management. In Proceedings of the 36th Annual ACM Symposium on Applied Computing, Virtual Event, Republic of Korea, 22–26 March 2021; pp. 611–619. [Google Scholar]
- Xu, L.; Zhou, Q.; Gong, K.; Liang, X.; Tang, J.; Lin, L. End-to-end knowledge-routed relational dialogue system for automatic diagnosis. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 7346–7353. [Google Scholar]
- Liu, S.; Chen, H.; Ren, Z.; Feng, Y.; Liu, Q.; Yin, D. Knowledge diffusion for neural dialogue generation. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia, 15–20 July 2018; pp. 1489–1498. [Google Scholar]
- Xia, Y.; Zhou, J.; Shi, Z.; Lu, C.; Huang, H. Generative adversarial regularized mutual information policy gradient framework for automatic diagnosis. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 1062–1069. [Google Scholar]
- Kao, H.C.; Tang, K.F.; Chang, E. Context-aware symptom checking for disease diagnosis using hierarchical reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
Success Rate | Average Dialogue Turns | Match Rate | |
---|---|---|---|
SVM-ex | 0.321 | / | / |
DQN | 0.343 | 2.455 | 0.045 |
KR-DQN | 0.395 | 7.120 | 0.067 |
REFUEL | 0.416 | 8.551 | 0.089 |
GAMP | 0.409 | 3.535 | 0.057 |
HRL-pretrained | 0.452 | 6.838 | / |
HRL | 0.504 | 12.959 | 0.105 |
KNHRL | 0.558 | 20.984 | 0.333 |
SVM-ex-im | 0.732 | / | / |
Success Rate | Average Number of Dialogue Turns | Match Rate | |
---|---|---|---|
KNHRL | 0.558 | 20.984 | 0.333 |
-dl | 0.545 | 20.102 | 0.315 |
-rel | 0.522 | 18.334 | 0.179 |
-kg | 0.526 | 19.568 | 0.186 |
-hrl | 0.426 | 6.105 | 0.082 |
-all | 0.343 | 2.455 | 0.045 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhu, Y.; Li, Y.; Cui, Y.; Zhang, T.; Wang, D.; Zhang, Y.; Feng, S. A Knowledge-Enhanced Hierarchical Reinforcement Learning-Based Dialogue System for Automatic Disease Diagnosis. Electronics 2023, 12, 4896. https://doi.org/10.3390/electronics12244896
Zhu Y, Li Y, Cui Y, Zhang T, Wang D, Zhang Y, Feng S. A Knowledge-Enhanced Hierarchical Reinforcement Learning-Based Dialogue System for Automatic Disease Diagnosis. Electronics. 2023; 12(24):4896. https://doi.org/10.3390/electronics12244896
Chicago/Turabian StyleZhu, Ying, Yameng Li, Yuan Cui, Tianbao Zhang, Daling Wang, Yifei Zhang, and Shi Feng. 2023. "A Knowledge-Enhanced Hierarchical Reinforcement Learning-Based Dialogue System for Automatic Disease Diagnosis" Electronics 12, no. 24: 4896. https://doi.org/10.3390/electronics12244896
APA StyleZhu, Y., Li, Y., Cui, Y., Zhang, T., Wang, D., Zhang, Y., & Feng, S. (2023). A Knowledge-Enhanced Hierarchical Reinforcement Learning-Based Dialogue System for Automatic Disease Diagnosis. Electronics, 12(24), 4896. https://doi.org/10.3390/electronics12244896