Evaluating Patients’ Experiences with Healthcare Services: Extracting Domain and Language-Specific Information from Free-Text Narratives
Abstract
:1. Introduction
2. Materials and Methods
2.1. Participants and Materials
2.2. Ethics Approval
2.3. Characteristics of Texts and Preprocessing of Textual Data
2.4. Research Questions about Patients’ Experiences
- Language-communication difficulties experienced by patients during interaction with healthcare unit personnel;
- Type of visit to the healthcare facility—seeking medical checkup vs. treatment for illness or injury;
- Type of medical personnel involved in the provision of health services to patients;
- The number of healthcare facilities patients visit on one occasion of seeking treatment.
2.5. Development of the Lexicon and Lexical–Syntactic Frames for the Automatic Query of Texts
2.6. Development of Algorithmic Rules for Automatic Queries of Texts
2.7. Formulation of Natural Language Questions for Human Readers’ Query of Texts
- Q1. Does the text mention any difficulties in language communication?
- Q2. Does the text mention illness or injury as a reason for contact with a healthcare facility?
- Q3a. Does the text mention any physicians or physicians involved in providing health services?
- Q3b. Does the text mention personnel other than physicians involved in providing health services?
- Q4. Does the text mention seeking medical help in more than one healthcare facility?
3. Results
3.1. Statistics for the Text Collection
3.2. Results of the Python Script Classifications
3.3. Results of Human Readers’ Classifications
3.4. Results for Correlations of the Automatic and Human Reader Classifications
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Fette, G.; Ertl, M.; Wörner, A.; Kluegl, P.; Störk, S.; Puppe, F. Information Extraction from Unstructured Electronic Health Records and Integration into a Data Warehouse. In Lecture Notes in Informatics (LNI), Proceedings—Series of the Gesellschaft fur Informatik (GI); Gesellschaft für Informatik: Bonn, Germany, 2012; Volume 208. [Google Scholar]
- Härkänen, M.; Paananen, J.; Murrells, T.; Rafferty, A.M.; Franklin, B.D. Identifying Risks Areas Related to Medication Administrations—Text Mining Analysis Using Free-Text Descriptions of Incident Reports. BMC Health Serv. Res. 2019, 19, 791. [Google Scholar] [CrossRef] [PubMed]
- Gupta, A.; Banerjee, I.; Rubin, D.L. Automatic Information Extraction from Unstructured Mammography Reports Using Distributed Semantics. J. Biomed. Inform. 2018, 78, 78–86. [Google Scholar] [CrossRef] [PubMed]
- Toyabe, S.I. Detecting Inpatient Falls by Using Natural Language Processing of Electronic Medical Records. BMC Health Serv. Res. 2012, 12, 448. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Wang, L.; Rastegar-Mojarad, M.; Moon, S.; Shen, F.; Afzal, N.; Liu, S.; Zeng, Y.; Mehrabi, S.; Sohn, S.; et al. Clinical Information Extraction Applications: A Literature Review. J. Biomed. Inform. 2018, 77, 34–49. [Google Scholar] [CrossRef] [PubMed]
- Mykowiecka, A.; Marciniak, M.; Kupść, A. Rule-Based Information Extraction from Patients’ Clinical Data. J. Biomed. Inform. 2009, 42, 923–936. [Google Scholar] [CrossRef]
- Lorenzoni, G.; Rampazzo, R.; Buratin, A.; Berchialla, P.; Gregori, D. Does the Integration of Pre-Coded Information with Narratives Improve in-Hospital Falls’ Surveillance? Appl. Sci. 2021, 11, 4406. [Google Scholar] [CrossRef]
- De Silva, D. Evidence Scan: Measuring the Patient Experience; The Health Foundation: London, UK, 2013; Volume 18. [Google Scholar]
- Sowada, C.; Sagan, A.; Kowalska-Bobko, I.; Badora-Musial, K.; Bochenek, T.; Domagala, A.; Dubas-Jakobczyk, K.; Kocot, E.; Mrozek-Gasiorowska, M.; Sitko, S.; et al. Poland: Health System Review. Health Syst. Transit. 2019, 21, 1–234. [Google Scholar]
- Fiorio, C.V.; Gorli, M.; Verzillo, S. Evaluating Organizational Change in Health Care: The Patient-Centered Hospital Model. BMC Health Serv. Res. 2018, 18, 95. [Google Scholar] [CrossRef]
- Zaharias, G. What Is Narrative-Based Medicine? Narrative-Based Medicine 1. Can. Fam. Physician 2018, 64, 176–180. [Google Scholar]
- Fioretti, C.; Mazzocco, K.; Riva, S.; Oliveri, S.; Masiero, M.; Pravettoni, G. Research Studies on Patients’ Illness Experience Using the Narrative Medicine Approach: A Systematic Review. BMJ Open 2016, 6, e011220. [Google Scholar] [CrossRef]
- Remein, C.D.F.; Childs, E.; Pasco, J.C.; Trinquart, L.; Flynn, D.B.; Wingerter, S.L.; Bhasin, R.M.; Demers, L.B.; Benjamin, E.J. Content and Outcomes of Narrative Medicine Programmes: A Systematic Review of the Literature through 2019. BMJ Open 2020, 10, e031568. [Google Scholar] [CrossRef] [PubMed]
- East Lancashire Hospitals NHS, UK. Patient, Family and Carer’s Experience Strategy. Available online: https://www.elht.nhs.uk/application/files/6115/2086/9299/PatientFamily_and_Carers_Experience_Strategy0_8.pdf (accessed on 31 December 2020).
- NSW Agency for Clinical Innovation, Australia. Collecting Patient & Carer Stories. Available online: https://aci.health.nsw.gov.au/resources/patient-experience/collect-stories/collect_stories/Guide_to_collect_patient_carer_stories.pdf (accessed on 31 December 2020).
- Shropshire Community Health NHS, UK. Using Stories to Improve Patient, Carer and Staff Experiences and Outcomes. Available online: https://www.england.nhs.uk/6cs/wp-content/uploads/sites/25/2015/09/scht-storytelling-toolkit.pdf (accessed on 31 December 2020).
- Jacennik, B. On Digital Health Research Priorities: From Telemedicine to Telehealth. Int. Healthc. Rev. 2022, 1. [Google Scholar] [CrossRef]
- Singh, N.; Hu, C.; Roehl, W.S. Text Mining a Decade of Progress in Hospitality Human Resource Management Research: Identifying Emerging Thematic Development. Int. J. Hosp. Manag. 2007, 26, 131–147. [Google Scholar] [CrossRef]
- Vankka, J.; Myllykoski, H.; Peltonen, T.; Riippa, K. Sentiment Analysis of Finnish Customer Reviews. In Proceedings of the 2019 6th International Conference on Social Networks Analysis, Management and Security, SNAMS 2019, Granada, Spain, 22–25 October 2019. [Google Scholar] [CrossRef]
- Buenano-Fernandez, D.; Gonzalez, M.; Gil, D.; Lujan-Mora, S. Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach. IEEE Access 2020, 8, 35318–35330. [Google Scholar] [CrossRef]
- Meystre, S.M.; Kim, Y.; Gobbel, G.T.; Matheny, M.E.; Redd, A.; Bray, B.E.; Garvin, J.H. Congestive Heart Failure Information Extraction Framework for Automated Treatment Performance Measures Assessment. J. Am. Med. Inform. Assoc. 2017, 24, e40–e46. [Google Scholar] [CrossRef]
- Achilonu, O.J.; Olago, V.; Singh, E.; Eijkemans, R.M.J.C.; Nimako, G.; Musenge, E. A Text Mining Approach in the Classification of Free-Text Cancer Pathology Reports from the South African National Health Laboratory Services. Information 2021, 12, 451. [Google Scholar] [CrossRef]
- Adnan, K.; Akbar, R. An Analytical Study of Information Extraction from Unstructured and Multidimensional Big Data. J. Big Data 2019, 6, 91. [Google Scholar] [CrossRef]
- Yamanishi, K.; Li, H. Mining Open Answers in Questionnaire Data. IEEE Intell. Syst. 2002, 17, 58–63. [Google Scholar] [CrossRef]
- Chen, Y.L.; Weng, C.H. Mining Fuzzy Association Rules from Questionnaire Data. Knowl.-Based Syst. 2009, 22, 46–56. [Google Scholar] [CrossRef]
- Hämäläinen, W.; Joy, M.; Berger, F.; Huttunen, S. Clustering Students’ Open-Ended Questionnaire Answers. arXiv 2018, arXiv:1809.07306. [Google Scholar]
- Hirasawa, S.; Shih, F.-Y.; Yang, W.-T. Student Questionnaire Analyses for Class Management by Text Mining Both in Japanese and in Chinese. In Proceedings of the 2007 IEEE International Conference on Systems, Man and Cybernetics, Montreal, Que, 7–10 October 2007; pp. 398–405. [Google Scholar] [CrossRef]
- Cohen, S. Bayesian analysis in natural language processing. Synth. Lect. Hum. Lang. Technol. 2016, 9, 1–274. [Google Scholar]
- Spasić, I.; Owen, D.; Smith, A.; Button, K. KLOSURE: Closing in on Open–Ended Patient Questionnaires with Text Mining. J. Biomed. Semant. 2019, 10, 24. [Google Scholar] [CrossRef] [PubMed]
- Menendez, M.E.; Shaker, J.; Lawler, S.M.; Ring, D.; Jawa, A. Negative Patient-Experience Comments After Total Shoulder Arthroplasty. JBJS 2019, 101, 330–337. [Google Scholar] [CrossRef] [PubMed]
- Xiong, W.; Wu, L.; Alleva, F.; Droppo, J.; Huang, X.; Stolcke, A. The Microsoft 2017 Conversational Speech Recognition System. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2017. [Google Scholar]
- Jacennik, B.; Zwierzchowski, T. Text Analysis (AT). Program for Quantitative Content Analysis of Text Collections, Version 0.6.8.0; University of Finance and Management in Warsaw: Warsaw, Poland, 2011. [Google Scholar]
- Jacennik, B. AT program applications to content analysis of Internet texts on health, disease and treatment. Presented at Health Communication Conference, Kraków, Poland, 17 January 2014. [Google Scholar]
- Cambria, E.; White, B. Jumping NLP curves: A review of natural language processing research. IEEE Comput. Intell. Mag. 2014, 9, 48–57. [Google Scholar] [CrossRef]
- Young, T.; Hazarika, D.; Poria, S.; Cambria, E. Recent trends in deep learning based natural language processing. IEEE Comput. Intell. Mag. 2018, 13, 55–75. [Google Scholar] [CrossRef]
- Hahn, U.; Oleynik, M. Medical Information Extraction in the Age of Deep Learning. Yearb. Med. Inform. 2020, 29, 208–220. [Google Scholar] [CrossRef]
- Marquez, L.; Salgado, J.G. Machine Learning and Natural Language Processing. 2000. Available online: https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.31.3498 (accessed on 1 January 2022).
- Nagarhalli, T.P.; Vaze, V.; Rana, N.K. Impact of machine learning in natural language processing: A review. In Proceedings of the 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, 4–6 February 2021; pp. 1529–1534. [Google Scholar]
- Choi, J.D.; Tetreault, J.; Stent, A. It Depends: Dependency Parser Comparison Using A Web-Based Evaluation Tool. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 26–31 July 2015; pp. 387–396. [Google Scholar] [CrossRef]
- McKinney, W. Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2012. [Google Scholar]
- Srinivasa-Desikan, B. Natural Language Processing and Computational Linguistics: A Practical Guide to Text Analysis with Python, Gensim, spaCy, and Keras; Packt Publishing Ltd.: Birmingham, UK, 2018. [Google Scholar]
- Final Code. COLAB. Available online: http://bit.do/fMfBe (accessed on 1 January 2022).
- Gauthier, T.D. Detecting Trends Using Spearman’s Rank Correlation Coefficient. Environ. Forensics 2001, 2, 359–362. [Google Scholar] [CrossRef]
- Field, A. Discovering Statistics Using IBM SPSS Statistics; SAGE Publications Ltd.: Thousand Oaks, CA, USA, 2013; Volume 58. [Google Scholar]
- Gómez-Rodríguez, C.; Alonso-Alonso, I.; Vilares, D. How Important Is Syntactic Parsing Accuracy? Empir. Eval. Rule-Based Sentim. Anal. 2017, 52, 2081–2097. [Google Scholar] [CrossRef]
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I. Language Models Are Unsupervised Multitask Learners. OpenAI Blog 2019, 1, 9. [Google Scholar]
- Devlin, J.; Chang, M.-W.; Lee, K.; Google, K.T.; Language, A.I. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Dong, E.; Du, H.; Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 2020, 20, 533–534. [Google Scholar] [CrossRef]
Unit of Analysis | N (Total) | Mean (in Text) | Min (in Text) | Max (in Text) |
---|---|---|---|---|
Texts | 104 | |||
Sentences | 958 | 9.21 | 3 | 23 |
Wordform tokens | 13,632 | 131.08 | 29 | 279 |
Wordform types | 1445 | 13.89 |
Questions | Reader 1 | % | Reader 2 | % | Readers’ Mean | % | Python Script | % |
---|---|---|---|---|---|---|---|---|
Q1. Communication | 41 | 39.4 | 41 | 39.4 | 41 | 39.4 | 43 | 41.3 |
Q2. Type of Service | 87 | 83.7 | 81 | 77.9 | 84 | 80.8 | 79 | 76.0 |
Q3a. Physicians | 80 | 76.9 | 80 | 76.9 | 80 | 76.9 | 80 | 76.9 |
Q3b. Other Personnel | 55 | 52.9 | 65 | 62.5 | 60 | 57.7 | 37 | 35.6 |
Q4. Change of HCU | 13 | 12.5 | 11 | 10.6 | 12 | 11.5 | 9 | 8.7 |
No. | Questions on Patients’ Experience | Rho Spearman Rank Coefficient |
---|---|---|
Q1. | Does the text mention any difficulties in language communication? | 0.881 ** |
Q2. | Does the text mention illness or injury as a reason for contact with a healthcare facility? | 0.921 ** |
Q3a. | Does the text mention any physician or physicians involved in providing health services? | 0.941 ** |
Q3b. | Does the text mention personnel other than a doctor who is involved in providing health services? | 0.511 ** |
Q4. | Does the text mention seeking medical help in more than one healthcare facility? | 0.831 ** |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jacennik, B.; Zawadzka-Gosk, E.; Moreira, J.P.; Glinkowski, W.M. Evaluating Patients’ Experiences with Healthcare Services: Extracting Domain and Language-Specific Information from Free-Text Narratives. Int. J. Environ. Res. Public Health 2022, 19, 10182. https://doi.org/10.3390/ijerph191610182
Jacennik B, Zawadzka-Gosk E, Moreira JP, Glinkowski WM. Evaluating Patients’ Experiences with Healthcare Services: Extracting Domain and Language-Specific Information from Free-Text Narratives. International Journal of Environmental Research and Public Health. 2022; 19(16):10182. https://doi.org/10.3390/ijerph191610182
Chicago/Turabian StyleJacennik, Barbara, Emilia Zawadzka-Gosk, Joaquim Paulo Moreira, and Wojciech Michał Glinkowski. 2022. "Evaluating Patients’ Experiences with Healthcare Services: Extracting Domain and Language-Specific Information from Free-Text Narratives" International Journal of Environmental Research and Public Health 19, no. 16: 10182. https://doi.org/10.3390/ijerph191610182
APA StyleJacennik, B., Zawadzka-Gosk, E., Moreira, J. P., & Glinkowski, W. M. (2022). Evaluating Patients’ Experiences with Healthcare Services: Extracting Domain and Language-Specific Information from Free-Text Narratives. International Journal of Environmental Research and Public Health, 19(16), 10182. https://doi.org/10.3390/ijerph191610182