Next Article in Journal
Exergames in Older Adult Community Centers and Nursing Homes to Improve Balance and Minimize the Risk of Falls in Older Adults: A Systematic Review and Meta-Analysis
Next Article in Special Issue
Long-Term Impacts of COVID-19 Pneumonia on Quality of Life: A Single Institutional Pilot Study
Previous Article in Journal
The Moderating Effects of Self-Care on the Relationships between Perceived Stress, Job Burnout and Retention Intention in Clinical Nurses
Previous Article in Special Issue
Public Health Challenges in Saudi Arabia during the COVID-19 Pandemic: A Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Topic Modeling Analysis of Diabetes-Related Health Information during the Coronavirus Disease Pandemic

College of Nursing Science, Kyung Hee University, 26, Kyunghee-daero, Dongdaemun-gu, Seoul 02447, Republic of Korea
*
Author to whom correspondence should be addressed.
Healthcare 2023, 11(13), 1871; https://doi.org/10.3390/healthcare11131871
Submission received: 19 May 2023 / Revised: 15 June 2023 / Accepted: 22 June 2023 / Published: 27 June 2023
(This article belongs to the Collection COVID-19: Impact on Public Health and Healthcare)

Abstract

:
This study aimed to provide diabetes-related health information by analyzing queries posted in the diabetes-related online community required during the COVID-19 pandemic. A total of 9156 queries from the diabetes-related online community, dated between 1 December 2019 and 3 May 2022, were used in the study. The collected data were preprocessed for bidirectional encoder representation from transformer topic modeling analysis. Topics were extracted using the class-based term frequency–inverse document frequency for nouns and verbs. From the extracted verbs, words with common definitions were subject to substitution and unification processes, which enabled the identification of multifrequent verb categories by noun topics. The following nine noun topics were extracted, in this order: dietary management, drug management, gestational and childhood diabetes, management of diabetic complications, use and cost of medical treatment, blood glucose management, exercise treatment, COVID-19 vaccine and complications, and diabetes in older adults. The top three verb categories by noun topics were permission, method, and possibility. This study provided baseline data that can be used by clinical nurses to deliver diabetes-related education and management based on information sought by patients.

1. Introduction

Coronavirus disease 2019 (COVID-19) is a viral disease that was first identified on 1 December 2019 in Wuhan, a city in Hubei Province, China. Since then, there have been over 92 million confirmed cases and over 1.98 million deaths around the world [1]. The World Health Organization declared it a global pandemic [2], and despite the national emergency responses implemented by each country for infectious disease control, COVID-19 has continued to affect numerous people globally, and there is heightened concern about the emergence of new variants with greater infectivity around the world [3]. The clinical features of COVID-19 vary from the absence of symptoms to fever, coughing, breathing difficulty, and diarrhea. Although the initial symptoms may be mild, progression to a severe form of the disease could lead to death, particularly in patients with pre-existing conditions, such as diabetes [4].
During the COVID-19 pandemic, public health centers and public hospitals responsible for providing healthcare to patients with diabetes were designated as COVID-19 hub hospitals. Consequently, patients with diabetes faced difficulties with adherence to medication and participation in consultation and diabetes education sessions, due to the financial burden of having to use private hospitals [5]. Moreover, delayed diagnosis of diabetes caused a delay in active management and was conducive to the onset of complications. The diagnosis rate of type 1 diabetes mellitus (T1DM) decreased by 23% in 2020, relative to that in 2019, while 44.3% of patients with a first-time diagnosis of T1DM also had diabetic ketoacidosis [6]. In particular, 5.8% of patients with diabetes in Republic of Korea did not receive or delay their outpatient care during the COVID-19 pandemic [7]. Patients with diabetes represent a group vulnerable to infectious diseases, such as influenza and pneumonia, and when infected with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), patients with diabetes have a 2.12-fold higher risk of COVID-19-related death and a 2.45-fold higher risk of progression to severe COVID-19 than healthy individuals [8]. Despite the need for strict healthcare and health information for self-care in daily life during the pandemic, the COVID-19-related measures restricted the necessary access [9].
Diabetes self-care refers to taking medications for the disease, controlling the diet, monitoring blood glucose, performing foot care and exercise, and refraining from smoking [10]. Diabetes self-care is related to many aspects of activities of daily living. Consequently, patients with diabetes seek information to resolve various questions they have about their own healthcare [11], as having access to additional disease-related information can increase treatment adherence and enhance self-care ability, which can be an important factor in preventing complications in patients with diabetes [12]. In addition, when patients with diabetes are unable to understand the information provided by healthcare professionals fully, they tend to seek additional information from other sources, such as family members and acquaintances, while they also spend much time in online searches, which are less time-consuming and have fewer spatial constraints [13].
Social lockdown in response to COVID-19 resulted in less time spent on walking and physical activities, while increasing sedentary time and the consumption of foods and snacks containing sugar [14]. Patients with diabetes have reported that they experienced a lack of physical activities and dietary management since the start of the COVID-19 pandemic [15]. While patients with diabetes developed elevated glycated hemoglobin (HbA1c), blood glucose, low-density lipoprotein cholesterol, and triglyceride levels during social lockdown, they also faced difficulties in receiving professional care for these elevated levels as well as in obtaining education or information for self-care in daily life [16]. The worsening of diabetes during this pandemic due to a decline in patients’ ability to self-manage their disease during the COVID-19-related social lockdown [15] highlights the need for providing more specific and diverse information to patients with diabetes during such an epidemic. In this respect, it is also necessary to identify what information patients with diabetes require to practice self-care. A review of studies on patients with diabetes during the COVID-19 pandemic showed the impact of social lockdown policies on glycemic control in patients with diabetes [17]; the immunity and response to COVID-19 of patients with diabetes [18]; COVID-19-related mortality rate among patients with diabetes [19]; and depression and fear related to COVID-19 [20]. In Republic of Korea, a study has reported on the unmet needs of patients with diabetes according to income [21], but studies related to health information for self-care among patients with diabetes in Republic of Korea are limited.
For healthcare professionals, identifying the information needs of patients and providing appropriate and accurate information to patients are important aspects of patient care [22]. Because interactions between patients and healthcare professionals during an epidemic are limited, it is necessary for healthcare professionals to provide reliable healthcare information to patients with diabetes to help them maintain self-care during daily life [17]. To this end, the health information that patients need for self-care during an epidemic must be identified. Analyzing questions asked by patients is one method that can enable the timely identification of the health information needs of these patients [23]. The bidirectional encoder representation from transformers (BERT) topic model is an analytical method with outstanding performance for natural language inference and analysis of Q&A records [24]. The BERT topic model can identify topics with latent meaning within text by means of text clustering [25]. It can extract topics more effectively than latent Dirichlet allocation [26], and thus, it is effective for analyzing the questions of patients with diabetes.
Accordingly, this study aimed to collect diabetes-related questions posted in an online community website during the COVID-19 pandemic to analyze the major keywords for diabetes-related health information, for the purpose of identifying the types of diabetes-related health information patients required and for providing basic data reflecting such health information.

2. Materials and Methods

2.1. Research Design

This text analysis study aimed to collect and analyze diabetes-related questions posted on an online diabetes-related community website during the COVID-19 pandemic to identify diabetes-related health information for the purpose of providing basic data to clinical nurses who care for patients with diabetes for devising self-care education plans.

2.2. Data Collection

The data used in this study consisted of diabetes self-care-related questions posted on a diabetes-related online community website. The online community website was selected based on the traffic volume, number of members, and number of posts. The online community website that was ultimately selected had 200,000 members and an average of approximately 120,000 posts per month. The data were used with permission from the administrator of the website. The data collection period was from 1 December 2019, which was the stage of concern regarding COVID-19 in Republic of Korea, to 3 May 2022, when the requirement for indoor mask wearing was officially lifted. The data were collected based on the title of the question, contents of the question, and posted date appearing in the bulletin board for “comprehensive counseling” within the diabetes consultation section of that website. Among a total of 9316 posts, a total of 9155 posts were used in the final analysis after excluding posts that were not related to diabetes.

2.3. Data Preprocessing

The present study used BERT topic modeling and the data analysis procedures involved data preprocessing, document embedding, document clustering, and topic extraction. The specific details are described below. The natural language processing of the collected data consisted of pruning, tokenization, normalization, and stop-word removal, which refers to a series of processes, such as the removal of stop words, including special characters, symbols, and meaningless words in the posts, as well as the correction of spacing and grammatical revision. Subsequently, a unification of duplicate and blending words among words extracted using frequency analysis was required. In this study, PyKoSpacing, a package for automatic Korean word spacing, was used to correct spacing errors and the Python “replace” command was used to revise typos and misspelled English expressions and keywords, after which Python regular expressions were used to remove special characters, including emojis. Moreover, the Python module Mecab, a Korean morpheme analyzer, was used to add a user dictionary of compound words and words with a common meaning were substituted and unified based on the Python dictionary, followed by the removal of stop words to complete the data preprocessing. In this study, the Ko-sentence-transformers model was used for sentence embedding. The Ko-sentence-transformers model (Figure 1), a type of KoBERT for processing the Korean language, is a semisupervised learning model that can be applied to various natural language processing tasks. BERT limits the number of characters that can be included in one segment to a maximum of 512 characters. Table 1 shows a KoBERT-based solution. Thus, embedding was performed after dividing the length of the sentence into 512 characters to create a list. For document clustering, the uniform manifold approximation and projection (UMAP) algorithm was used to reduce the high dimensionality of local structure to a lower dimensionality after sentence embedding. N_neighbors is a hyperparameter that affects the performance of UMAP. Generally, a range of 5–50 is applied for n_neighbors, while a range of 10–15 is recommended as the most reasonable range [27]. In this study, the n_neighbors parameter was initially set to 10–15 in consideration of the data size, and after comparison of document clustering, n_neighbors was set to 15 and the metric was set to cosine. After structural dimensionality reduction using UMAP, a hierarchical density-based spatial clustering of applications with noise (HDBSCAN) was used for the final clustering. HDBSCAN includes the hyperparameter min_cluster_size [28], and in this study, a scatter plot of the clusters with min_cluster_size varying between 15 and 30 was checked, and the final analysis was performed by setting the min_cluster_size to 30. To check the specific details of the types of health information, noun and verb topics were extracted. English words and numbers were removed using Python regular expression, and noun topics were extracted via class-based term frequency–inverse document frequency (c-TF-IDF). c-TF-IDF is a method in which the sentences in each class are combined into a single document and the TF-IDF of the corresponding class is derived. This offers the advantage of demonstrating the topic of each class more accurately [29]. The overall analysis procedure is shown in Figure 2. In topic model analysis, qualitative analysis can be used to compensate for the limitations of quantitative analysis [30]. Thus, this study applied quantitative analysis using a scatter plot confirmed after HDBSCAN to select and interpret the appropriate number of topics, and a group of three experts (one nursing college professor, one doctor of nursing, and one nursing PhD student) was formed as a qualitative evaluation method to compensate for the limitations of quantitative analysis. For the interpretation of the topics derived through topic modeling, the experts repeatedly traced the data analysis results back to the original sentences and compared and contrasted them with the original data to exclude subjectivity in the meaning and interpretation of the topics as much as possible and to ensure the rigor of the topic names and interpretations. For the extraction of verb topics, high-frequency verbs were identified with c-TF-IDF. Among the extracted verbs, words with a common meaning were substituted and unified by the experts above, to check the categories of high-frequency verbs for each noun topic. To determine the categories of verbs, this study used the wh-questions theory proposed by Park [31].

2.4. Ethical Considerations

The present study received institutional review board review exemption (KHSIRB-22-455 [EA]).

3. Results

3.1. Noun Topic Modeling

When the UMAP algorithm and HDBSCAN were applied for document clustering, document clusters were formed as shown in Figure 3. Ultimately, nine clusters were selected in consideration of the meaning and ease of clustering. After checking c-TF-IDF of the derived clusters to identify the top 20 terms, the terms of the first topic were related to the COVID-19 vaccine and complications, including “COVID-19 vaccine”, “vaccination”, “thrombosis”, “antibody”, and “death”. The terms of the second topic were related to diabetes in older individuals, including “mother”, “father”, “insulin”, “management”, “hospital”, “surgery”, “age”, and “health”. The terms of the third topic were related to glycemic control, including “blood glucose monitor”, “use”, “blood glucose check”, “difference”, “lancet”, and “code”. The terms of the fourth topic were related to dietary management, including “after meal”, “diet”, “salad”, “carbohydrates”, “vegetables”, “eggs”, “side dishes”, “brown rice”, and “protein”. The terms of the fifth topic were related to exercise therapy, including “exercise”, “squat”, “knee”, “bicycle”, “aerobic exercise”, and “treadmill”. The terms of the sixth topic were related to pregnancy and pediatric diabetes, including “pregnancy”, “childbirth”, “children”, “babies”, “management”, “examination”, “HbA1c”, “lactation”, “breast milk”, and “impaired glucose tolerance”. The terms of the seventh topic were related to medication management, including “oral hypoglycemic agents”, “dosage”, “prescription”, and “nutritional supplements” and “side effects”. The terms of the eighth topic were related to the management of diabetic complications, including “pain”, “toenails”, “toes”, “neuropathy”, “sensation”, and “inflammation”. The terms of the ninth topic were related to healthcare utilization and cost, including “insurance”, “examination”, “hospital”, “ophthalmology”, “dispute”, “surgery”, “enrollment”, and “treatment” (Table 2). Based on this, the title of the topics for each cluster were assigned as COVID-19 vaccine and complications; diabetes in older individuals; glycemic control; dietary management; exercise therapy; pregnancy and pediatric diabetes; medication management; the management of diabetic complications; and healthcare utilization and cost. The frequency of document for each topic was as follows: dietary management, 2243 cases (26.5%); medication management, 1327 cases (15.7%); pregnancy and pediatric diabetes, 1140 cases (13.5%); management of diabetic complications, 996 cases (11.8%); healthcare utilization and cost, 925 cases (10.9%); glycemic control, 803 cases (9.5%); exercise therapy, 458 cases (5.4%); COVID-19 vaccine and complications, 303 cases (3.6%); and diabetes in older individuals, 270 cases (3.2%) (Table 3).

3.2. Verb Categories by Noun Topics

Major verb categories by noun topic are shown in Table 4. The frequency of questions for each topic appeared in the following order: permission (n = 99, 32.1%), method (n = 92, 29.9%), and possibility (n = 47, 15.3%) for the topic of COVID-19 vaccine and complications; method (n = 118, 43.7%), possibility (n = 39, 14.4%), and definition (n = 33, 12.2%) for the topic of diabetes in older individuals; method (n = 293, 36.5%), possibility (n = 127, 15.8%), and permission (n = 109, 13.6%) for the topic of glycemic control; permission (n = 906, 40.4%), method (n = 695, 29.4%), and possibility (n = 220, 9.8%) for the topic of dietary management; method (n = 222, 48.5%), permission (n = 87, 19.0%), and possibility (n = 68, 14.8%) for the topic of exercise therapy; method (n = 494, 43.3%), permission (n = 276, 24.2%), and possibility (n = 160, 14.0%) for the topic of pregnancy and pediatric diabetes; permission (n = 488, 36.8%), method (n = 459, 34.6%), and possibility (n = 155, 11.7%) for the topic of medication management; method (n = 396, 39.8%), permission (n = 264, 26.5%), and possibility (n = 132, 13.3) for the topic of management of diabetic complications; and method (n = 437, 47.2%), possibility (n = 157, 17.0%), and permission (n = 136, 14.7%) for the topic of healthcare utilization and cost.

4. Discussion

The present study aimed to discuss the findings derived through BERT topic modeling and cTF-IDF performed on diabetes-related questions posted in an online community website during the COVID-19 pandemic. BERT is bidirectional, unlike existing NLP models that were “unidirectional”, and uses MLM techniques to predict a specific word based on other words that exist in the sequence, so that the model learns the context which is said to understand. However, it can be difficult to understand the context, which is the main purpose of MLM, because randomly assigned masked words can select meaningless words rather than main words. In an effort to compensate for these shortcomings, this study applied a qualitative evaluation method using experts. Nine topics were finally extracted via noun topic modeling. The topics were similar to some items in the information provision categories of the Korean Diabetes Association (KDA), including “nutritional management”, “drug therapy”, and “diabetic complications”. Zhou and Ni [22] identified “diagnosis”, “treatment”, “lifestyle”, “complications”, “maternity”, “disease course”, “healthcare professionals”, “prevention”, and others via clustering diabetes-related questions posted on a Chinese health information website (http://www.39.net, accessed on 30 May 2022), among which the topic of “complications” was consistent with the topics in the present study. A study by Xusheng and Likeng [32] identified “body mass index”, “HbA1c”, “hypertension”, and “creatinine” by analyzing diabetes-related questions posted in Channel 39, a website operated by experts, which were different from the topics identified in the present study. Such differences could be attributed to the fact that the data in the present study were collected during the COVID-19 pandemic, and as a result, there may be some differences from data that were generated prior to the COVID-19 pandemic. Moreover, questions posted in the counseling section of the KDA website were rearranged and categorized by different fields, whereas the study by Xusheng and Likeng [32] collected questions from a diabetes-related expert website instead of a website for the general public. The present study confirmed that the classification of questions may vary depending on the questions frequently asked in specific situations and during particular periods, and that it may also vary according to the characteristics of the website users. Accordingly, nurses should also consider the environment and characteristics of patients with diabetes and changes in society when providing care to patients with diabetes as a clinical practice.
With respect to the specific details of each noun topic, the terms belonging to the topic of the COVID-19 vaccine and complications were “COVID-19 vaccine”, “vaccination”, “appointment”, “complications”, “thrombosis”, “pain”, and “medication”, indicating a high demand for information regarding COVID-19 vaccination and adverse events. While direct comparison is difficult due to a lack of existing studies on this topic, SARS-CoV-2 is known to increase the severity of COVID-19 in patients with chronic underlying diseases, and in particular, patients with diabetes face a higher risk of developing serious complications, being hospitalized, and progressing to severe COVID-19 than patients without diabetes. Additionally, a new onset of diabetes as a complication of COVID-19 has also been reported [8,9]. Therefore, patients with diabetes may have had high demand for information regarding COVID-19 prevention and infection control methods during the COVID-19 pandemic. The American Diabetes Association and the Johns Hopkins Patient Guide to Diabetes provide COVID-19-related information to patients with diabetes through their own websites [33,34] in efforts to reduce anxiety about emerging infectious disease among patients with diabetes [31]. However, patients with diabetes in Republic of Korea faced many limitations in obtaining relevant information from healthcare professionals during this pandemic [5], and as a result, they may have attempted to obtain COVID-19-related information through easily accessible online communities [35,36].
The terms included in the topic of diabetes in older individuals consisted of “mother”, “father”, “hypoglycemia”, “insulin”, “shock”, “complications”, and “treatment”, which indicated that guardians faced many difficulties in their care for a family member who uses insulin. Such findings were reconfirmed in studies by Dikkers and Dunning [37] and Holtz [38], which reported that the scope of providing diabetes-related health information needs to be expanded to include not only the patient but also the family members, due to the recent increase in the number of families caring for older patients with diabetes as well as the number of patients with diabetes. In particular, with the increased management of older patients with diabetes by family members during the social lockdown period caused by the COVID-19 pandemic, the demand for diabetes self-care information by means of websites also increased. In particular, in the unique situation created by an outbreak of an infectious disease, when access to healthcare professionals may not be easy, obtaining unreliable information through websites could cause other complications or progression to more severe disease. Considering these factors, it is necessary for certified institutions to provide information to patients with diabetes and their family to promote effective and efficient healthcare, while also providing information that family members can use for responding to any related emergency at home.
In the topic of glycemic control of patients with diabetes, the identified terms consisted of “blood glucose monitor”, “use of continuous glucose monitoring system”, “blood glucose check”, “finger”, “difference”, and “accuracy”, which indicated that patients with diabetes had many questions about the accuracy of and differences in blood glucose levels measured according to the type of blood glucose monitor and the site of blood glucose measurement. Previous studies have reported that blood glucose levels measured by a blood glucose monitor may vary depending on the type of blood glucose monitor used and the method and site of blood glucose measurement [39,40,41]. Such differences in blood glucose levels measured by means of a blood glucose monitor may be due to the performance of the device, but they can also occur due to personal factors (measurement method used and environmental factors) [39]. Patients can regularly visit hospitals for a correction of problems related to using a blood glucose monitor and for blood glucose measurements by healthcare professionals [42,43]. However, because of limitations in education through healthcare professionals during the COVID-19 pandemic, there was high demand for information related to blood glucose measurement.
The terms included in the topic of dietary management of diabetes consisted of “diet”, “salad”, “vegetables”, “eggs”, “side dishes”, “brown rice”, “protein”, “multigrain rice”, “tofu”, “breast meat”, “white rice”, and “soy milk”, with most of the questions being about specific types of food and consumption methods. In addition, the terms belonging to the topic of dietary management identified in the present study were after meal, fasting, before meal, blood glucose check, and highest blood glucose level, which indicated that patients with diabetes were paying close attention to changes in their blood glucose levels after meals. However, patients with diabetes faced difficulties due to a lack of knowledge about dietary management and planning, and, therefore, had a high demand for information and help from experts [44,45]. Patients with diabetes practice dietary control in daily life through relevant education received during hospital visits. However, diabetes education could not be conducted smoothly during the COVID-19 pandemic due to restrictions on hospital visits and because patients avoided hospital visits, which is likely why patients with diabetes attempted to acquire dietary management-related information through online communities. Moreover, changes in blood glucose levels of patients with diabetes are determined not only by the food consumed, but also by various factors, including climate change, insulin secretion, insulin sensitivity, the secretion of gastrointestinal hormones in response to ingested nutrients, stress, and mood changes [46,47]. Because of these reasons and due to inconsistent blood glucose measurements according to the type of food consumed, patients with diabetes experience difficulties in managing their diet [48,49]. Therefore, nurses and other healthcare professionals who provide education to patients with diabetes in clinical practice need to prepare and provide specific information regarding changes in blood glucose levels according to the type of food consumed by patients with diabetes, as well as about factors other than food that may affect changes in blood glucose levels.
For the topic of exercise therapy, the terms identified consisted of “exercise”, “squat”, “bicycle”, “indoor”, “aerobic exercise”, “stepper”, and “treadmill”, indicating a high demand for information regarding indoor exercise. Such findings were different from a study by Ramsingh and Bhuvaneswari [50] that analyzed website posts of patients with diabetes in India, which reported that “yoga” and “cycling” were the major terms related to exercise among patients with diabetes. It is believed that, because the data collection period in the present study was during a period when outdoor activities were restricted by social lockdown and distancing in response to COVID-19 [14], patients with diabetes may have shown a high demand for information regarding indoor exercise to control their blood glucose levels. In particular, in Republic of Korea, individuals experienced a lack of physical activities during the COVID-19 pandemic [15], which could have caused anxiety with respect to self-care among patients with diabetes. Therefore, particularly after the experience with this major pandemic, healthcare professionals should consider the need for providing information regarding indoor exercise to patients with diabetes. In particular, they should provide information about various indoor exercise programs that can be safely applied at home and should introduce such programs through the websites of applicable medical institutions.
The terms included in the topic of pregnancy and pediatric diabetes were mostly “pregnancy”, “childbirth”, “baby”, “breast milk”, and “lactation”. Such findings were slightly different from the keywords “exposure”, “fetus”, “hypoglycemia”, “marker”, medication”, “metabolic disease”, “preeclampsia”, “prevention”, “program”, and “smoking” identified in a text analysis study on gestational diabetes [50]. Unlike the present study, which analyzed texts regarding pregnancy and childbirth posted by individuals who are not healthcare professionals, based on a collection of questions posted in a diabetes-related online community website, the study by Lee and Kim [51] used data from previous studies on gestational diabetes, and as a result, many of the keywords were medical terms. Such findings confirmed that there are differences in gestational-diabetes-related information between patients and experts and that patients with diabetes have a broad range of information needs, including parenting-related information, such as breastfeeding while taking diabetes medication after giving birth. In particular, “children”, “diet”, “impaired glucose tolerance”, “hospital”, and “blood glucose check” appeared as the top terms of this topic, which reflected the fact that guardians who have children with diabetes focused on accepting and managing diabetes through understanding of the effects of diabetes on children and family, as well as the pathogenesis of diabetes. Therefore, sufficient information on this topic should also be provided.
For the diabetes medication management topic, the identified terms reflected the demand for information regarding medication taken by patients with diabetes. Metformin, which was one of the terms identified in the present study, is an oral drug for diabetes. This may have caused an increase in the frequency of questions regarding medication on diabetes-related health information websites. Therefore, health information for patients with diabetes should include information regarding drugs that have recently become social issues and health expert counseling methods related to drugs [52,53]. In particular, the present study focused on the period during the COVID-19 pandemic, and thus, opportunities for face-to-face encounters between patients and healthcare professionals were limited5. Consequently, questions regarding medication on websites operated by non-professionals, accounting for a significant portion of the questions regarding symptom management, were about drug misuse and abuse. Therefore, it is necessary to promote Q&A actively to enable patients to receive accurate information through websites operated by healthcare professionals or medical institutions. Allocating dedicated personnel for such tasks is also necessary.
The terms included in the topic of diabetic complications consisted of “pain”, “toenails”, “toes”, “soles”, “ankles”, “legs”, “big toe”, “orthopedics”, “callosity”, and “wounds”, indicating a high demand for information regarding diabetic complications, particularly diabetes-related foot diseases, among patients with diabetes. In a previous study [3], among diabetic complications, patients with diabetes showed the highest level of interest in ocular complications. This difference may be due to the difference in the methods by which the questions were analyzed. Wang and Zhou [3] extracted the frequency of terms that appeared most often for each topic in health-related websites, whereas the present study used the Ko-sentence-transformers model to extract keywords based on context. Therefore, the study by Wang et al. may have classified the topic terms by focusing simply on the terms, rather than considering the context of these terms, which may have resulted in keywords for patients with diabetes being reflected less commonly in their findings. Therefore, future studies analyzing diabetes-related questions should use various data sources and multiple methodologies for clustering collected questions and then compare the results. Based on our study and that of Wang and Zhou [3], there is a high demand for information regarding diabetic foot diseases and ocular complications among patients with diabetes
In the topic of healthcare utilization and cost, the terms identified consisted of “hospital”, “ophthalmology”, “retina”, “surgery”, “complications”, “diagnosis”, “treatment”, “appointment”, and “doctor”, and thus, it can be surmised that patients with diabetes requested information regarding hospitals and clinics. In particular, information regarding hospitals for ophthalmic care was identified as important, considering the contextual dimension. While direct comparison is difficult due to a lack of existing studies on this topic, it is suspected that patients with diabetes sought information about hospitals and clinics through websites because of restrictions on visits to these institutions due to social distancing implemented during the COVID-19 pandemic [54]. Moreover, considering that other keywords, such as “insurance”, “examination”, “actual expenses”, and “enrollment” were included in the topic of healthcare utilization and cost, it was determined that patients with diabetes have a high demand for information regarding healthcare costs. Patients who have been diagnosed with diabetes have increased healthcare utilization due to regular visits to hospitals/clinics, frequent examinations, and drug purchases [55]. As a result, they may face financial hardship [39] and patients with diabetes with diminished socioeconomic activities due to the COVID-19 pandemic may have shown even higher demand than normal for information regarding costs and support for their own healthcare.
For the noun topics identified in the present study, the questions were classified into categories of pediatric diabetes, gestational diabetes, and diabetes in older individuals, depending on the stage of life to which the question pertained. Such findings were consistent with a study on a needs survey for providing health information conducted on an Internet panel group, which reported that different population groups of respondents showed different preferences for health information [56]. This may be due to the characteristics of people who use online health information. According to a previous study, people who mostly use online health information are people who play the role of caregiver at home, such as married women [57,58]. In particular, patients with pediatric diabetes, gestational diabetes, or diabetes in older individuals represent a group of patients who require assistance and care from guardians, and thus, most of the questions on the website were posted by guardians. Thus, information provided by websites should include contents that can be helpful for not only patients but also guardians, as mentioned in the studies by Dikkers and Dunning [37] and Holtz [38]. Moreover, it is also necessary to plan more systematic education and operate programs in clinical practice not only for the patients with diabetes but also for their guardians. In addition, it is also necessary to prepare plans for providing health information to patients and guardians who may have a poor ability to use online information.
The findings in the present study showed that the top-4 verb categories of the noun topics were “permission”, “method”, “possibility”, and “definition”. Such findings were similar to “definition”, “method”, and “effect” that were identified as verb categories in previous studies that analyzed the types of product-related questions from purchasers of general electronic products [59,60]. The health information sought by patients with diabetes is thought to be reflected by questions seeking expert opinions on glycemic control, medication, and dietary management for self-care. General electronic products are related to diabetes in that purchasers would ask many questions about how to use the product on their own, as a non-expert. However, a prominent characteristic of health-related questions is the desire to acquire information that they currently lack, in order to increase their body of knowledge and to reduce the information gap with others [61]. Although healthcare professionals are making various efforts to provide high-quality health information to patients, patients still feel a deficiency in the information they need during their hospital/clinic visits or treatment process [62]. In particular, the verb category of “method” was extracted the most among the noun topics related to diabetes in older individuals, glycemic control, exercise therapy, pregnancy and pediatric diabetes, the management of diabetic complications, and healthcare utilization and cost. While a direct comparison is difficult due to the limited number of previous studies on the types of diabetes-related questions posted on online community websites, the findings in the present study were similar to the findings in a study by Chiu and Wu [63] on the analysis of 440 cases of health-related questions in Yahoo! Answers, which reported that questions related to specific methods (n = 235, 52%) accounted for the greatest proportion of questions. Among questions regarding health information, the type of questions requiring professional knowledge to which users cannot find the answers by simple searching accounted for a high percentage of questions [60]. Patients with diabetes face difficulties in coping with various situations in daily life in the absence of professional help, but they have a relative lack of specialized knowledge and information. Consequently, they have an increased tendency to make decisions based on the experiences of others. Therefore, health information for patients with diabetes should be more specific and practical to allow them to maintain management of their own health, even during various unexpected situations, such as an epidemic, when it is difficult to visit medical institutions or to perform in-person examination by healthcare professionals. In addition, there is a need to establish a system to provide easy access to such information, while the scope and amount of information provided by medical institutions, societies, and associations also need to improve.
In addition, the verb category for confirming “permission” was mostly found in the topics of COVID-19 vaccine and complications, dietary management, and medication management. This type of question seeks to obtain permission as an assurance or consent for their own actions under situations that require medical judgement. Exchange of such information by non-healthcare professionals could lead to drug misuse and abuse [53]. Online information users with a strong tendency to resolve health problems on their own have a high risk of self-assurance, such as making a diagnosis and seeking unnecessary prescription on their own, based on online health information searches, as well as not adhering to prescribed medication, with no communication with healthcare professionals [60]. In particular, during situations in which resolving problems through consultation with a healthcare professional is difficult due to the COVID-19 pandemic [64], the tendency for self-determination regarding health problems could be unintentionally further reinforced. However, online health information is generated and distributed by both healthcare professionals and non-healthcare professionals, and thus, a problem with the reliability of information exists [65,66]. Accordingly, it is necessary to strike a balance between self-determination of health information users, further reinforced by COVID-19, and disease management through healthcare professionals. An educational system could also be contemplated based on the present study. In particular, healthcare professionals need to shift from the conventional paradigm of thinking of themselves as a provider of treatment and information to patients with diabetes to a concept of being partners in the patients’ disease process [67]. Moreover, healthcare professionals need to pay attention to information generated by patients with diabetes through community websites and should provide sufficient information to enable patients with diabetes to make their own decision regarding problems that do not require medical judgement. They also need to design an effective information architecture to support user access to various types of information. Furthermore, a decision-making protocol that can help identify problems that require medical judgement, among various problems faced by patients with diabetes during the course of their disease, is also needed. Through such methods, the ability to cope with diabetes and the self-efficacy of health information users could be enhanced [68]. The unmet health information need of patients with diabetes can be satisfied by providing information and stable counseling via healthcare professionals.

5. Limitation

The present study analyzed posts from a single website, and thus, the study was limited in identifying questions according to the characteristics of websites or medical institutions. Therefore, future studies using a text analysis of diabetes-related questions extracted from posts in various types of websites are needed. In the case of topic analysis, since the analysis data and algorithms used by each researcher are diverse, we suggest a future study to consistently compare the meaningful results between the results of this study and previous studies.

6. Conclusions

The present study collected and analyzed diabetes-related questions posted on a website to identify the health information needs of patients with diabetes during the COVID-19 pandemic for the purpose of providing basic data for self-care by patients with diabetes. The present study was significant in that it analyzed diabetes-related health information requested by online community users to derive the topics for diabetes-related health information and provided basic data regarding health information needs of patients with diabetes. The present study was also significant in that it used topics related to diabetes-related health information derived in the study to provide basic data for developing self-care education programs for patients with diabetes. However, in order for the model developed in this study to have more sophistication and generalization, it should be fine-tuned and tested. In future studies, it is also expected to develop diabetes-related educational contents based on the results of this study.

Author Contributions

Conceptualization, J.H. and S.M.; methodology, J.H. and S.M.; software, J.H. and S.M.; validation, J.H. and S.M.; formal analysis, J.H. and S.M.; investigation, J.H. and S.M.; writing—review and editing, J.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Kyunghee University.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kim, J.; You, M.; Shon, C. Impact of the COVID-19 pandemic on unmet healthcare needs in Seoul, South Korea: A cross-sectional study. BMJ Open 2021, 11, e045845. [Google Scholar] [CrossRef] [PubMed]
  2. World Health organization. Coronavirus Disease 2019 (COVID-19): Situation Report, 51; World Health Organization: Geneva, Switzerland, 2020; pp. 1–9. Available online: https://apps.who.int/iris/handle/10665/331475 (accessed on 21 September 2022).
  3. Wang, T.-H.; Zhou, X.-F.; Ni, Y.; Pan, Z.-G. Health information needs regarding diabetes mellitus in China: An internet-based analysis. BMC Public Health 2020, 20, 990. [Google Scholar] [CrossRef] [PubMed]
  4. Heo, J.Y. Clinical and Epidemiological Characteristics of Coronavirus Disease 2019 in the Early Stage of Outbreak. Korean J. Med. 2020, 95, 67–73. [Google Scholar] [CrossRef] [Green Version]
  5. Yoo, B. Social Determinants of Health in Diabetes Patient Care. J. Korean Diabetes 2022, 23, 138–143. (In Korean) [Google Scholar] [CrossRef]
  6. Rabbone, I.; Schiaffini, R.; Cherubini, V.; Maffeis, C.; Scaramuzza, A.; Bertelli, E.; Ferlito, L.; Bobbio, A.; Schieven, E.; Delvecchio, M.; et al. Has COVID-19 Delayed the Diagnosis and Worsened the Presentation of Type 1 Diabetes in Children? Diabetes Care 2020, 43, 2870–2872. [Google Scholar] [CrossRef]
  7. Park, E.J.; Song, E.S.; Choi, S.K. Medical Service Use and Medication Use during COVID-19 Pandemic: Focused on Unmet Need; Korea Institute for Health and Socail Affairs: Sejong-si, Republic of Korea, 2021. (In Korean) [Google Scholar]
  8. Gupta, R.; Ghosh, A.; Singh, A.K.; Misra, A. Clinical considerations for patients with diabetes in times of COVID-19 epidemic. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 211–212. [Google Scholar] [CrossRef]
  9. Huang, I.; Lim, M.A.; Pranata, R. Diabetes mellitus is associated with increased mortality and severity of disease in COVID-19 pneumonia—A systematic review, meta-analysis, and meta-regression. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 395–403. [Google Scholar] [CrossRef]
  10. McCleary-Jones, V. Health literacy and its association with diabetes knowledge, self-efficacy and disease self-management among African Americans with diabetes mellitus. ABNF J. Off. J. Assoc. Black Nurs. Fac. High. Educ. Inc 2011, 22, 25–32. [Google Scholar]
  11. Longo, D.R.; Schubert, S.L.; Wright, B.A.; LeMaster, J.; Williams, C.D.; Clore, J.N. Health Information Seeking, Receipt, and Use in Diabetes Self-Management. Ann. Fam. Med. 2010, 8, 334–340. [Google Scholar] [CrossRef] [Green Version]
  12. Choi, K.I.; Jung, D.Y. The relationships between knowledge of diabetes, barrier and compliance with sick role behavior of diabetic patients. Health Nurs. 2010, 22, 13–26. [Google Scholar]
  13. Sandelowsky, H.; Krakau, I.; Modin, S.; Ställberg, B.; Nager, A. COPD patients need more information about self-management: A cross-sectional study in Swedish primary care. Scand. J. Prim. Health Care 2019, 37, 459–467. [Google Scholar] [CrossRef] [Green Version]
  14. Ruiz-Roso, M.B.; Knott-Torcal, C.; Matilla-Escalante, D.C.; Garcimartín, A.; Sampedro-Nuñez, M.A.; Dávalos, A.; Marazuela, M. COVID-19 Lockdown and Changes of the Dietary Pattern and Physical Activity Habits in a Cohort of Patients with Type 2 Diabetes Mellitus. Nutrients 2020, 12, 2327. [Google Scholar] [CrossRef]
  15. Park, E.J. Disease management and unmet medical conditions of hypertensive and diabetic patients during the COVID-19 pandem. Health Welf. ISSUE FOCUS 2022, 422, 1–8. (In Korean) [Google Scholar] [CrossRef]
  16. Karatas, S.; Yesim, T.; Beysel, S. Impact of lockdown COVID-19 on metabolic control in type 2 diabetes mellitus and healthy people. Prim. Care Diabetes 2021, 15, 424–427. [Google Scholar] [CrossRef]
  17. Önmez, A.; Gamsızkan, Z.; Özdemir, Ş.; Kesikbaş, E.; Gökosmanoğlu, F.; Torun, S.; Cinemre, H. The effect of COVID-19 lockdown on glycemic control in patients with type 2 diabetes mellitus in Turkey. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 1963–1966. [Google Scholar] [CrossRef]
  18. Zheng, M.; Wang, X.; Guo, H.; Fan, Y.; Song, Z.; Lu, Z.; Wang, J.; Zheng, C.; Dong, L.; Ma, Y.; et al. The Cytokine Profiles and Immune Response Are Increased in COVID-19 Patients with Type 2 Diabetes Mellitus. J. Diabetes Res. 2021, 2021, 1–8. [Google Scholar] [CrossRef]
  19. Rajpal, A.; Rahimi, L.; Ismail-Beigi, F. Factors leading to high morbidity and mortality of COVID-19 in patients with type 2 diabetes. J. Diabetes 2020, 12, 895–908. [Google Scholar] [CrossRef]
  20. Khademi, F.; Moayedi, S.; Golitaleb, M.; Karbalaie, N. The COVID-19 pandemic and death anxiety in the elderly. Int. J. Ment. Health Nurs. 2020, 30, 346–349. [Google Scholar] [CrossRef]
  21. Song, E.S.; Park, E.J.; Choi, S.K. Unmet needs among patients with hypertension or diabetes during the COVID-19 pandemic by household income. Health Soc. Welf. Rev. 2022, 42, 246–259. [Google Scholar]
  22. Zhou, X.F.; Ni, Y.; Xie, G.; Zhu, W.; Chen, C.; Wang, T.H.; Pan, Z.G. Analysis of the Health Information Needs of Diabetics in China; IOS Press: Amsterdam, The Netherland, 2019; pp. 487–491. [Google Scholar]
  23. Inzucchi, S.E.; Bergenstal, R.M.; Buse, J.B.; Diamant, M.; Ferrannini, E.; Nauck, M.; Peters, A.L.; Tsapas, A.; Wender, R.; Matthews, D.R. Management of Hyperglycemia in Type 2 Diabetes, 2015: A Patient-Centered Approach: Update to a Position Statement of the American Diabetes Association and the European Association for the Study of Diabetes. Diabetes Care 2014, 38, 140–149. [Google Scholar] [CrossRef] [Green Version]
  24. Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar] [CrossRef]
  25. Ko, Y.S.; Lee, S.B.; Cha, M.J.; Kim, S.D.; Lee, J.H.; Han, J.Y.; Song, M. Topic modeling insomnia social media corpus using BERTopic and building automatic deep learning classification model. J. Korean Soc. Inf. Manag. 2022, 39, 111–129. (In Korean) [Google Scholar] [CrossRef]
  26. Thompson, L.; Mimno, D. Topic modeling with contextualized word representation clusters. arXiv 2020, arXiv:2010.12626. [Google Scholar] [CrossRef]
  27. McInnes, L.; Healy, J.; Melville, J. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv 2018, arXiv:1802.03426. [Google Scholar] [CrossRef]
  28. Stewart, G.; Al-Khassaweneh, M. An Implementation of the HDBSCAN* Clustering Algorithm. Appl. Sci. 2022, 12, 2405. [Google Scholar] [CrossRef]
  29. Grootendorst, M. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv 2022, arXiv:2203.05794. [Google Scholar] [CrossRef]
  30. Kang, J.; Kim, S.; Roh, S. A Topic Modeling Analysis for Online News Article Comments on Nurses' Workplace Bullying. J. Korean Acad. Nurs. 2019, 49, 736–747. [Google Scholar] [CrossRef]
  31. Park, Y.H. About meaning of Korean interrogativity. Acad. Korean Lang. Educ. 1990, 46, 95–104. (In Korean) [Google Scholar]
  32. Xusheng, G.; Likeng, L.; Yuanxia, L.; Heng, W.; Tianyong, H. The construction of a diabetes-oriented frequently asked question corpus for automated question-answering services. In Proceedings of the 2020 Conference on Artificial Intelligence and Healthcare, Taiyuan, China, 23–25 October 2020; pp. 60–66. [Google Scholar]
  33. American Diabetes Association. Frequently Asked Questions: COVID-19 and Diabetes. Available online: https://diabetes.org/coronavirus-covid-19/how-coronavirus-impacts-people-with-diabetes (accessed on 6 September 2022).
  34. Johns Hopkins. FAQs about Diabetes + COVID-19. Available online: https://hopkinsdiabetesinfo.org/faqs-about-diabetes-covid-19/ (accessed on 6 September 2022).
  35. Khademian, F.; Delavari, S.; Koohjani, Z.; Khademian, Z. An investigation of depression, anxiety, and stress and its relating factors during COVID-19 pandemic in Iran. BMC Public Health 2021, 21, 275. [Google Scholar] [CrossRef]
  36. Sjöström, A.; Hajdarevic, S.; Hörnsten, Å.; Öberg, U.; Isaksson, U. Experiences of Online COVID-19 Information Acquisition among Persons with Type 2 Diabetes and Varying eHealth Literacy. Int. J. Environ. Res. Public Health 2021, 18, 13240. [Google Scholar] [CrossRef]
  37. Dikkers, M.F.; Dunning, T.; Savage, S. Information Needs of Family Carers of People with Diabetes at the End of Life: A Literature Review. J. Palliat. Med. 2013, 16, 1617–1623. [Google Scholar] [CrossRef] [Green Version]
  38. Holtz, B.E. Evaluating the most popular diabetes websites in the USA: A content analysis. Health Promot. Int. 2020, 35, 1394–1405. [Google Scholar] [CrossRef]
  39. Na, Y. Self-monitoring Glucometers: How Accurate Are They? J. Korean Diabetes 2013, 14, 143. (In Korean) [Google Scholar] [CrossRef]
  40. Lee, S.Y.; Kim, J.W.; Lee, N.Y. Evaluation of 6 glucose testing systems. Ann. Lab. Med. 2003, 23, 170–179. (In Korean) [Google Scholar]
  41. Lee, Y.-H.; Kim, J.-H.; Kim, J.-K.; Min, K.-P.; Jung, E.-Y.; Park, D.-K. Smart Phone based Personalized Menu Management System for Diabetes Patient. J. Korea Contents Assoc. 2010, 10, 1–9. [Google Scholar] [CrossRef] [Green Version]
  42. Lee, K.A.; Jeon, S.; Kim, W.J.; Jin, H.Y.; Park, J.H.; Baek, H.S.; Park, T.S. The Education Effect of Glucometer Use on the Glucose Levels and the Glucose Value Comparison among Diverse Glucometers. J. Korean Diabetes 2011, 12, 113–121. (In Korean) [Google Scholar] [CrossRef] [Green Version]
  43. Lee, J.Y. Information desk: Social Welfare Information for Patients with Diabetes Mellitus. J. Korean Diabetes 2016, 17, 117. (In Korean) [Google Scholar] [CrossRef] [Green Version]
  44. Kang, E.H. Medical nutrition therapy in diabetes. Korea Food Res. Inst. 2011, 4, 1–5. (In Korean) [Google Scholar]
  45. Nagelkerk, J.; Reick, K.; Meengs, L. Perceived barriers and effective strategies to diabetes self-management. J. Adv. Nurs. 2006, 54, 151–158. [Google Scholar] [CrossRef]
  46. Oh, T.J. Postprandial Hyperglycemia. J. Korean Diabetes 2016, 17, 233–237. (In Korean) [Google Scholar] [CrossRef]
  47. Woerle, H.J.; Szoke, E.; Meyer, C.; Dostou, J.M.; Wittlin, S.D.; Gosmanov, N.R.; Welle, S.L.; Gerich, J.E. Mechanisms for abnormal postprandial glucose metabolism in type 2 diabetes. Am. J. Physiol. Metab. 2006, 290, E67–E77. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Lee, H.S.; Joo, J.; Choue, R. Study on Diet-related Quality of Life in Online Self-help Diabetes Mellitus Patients Who Practice Dietary Regimen. Korean J. Community Nutr. 2011, 16, 136–144. [Google Scholar] [CrossRef] [Green Version]
  49. Delamater, A.M.; Patiño-Fernández, A.M.; E Smith, K.; Bubb, J. Measurement of diabetes stress in older children and adolescents with type 1 diabetes mellitus. Pediatr. Diabetes 2013, 14, 50–56. [Google Scholar] [CrossRef] [PubMed]
  50. Ramsingh, J.; Bhuvaneswari, V. An efficient Map Reduce-Based Hybrid NBC-TFIDF algorithm to mine the public sentiment on diabetes mellitus—A big data approach. J. King Saud Univ. Comput. Inf. Sci. 2018, 33, 1018–1029. [Google Scholar] [CrossRef]
  51. Lee, J.; Kim, Y.; Kwak, E.; Park, S. A study on research trends for gestational diabetes mellitus and breastfeeding: Focusing on text network analysis and topic modeling. J. Korean Acad. Soc. Nurs. Educ. 2021, 27, 175–185. [Google Scholar] [CrossRef]
  52. Ministry of Food and Drug Safety. Temporary Ban Imposed on Manufacture and Sale of 31 Metformin Diabetes Drugs. Available online: https://www.mfds.go.kr/eng/brd/m_61/view.do?seq=24&srchFr=&srchTo=&srchWord=&srchTp=&itm_seq_1=0&itm_seq_2=0&multi_itm_seq=0&company_cd=&company_nm=&page=10 (accessed on 27 September 2022).
  53. Yun, J. Necessity of Providing More Detailed and Specific Information on Drugs Prohibited by the Ministry of Food and Drug Safety. Korean J. Health Promot. 2020, 20, 211–213. (In Korean) [Google Scholar] [CrossRef]
  54. Park, J.H. Diabetes Mellitus and COVID-19. J. Korean Diabetes 2020, 21, 116–119. (In Korean) [Google Scholar] [CrossRef]
  55. Hur, J.W.; Kwon, Y.D. Factors affecting the purchase of indemnity private health insurance and related factors of outpatient healthcare utilization among chronic disease patients. Korean J. Hosp. Manag. 2019, 24, 1–10. (In Korean) [Google Scholar]
  56. Jo, H.S. Development of Health Information and Renewal of Korean National Health Information Portal; Korea Center for Disease Control and Prevention: Osong-si, Republic of Korea, 2018. (In Korean) [Google Scholar]
  57. Calabretta, N. Consumer-driven, patient-centered health care in the age of electronic information. J. Med. Libr. Assoc. 2002, 90, 32–37. [Google Scholar]
  58. Jia, X.; Pang, Y.; Liu, L.S. Online Health Information Seeking Behavior: A Systematic Review. Healthcare 2021, 9, 1740. [Google Scholar] [CrossRef]
  59. Kim, J.A.; Kwon, S.J.; Kang, S.W.; Seo, J.Y. The question analysis method for information retrieval based question-answering systems in biomedical domain. In Proceedings of the Korean Information Science Society Conference, 2016; pp. 783–785. (In Korean). [Google Scholar]
  60. Heo, J.; Ryu, P.M.; Jang, M.G.; Kim, H.K. Search space reduction and answer type classification for open domain Q&A. J. Comput. Sci. Eng. 2012, 39, 118–132. (In Korean) [Google Scholar]
  61. Zhang, Y. Contextualizing consumer health information searching: An analysis of questions in a social Q&A community. In Proceedings of the 1st ACM International Health Informatics Symposium, Arlington, VA, USA, 11–12 November 2010; pp. 210–219. [Google Scholar]
  62. Buckland, S. Unmet needs for health information: A literature review. Health Libr. Rev. 1994, 11, 82–95. [Google Scholar] [CrossRef]
  63. Chiu, M.-H.P.; Wu, C.-C. Integrated ACE model for consumer health information needs: A content analysis of questions in Yahoo! Answers. Proc. Am. Soc. Inf. Sci. Technol. 2012, 49, 1–10. [Google Scholar] [CrossRef]
  64. Hur, K.Y.; Moon, M.K.; Park, J.S.; Kim, S.-K.; Lee, S.-H.; Yun, J.-S.; Baek, J.H.; Noh, J.; Lee, B.-W.; Oh, T.J.; et al. 2021 Clinical Practice Guidelines for Diabetes Mellitus of the Korean Diabetes Association. Diabetes Metab. J. 2021, 45, 461–481. [Google Scholar] [CrossRef]
  65. Daraz, L.; Morrow, A.S.; Ponce, O.J.; Beuschel, B.; Farah, M.H.; Katabi, A.; Alsawas, M.; Majzoub, A.M.; Benkhadra, R.; Seisa, M.O.; et al. Can Patients Trust Online Health Information? A Meta-narrative Systematic Review Addressing the Quality of Health Information on the Internet. J. Gen. Intern. Med. 2019, 34, 1884–1891. [Google Scholar] [CrossRef]
  66. Song, H.; Omori, K.; Kim, J.; E Tenzek, K.; Hawkins, J.M.; Lin, W.-Y.; Kim, Y.-C.; Jung, J.-Y. Trusting Social Media as a Source of Health Information: Online Surveys Comparing the United States, Korea, and Hong Kong. J. Med. Internet Res. 2016, 18, e25. [Google Scholar] [CrossRef]
  67. Tamhane, S.; Rodriguez-Gutierrez, R.; Hargraves, I.; Montori, V.M. Shared Decision-Making in Diabetes Care. Curr. Diabetes Rep. 2015, 15, 1–10. [Google Scholar] [CrossRef]
  68. Morahan-Martin, J.M. How Internet Users Find, Evaluate, and Use Online Health Information: A Cross-Cultural Review. CyberPsychology Behav. 2004, 7, 497–510. [Google Scholar] [CrossRef]
Figure 1. Architecture of KoBERT.
Figure 1. Architecture of KoBERT.
Healthcare 11 01871 g001
Figure 2. KoBERT topic modeling algorism.
Figure 2. KoBERT topic modeling algorism.
Healthcare 11 01871 g002
Figure 3. Scatter plot of document clustering.
Figure 3. Scatter plot of document clustering.
Healthcare 11 01871 g003
Table 1. KoBERT-based Solution.
Table 1. KoBERT-based Solution.
AttributeValue
Features768
Hidden size3072
Number of layers12
Max_length512
Number of classes2
Dropout0.1
Table 2. Top 20 terms by noun topic.
Table 2. Top 20 terms by noun topic.
RankingCOVID-19 Vaccine and ComplicationsTF-IDFDiabetes in Older IndividualsTF-IDFGlycemic ControlTF-IDFDietary ManagementTF-IDFExercise TherapyTF-IDFPregnancy and Pediatric DiabetesTF-IDFMedication ManagementTF-IDFManagement of Diabetic ComplicationsTF-IDFHealthcare Utilization and CostTF-IDF
1COVID-19 vaccine0.58mother0.09blood glucose monitor0.17after meal0.04exercise0.16pregnancy0.14oral hypoglycemic agents0.09pain0.09insurance0.05
2vaccination0.20father0.07use0.07diet0.03squat0.08childbirth0.09dosage0.06toenails0.09examination0.05
3reservation0.11hypoglycemia0.03blood glucose test strips0.06fasting0.03knee0.07children0.07prescription0.04toes0.08hospital0.05
4injection0.05insulin0.03buy0.06snack0.02bicycle0.05babies0.04side effects0.03neuropathy0.07ophthalmology0.03
5patient0.05management0.02continuous glucose meter0.05salad0.02muscular strength0.05management0.03HbA1c0.03sole of the foot0.06retina0.03
6prevention0.04hospital0.02purchase0.04blood glucose check0.02fitness center0.05examination0.03nutritional supplements0.03sensation0.06clinic0.03
7side effects0.04diet0.02consume0.04before meal0.02inside0.04normal0.03blood pressure0.02leg0.05actual expense0.03
8flu0.04prescription0.02blood glucose check0.04carbohydrates0.02after meal0.04fasting0.03hyperlipidemia0.02peripheral0.05surgery0.03
9thrombosis0.03doctor0.02difference0.04high blood sugar0.02aerobic exercise0.04HbA1c0.03vitamin0.02injury0.05complications0.03
10hypertension0.03surgery0.02prescription0.04vegetables0.02effect0.03insulin0.02doctor0.02treatment0.04enrollment0.03
11hyperlipidemia0.02take0.02material0.03eggs0.02stairs0.03lactation0.02stopping the pill0.02inflammation0.03HbA1c0.02
12pain0.02blood pressure0.02recommendation0.03side dishes0.02stepper0.03diet0.02effect0.02part0.03treatment0.02
13children0.02shock0.02pharmacy0.03brown rice0.01treadmill0.03breast milk0.02after meal0.02dermatology0.03reservation0.02
14take0.02complications0.01lancet0.03protein0.01diet0.03impaired glucose tolerance0.02diet0.02circulation0.03actual expense0.02
15antibody0.02treatment0.01finger0.03multigrain rice0.01thigh0.03hospital0.02management0.02complications0.03doctor0.02
16death0.02diagnosis0.01code0.02tofu0.01blood glucose check0.02blood glucose check0.02fasting0.01ankle0.03diagnosis0.02
17disease0.02fasting0.01price0.02chicken breast0.01deadweight0.02after meal0.02weight0.01hands and feet0.03general0.02
18news0.02health0.01accuracy0.02polished rice0.01posture0.02ob-gyn0.02hypoglycemia0.01big toe0.03laser0.01
19muscle pain0.01age0.01finger prick0.02soybean milk0.01waist0.02exercise0.01before meal0.01orthopedics0.03special0.01
20danger0.01weight0.01support0.02bowl0.01too much0.02diabetes0.01lactobacillus0.01corn0.02nature of a disease0.01
TF-IDF = frequency–inverse document frequency.
Table 3. Examples of questions and frequency of noun topics.
Table 3. Examples of questions and frequency of noun topics.
TopicsExamples of Questionsn (%)
COVID-19 vaccine and complicationsCan I get vaccinated?
After getting the COVID-19 vaccine, can I still take insulin as usual?
I have been diagnosed with COVID-19. Do I need to take medication?
Do blood glucose levels change after vaccination?
308(3.6)
Diabetes in older individualsMy mother has diabetes. Is herbal medicine prohibited?
My mother suffered hypoglycemic shock. Please advise.
Can you take a look at the fasting blood glucose level of my mother who is in her 80s?
Please recommend nutritional supplements for my mother who is in her 60s.
270(3.2)
Glycemic controlWhat is a good blood glucose monitor?
Can I get diabetes consumables even on Saturdays?
Is there a relationship between blood glucose monitoring and temperature?
Please tell me where I can buy test papers near Sokcho City Hall.
803(9.5)
Dietary managementCan I eat rice (brown rice, multigrain rice)?
Can I continue to drink bitter melon and pig potato decoction?
What type of noodles should I eat?
Is there any delivery food that I can eat?
2243(26.5)
Exercise therapyHow should I manage my blood glucose level without losing weight?
When my fasting blood glucose level is high, can I exercise on an empty stomach?
Is ankle pumping exercise bad for peripheral neuropathy?
Squats and walking after dinner should be the main exercise, right?
458(5.4)
Pregnancy and pediatric diabetesIs there anyone who had diabetes before pregnancy and stopped breastfeeding after childbirth?
How did you stop breastfeeding?
How are you controlling the blood glucose levels of your child?
If postprandial blood glucose levels are high at week 16 of pregnancy, should I get tested for gestational diabetes right away?
1140(13.5)
Medication managementCan I take anti-inflammatory painkillers if I have a headache?
Can I take diabetes medicine and nutritional supplement together?
Which drug should I take if I want to reduce side effects?
Is constipation a side effect of Diabex XR extended release tablet?
1327(15.7)
Management of diabetic complicationsCan a toe bruise lead to necrosis?
Can overwork cause diabetic complications?
Are myalgia, fever, and diarrhea symptoms of acute diabetes?
Could deterioration of peripheral nerves be due to blood glucose level?
996(11.8)
Healthcare utilization and costIs there a hospital in Bundang that is recommended for kidney tests?
Are there any dental clinics good at treating patients with diabetes?
Is there any private health insurance that patients with diabetes can enroll in?
Which department (hospital) should I go to for impaired fasting glucose test?
925(10.9)
Table 4. Examples of questions related to major verb categories by noun topic.
Table 4. Examples of questions related to major verb categories by noun topic.
TopicsMajor Verb CategoriesExamples of Questions
COVID-19 vaccine and complications(Decision questions) permissionCan I take diabetes medicine after being diagnosed with COVID-19?
(Question word questions) methodWhat should I do about my blood glucose level that went up after being diagnosed with COVID-19?
(Decision questions) possibilityIs it possible for my blood pressure to go up after vaccination?
Diabetes in older individuals(Question word questions) methodHow should I control my mother’s blood glucose level?
(Decision questions) possibilityIs it possible that my father can have an elevated glycated hemoglobin level because of gaining weight?
(Question word questions) definitionMy mother has hypertension. What are the symptoms of hypertension?
Glycemic control(Question word questions) methodHow should I dispose of a lancet?
(Decision questions) possibilityIf I am using a Libre, is there bleeding when exchanging the Libre?
(Decision questions) permissionCan I use it past the expiration date?
Dietary
management
(Decision questions) permissionCan I drink squeezed juice?
(Question word questions) methodWhat proportion of rice should I eat?
(Decision questions) possibilityCan blood glucose levels go up after eating sugar-free snacks?
Exercise therapy(Question word questions) methodMy blood glucose level keeps going up. What exercise should I do?
(Decision questions) permissionCan I do indoor cycling exercise? There are many who say walking exercise is the best.
(Decision questions) possibilityAfter anaerobic (muscle strengthening) exercise, can I experience sharp pain in that area?
Pregnancy and
pediatric diabetes
(Question word questions) methodMy child has high blood glucose levels. How should I manage the blood glucose levels? How should I prepare for pregnancy after being diagnosed with diabetes?
(Decision questions) permissionCan I breastfeed after giving birth even if I have diabetes?
(Decision questions) possibilityI am pregnant. Is there a possibility that diabetes can be passed down to my child?
Medication management (Decision questions) permissionCan I take complete nutritional supplements?
(Question-word questions) methodHow should I take metformin and statins?
(Decision questions) possibilityDoes Champix raise blood glucose levels? I am taking Champix to quit smoking, but my blood glucose levels went up a lot.
Management of diabetic complications (Question word questions) methodWhat are some methods for coping with hypoglycemic shock?
(Decision questions) permissionDuring a neurological examination due to foot numbness, can I tell them that I have diabetes before the examination?
(Decision questions) possibilityIf my blood glucose level is managed well, will I see improvement in cold hands and feet?
Healthcare utilization and cost (Question word questions) methodHow does 00 hospital manage patients with diabetes?
(Decision questions) possibilityCan medication expenses be claimed from private insurance?
(Decision questions) permissionCan I be treated by an endocrinologist?
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Min, S.; Han, J. Topic Modeling Analysis of Diabetes-Related Health Information during the Coronavirus Disease Pandemic. Healthcare 2023, 11, 1871. https://doi.org/10.3390/healthcare11131871

AMA Style

Min S, Han J. Topic Modeling Analysis of Diabetes-Related Health Information during the Coronavirus Disease Pandemic. Healthcare. 2023; 11(13):1871. https://doi.org/10.3390/healthcare11131871

Chicago/Turabian Style

Min, Soyoon, and Jeongwon Han. 2023. "Topic Modeling Analysis of Diabetes-Related Health Information during the Coronavirus Disease Pandemic" Healthcare 11, no. 13: 1871. https://doi.org/10.3390/healthcare11131871

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop