Next Article in Journal
Roles of Social Protection to Promote Health Service Coverage among Vulnerable People toward Achieving Universal Health Coverage: A Literature Review of International Organizations
Previous Article in Journal
Impact of COVID-19 Pandemic on Daily Mobility of the Elderly Living in Small Cities in Lodz Province
Previous Article in Special Issue
Analysis of Caregiver Burden Expressed in Social Media Discussions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Exploring Future Signals of COVID-19 and Response to Information Diffusion Using Social Media Big Data

1
Criminal Justice, Pennsylvania State University, Schuylkill, PA 17972, USA
2
Department of Public Health, Graduate School of Korea University & Transdisciplinary Major in Learning Health Systems, Korea University, Seoul 02841, Republic of Korea
3
School of Industry and Environment, Gachon University, Seoul 13120, Republic of Korea
4
CEO for HealthMax Co., Ltd., Seoul 06078, Republic of Korea
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2023, 20(9), 5753; https://doi.org/10.3390/ijerph20095753
Submission received: 14 March 2023 / Revised: 9 April 2023 / Accepted: 18 April 2023 / Published: 8 May 2023
(This article belongs to the Special Issue Big Data in Public Health: Challenges and Opportunities)

Abstract

:
COVID-19 is a respiratory infectious disease that first reported in Wuhan, China, in December 2019. With COVID-19 spreading to patients worldwide, the WHO declared it a pandemic on 11 March 2020. This study collected 1,746,347 tweets from the Korean-language version of Twitter between February and May 2020 to explore future signals of COVID-19 and present response strategies for information diffusion. To explore future signals, we analyzed the term frequency and document frequency of key factors occurring in the tweets, analyzing the degree of visibility and degree of diffusion. Depression, digestive symptoms, inspection, diagnosis kits, and stay home obesity had high frequencies. The increase in the degree of visibility was higher than the median value, indicating that the signal became stronger with time. The degree of visibility of the mean word frequency was high for disinfectant, healthcare, and mask. However, the increase in the degree of visibility was lower than the median value, indicating that the signal grew weaker with time. Infodemic had a higher degree of diffusion mean word frequency. However, the mean degree of diffusion increase rate was lower than the median value, indicating that the signal grew weaker over time. As the general flow of signal progression is latent signal → weak signal → strong signal → strong signal with lower increase rate, it is necessary to obtain active response strategies for stay home, inspection, obesity, digestive symptoms, online shopping, and asymptomatic.

1. Introduction

The outbreak of the novel coronavirus disease (COVID-19), which causes various respiratory symptoms, became a global issue in January 2020 [1]. Although COVID-19 infections began in Wuhan, China, in December 2019 [2,3], Korea saw its first confirmed patient on 20 January 2020, and group infections emerged due to exposure as confirmed patients moved about. The WHO reported that the majority of people infected with COVID-19 experience mild to moderate respiratory symptoms, and the elderly and people suffering from cardiovascular conditions, diabetes, and chronic respiratory diseases are more likely to develop serious illness [4]. Common COVID-19 symptoms include fever, dry cough, and fatigue, and on rarer occasions, people may experience body aches, sore throat, diarrhea, conjunctivitis, headache, loss of taste of smell, skin rash, discoloration of fingers or toes, and muscle pain. More serious symptoms include difficulty breathing, chest pain and tightness, and speech or movement disorders [3,4,5,6,7]. Key comorbid symptoms of COVID-19 patients include high blood pressure, diabetes, shortness of breath, and vomiting [8,9,10].
It is currently unclear whether complications from the COVID-19 pandemic affect the risk and severity of infection with COVID-19. In Korea, a retrospective controlled study was conducted on 219,961 people aged 18 years or older who were charged for medical expenses for COVID-19 testing by 15 May 2020. The proportion was high at 59.5% (4027 people), and in terms of COVID severity, it was found that older men had a higher proportion than the non-severe group [8].
In [9], Danish patients diagnosed with COVID-19 after the outbreak of COVID-19 until 3 May 2020 were identified to calculate the exposure of interest to comorbidities 10 years before the outbreak of COVID-19. A total of 4480 patients were diagnosed with COVID-19, of which 65% had a Charlson Comorbidity Index Score (CCIS) of 0, 24.8% CCIS 1–2, 6.4% CCIS 3–4, and 3.8% had 4 or more. Overall, 17.8% had serious severe outcomes, of which 9.3% reported death. In the CCIS 0 group, only 9.1% had a serious outcome and 77 died. In the CCIS 1–2 group, 29.7% had a serious outcome and 17.7% died, in the CCIS 3–4 group, 42.7% had a serious outcome and 29.5% died, and in the CCIS 4 or more group, 473 % had serious consequences and 34.3% died [9].
Ref. [10] evaluated the risk of developing serious side effects by stratifying the co-morbidity status of patients with COVID-19. A total of 1590 inpatients were analyzed. Their average female was 48.9 years old, and the number of females was 42.7%, which was 686, and 16% of the population was severe. In addition, 399 patients (25.1%) were found to have at least one comorbidity, with hypertension accounting for 16.9% and diabetes accounting for 8.2% as the most common comorbidity. Patients with one or more comorbidities had a mean age of 44.8 years and were more likely to have shortness of breath, nausea and vomiting [10].
The most significant threats of the spread of the pandemic are potentially high mortality rates and high economic costs [11]. While humankind has seen various infectious diseases such as dengue fever, malaria, influenza, and HIV/AIDS, it is difficult to build appropriate epidemiological models for these diseases, and some scientists consider the spread of the pandemic a complex network of prediction and modeling [12].
As COVID-19 spreads worldwide, it is important to understand epidemiological models of its diffusion, mortality, and recovery [11,13].
In response to the COVID-19 pandemic, most countries adopted social distancing, and the “social distancing” policy was used to limit the spread of COVID-19 in the long term. It is a policy that aims to prevent infection and contain its spread while maintaining daily life, economic, and social activities.
Governments worldwide are continually working to treat infected patients, educate people about social distancing, test potential treatments and vaccines, and reduce the spread of infections. Although governments and organizations face economic and social threats to safely and gradually reopen the society, there is a lack of scientific evidence about how to do this [14]. Since Korea experienced the Middle East Respiratory Syndrome (MERS) in 2015, the Korea Centers for Disease Control and Prevention and many hospitals in Korea have been preparing for an infectious outbreak. When an explosive outbreak occurred in Daegu and Gyeongsangbuk-do in February 2020, Korea successfully conducted group examinations to identify mild or asymptomatic patients to prevent the spread of COVID-19. It also designed and implemented drive-thru testing centers to stop and overcome the spread of COVID-19 [15]. Furthermore, it chose social distancing, a type of nonpharmaceutical intervention used to slow the spread of respiratory viruses [16,17,18], as a basic measure to protect against COVID-19, preventing further spread of infections during this period [15,19].
Social media channels such as Facebook, Twitter, and WeChat [20] have become important sources of health information for global users. With the spread of COVID-19, both correct and false information have spread across social media. The diffusion of misinformation can lead to the masking of healthy behavior and increasing the spread of the virus, leading to negative physical and mental outcomes for individuals (stress, depression, etc.) [21]. Accurate predictions are important to slow and prevent infections [22]. A health informatics monitoring system was developed that utilizes social media information for the early prediction and monitoring of diseases, and the availability of social network media data, particularly that of Twitter, allowed for real-time monitoring of potential outbreaks, including follow-up measures, immediate analyses, and feedback [23]. One of the most important factors in the progression of a pandemic is the rate of its spread and what measures can be taken to slow the spread. Social media has emerged as an important medium that identifies and explains emergency situations for both governments and citizens [24]. In times of crisis, many countries have utilized social media for communication and crisis management [25]. A study by Wukich [26] explained that US state agencies use official social media accounts to disclose different stages of disasters and provide information on early warning and guidance, government measures, and correction of misinformation. Yum [27] conducted an analysis of the influence of public key players on Twitter regarding coronavirus in the United States and found that Barack Obama played an important role in social media, while his centrality was relatively low compared to his followers. Donald Trump, who had fewer followers than Barack Obama, had the greatest influence on social media. Jurgens and Helsloot [28] found that social media platforms allow people to search for and share trusted information, and that social media can improve people’s ability to understand the current situation and cooperatively solve problems due to its connective capabilities.
Najmul Islam et al. [29] analyzed the effects of sharing unconfirmed information as news and personal experiences during an outbreak of an epidemic, and found that those who were engaged in marketing and entertainment and those who lack self-regulation were more likely to share unconfirmed information. A study on the risk of information diffusion related to MERS by Song et al. [30] found that negative information was being spread on Twitter, and positive information was being spread on news channels. Cuan-Baltazar et al. [31] analyzed information quality by collecting data using the keywords “coronavirus” and “Wuhan” through the Google search engine from the outbreak of COVID-10 until 6 February 2020, and found that 70.0% of the websites had low-quality information. Furthermore, there were significant health risks associated with health information online. Governments should develop strategies to regulate health information on the Internet, because wrong information, false news, and misinformation about COVID-19 can create difficulties for public health guidelines and prevention. This is because people are hungry for information and are willing to trust confusing information and spread such information [32].
Today, AI technologies and tools are playing a key role in all aspects of responding to the COVID-19 crisis. As the coronavirus becomes a global pandemic, AI can be used to help manage all stages of the pandemic crisis and its aftermath (detection, prevention, response, and recovery) across society, including policy and health care [33].
Hiltunen (2008) explained the weak signal as a three-dimensional future signal space such as signal, issue, and interpretation using the concept of future sign [34]. Yoon (2012) created keyword portfolios of KEM (Keyword Emergence Map) and KIM (Keyword Issue Map) using word frequency, document frequency, and occurrence frequency increase rate, and selected weak signals using them. KEM can calculate DoV (Degree of Visibility) by showing visibility, and KIM can calculate DoD (Degree of Diffusion) by showing the degree of diffusion [35].
For this study, we examined tweets related to COVID-19 on Twitter, to explore the future signals of COVID-19 in Korea and prepare responses for information diffusion.

2. Materials and Methods

2.1. Data

The collection of tweets utilized the Twitter Application Programming Interface and included users whose last tweet was 365 days ago or less. Using the user information in the Twitter pool, a crawler was used to collect data from original tweets (not retweets) made from 2 February 2020 to 31 May 2020 about relevant topics related to COVID-19. For the natural language processing of the collected tweets (unstructured text documents), morpheme analysis was conducted using methodologies such as head–tail classification, left–right and right–left analysis, and syllable unit analysis. The data refining was performed by keyword extraction through morpheme analysis of the collected documents, and the documents relating to advertisement posts were filtered and excluded.
A total of 1,746,347 posts related to COVID-19 were found, and among them, 373,562 posts based on ontology classification and 5 subjects (symptom, response, prevention, issue, industry) were utilized as targets for the analysis of future signals related to COVID-19.
This study was conducted after receiving the approval of the Institutional Review Board (2-1040781-A-N-012020056HR). The participants of this study cannot be personally identified from the data, ensuring their confidentiality and anonymity.

2.2. Text Mining

Tweets are unstructured text documents with a limit of 140 characters. Ontology is required to extract meaningful keywords from them, extract COVID-19 concepts, and show the relationships between concepts. Ontology, a computer-interpretable knowledge model that formalizes and represents shared concepts of a topic of interest [36], is required to classify and process large-scale unstructured text documents, which can then be analyzed through existing research methodologies. We developed an ontology by analyzing papers and press releases on related topics of COVID-19, and conducted text mining of COVID-19-related tweets. The ontology classification of COVID-19 in this study consisted of 5 subjects: symptoms, response, prevention, issue, and industry. Symptoms were classified into 9 factors: positive case, negative case, fever, respiratory, cold, sore throat, digestive symptoms, and depression. Response was classified into 6 factors: inspection, treatment, quarantine, diagnosis kit, government response, and school closed. Prevention was classified into 5 factors: immunity food, health care, hand cleaner, disinfectant, and mask. Issue was classified into 4 factors: no entry, infodemic, obesity, and stay home. Industry was classified into 5 factors: regional economy, tourism industry, culture industry, online shopping, and airline.

2.3. Future Signal Analysis

To explore future signals by analyzing posts on Twitter, it was necessary to calculate the term frequency and document frequency of the relevant factors therein. Term frequency is the total number of times a word is used in all relevant documents. Document frequency indicates the number of documents in which a specific word appears. The study used “R” program to conduct the analysis.
We then calculated the degree of visibility (DoV), and the degree of diffusion (DoD) [37]. The following equations are used to calculate DoV and DoD. The symbols in the equations signify the following:
DoV i j = ( T F i j / N N j )   ×   { 1     tw ×   ( n j ) }
DoD i j = ( D F i j / N N j )   ×   { 1     tw ×   ( n j ) }
  • NN: total number of documents;
  • TF: term frequency;
  • DF: document frequency;
  • tw: time weight (time weight of 0.05 was applied);
  • n: entire time segment;
  • j: time point;
  • {1 − tw × (n − j)}” is a function that weakens its influence as time passes, thus determining the scale of the time weight [37]. Keyword emergence map (KEM) and Keyword Issue Map (KIM) were examined by visualizing the frequency of emergence and the time-weighted rate of the increase in topics [35,37].

3. Results

As shown in Table 1, this study calculated the mean DoV increases and mean term frequency for each of the major factors of COVID-19 topics and found that the median value of DoV increase was 0.145, indicating that the mention of major factors increased. Stay home, inspection, obesity, depression, diagnosis kit, digestive symptoms, asymptomatic, online shopping, culture industry, negative case, regional economy, quarantine, government response, and treatment showed high frequencies, and the DoV increased to be higher than the median value, indicating that their frequency grew stronger over time. The mean term frequencies of disinfectant, cold, health care, mask, positive case, infodemic, tourism industry, and airline were high, but the DoV increase rates were lower than the median, indicating that their use grew weaker over time.
As shown in Table 2, DoD showed similar trends as DoV. However, the median value of DoD increase was 0.101, indicating that the diffusion of the major factors by each COVID-19 subject increased. Diagnosis kit, negative case, culture industry, regional economy, quarantine, government response, and treatment showed high frequencies, with DoD increasing to be higher than the median, indicating that their signals grew stronger over time. The mean term frequencies of disinfectant, cold, infodemic, hand cleaner, and positive case were high; their DoD increases were lower than the median, indicating that their signals grew weaker over time.
To explore future signals, we set the mean term frequency of DoV and mean document frequency of DoD as the x-axis and the mean increases in DoV and DoD as the y-axis. Then, we divided the median values of each into quadrants—the keywords in Quadrant 2 have weak signals, the keywords in Quadrant 1 have strong signals, the keywords in Quadrant 3 have latent signals, and the keywords in Quadrant 4 have strong signals but with slow increase rates.
As shown in Figure 1 and Figure 2, and Table 3, depression was found to be a strong signal in the KEM, but a weak signal in the KIM in terms of the main factors of COVID-19. This indicated that the influence of the depression factor decreased. The strong signals common to both KEM and KIM (Quadrant 1) included culture industry, quarantine, negative case, diagnosis kit, regional economy, treatment, and government response. Weak signals (Quadrant 2) included stay home, inspection, obesity, digestive symptoms, online shopping, and asymptomatic. Latent signals (Quadrant 3) included fever, sore throat, school closed, respiratory, immunity food, and no entry. Strong signals with slow increase rates in (Quadrant 4) included infodemic, cold, disinfectant, hand cleaner, tourism industry, and mask.

4. Discussion

The COVID-19 pandemic has caused serious problems worldwide and has led to rapid increases in rumors and misinformation about the causes, results, prevention, and treatment of the disease. Collecting and analyzing big data related to COVID-19 on social media could lead to effective responses to COVID-19 by providing correct knowledge and information. Social media data analysis can be used to track incorrect information and unconfirmed rumors and can help in understanding the fear associated with the disease. Furthermore, it can be used for social mobilization, health promotion, and communication with the public about public health interventions.
This study examined 1,746,347 posts on Twitter related to COVID-19 to explore the future signals of COVID-19 and responses to information diffusion. We analyzed the term frequency, document frequency of the key factors, importance, and degree of diffusion to explore future signals.
Depression, digestive symptoms, asymptomatic keywords relating to symptoms, inspection and diagnosis kit relating to response, and stay home and obesity relating to issues showed a high frequency. The results indicate that their signals strengthened over time. Therefore, it is necessary to actively advertise the symptoms of COVID-19 and engage in active testing for early detection of confirmed patients, ensuring that regional outbreaks do not occur. Furthermore, it is necessary to take preemptive measures against asymptomatic patients by understanding the degree of the COVID-19 outbreak by constructing a community sample monitoring system to ensure that outbreaks are not driven by asymptomatic patients.
The DoV mean term frequency was high for disinfectant, health care, and mask, but the signals were growing weaker over time. Reemergence of COVID-19 outbreaks are a concern, given people’s fatigue of wearing masks long term, and the avoidance of mask wearing during the summertime. Therefore, it is necessary to continuously reinforce public communication on prevention measures and mask wearing.
The issue of infodemic had a high DoD term frequency, but the results indicated that the signal weakened over time. The diffusion of misinformation related to COVID-19 is difficult to correct and may lead to excessive fear or psychological issues such as depression and anxiety. Therefore, management of the infodemic issue and fact checking at the governmental level is necessary to provide reliable information.
Stay home and obesity showed high increasing rates, indicating that these keywords may grow into strong signals. As such, active countermeasures are needed for these issues. Regional outbreaks of COVID-19 may lead to obesity as people spend time at home, so it is necessary to provide people with information on exercise methods that can be carried out at home. Moreover, the prevalence of obesity in COVID-19 patients admitted to the intensive care unit was disproportionately high compared to the general public [38], and it has been reported that when obese people are exposed to COVID-19, they are more likely to be hospitalized [39]. Therefore, it is necessary to intervene to improve lifestyles and obesity to reduce the risk of COVID-19, and it is necessary for governments to strengthen social distancing measures for severely obese individuals. Furthermore, the government should provide clear guidelines for risk management for infections in obese patients.
Finally, depression as a symptom was found to be a strong signal in the KEM but a weak signal in the KIM. This indicates that the influence of depression as a symptom and a major factor of COVID-19 continues to decrease, requiring subject-specific psychological treatment and stress management at the government level. Furthermore, as the COVID-19 crisis continues, there are more people stricken with depression from the fear of infections as well as with the overwhelming stress of unemployment, death, and social isolation. The importance of improving the psychological quarantine system should be emphasized as anxiety and fear associated with COVID-19 spreads.
The policy suggestions and limitations of this study are as follows: First, the typical flow of a signal progresses from a latent signal (low frequency and increasing rate) → weak signal (low frequency and high increasing rate) → strong signal (high frequency and increasing rate) → high signal with low increasing rate (high frequency and low increasing rate). When a signal has a low frequency and an increasing rate, it can be interpreted as a latent signal, as it has not emerged as an issue. However, weak signals with low frequency and a high increasing rate are likely to become issues in the future. Furthermore, strong signals with high frequency and an increasing rate are keywords that have already emerged as issues. Therefore, the weak signals of stay home, inspection, obesity, digestive symptoms, online shopping, and asymptomatic can become strong signals within a short period of time, requiring countermeasures.
Second, COVID-19 terms on Twitter may include colloquial speech and slang often found on Twitter as well as the professional terms classified under the theoretical background of ontology; as such, the COVID-19 ontology should be continuously revised and supplemented.
Third, the ontology developed in this study can contribute to the development of a chatterbot that can immediately predict the risk of COVID-19 by learning unstructured data.
Fourth, the COVID-19-related terms used in the subject analysis of this study may differ in definition compared to the COVID-19 terms in existing offline surveys.
In conclusion, as the second wave of the pandemic approaches, there are difficulties associated with developing a COVID-19 vaccine, and even if one is developed, it is likely that its development will happen after the second wave. Until the effectiveness of a COVID-19 vaccine has been proven and its distribution is sufficient for people around the world to be vaccinated and develop immunities, the world has no choice but to prevent COVID-19 through personal quarantines, social distancing, mask wearing, and strict hygiene management. Furthermore, it is necessary for people to strengthen their own individual immunity to reduce the risk of COVID-19 infection. Finally, it is necessary to strengthen the monitoring mechanism through preemptive measures such as rapid diagnosis and the isolation of confirmed patients.

Author Contributions

Conceptualization, J.S., T.M.S. and S.H.L.; Methodology, J.S., T.M.S. and S.H.L.; Software, J.S.; Validation, J.S.; Formal analysis, J.S.; Investigation, J.S., T.M.S. and S.H.L.; Resources, S.H.L.; Data curation, J.S., D.-L.J. and S.H.L.; Writing—original draft, D.-L.J. and T.M.S.; Writing—review & editing, J.S., D.-L.J. and T.M.S.; Funding acquisition, J.S. and S.H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Data Voucher Support Project of the Korea Data Agency.

Institutional Review Board Statement

The Institutional Review Board at the authors’ institution approved the study protocol of this study.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not Applicable.

Conflicts of Interest

Author Tae Min Song received fund from the Data Voucher Support Project of the Korea Data Agency. Author Sang Ho Lee was CEO of the the company HealthMax Co., LTD, Korea. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Liu, P.L. COVID-19 Information Seeking on Digital Media and Preventive Behaviors: The Mediation Role of Worry. Cyberpsychology Behav. Soc. Netw. 2020, 23, 677–682. [Google Scholar] [CrossRef] [PubMed]
  2. Guan, W.-J.; Ni, Z.-Y.; Hu, Y.; Liang, W.; Ou, C.; He, J.; Liu, L.; Shan, H.; Lei, C.; Hui, D.; et al. Clinical Characteristics of Coronavirus Disease 2019 in China. N. Engl. J. Med. 2020, 382, 1708–1720. [Google Scholar] [CrossRef] [PubMed]
  3. Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef] [PubMed]
  4. World Health Organization. Coronavirus. Available online: www.who.int/health-topics/coronavirus#tab=tab1 (accessed on 12 August 2020).
  5. Pan, L.; Mu, M.; Yang, P.; Sun, Y.; Wang, R.; Yan, J.; Li, P.; Hu, B.; Wang, J.; Hu, C.; et al. Clinical Characteristics of COVID-19 Patients with Digestive Symptoms in Hubei, China: A Descriptive, Cross-Sectional, Multicenter Study. Am. J. Gastroenterol. 2020, 115, 766–773. [Google Scholar] [CrossRef]
  6. Carfì, A.; Bernabei, R.; Landi, F. Group ftGAC-P-ACS. Persistent Symptoms in Patients After Acute COVID-19. JAMA 2020, 324, 603–605. [Google Scholar] [CrossRef]
  7. Tian, S.; Hu, N.; Lou, J.; Chen, K.; Kang, X.; Xiang, Z.; Chen, H.; Wang, D.; Liu, N.; Liu, D.; et al. Characteristics of COVID-19 infection in Beijing. J. Infect. 2020, 80, 401–406. [Google Scholar] [CrossRef]
  8. Ji, W.; Huh, K.; Kang, M.; Hong, J.; Bae, G.H.; Lee, R.; Na, Y.; Choi, H.; Gong, S.Y.; Choi, Y.-H.; et al. Effect of Underlying Comorbidities on the Infection and Severity of COVID-19 in Korea: A Nationwide Case-Control Study. J. Korean Med. Sci. 2020, 35, e237. [Google Scholar] [CrossRef]
  9. Christensen, D.M.; Strange, J.E.; Gislason, G.; Torp-Pedersen, C.; Gerds, T.; Fosbøl, E.; Phelps, M. Charlson Comorbidity Index Score and Risk of Severe Outcome and Death in Danish COVID-19 Patients. J. Gen. Intern. Med. 2020, 35, 2801–2803. [Google Scholar] [CrossRef]
  10. Guan, W.-J.; Liang, W.-H.; Zhao, Y.; Liang, H.-R.; Chen, Z.-S.; Li, Y.-M.; Liu, X.-Q.; Chen, R.-C.; Tang, C.-L.; Wang, T.; et al. Comorbidity and its impact on 1590 patients with COVID-19 in China: A nationwide analysis. Eur. Respir. J. 2020, 55, 2000547. [Google Scholar] [CrossRef]
  11. Salathé, M.; Kazandjieva, M.; Lee, J.W.; Levis, P.; Feldman, M.W.; Jones, J.H. A high-resolution human contact network for infectious disease transmission. Proc. Natl. Acad. Sci. USA 2010, 107, 22020–22025. [Google Scholar] [CrossRef]
  12. He, S.; Peng, Y.; Sun, K. SEIR modeling of the COVID-19 and its dynamics. Nonlinear Dyn. 2020, 101, 1667–1680. [Google Scholar] [CrossRef] [PubMed]
  13. Abou-Ismail, A. Compartmental Models of the COVID-19 Pandemic for Physicians and Physician-Scientists. SN Compr. Clin. Med. 2020, 2, 852–858. [Google Scholar] [CrossRef] [PubMed]
  14. Block, P.; Hoffman, M.; Raabe, I.J.; Dowd, J.B.; Rahal, C.; Kashyap, R.; Mills, M.C. Social network-based distancing strategies to flatten the COVID-19 curve in a post-lockdown world. Nat. Hum. Behav. 2020, 4, 588–596. [Google Scholar] [CrossRef] [PubMed]
  15. Choi, J.Y. COVID-19 in South Korea. Postgrad. Med. J. 2020, 96, 399–402. [Google Scholar] [CrossRef]
  16. Noh, J.Y.; Seong, H.; Yoon, J.G.; Song, J.Y.; Cheong, H.J.; Kim, W.J. Social Distancing against COVID-19: Implication for the Control of Influenza. J. Korean Med. Sci. 2020, 35, e182. [Google Scholar] [CrossRef]
  17. Newbold, S.C.; Finnoff, D.; Thunström, L.; Ashworth, M.; Shogren, J.F. Effects of Physical Distancing to Control COVID-19 on Public Health, the Economy, and the Environment. Environ. Resour. Econ. 2020, 76, 705–729. [Google Scholar] [CrossRef]
  18. Siedner, M.J.; Harling, G.; Reynolds, Z.; Gilbert, R.F.; Haneuse, S.; Venkataramani, A.S.; Tsai, A.C. Social distancing to slow the US COVID-19 epidemic: Longitudinal pretest–posttest comparison group study. PLoS Med. 2020, 17, e1003244. [Google Scholar] [CrossRef]
  19. Central Disaster and Safety Countermeasure Headquarters of the Republic of Korea. Rules and guidelines for distancing in daily life to control coronavirus disease 2019 in Korea: 3rd version, announced on 3 July 2020. J. Educ. Eval. Health Prof. 2020, 17, 20. [Google Scholar] [CrossRef]
  20. Zhao, D.; Sun, J.; Tan, Y.; Wu, J.; Dou, Y. An extended SEIR model considering homepage effect for the information propagation of online social networks. Phys. A Stat. Mech. Its Appl. 2018, 512, 1019–1031. [Google Scholar] [CrossRef]
  21. Tasnim, S.; Hossain, M.; Mazumder, H. Impact of Rumors and Misinformation on COVID-19 in Social Media. J. Prev. Med. Public Health 2020, 53, 171–174. [Google Scholar] [CrossRef]
  22. Zeroual, A.; Harrou, F.; Dairi, A.; Sun, Y. Deep learning methods for forecasting COVID-19 time-Series data: A Comparative study. Chaos Solitons Fractals 2020, 140, 110121. [Google Scholar] [CrossRef] [PubMed]
  23. Tuli, S.; Tuli, S.; Tuli, R.; Gill, S.S. Predicting the growth and trend of COVID-19 pandemic using machine learning and cloud computing. Internet Things 2020, 11, 100222. [Google Scholar] [CrossRef]
  24. Panagiotopoulos, P.; Barnett, J.; Bigdeli, A.Z.; Sams, S. Social media in emergency management: Twitter as a tool for communicating risks to the public. Technol. Forecast. Soc. Chang. 2016, 111, 86–96. [Google Scholar] [CrossRef]
  25. Chatfield, A.T.; Reddick, C.G. All hands on deck to tweet #sandy: Networked governance of citizen coproduction in turbulent times. Gov. Inf. Q. 2018, 35, 259–272. [Google Scholar]
  26. Wukich, C. Government Social Media Messages across Disaster Phases. J. Conting. Crisis Manag. 2016, 24, 230–243. [Google Scholar] [CrossRef]
  27. Yum, S. Social Network Analysis for Coronavirus (COVID-19) in the United States. Soc. Sci. Q. 2020, 101, 1642–1647. [Google Scholar] [CrossRef]
  28. Jurgens, M.; Helsloot, I. The effect of social media on the dynamics of (self) resilience during disasters: A literature review. J. Conting. Crisis Manag. 2018, 26, 79–88. [Google Scholar] [CrossRef]
  29. Islam, A.K.M.N.; Laato, S.; Talukder, S.; Sutinen, E. Misinformation sharing and social media fatigue during COVID-19: An affordance and cognitive load perspective. Technol. Forecast. Soc. Chang. 2020, 159, 120201. [Google Scholar] [CrossRef]
  30. Song, J.; Song, T.M.; Seo, D.C.; Jin, D.L.; Kim, J.S. Social Big Data Analysis of Information Spread and Perceived Infection Risk During the 2015 Middle East Respiratory Syndrome Outbreak in South Korea. Cyberpsychol. Behav. Soc. Netw. 2017, 20, 22–29. [Google Scholar] [CrossRef]
  31. Cuan-Baltazar, J.Y.; Muñoz-Perez, M.J.; Robledo-Vega, C.; Pérez-Zepeda, M.F.; Soto-Vega, E. Misinformation of COVID-19 on the Internet: Infodemiology Study. JMIR Public Health Surveill. 2020, 6, e18444. [Google Scholar] [CrossRef]
  32. Pazzanese, C. Battling the ‘Pandemic of Misinformation’. The Harvard Gazette, 8 May 2020. Available online: https://news.harvard.edu/gazette/story/2020/05/social-media-used-to-spread-create-covid-19-falsehoods/ (accessed on 2 July 2020).
  33. OECD Policy Responses to Coronavirus (COVID-19), Using Artificial Intelligence to Help Combat COVID-19, Updated 23 April 2020. Available online: https://www.oecd.org/coronavirus/policy-responses/using-artificial-intelligence-to-help-combat-covid-19-ae4c5c21/ (accessed on 13 December 2020).
  34. Hiltunen, E. The future sign and its three dimensions. Futures 2008, 40, 247–260. [Google Scholar] [CrossRef]
  35. Yoon, J. Detecting weak signals for long-term business opportunities using text mining of Web news. Expert Syst. Appl. 2012, 39, 12543–12550. [Google Scholar] [CrossRef]
  36. Kim, H.-Y.; Park, H.-A.; Min, Y.H.; Jeon, E. Development of an obesity management ontology based on the nursing process for the mobile-device domain. J. Med. Internet Res. 2013, 15, e130. [Google Scholar] [CrossRef] [PubMed]
  37. Song, J.; Han, Y.; Kim, K.; Song, T.M. Social big data analysis of future signals for bullying in South Korea: Application of general strain theory. Telemat. Inform. 2020, 54, 101472. [Google Scholar] [CrossRef]
  38. Le Brocq, S.; Clare, K.; Bryant, M.; Roberts, K.; Tahrani, A.A. Obesity and COVID-19: A call for action from people living with obesity. Lancet Diabetes Endocrinol. 2020, 8, 652–654. [Google Scholar] [CrossRef] [PubMed]
  39. Sattar, N.; Ho, F.K.; Gill, J.; Ghouri, N.; Gray, S.; Celis-Morales, C.; Katikireddi, S.V.; Berry, C.; Pell, J.P.; Mcmurray, J.; et al. BMI and future risk for COVID-19 infection and death across sex, age and ethnicity: Preliminary findings from UK biobank. Diabetes Metab. Syndr. 2020, 14, 1149–1151. [Google Scholar] [CrossRef] [PubMed]
Figure 1. KEM (Keyword Emergence Map) of main factors related to COVID-19.
Figure 1. KEM (Keyword Emergence Map) of main factors related to COVID-19.
Ijerph 20 05753 g001
Figure 2. KIM (Keyword Issue Map) of main factors related to COVID-19.
Figure 2. KIM (Keyword Issue Map) of main factors related to COVID-19.
Ijerph 20 05753 g002
Table 1. DoV mean increase rate and mean term frequency for main factors related to COVID-19.
Table 1. DoV mean increase rate and mean term frequency for main factors related to COVID-19.
KeywordDoVMean Increase RateMean Term
Frequency
2020
February
2020
March
2020
April
2020
May
Mask27,77144,03417,17313,8120.04125,698
Government Response12,71420,86513,54095720.16214,173
Tourism Industry13,53610,974657365290.0729403
Treatment730213,276970548430.1508782
Health Care11,05111,88466534623−0.0388553
Diagnosis Kit394515,485980129450.5138044
Regional Economy594410,024770452230.2317224
Cold8294640830172537−0.0935064
Infodemic51077627357628040.0564779
Disinfectant6557668937641896−0.1484727
Culture Industry45635396407640570.2604523
Quarantine33826178589721500.2194402
Hand Cleaner39194241281720610.0453260
32044697148620270.2522854
27144387216915000.0552693
Immunity Food22742877192716010.1452170
Airline21022689209112590.0972035
Depression10642618205420640.6001950
Negative Case 32962888693190−0.4971767
Positive Case739147011298240.3331041
Fever12561260639550−0.004926
Stay home34895817785160.958900
Respiratory103711397695630.056877
Sore Throat1043989342325−0.070675
Obesity3696626177390.623597
Digestive Symptom3797223474490.425474
Inspection1536175864120.937442
School Closed555621160135−0.135368
Online Shopping5303402313260.292357
Median0.1452693
Table 2. DoD mean increase rate and mean document frequency for main factors related to COVID-19.
Table 2. DoD mean increase rate and mean document frequency for main factors related to COVID-19.
KeywordDoDMean Increase RateMean Document Frequency
2020
February
2020
March
2020
April
2020
May
Mask19,39927,90013,31410,2250.03117,710
Government Response10,88017,89811,72881150.15012,155
Treatment615011,124807941940.1467387
Tourism Industry10,4778182484746790.0317046
Regional Economy48918227608243070.2165877
Diagnosis Kit29108835521117740.2874683
Cold6898549725092038−0.1214236
Infodemic42666265310023570.0453997
Quarantine30105567522619440.2063937
Culture Industry30903654263527020.2393020
Disinfectant3760398021011206−0.1332762
Negative Case 29144175134818560.2462573
Hand Cleaner29283285228516160.0482529
Positive Case25634060195113950.0442492
Immunity Food20012367150412810.1011788
Depression9902315161416280.5101637
No Entry29212587648181−0.4881584
Airline1289158312767000.0601212
Asymptomatic664131310107780.340941
Fever111711375765110.002835
Stay home33792415964750.857833
Respiratory95710286955220.047801
Health Care1060982615469−0.014782
Sore Throat887866304295−0.058588
Obesity3216045747160.662554
Digestive Symptom3416292953850.400413
Inspection1415085233680.871385
School Closed495572142128−0.107334
Online Shopping4243142072630.214302
Median0.1011788
Table 3. Future signals of main factors related to COVID-19.
Table 3. Future signals of main factors related to COVID-19.
Future
Signal
Latent SignalWeak SignalStrong SignalStrong But Low
Increasing Signal
KEMFever, Sore Throat, School Closed, Positive Case, Respiratory, Immunity Food, No EntryStay home, Inspection, Obesity, Digestive Symptom, Online Shopping, AsymptomaticDepression, Culture Industry,
Quarantine, Negative Case,
Diagnosis Kit, Regional Economy,
Treatment, Government Response
Airline, Infodemic, Cold, Disinfectant, Hand Cleaner
Tourism Industry, Health Care, Mask
KIMFever, Sore Throat,
School Closed, Respiratory,
Immunity Food, No Entry, Airline, Health Care
Stay home, Inspection, Obesity, Digestive Symptom, Online Shopping, Asymptomatic, DepressionCulture Industry,
Quarantine,
Negative Case, Diagnosis Kit, Regional Economy,
Treatment, Government Response
Infodemic, Cold, Disinfectant, Hand Cleaner, Tourism Industry, Mask, Positive Case,
Common
signal
Fever, Sore Throat,
School Closed, Respiratory,
Immunity Food, No Entry
Stay home, Inspection, Obesity, Digestive Symptom, Online Shopping, AsymptomaticCulture Industry,
Quarantine,
Negative Case, Diagnosis Kit, Regional Economy,
Treatment, Government Response
Infodemic, Cold, Disinfectant, Hand Cleaner, Tourism Industry, Mask
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Song, J.; Jin, D.-L.; Song, T.M.; Lee, S.H. Exploring Future Signals of COVID-19 and Response to Information Diffusion Using Social Media Big Data. Int. J. Environ. Res. Public Health 2023, 20, 5753. https://doi.org/10.3390/ijerph20095753

AMA Style

Song J, Jin D-L, Song TM, Lee SH. Exploring Future Signals of COVID-19 and Response to Information Diffusion Using Social Media Big Data. International Journal of Environmental Research and Public Health. 2023; 20(9):5753. https://doi.org/10.3390/ijerph20095753

Chicago/Turabian Style

Song, Juyoung, Dal-Lae Jin, Tae Min Song, and Sang Ho Lee. 2023. "Exploring Future Signals of COVID-19 and Response to Information Diffusion Using Social Media Big Data" International Journal of Environmental Research and Public Health 20, no. 9: 5753. https://doi.org/10.3390/ijerph20095753

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop