Next Article in Journal
Tracing down the Updates on Dengue Virus—Molecular Biology, Antivirals, and Vaccine Strategies
Next Article in Special Issue
Prospective Attitudes towards Respiratory Syncytial Virus (RSV) Vaccination: Validation of a Survey Instrument among Young Females in Jordan Pending Vaccine Authorization
Previous Article in Journal
Protective Anti-HBs Antibodies and Response to a Booster Dose in Medical Students Vaccinated at Childhood
Previous Article in Special Issue
Knowledge and Attitudes of Healthcare Professionals Regarding Perinatal Influenza Vaccination during the COVID-19 Pandemic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Impressions of COVID-19 Vaccination and Influenza Vaccination in Japan by Analyzing Social Media Using Text Mining

1
Department of Hygiene, Faculty of Medicine, Kagawa University, Miki 761-0793, Japan
2
Sakaide City Hospital, Sakaide 762-8550, Japan
3
Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima 770-8503, Japan
*
Author to whom correspondence should be addressed.
Vaccines 2023, 11(8), 1327; https://doi.org/10.3390/vaccines11081327
Submission received: 17 June 2023 / Revised: 26 July 2023 / Accepted: 3 August 2023 / Published: 5 August 2023

Abstract

:
The aim of this study was to compare impressions of COVID-19 vaccination and influenza vaccination in Japan by analyzing social media (Twitter®) using a text-mining method. We obtained 10,000 tweets using the keywords “corona vaccine” and “influenza vaccine” on 15 December 2022 and 19 February 2023. We then counted the number of times the words were used and listed frequency of these words by a text-mining method called KH Coder. We also investigated concepts in the data using groups of words that often appeared together or groups of documents that contained the same words using multi-dimensional scaling (MDS). “Death” in relation to corona vaccine and “severe disease” for influenza vaccine were frequently used on 15 December 2022. The number of times the word “death” was used decreased, “after effect” was newly recognized for corona vaccine, and “severe disease” was not used in relation to influenza vaccine. Through this comprehensive analysis of social media data, we observed distinct variations in public perceptions of corona vaccination and influenza vaccination in Japan. These findings provide valuable insights for public health authorities and policymakers to better understand public sentiment and tailor their communication strategies accordingly.

1. Introduction

The first case of coronavirus infectious disease, also known as COVID-19, originated in 2019 and was reported in China on 8 December 2019 [1]. Subsequently, on 16 January 2020, the first case of COVID-19 in Japan was confirmed. Since then, the spread of COVID-19 infection has continued to affect Japan [2]. As of 26 February 2023, the cumulative number of novel positive cases of COVID-19 in Japan reached a staggering 33,184,966 [3].
In an ongoing effort to control the spread of the virus, the Japanese government has been actively encouraging its citizens to modify their behavior while also initiating a comprehensive COVID-19 vaccination campaign in February 2021 [4]. As of 23 February 2023, the vaccination coverage for COVID-19 among the entire population stood at 77.9% for the first dose, 77.4% for the second dose, 68.3% for the third dose, 45.9% for the fourth dose, and 23.7% for the fifth dose. Unfortunately, the vaccination rate after the third dose has shown limited progress [5]. Additionally, reports indicate that the vaccination rate remains lower among younger generations compared to older generations [6].
While influenza infection poses a significant public health challenge globally, Japan has not experienced any major influenza epidemics since the emergence of COVID-19 [7]. However, the influenza vaccination rates for the 2021–2022 period were considerably lower compared to pre-pandemic levels [8]. An influenza epidemic was anticipated for the 2022–2023 season, and the number of confirmed influenza cases had been steadily increasing since November 2022. From 13 February 2023 to 4 March, a total of 62,101 cases were reported, marking a substantial increase of approximately 2400-fold compared to the same period in the previous year [9].
Social media platforms have revolutionized communication, enabling individuals to share their thoughts, opinions, and ideas on a global scale. In Japan, these platforms have gained immense popularity, transcending generational boundaries. The Ministry of Internal Affairs and Communications conducted a survey in 2021, revealing that LINE® remains the dominant platform with a staggering 92.5% usage rate, followed by Facebook at 32.6%. Instagram® and Twitter® have also gained considerable traction, with respective usage rates of 48.5% and 46.2% [10]. Twitter®, known for its succinct 140-character messages [11], facilitates connections between users who share similar interests and concerns. With its viral nature, information spreads rapidly through retweets, making it an invaluable tool for disseminating important updates. What sets Twitter® apart is its diverse user base, spanning across different age groups. Astonishingly, 67.4% of teenagers, 78.6% of people in their 20s, 57.9% of individuals in their 30s, 44.8% of those in their 40s, 34.3% of people in their 50s, and even 14.1% of individuals in their 60s utilize Twitter® in Japan [10].
Social media use continues to grow worldwide, and there are numerous tweets about various health problems, including human papillomavirus infection [12], suicide [13], stress [14], air pollution [15], and others [12,13,14,15,16,17,18,19,20]. Keelan et al. examined 303 posts on HPV immunization and classified 157 (52%) as positive, 129 (43%) as negative, and 17 (6%) as ambivalent [12]. Another study investigated the relationship between suicide-related tweets and suicidal behavior in order to identify young individuals at risk on the Internet [13]. Song et al. obtained social data from online news and blogs, which provided insights into adolescent stress [14]. Additionally, a study collected air pollution data from 2210 monitoring sites across China, used keyword-based filtering to identify individual messages related to air pollution and health on Chinese Twitter®, and revealed the correlation between air pollution and related tweets during periods and locations with poor air quality [15]. Therefore, analyzing data from social media can be beneficial for examining health concerns within communities.
Previous studies have examined COVID-19 by analyzing social media [21,22,23,24,25]. Shim et al. utilized Google search trends to investigate the impact of media on skin lesions during the COVID-19 pandemic [21]. Abd-Alrazaq et al. extracted text and metadata (user profile information including the number of likes, retweets, and number of followers) from Twitter®, revealing that citizens were concerned about economic losses during the COVID-19 epidemic [22]. Pobiruchin et al. also analyzed Twitter® based on 16 hashtags about COVID-19 in the European region to examine temporal changes and regional characteristics of COVID-19-related tweets [23]. Alhuzali et al. extracted over 500,000 tweets related to COVID-19 from 48 different cities in the U.K. between February 2020 and November 2021, initially observing positive opinions that later shifted towards an increase in negative opinions [24]. In Japan, Suzuki et al. used Twitter® to examine the Japanese public’s attitude towards discrimination related to COVID-19 [25].
Understanding the extensive reach and popularity of Twitter®, it becomes clear that analyzing Twitter® data can provide invaluable insights, particularly in promoting COVID-19 vaccination among younger generations. By closely examining the sentiments, concerns, and preferences expressed by Twitter® users, effective strategies can be developed to address any doubts or misconceptions related to vaccinations. Engaging with the younger audience through Twitter® can play a pivotal role in significantly increasing vaccination rates in Japan and effectively combatting the ongoing pandemic. Previously, we reported on the initial impressions of medical staff regarding the COVID-19 vaccine using text mining, obtaining text data through a self-reported questionnaire [26,27]. In younger women, the words “pregnancy” and “side effect” were frequently associated with the first vaccine, while the mention of “side effect” decreased for the second vaccine [26]. Additionally, “pregnancy” was frequently mentioned in tweets about the third vaccine by women in their 30s [27]. However, the text data were obtained only from a limited number of medical staff at one hospital in Japan.
Despite the potential usefulness of the analysis of social media, particularly platforms like Twitter®, there has been a significant lack of comparative studies examining public sentiments regarding COVID-19 vaccination and influenza vaccination in Japan. This knowledge gap poses a hindrance to our comprehension of public perceptions and presents a formidable obstacle to the advancement of future COVID-19 vaccination initiatives. In order to bridge this gap and make valuable contributions to the improvement of COVID-19 vaccination efforts, we embarked on a comprehensive text-mining analysis of Twitter® data. Our objective was to delve into and draw comparisons between the general impressions held by the public concerning COVID-19 vaccination and influenza vaccination in Japan.

2. Materials and Methods

2.1. Data Extraction and Adjustment

To gather data for our study, we made use of a tool called the Twitter® application programming interface (API) [28]. The API is a set of rules and protocols that allows us to access information from Twitter®, such as tweets posted by users. Specifically, we used the API to retrieve tweets that were posted within the last seven days. We started the process of extracting data from Twitter® on 15 December 2022, which was before the widespread occurrence of influenza infections [9]. To narrow down our search and focus on the topic of vaccinations related to both the coronavirus and influenza viruses, we used two specific keywords: “corona vaccine” and “influenza vaccine”. Initially, we retrieved 10,000 tweets that were relevant to discussions about vaccinations in the context of both viruses. The purpose of this initial extraction was to capture a wide range of tweets related to our topic of interest. Later, on 19 February 2023, when the influenza virus was more prevalent [9], we performed an additional extraction of 10,000 tweets from Twitter® using the same set of keywords as before. This allowed us to gather more recent data specifically related to discussions about vaccinations during the time when the influenza virus was actively circulating.
To carry out the extraction process, we used a programming language called R® [29], along with an integrated development environment called RStudio® [30]. R® is a popular programming language used for analyzing data, and it provides a wide range of features for manipulating data and performing statistical analysis. On the other hand, RStudio® is a specialized development environment created specifically for working with R®. It offers a user-friendly interface and various tools that make coding and analysis tasks easier to perform. By utilizing R® and RStudio® together, we were able to efficiently work with data and conduct statistical analysis in a convenient and effective manner.
To collect the tweets, we used programming code in R® and RStudio®. We set specific keywords, namely “corona vaccine” and “influenza vaccine”, to filter the tweets. The extraction process was designed to include instances where the keywords appeared separately within a tweet. For example, a tweet could contain “corona” and “vaccine” in different parts of the text or “influenza” and “vaccine” in separate sections.
After the initial extraction, we performed further adjustments on the extracted data using Visual Studio Code® [31]. Visual Studio Code® is a popular source code editor that offers a range of features and extensions to support various programming languages, including R®. In our case, we utilized Visual Studio Code® to perform data cleaning and formatting tasks. During the adjustment phase, we removed certain elements from the extracted data to ensure data privacy and integrity. These elements included the following:
  • Uniform resource locators (URLs): Any web addresses present in the tweets were eliminated, as they were not relevant to our analysis;
  • Twitter® account names: The usernames associated with each tweet were removed to protect the privacy of the individuals who posted them;
  • Blank lines: Empty lines within the data were eliminated, as they did not contribute to the analysis;
  • Phone numbers and zip codes: Any phone numbers and zip codes mentioned in the tweets were removed to maintain privacy and comply with data-protection regulations;
  • Advertisements and suspected advertisements: Tweets that appeared to be advertisements or promotional in nature were excluded from the dataset, as our focus was on genuine discussions and opinions related to vaccination.
By performing these adjustments, we ensured that the data used in our subsequent analysis was clean, reliable, and devoid of any identifiable or irrelevant information.

2.2. Text Mining

Following the data extraction and adjustment process, we carefully conducted a comprehensive analysis of the tweet data using a powerful and widely recognized text-mining method called KH Coder (KH Coder 3.0, Koichi Higuchi, Tokyo, Japan). This methodology has been previously reported [32,33] and is highly regarded for its ability to extract valuable insights from textual data.
We used KH Coder to carefully analyze the tweets we collected about the “corona vaccine” and “influenza vaccine”. Our analysis involved both quantitative and qualitative techniques to thoroughly understand the patterns, recurring themes, and complex relationships in the tweets. To start, we created a detailed frequency table to identify the most commonly used words in the dataset. This information helped us gain a better understanding of the main topics and ongoing discussions related to the vaccines.
In addition to frequency analysis, we took advantage of the advanced multivariate analysis capabilities offered by KH Coder. This allowed us to dive deeper into the tweet data and extract more nuanced insights. To visually represent the relationships between the words, we used a popular technique called multi-dimensional scaling (MDS). MDS arranges the tweets as points in a low-dimensional space, highlighting similarities and differences between them. This MDS plot provided a clear visual representation of how tweets were related, with similar words positioned close together and dissimilar words positioned farther apart. This approach has been successfully used in previous studies to uncover distinct clusters and intricate patterns in text data [34,35]. Cluster analysis was then performed based on the coordinates of the plots obtained by the MDS method. The method is based on the Ward method using Euclidean distance. In this case, words that are placed close to each other on the plot are classified into the same cluster. In this case, eight clusters, which is the default, were used to support the interpretation of the results of the MDS method.
By using these strong analytical methods and techniques, we gained deep understanding from the tweet data. We discovered important discussions, uncovered new patterns and connections, and learned valuable information about the “corona vaccine” and “influenza vaccine” trends in social media conversations. This analysis is a valuable tool for better comprehending the dynamics and trends surrounding these topics.

2.3. Ethics

Ethical approval for this study was obtained from the Ethical Committee of Sakaide City Hospital in Sakaide, Japan (Number: 2022-002, Date: 15 December 2022). The committee carefully reviewed and approved the research methodology, ensuring that the study adhered to ethical guidelines and considerations for the protection of participants’ rights and privacy and the responsible use of social media data.

2.4. English Editing

The original text of this paper was written by the authors. However, English editing was performed by a professional proofreader (Medical English Service, Kyoto, Japan). Chat GPT and DeepL translation software were also used for some of the post-review English editing.

3. Results

On 15 December 2022, a study was conducted to analyze tweets related to the “corona vaccine” and the “influenza vaccine”. Among the 10,000 tweets examined, the phrase “corona vaccine” was used 552,432 times, while “influenza vaccine” was used 576,708 times. Table 1 and Table 2 provide lists of frequently used words among nouns and adjectival nouns extracted from these tweets. In the tweets discussing the “corona vaccine”, the most commonly used word was “vaccination”, which appeared 7625 times. Other frequently used words included “vaccine”, “corona vaccine”, “corona”, “new type”, and “death”. On the other hand, in tweets about the “influenza vaccine”, the word “vaccine” appeared most frequently, with a count of 8429. This was followed by “vaccination”, “influenza vaccine”, “influenza”, “corona”, and “severe disease” (Table 1).
Moving forward to 19 February 2023, the number of tweets mentioning the “corona vaccine” increased to 600,591, while the “influenza vaccine” was mentioned 577,965 times. Among the tweets focusing on the “corona vaccine”, the most commonly used word remained “vaccine”, with a count of 7811. It was followed by “corona vaccine”, “vaccination”, “corona vaccine”, “new type”, and “after effect”. In the case of tweets discussing the “influenza vaccine”, “vaccine” was once again the most frequently used word, appearing 9411 times. Other notable words included “influenza”, “mask”, “WHO”, “world”, and “vaccination” (Table 2).
The collected tweets were subjected to analysis using multi-dimensional scaling (MDS) to identify clusters of related words or concepts. In the analysis conducted in December 2022 (Figure 1), it was observed that tweets mentioning “corona vaccine” formed clusters associated with the word “death”. Additionally, a separate cluster emerged regarding the cost of future vaccinations. In the case of “influenza vaccine” tweets, a cluster was formed around the term “severe disease”, which was distinct from the cluster related to the “influenza vaccine” itself.
Moving on to the analysis performed in February 2023, the “corona vaccine” tweets revealed a separate cluster associated with the term “after effect”, which was distinct from the main cluster representing the “corona vaccine”. This cluster was related to concepts such as the “Ministry of Health, Labor and Welfare”, “medical care”, and “support”. Notably, the usage of “severe disease” was not observed in these tweets, and overall, there were no significant changes compared to the December 2022 analysis (Figure 2).
In summary, the analysis of tweets revealed shifts in the frequency and usage of words related to the “corona vaccine” and the “influenza vaccine” over time. These findings provide insights into the public discourse surrounding these vaccines and the evolving concerns and topics associated with them.

4. Discussion

In the present study, we compared public impressions of corona vaccination and influenza vaccination in Japan by analyzing Twitter® using a text-mining method for the first time. Prior to the influenza pandemic (on 15 December 2022), the impression of younger generations of the corona vaccine was characterized by the word “death”, while that of the influenza vaccine was associated with “severe disease”. During the influenza pandemic (on 19 February 2023), the use of the word “death” decreased, and a new association with “after effect” was identified in relation to the corona vaccine, while the word “severe disease” was no longer detected in tweets discussing the influenza vaccine.
According to impressions of the corona vaccine based on an analysis of social media, Awijen et al. reported an increase in Google search trends related to fear and anxiety with the spread of corona vaccination [36]. In a text-mining study on data related to the corona vaccine from Twitter®, Lyu et al. analyzed 1,499,421 tweets containing the words “vaccine”, “immunization”, and “vaccination”. An emotion analysis revealed that trust was the most predominant emotion, followed by anticipation, fear, and sadness [37]. Ruiz-Núñez examined the tweets of healthcare providers in Spain at two different time periods (one closest to the 2021 suspension of the AstraZeneca vaccine and the other 30 days later), reporting that negative tweets sent by healthcare providers against the corona vaccine had an impact on the prevention of COVID-19 [38]. Ong et al. investigated tweets from Malaysian Twitter® users related to additional doses of vaccines and found that they were interested in the type of vaccine, the effect of the vaccine, and the vaccination schedule [39]. Previous studies also examined the impression of the influenza vaccine using social media [40,41]. Guidry et al. analyzed Twitter® before and during an influenza pandemic to assess changes in attitudes towards influenza vaccination during the influenza season [40]. Signorini et al. used Twitter® to examine disease outbreaks and public interest in the USA during an influenza pandemic, suggesting the potential of automated extraction of epidemiological information using social media [41].
In the present study, we conducted a groundbreaking analysis to investigate the contrasting public impressions of corona vaccination and influenza vaccination in Japan. Utilizing a text-mining method on Twitter®, we embarked on an unprecedented approach, marking a significant advancement in vaccine-perception research. The results of our study revealed a stark divergence between the two vaccines. To contextualize our findings, we referred to a previous survey conducted by Katsiroumpa et al., where 254 high-risk individuals were surveyed regarding influenza vaccination during the COVID-19 epidemic. Their findings indicated that 39.4% expressed a desire to be vaccinated, while 33.9% did not. Intriguingly, individuals who had received a booster dose of the corona vaccine demonstrated a higher inclination to be vaccinated against influenza. However, they also reported that side effects of the corona vaccine and fatigue diminished their willingness to receive the influenza vaccine [42]. Notably, this study was the first to directly compare public impressions of corona vaccination and influenza vaccination in Japan. By harnessing the power of social media analysis, including Twitter®, we were able to promptly access and analyze the opinions and impressions of the general public, with a particular focus on the younger generation. The disparities in public perception of the corona vaccine and influenza vaccine were found to be significant.
Moreover, we made an intriguing observation regarding the association of specific terms with the corona vaccine on Twitter®. On 15 December 2022, the term “death” was categorized alongside the “corona vaccine”. However, by 19 February 2023, a separate cluster had emerged, linking the “after effect” to the “corona vaccine”. This cluster was also associated with other terms such as “Ministry of Health, Labor and Welfare”, “medical care”, and “support”. These findings suggest that the term “after effect” was not directly linked to negative connotations associated with the “corona vaccine”. This insight highlights the value of Twitter® data in informing future policy decisions regarding vaccination. Drawing inspiration from the work of Brownstein et al., who proposed a method for tracking disease outbreaks and public interest through social media analysis, we acknowledge the potential of social media platforms, particularly Twitter®, in enhancing healthcare policies [43]. In some of the literature, the importance of health strategies against COVID-19 has been shown [44,45]. Additionally, it has been reported that not only health strategies but also the importance of economic and global factors [46,47,48,49] are significant in preventing and improving COVID-19. In February 2023, the Ministry of Health, Labor, and Welfare in Japan released its future policies regarding corona vaccination [50]. In line with this, we firmly believe that continuous and systematic analysis of social media, including Twitter®, will be instrumental in raising public awareness about corona vaccination, addressing concerns and misinformation, and shaping future policies.
However, we need to consider and address several limitations in our current study. Firstly, it is important to note that Twitter® usage was most prevalent among individuals in their 20s, accounting for 78.6% of the sample, but only 46.2% across all age groups. As a result, the findings of our study may not equally represent information from all age demographics. Secondly, our study did not collect data on the clinical characteristics of participants, such as their gender, age, and occupation. This limitation restricts our ability to conduct more detailed and comprehensive analyses. Thirdly, Twitter® is a platform where information is constantly updated in real time, and tweets are generated at an extremely fast pace, necessitating prompt data analysis. Real-time analysis requires fast and efficient data processing support. However, there was a certain amount of time required from data acquisition to analysis, interpretation, and the publication of the paper. Lastly, Twitter® tweets have a character limit, and their brevity and use of text abbreviations can make analysis challenging. Furthermore, the wide range of expressions used in tweets, including slang, pictures, and hashtags, adds complexity to the analysis. To address this issue, we established certain rules in our study.
Despite the complexity, our study has shown that analyzing Twitter® can greatly help promote corona vaccination in Japan. To encourage vaccination in the future, it is important to provide the younger generation with more detailed and carefully explained information about the potential effects of the corona vaccine. Additionally, this analysis can be beneficial for shaping future vaccine policies when dealing with unfamiliar infectious diseases. By understanding public opinions and feelings, we can develop strategies to enhance vaccine acceptance. This research sets the stage for future endeavors that utilize social media data for public health initiatives, ultimately resulting in improved healthcare policies and a better-informed population.

5. Conclusions

In conclusion, our comprehensive analysis of Twitter® using a text-mining method revealed distinct and noteworthy differences in the public’s impressions of corona vaccination and influenza vaccination. In Japan, the terms “death” and “after effect” were frequently associated with the corona vaccine, while “severe disease” was commonly mentioned in relation to the influenza vaccine, particularly among the younger generations. These findings show the public discourse surrounding vaccination and emphasize the importance of targeted communication strategies to address concerns and misconceptions.

Author Contributions

Conceptualization, Y.M. (Yoshiro Mori); data curation, Y.M. (Yoshiro Mori) and Y.M. (Yuka Mori); formal analysis, Y.M. (Yoshiro Mori), Y.M. (Yuka Mori) and N.M.; investigation, Y.M. (Yoshiro Mori); methodology, Y.M. (Yoshiro Mori) and N.M.; project administration, K.T. and N.M.; supervision, N.M.; validation, Y.M. (Yoshiro Mori) and N.M.; visualization, Y.M. (Yoshiro Mori), N.M., H.S., Y.M. (Yuka Mori), S.O. and K.T.; writing—original draft, Y.M. (Yoshiro Mori) and N.M.; writing—review and editing, Y.M. (Yoshiro Mori), N.M., H.S., Y.M. (Yuka Mori), S.O. and K.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by Ethics Committee of Sakaide City Hospital, Sakaide, Japan (15 December 2022, approval number: 2022-002).

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

In addition, Chat GPT and DeepL translation software were also used for the post-reviewing process. All authors have read and agreed to the published version of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yu, F.; Du, L.; Ojcius, D.M.; Pan, C.; Jiang, S. Measures for diagnosing and treating infections by a novel coronavirus responsible for a pneumonia outbreak originating in Wuhan, China. Microbes Infect. 2020, 22, 74–79. [Google Scholar] [CrossRef]
  2. Ludwig, S.; Zarbock, A. Coronaviruses and SARS-CoV-2: A Brief Overview. Anesth. Analg. 2020, 131, 93–96. [Google Scholar] [CrossRef] [PubMed]
  3. Watanabe, M. The COVID-19 Pandemic in Japan. Surg. Today 2020, 50, 787–793. [Google Scholar] [CrossRef]
  4. Ministry of Health, Labor and Welfare, Japan. Coronavirus Infections Information. Available online: https://covid19.mhlw.go.jp (accessed on 24 February 2023). (In Japanese)
  5. Hayashi, M.; Morikawa, S.; Goto, Y.; Yoshida, T.; Kimura, Y.; Imaizumi, K. Adverse reactions to mRNA coronavirus disease 2019 (COVID-19) vaccine for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in 576 medical staff. Fujita Med. J. 2022, 8, 79–82. [Google Scholar] [PubMed]
  6. Digital Agency, Japan. Corona Vaccine Status. Available online: https://info.vrs.digital.go.jp/dashboard/ (accessed on 23 February 2023). (In Japanese)
  7. Prime Minister’s Office of Japan. Corona Vaccine Results by Age Group. Available online: https://www.kantei.go.jp/jp/content/nenreikaikyubetsu-vaccination_data.pdf (accessed on 20 April 2023). (In Japanese)
  8. Ministry of Health, Labor and Welfare, Japan. The 30th time health sciences council vaccination, vaccine subcommittee meeting research and development, production and circulation sectional meeting Material. Available online: https://www.mhlw.go.jp/stf/newpage_27660.html (accessed on 27 February 2023). (In Japanese)
  9. Ministry of Health, Labor and Welfare, Japan. 2022/2023 seasons of press materials about influenza. Available online: https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/kenkou_iryou/kenkou/kekkaku-kansenshou01/houdou_00010.html (accessed on 20 February 2023). (In Japanese)
  10. Ministry of Internal Affairs and Communications. Report on Survey on Information and Communication Media Usage Time and Information Behavior in FY2021. Available online: https://www.soumu.go.jp/iicp/research/results/media_usage-time.html (accessed on 5 March 2023). (In Japanese)
  11. Honcharov, V.; Li, J.; Sierra, M.; Rivadeneira, N.A.; Olazo, K.; Nguyen, T.T.; Mackey, T.K.; Sarkar, U. Public Figure Vaccination Rhetoric and Vaccine Hesitancy: Retrospective Twitter Analysis. JMIR Infodemiology 2023, 10, e40575. [Google Scholar] [CrossRef]
  12. Keelan, J.; Pavri, V.; Balakrishnan, R.; Wilson, K. An analysis of the Human Papilloma Virus vaccine debate on MySpace blogs. Vaccine 2010, 10, 1535–1540. [Google Scholar] [CrossRef] [PubMed]
  13. Sueki, H. The association of suicide-related Twitter use with suicidal behaviour: A cross-sectional study of young internet users in Japan. J. Affect. Disord. 2015, 170, 155–160. [Google Scholar] [CrossRef] [PubMed]
  14. Song, J.; Yang, J.; Yoo, S.; Cheon, K.; Yun, S.; Shin, Y. Exploring Korean adolescent stress on social media: A semantic network analysis. PeerJ 2023, 11, e15076. [Google Scholar] [CrossRef]
  15. Ye, B.; Krishnan, P.; Jia, S. Public Concern about Air Pollution and Related Health Outcomes on Social Media in China: An Analysis of Data from Sina Weibo (Chinese Twitter) and Air Monitoring Stations. Int. J. Environ. Res. Public Health 2022, 19, 16115. [Google Scholar] [CrossRef]
  16. Tsuya, A.; Sugawara, Y.; Tanaka, A.; Narimatsu, H. Do cancer patients tweet? Examining the twitter use of cancer patients in Japan. J. Med. Internet Res. 2014, 16, e137. [Google Scholar] [CrossRef]
  17. Fujii, S.; Kunii, Y.; Nonaka, S.; Hamaie, Y.; Hino, M.; Egawa, S.; Kuriyama, S.; Tomita, H. Real-Time Prediction of Medical Demand and Mental Health Status in Ukraine under Russian Invasion Using Tweet Analysis. Tohoku J. Exp. Med. 2023, 259, 177–188. [Google Scholar] [CrossRef] [PubMed]
  18. Hasegawa, S.; Suzuki, T.; Yagahara, A.; Kanda, R.; Aono, T.; Yajima, K.; Ogasawara, K. Changing Emotions About Fukushima Related to the Fukushima Nuclear Power Station Accident-How Rumors Determined People’s Attitudes: Social Media Sentiment Analysis. J. Med. Internet Res. 2020, 22, e18662. [Google Scholar] [CrossRef]
  19. Takayasu, M.; Sato, K.; Sano, Y.; Yamada, K.; Miura, W.; Takayasu, H. Rumor diffusion and convergence during the 3.11 earthquake: A twitter case study. PLoS ONE 2015, 10, e0121443. [Google Scholar] [CrossRef]
  20. Harris, D.; Krishnan, A. Exploring the Association Between Suicide Prevention Public Service Announcements and User Comments on YouTube: A Computational Text Analysis Approach. J. Health Commun. 2023, 17, 1–10. [Google Scholar] [CrossRef] [PubMed]
  21. Shim, J.G.; Ryu, K.H.; Lee, S.H.; Cho, E.A.; Lee, Y.J.; Ahn, J.H. Text Mining Approaches to Analyze Public Sentiment Changes Regarding COVID-19 Vaccines on Social Media in Korea. Int. J. Environ. Res. Public Health 2021, 18, 6549. [Google Scholar] [CrossRef]
  22. Abd-Alrazaq, A.; Alhuwail, D.; Househ, M.; Hamdi, M.; Shah, Z. Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study. J. Med. Internet Res. 2020, 21, 22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Pobiruchin, M.; Zowalla, R.; Wiesner, M. Temporal and Location Variations, and Link Categories for the Dissemination of COVID-19-Related Information on Twitter During the SARS-CoV-2 Outbreak in Europe: Infoveillance Study. J. Med. Internet Res. 2020, 22, e19629. [Google Scholar] [CrossRef] [PubMed]
  24. Alhuzali, H.; Zhang, T.; Ananiadou, S. Emotions and Topics Expressed on Twitter During the COVID-19 Pandemic in the United Kingdom: Comparative Geolocation and Text Mining Analysis. J. Med. Internet Res. 2022, 24, e40323. [Google Scholar] [CrossRef]
  25. Suzuki, R.; Iizuka, Y.; Lefor, A.K. COVID-19 related discrimination in Japan: A preliminary analysis utilizing text-mining. Medicine 2021, 100, e27105. [Google Scholar] [CrossRef]
  26. Mori, Y.; Miyatake, N.; Suzuki, H.; Okada, S.; Tanimoto, K. A Text Mining-Based Survey of Pre-Impressions of Medical Staff toward COVID-19 Vaccination in a Designated Medical Institution for Class II Infectious Diseases. Vaccines 2021, 9, 1282. [Google Scholar] [CrossRef]
  27. Mori, Y.; Miyatake, N.; Suzuki, H.; Mori, Y.; Okada, S.; Tanimoto, K. Pre-Impressions of the Third COVID-19 Vaccination among Medical Staff: A Text Mining-Based Survey. Vaccines 2022, 10, 856. [Google Scholar] [CrossRef]
  28. About Twitter’s Application Programming Interface (API). Available online: https://developer.twitter.com/en (accessed on 20 April 2023).
  29. The R Project for Statistical Computing. Available online: https://www.r-project.org (accessed on 20 April 2023).
  30. Description of RStudio and Where to Download It. Available online: https://posit.co/products/open-source/rstudio/ (accessed on 25 April 2023).
  31. Official Home Page for Visual Studio Code. Available online: https://code.visualstudio.com (accessed on 25 April 2023).
  32. Higuchi, K. A Two-Step Approach to Quantitative Content Analysis: KH Coder Tutorial Using Anne of Green Gables (Part I). Ritsumeikan Soc. Sci. Rev. 2016, 52, 77–91. [Google Scholar]
  33. Higuchi, K. A Two-Step Approach to Quantitative Content Analysis: KH Coder Tutorial Using Anne of Green Gables (Part II). Ritsumeikan Soc. Sci. Rev. 2017, 53, 137–147. [Google Scholar]
  34. Tzeng, J.; Lu, H.H.; Li, W.H. Multidimensional scaling for large genomic data sets. BMC Bioinform. 2008, 4, 179. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Taguchi, Y.H.; Oono, Y. Relational patterns of gene expression via non-metric multidimensional scaling analysis. Bioinformatics 2005, 21, 730–740. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Awijen, H.; Ben, Z.Y.; Nguyen, D.K. COVID-19 vaccination, fear and anxiety: Evidence from Google search trends. Soc. Sci. Med. 2022, 297, 114820. [Google Scholar] [CrossRef]
  37. Lyu, J.C.; Luli, G.K. Understanding the Public Discussion About the Centers for Disease Control and Prevention During the COVID-19 Pandemic Using Twitter Data: Text Mining Analysis Study. J. Med. Internet Res. 2021, 23, e25108. [Google Scholar] [CrossRef]
  38. Ruiz-Núñez, C.; Herrera-Peco, I.; Campos-Soler, S.M.; Carmona-Pestaña, Á.; Benítez de Gracia, E.; Peña Deudero, J.J.; García-Notario, A.I. Sentiment Analysis on Twitter: Role of Healthcare Professionals in the Global Conversation during the AstraZeneca Vaccine Suspension. Int. J. Environ. Res. Public Health 2023, 20, 2225. [Google Scholar] [CrossRef]
  39. Ong, S.Q.; Pauzi, M.B.M.; Gan, K.H. Text Mining and Determinants of Sentiments towards the COVID-19 Vaccine Booster of Twitter Users in Malaysia. Healthcare 2022, 10, 994. [Google Scholar] [CrossRef]
  40. Guidry, J.P.D.; Austin, L.L.; O’Donnell, N.H.; Coman, I.A.; Lovari, A.; Messner, M. Tweeting the #flushot: Beliefs, Barriers, and Threats During Different Periods of the 2018 to 2019 Flu Season. J. Prim. Care Community Health 2020, 11, 2150132720932722. [Google Scholar]
  41. Signorini, A.; Segre, A.M.; Polgreen, P.M. The use of Twitter to track levels of disease activity and public concern in the U.S. during the influenza A H1N1 pandemic. PLoS ONE 2011, 6, e19467. [Google Scholar] [CrossRef] [Green Version]
  42. Katsiroumpa, A.; Sourtzi, P.; Kaitelidou, D.; Siskou, O.; Konstantakopoulou, O.; Galanis, P. Predictors of Seasonal Influenza Vaccination Willingness among High-Risk Populations Three Years after the Onset of the COVID-19 Pandemic. Vaccines 2023, 11, 331. [Google Scholar] [CrossRef]
  43. Brownstein, J.S.; Freifeld, C.C.; Madoff, L.C. Digital disease detection—Harnessing the Web for public health surveillance. N. Engl. J. Med. 2009, 360, 2153–2155, 2157. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Su, Z.; Cheshmehzangi, A.; Bentley, B.L.; McDonnell, D.; Šegalo, S.; Ahmad, J.; Chen, H.; Terjesen, L.A.; Lopez, E.; Wagers, S.; et al. Technology-based interventions for health challenges older women face amid COVID-19: A systematic review protocol. Syst. Rev. 2022, 11, 271. [Google Scholar] [CrossRef]
  45. Abbas, J. Crisis management, transnational healthcare challenges and opportunities: The intersection of COVID-19 pandemic and global mental health. Res. Glob. 2021, 3, 100037. [Google Scholar] [CrossRef]
  46. Abbas, J.; Al-Sulaiti, K.; Lorente, D.B.; Shah, S.A.R.; Shahzad, U. Economic Growth and Environmental Quality in a Post-Pandemic World, 1st ed.; Routledge: London, UK, 2023; pp. 177–201. [Google Scholar]
  47. Global Burden of Disease 2021 Health Financing Collaborator Network. Global investments in pandemic preparedness and COVID-19: Development assistance and domestic spending on health between 1990 and 2026. Lancet Glob. Health 2023, 11, e385–e413. [Google Scholar] [CrossRef] [PubMed]
  48. GBD 2019 Iran Collaborators. Health system performance in Iran: A systematic analysis for the Global Burden of Disease Study 2019. Lancet 2022, 399, 1625–1645. [Google Scholar] [CrossRef] [PubMed]
  49. GBD 2019 Pakistan Collaborators. The state of health in Pakistan and its provinces and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet Glob. Health 2023, 11, e229–e243. [Google Scholar] [CrossRef] [PubMed]
  50. Ministry of Health, Labor and Welfare, Japan. Future Policy on Corona Vaccination in Japan. Available online: https://www.mhlw.go.jp/stf/newpage_31004.html (accessed on 22 April 2023). (In Japanese)
Figure 1. Concepts in data based on groups of words that often appear together or groups of documents that contain the same words on 15 December 2022 using multi-dimensional scaling (MDS).
Figure 1. Concepts in data based on groups of words that often appear together or groups of documents that contain the same words on 15 December 2022 using multi-dimensional scaling (MDS).
Vaccines 11 01327 g001
Figure 2. Concepts in data based on groups of words that often appear together or groups of documents that contain the same words on 19 February 2023 using multi-dimensional scaling (MDS).
Figure 2. Concepts in data based on groups of words that often appear together or groups of documents that contain the same words on 19 February 2023 using multi-dimensional scaling (MDS).
Vaccines 11 01327 g002
Table 1. List of frequently used words among nouns and adjectival nouns in a survey performed on 15 December 2022.
Table 1. List of frequently used words among nouns and adjectival nouns in a survey performed on 15 December 2022.
Corona VaccineTimes Influenza VaccineTimes
Used Used
Total 552,432%Total576,708%
1stVaccination76251.38Vaccine84291.46
2ndVaccine72261.31Vaccination60491.05
3rdCorona vaccine68351.24Influenza vaccine52110.90
4thCorona49430.89Influenza51080.89
5thNew type31610.57Corona49160.85
6thDeath28550.52Severe disease 16200.28
7thInfection19990.36Doctor15490.27
8thInformation13310.24Outbreak14860.26
9thMask12550.23Hospitalization14060.24
10thMinistry of Health, Labor, and Welfare9810.18Flu (influenza)13910.24
11thFree9810.18Prevention13690.24
12thFor the time being9010.16Symptoms13630.24
13thWorld8970.16Data12020.21
14thContinuation8720.16Patients11740.20
15thMinister of Health, Labor, and Welfare Kato8590.16Evidence11550.20
16thSince then8440.15Impression11370.20
17thJapan7570.14Epidemic11360.20
18thDoctor7390.13Chairman Omi11320.20
19thFlorida7210.13Tokyo Medical Association11310.20
20thInvestigation7160.13Effect10960.19
Table 2. List of frequently used words among nouns and adjectival nouns in a survey performed on 19 February 2023.
Table 2. List of frequently used words among nouns and adjectival nouns in a survey performed on 19 February 2023.
Corona VaccineTimes Influenza VaccineTimes
Used Used
Total600,591%Total577,965%
1stVaccine78111.30Vaccine94111.63
2ndCorona vaccine62481.04Influenza56820.98
3rdVaccination59440.99Mask54120.94
4thCorona vaccine53860.90WHO39810.69
5thNew type24360.41World37840.65
6thAfter effect17560.29Vaccination32760.57
7thDeath15020.25Manufacture31830.55
8thJapan13260.22Corona29630.51
9thInfection12900.21Avian influenza24210.42
10thInfluenza12270.20Virus24140.42
11thVirus11040.18Pandemic22460.39
12thCommon cold10700.18Organization19680.34
13thEffect10110.17Tedros19290.33
14thResearch9660.16Human19170.33
15thHerpes zoster9420.16Use18960.33
16thUnited States8570.14Prediction18770.32
17thBuy8450.14Preparation18760.32
18thChildren8360.14Monkey pox18760.32
19thGovernment7890.13Fatal18660.32
20thmRNA7780.13In vain18660.32
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mori, Y.; Miyatake, N.; Suzuki, H.; Mori, Y.; Okada, S.; Tanimoto, K. Comparison of Impressions of COVID-19 Vaccination and Influenza Vaccination in Japan by Analyzing Social Media Using Text Mining. Vaccines 2023, 11, 1327. https://doi.org/10.3390/vaccines11081327

AMA Style

Mori Y, Miyatake N, Suzuki H, Mori Y, Okada S, Tanimoto K. Comparison of Impressions of COVID-19 Vaccination and Influenza Vaccination in Japan by Analyzing Social Media Using Text Mining. Vaccines. 2023; 11(8):1327. https://doi.org/10.3390/vaccines11081327

Chicago/Turabian Style

Mori, Yoshiro, Nobuyuki Miyatake, Hiromi Suzuki, Yuka Mori, Setsuo Okada, and Kiyotaka Tanimoto. 2023. "Comparison of Impressions of COVID-19 Vaccination and Influenza Vaccination in Japan by Analyzing Social Media Using Text Mining" Vaccines 11, no. 8: 1327. https://doi.org/10.3390/vaccines11081327

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop