Next Article in Journal
Health System Performance and Resilience in Times of Crisis: An Adapted Conceptual Framework
Previous Article in Journal
“Collapsing into Darkness”: An Exploratory Qualitative Thematic Analysis of the Experience of Workplace Reintegration among Nurses with Operational Stress Injuries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Depression in South Korean Adolescents Captured by Text and Opinion Mining of Social Big Data

1
Administration of Justice, Pennsylvania State University, Schuylkill, PA 17972, USA
2
Gachon University Graduate School of Industry & Environment, Seoul 13120, Republic of Korea
3
HealthMax Co., Ltd., Seoul 06078, Republic of Korea
4
Department of Applied Health Science, Indiana University School of Public Health, Bloomington, IN 47405, USA
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2023, 20(17), 6665; https://doi.org/10.3390/ijerph20176665
Submission received: 17 July 2023 / Revised: 3 August 2023 / Accepted: 24 August 2023 / Published: 28 August 2023

Abstract

:
Depression in adolescence is recognized as an important social and public health issue that interferes with continued physical growth and increases the likelihood of other mental disorders. The goal of this study was to examine online documents posted by South Korean adolescents for 3 years through the text and opinion mining of collectable documents in order to capture their depression. The sample for this study was online text-based individual documents that contained depression-related words among adolescents, and these were collected from 215 social media websites in South Korea from 1 January 2012 to 31 December 2014. A sentiment lexicon was developed for adolescent depressive symptoms, and such sentiments were analyzed through opinion mining. The depressive symptoms in the present study were classified into nine categories as suggested by the Diagnostic and Statistical Manual for Mental Disorders, 5th Edition (DSM-5). The association analysis and decision tree analysis of data mining were used to build an efficient prediction model of adolescent depression. Opinion mining indicated that 15.5% were emotionally stable, 58.6% moderately stressed, and 25.9% highly distressed. Data mining revealed that the presence of depressed mood most of the day or nearly every day had the greatest effect on adolescents’ depression. Social big data analysis may serve as a viable option for developing a timely response system for emotionally susceptible adolescents. The present study represents one of the first attempts to investigate depression in South Korean adolescents using text and opinion mining from three years of online documents that originally amounted to approximately 3.1 billion documents.

1. Introduction

Even though depression is a familiar part of the vocabulary for men and women of all ages and is referred to as “modern people’s mental cold” in psychopathology, its harmful effects are serious [1]. Depression in adolescence is recognized as an important social and public health issue that interferes with continued physical growth and increases the likelihood of other mental disorders. It usually takes chronic and recurrent courses and often bears an undesirable influence on interpersonal functioning and academic achievements [1]. Depression is a social issue because it is associated with suicidal thoughts and attempts [2,3]. Indeed, many people who commit suicide turn out to have suffered from depression [2,4]. In addition, thoughts of self-harm are considered suicide risk factors and are also highly related to mental illnesses such as major depressive disorder [2,5].
Most people consider depression a “private matter”, and because they do not want to be called a mentally ill patient, they tend to refuse treatments [6,7]. The number of patients with depression is about 350 million worldwide [8]. By using a Korean female adolescent sample, the study suggests that depressive symptoms partially mediate the relationship between ADHD symptoms and suicidal ideation [9]. However, more than 50% of these patients are not utilizing appropriate therapies or medical care [10]. The prevalence of depressive disorders in adults 19 years or older in South Korea was 6.7% in 2014, and only 18.2% of the affected patients sought counseling or treatment for mental problems [11]. Even though the rate of adolescents experiencing depressive mood in South Korea has been overall decreasing from 32.8% in 2011, 30.5% in 2012, 30.9% in 2013, to 26.7% in 2014, the rate is still high [11].
Prior research has shown that a number of sociodemographic, lifestyle, and disease-related factors affect depression. For example, being female [12,13], low social capital in the family [14,15], and experience of divorce or separation [16] are related to a higher likelihood of depression. Meanwhile, as the influence of social networking sites (SNS hereafter) increases, SNS are becoming not only a communication tool for ordinary daily matters or chats but also a space where one can express or listen to depressive feelings, stress, or worries [17].
Especially given that almost all Korean adolescents are using SNS on a daily basis, Korean adolescents who have depressive symptoms or mood are highly likely to leave postings on online media such as Twitter, blogs, cafés, and bulletin boards. The emotional contagion theory states that whatever emotion is expressed by someone, such emotion tends to be transferred similarly to another person who is exposed to it [18]. Morris et al. and Kramer, Guillory and Hancock affirm this, reporting that buzzes posted on Facebook affect the emotions of other users who read them [19,20]. Krasnova, Wenninger, Widjaja and Buxmann noted “Facebook Envy”, which refers to the phenomenon that Facebook postings stir people’s feelings of envy as some use Facebook as an instrument to show off their happy lives rather than as a communication space [21]. Other researchers reported a similar finding that long-term use of Facebook may lead people to feel depressed [22,23]. The American Academy of Pediatrics stated that adolescents’ exposure to social media such as Facebook could help adolescents feel depressed [24]. Lin et al. surveyed 1787 adults ages 19 to 32 about social media use and depression and found that social media use was significantly associated with increased depression [25].
Regardless of the similar or dissimilar feelings that SNS postings stimulate, SNS are increasingly becoming a space where one can express or be exposed to stressful feelings or worries as well as cheerful or pleasant emotions and stories. Thus, it is plausible that an analysis of online postings among Korean adolescents may shed light on the characteristics and patterns of such postings and help identify adolescents who are emotionally susceptible. Better yet, such an analysis could help establish an automatic online monitoring system (e.g., artificial intelligence) that could monitor and capture online postings of emotionally susceptible adolescents and provide them with ballooned texts of helpful information on a real-time basis. Although a prior study was conducted to analyze depression-related Twitter postings, it was based on relatively small amounts of tweets during only a 2-month period [26].
The purpose of this study is to analyze the characteristics and forms of depression in adolescents and to predict risk factors through social big data analysis. Further, this research intends to demonstrate whether social big data analysis may serve well in depicting the probability of depression in adolescents using social media on a regular basis. It may also be an important guide for the development of an online response system for emotionally susceptible adolescents. The present study analyzed online documents posted by South Korean adolescents that contained depression-related words for 3 years from 1 January 2012 to 31 December 2014 through the text and opinion mining of collectable documents in order to capture their depression. The study conducted text mining and opinion mining of online text-based documents in online channels, which allowed for the approximate identification of adolescents who were emotionally susceptible. The study hypothesized that the nine depressive symptoms suggested by the Diagnostic and Statistical Manual for Mental Disorders, 5th Edition (DSM-5), have differential predictability of depression.

2. Materials and Methods

2.1. Sample

The procedures for the collection and classification of online documents related to adolescent emotional susceptibility (i.e., depression-related words) in the present study are illustrated in Figure 1. The sample for this study was online text-based individual documents that contained depression-related words among adolescents, and these were collected from 215 popular social media websites in South Korea (i.e., Twitter, 199 online news sites, 9 online bulletin boards, 4 blogs, and 2 online cafés) from 1 January 2012 to 31 December 2014. Twitter crawling uses the Twitter API (application programming interface) to crawl. To use the Twitter API, register an application on the Twitter development site and access and collect public information.
As shown in Figure 1, out of more than 3.10 billion posts from the 3 years, a total of 3,703,135 contained depression-related words. Noise was minimized by excluding documents that contained predetermined stop-words. In order to limit the study sample to adolescents, only documents that contained such descriptors as adolescents, secondary school students, or ages 19 or younger were selected and included in the sample, which reduced the sample to 161,581. Among the documents from these adolescents, 86,957 documents contained expressions of emotion or sentiment which allowed categorization into emotionally stable, moderately stressed, or highly distressed.

2.2. Measures

For the purpose of collecting and analyzing social big data related to adolescent depression, we constructed an adolescent depression management ontology on the basis of Ontology Development 101 [27] and developed a theme classification system and a terminology system for youth depression management [28]. For the natural language processing of the collected data, morpheme analysis was conducted using methodologies such as head–tail classification, left–right and right–left analysis, and syllable unit analysis. The data refining was performed using keyword extraction through morpheme analysis of the collected documents, and the documents relating to advertisement posts were filtered and excluded.

2.2.1. Dependent Variable

The dependent variable of the present study was depression. Once all of the relevant online text documents were retrieved, composing elements (such as nouns, verbs, adjectives, prepositions, and postpositional words) were identified and classified based on analysis of linguistic morphology of texts through text mining. Then, opinion mining was performed using the ontology developed for analyzing adolescents’ depressive mood, and one of three states of depression (i.e., emotionally stable, moderately stressed, and highly distressed) was assigned to each document. The emotionally stable state (coded 1) applies to documents that mention having no/little stress, no/little depression, or having a sense of happiness or peace. The highly distressed state (coded 3) refers applies to documents that mention having serious stress, worries, fear, insomnia, depression, suicidal thoughts, or victimization to school violence. The moderately stressed state (coded 2) applies to documents that mention things other than codes 1 and 3.

2.2.2. Independent Variables

The major independent variable was the nine binary (yes/no) variables of depressive symptoms suggested by the DSM-5. Specifically, DSM1 (words included in the documents: depressed, depressed feeling, depression tweet, unhappiness, sorrow, cry, deep sadness, grief, gloom, depressive disorder, depression, neurotic depression, psychotic depression, severe depression, and chronic depression) refers to depressed mood most of the day or nearly every day; DSM2 (words included in the documents: psychotic depression, character depression, psychotic depression, mild depression, hypothermia, emotional abnormality, powerless, and asthenia) markedly diminished interest or pleasure in all or almost all activities most of the day or nearly every day; DSM3 (words included in the documents: loss of appetite, weight change, weight increase, weight decrease, fat, obese, corpulent, fleshy, and increased appetite) a significant weight loss or weight gain, or decrease/increase in appetite nearly every day; DSM4 (words included in the documents: dyssomnia, lethargy, sleeping pills, drowsiness, sleep, lack of sleep, excessive sleep, narcolepsy, insomnia, and sleep disorders) insomnia or hypersomnia nearly every day; DSM5 (words included in the documents: hypomania, amnesia, psychiatric symptoms, auditory hallucination, delusion, mental symptoms, insanity, anxiety, jitter, phobia, irritancy, agitation, excitation, psychomotor agitation, nerve-racking, impulsiveness, impulsion, worry, incomplete, psychomotor retardation, anxiety disorder, neurosis, phobia, panic disorder, generalized anxiety disorder, and panic disorder) psychomotor agitation or retardation nearly every day; DSM6 (words included in the documents: fatigue, feeling confused, confusion, chaos, lightheaded, dizziness, inertia, and weariness) fatigue or loss of energy nearly every day; DSM7 (words included in the documents: worthlessness, feeling of guilt, inappropriate guilt, sense of sin, sense of sins, guilty conscience, guilt, remorse, and blame oneself) feelings of worthlessness or excessive or inappropriate guilt; DSM8 (words included in the documents: dependence, dependent, dependent personality, obedient personality, submissive, obedience, decline of concentration, indecisiveness, decline of brain activity, brain, brain activity, decline of brain function, brain function, ADHD, and attention deficit hyperactivity disorder) diminished ability to think or concentrate, or indecisiveness nearly every day; and DSM9 (words included in the documents: suicidal tendency, suicide, suicidal impulse, suicidal ideation, and suicide-related behaviors) recurrent thoughts of death, recurrent suicidal ideation without a specific plan, or a suicide attempt or a specific plan for committing suicide. For statistical analysis, each expression that contained each of the depression symptoms was coded as 1 versus 0.

2.3. Data Analysis

Online documents related to online adolescent depression were collected. After extracting keywords through morphological analysis, risk factors related to adolescent depression were organized by classifying them into nine depressive symptoms suggested by DSM-5.
In the present study, association analysis of data mining and decision tree analysis, which does not require statistical assumptions, were used to build an efficient prediction model, so that adolescents’ depressive symptoms mentioned in online channels could be used to identify depression. For the association analysis, the Apriori principle algorithm proposed by Agrawal and Srikant was used [29]. The Apriori algorithm identifies associations between two or more words included in an online document or transaction. Association rules are predicated on “support” (used to remove rules that appear less frequently) and “confidence” (used to gauge the strength of the association between words). Association analysis involves the generation of frequent item sets that satisfy a minimal support criterion as defined by the researcher. Of these item sets, those that satisfy a minimal confidence criterion and a lift of at least 1 are selected [26]. Evaluation of the association analysis used to predict adolescents’ depression was carried out using the criteria of support 0.001 and confidence 0.1. The analysis algorithm used to form the decision tree in the present study was Chi-squared automatic interaction detection [30]. As a stopping rule, the minimum number of cases for the parent node was set at 100 and for the child node at 50, while the depth was set at 3 [30]. IBM SPSS version 24.0 was used for decision tree analysis and R version 3.5.0 was used for association analysis. In regard to ethical consideration of the research, this study was conducted after obtaining approval from Institutional Review Board of the Korea Institute for Health and Social Affairs (NO. 2014-23). The research used social big data collected by the Korea Institute for Health and Social Affairs. The social big data collected did not include personal identification information, thereby ensuring the anonymity and confidentiality of the subjects.

3. Results

As shown in Table 1, opinion mining of online documents (buzzes) indicated that 15.5% of subjects were emotionally stable, 58.6% were moderately stressed, and 25.9% were highly distressed (25.2% in 2012, 27.9% in 2013, and 24.5% in 2014). Of the nine DSM-5 symptoms, “feeling depressed most of the day or nearly every day” appeared the most frequently (42.4%), followed by “feeling anxiety” (15.9%) and “having recurrent thoughts of death” (12.8%).
As shown in Table 2, of all of the buzzes that showed two or more depressive symptoms, 44.2% were associated with a highly distressed status. In terms of online channels, more emotionally stable buzzes were found in blogs (38.1%) than in other places (29.7% in online cafés, 24.3% in online news, 16.9% in online boards, and 6.5% on Twitter).
The result of decision tree analysis of depression is shown in Figure 2 and a corresponding profit chart is shown in Table 3. The presence of DSM1 (depressed mood most of the day or nearly every day) was found to have the greatest effect on depression. If DSM1 symptoms were present, the likelihood of finding highly distressed individuals increased from 25.9% to 38.3% and that of finding moderately stressed individuals decreased from 58.6% to 37.2%. If both the DSM1 and DSM5 (psychomotor agitation or retardation nearly every day) symptoms were present, the likelihood of finding highly distressed individuals increased from 38.3% to 45.9% and that of finding moderately stressed individuals decreased from 37.2% to 20.5%. If all three symptoms of DSM1, DSM5, and DSM9 (recurrent thoughts of death, recurrent suicidal ideation without a specific plan, or a suicide attempt or a specific plan for committing suicide) were present, then the likelihood of finding highly distressed individuals increased from 45.9% to 51.6% and that of finding moderately stressed individuals decreased from 20.5% to 16.0%.
As shown in Table S1 (Supplementary Materials), the association analysis of depression indicates that highest level of confidence (0.675) was associated with a combination of six variables (DSM2, DSM3, DSM4, DSM6, DSM7, and DSM8) and a highly distressed emotional state with an increase of 2.605. This means that when the six depressive symptoms are mentioned in online documents, the level of confidence of finding a highly distressed emotional state is about 67.5% and the risk for feeling highly distressed is about 2.605 times the risk when these six symptoms are not mentioned.

4. Discussion

4.1. Contributions

The present study is one of the first attempts to investigate the depression of South Korean adolescents using text and opinion mining from three years of online documents that originally amounted to approximate 3.1 billion documents. One of the major hypotheses that drove this investigation was that the text and opinion mining of collectable documents in online channels would allow for the approximate identification of adolescents who are emotionally susceptible. The results of the present study showed that 25.2% in 2012, 27.9% in 2013, and 24.5% in 2014 of the adolescents were highly distressed. These numbers are compared to 30.5% in 2012, 30.9% in 2013, and 26.7% in 2014 of surveyed Korean adolescents who reported feeling sad or depressed in the past two weeks in a national online surveillance among 75,000 students [11]. Although the percentage differences are statistically significant between the national surveillance data and the data from the present study, one could argue that the differences are not large. It appears that there is potential for the text and opinion mining of online documents to identify adolescents who are emotionally susceptible.
The differential association between depression and the nine symptoms of the Diagnostic and Statistical Manual for Mental Disorders, 5th Edition, deserves mention. In the present study, emotionally susceptible adolescents were associated with depressive symptoms in the order of DSM1 (depressed mood most of the day or nearly every day), DSM5 (psychomotor agitation or retardation nearly every day), and DSM9 (recurrent thoughts of death, suicide attempts or a specific plan for committing suicide). This result contrasts with that of Cavazos–Rehg et al. who analyzed Twitter postings and found depressive symptoms to the order of DSM1, DSM7 (feelings of worthlessness or excessive guilt), and DSM9 [31]. For both studies, DSM1 and DSM9 were mentioned as important predictors for emotionally susceptible adolescents. This implies that these two symptoms found in online documents could serve as major diagnostic criteria for depressive symptoms in adolescents.
Measurement of the depression of adolescents should be approached with clinical methods. It is difficult to measure the entire population of adolescents due to the lack of population-based-data. The national surveillance system survey is the most common tool used to measure depression in adolescents. The survey has a high possibility of error due to social stigma and social desirability bias. Adolescents commonly express their feelings, depression, and stress in daily life on social network services. The information related to depression in adolescents appearing online can be collected with less bias than the information related to depression measured in the national surveillance system survey. Therefore, analyzing online documents related to emotional expressions or psychological crisis behaviors of adolescents could be a more effective response to the problems of youth exposed to the risk of depression as well as the prediction of risk signs.
The findings of the present study have policy implications for preventing and responding to depression risk in South Korean adolescents. Given that adolescents express or are exposed to stressful feelings, depressive mood, or even suicidal ideation online, and that such emotion or ideation can be transferred to another person who is exposed to it, it may be necessary to establish an online depression monitoring system. Further, for a timely response to adolescents at risk in cyber space, it would be desirable to build a system that captures the online postings of emotionally susceptible adolescents and provides them with ballooned texts of helpful information on a real-time basis would be needed.

4.2. Limitations

The study has limitations. First, caution is warranted to guard interpretations against ecological fallacy. Given that the present study did not perform analysis on individual characteristics but used the data of the entire group of individual members, an ecological fallacy could occur if the results are applied to individuals [17]. Second, if some of the analyzed online documents (buzzes) that appeared to be written by adolescents were actually posted by adults that used adolescent keywords (e.g., under the age of 19, elementary school students, middle school students, and high school students), the results might have been confounded. Third, this research assumed that individuals did not engage in disinformation in their postings. Even if some individuals posted false information in order to mislead others, the impact on the study findings might be minimal as such individuals, if any, would comprise only a minuscule fraction of the sample given the topic. Despite these limitations, the present study contributes to the literature and the field by suggesting a new analytical method that would allow for building a timely response system to provide assistance to emotionally susceptible adolescents.

5. Conclusions

The present study is one of the first attempts to investigate depression in South Korean adolescents using text and opinion mining from three years of online documents that originally amounted to approximate 3.1 billion documents. It appears that there is potential for the text and opinion mining of online documents to identify adolescents who are emotionally susceptible. The differential association between depression and the nine symptoms of the Diagnostic and Statistical Manual for Mental Disorders, 5th Edition, deserves mention. In the present study, emotionally susceptible adolescents were associated with depressive symptoms to the order of DSM1, namely depressed mood most of the day or nearly every day; DSM5, namely psychomotor agitation or retardation nearly every day; and DSM9, namely recurrent thoughts of death, suicide attempts, or a specific plan for committing suicide. It may be necessary to establish an online depression monitoring system and a timely response to adolescents at risk in cyber space to prevent or help adolescents’ depression risks.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijerph20176665/s1, Table S1: Association Rules of Depressive Symptoms.

Author Contributions

Conceptualization, J.S., T.-M.S. and D.-C.S.; methodology, T.-M.S. and S.L.; software, T.-M.S.; formal analysis, J.S. and T.-M.S.; investigation, J.S. and D.-C.S.; resources, T.-M.S. and S.L.; data curation, T.-M.S. and S.L.; writing—original draft preparation, J.S.; writing—review and editing, D.-C.S.; supervision, D.-C.S.; funding acquisition, T.-M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of the Korea Institute for Health and Social Affairs (protocol code 2014-23).

Informed Consent Statement

Patient consent was waived due to the secondary analysis. The research used social big data collected by the Korea Institute for Health and Social Affairs and the social big data did not include personally identifiable information, thereby ensuring the anonymity and confidentiality of the subjects.

Data Availability Statement

The data can be provided upon request to the Korea Institute for Health and Social Affairs and signing on the data use agreement.

Acknowledgments

Authors appreciate the support the Korea Institute for Health and Social Affairs has provided including permission to use the social big data.

Conflicts of Interest

The authors declare no conflict of interest. Sangho Lee is CEO of HealthMax Co., Ltd.

References

  1. Fergusson, D.M.; Boden, J.M.; Horwood, L.J. Recurrence of major depression in adolescence and early adulthood, and later mental health, educational and economic outcomes. Br. J. Psychiatry 2007, 191, 335–342. [Google Scholar] [CrossRef] [PubMed]
  2. Duffy, M.E.; Twenge, J.M.; Joiner, T.E. Trends in mood and anxiety symptoms and suicide-related outcomes among US undergraduates, 2007–2018: Evidence from two national surveys. J. Adolesc. Health 2019, 65, 590–598. [Google Scholar] [CrossRef] [PubMed]
  3. Lundin, A.; Lundberg, I.; Allebeck, P.; Hemmingsson, T. Psychiatric diagnosis in late adolescence and long-term risk of suicide and suicide attempt. Acta Psychiatr. Scand. 2011, 124, 454–461. [Google Scholar] [CrossRef] [PubMed]
  4. Siu, A.M. Self-harm and suicide among children and adolescents in Hong Kong: A review of prevalence, risk factors, and prevention strategies. J. Adolesc. Health 2019, 64, S59–S64. [Google Scholar] [CrossRef] [PubMed]
  5. Kessler, R.C.; Berglund, P.; Borges, G.; Nock, M.; Wang, P.S. Trends in suicide ideation, plans, gestures, and attempts in the United States, 1990–1992 to 2001–2003. JAMA 2005, 293, 2487–2495. [Google Scholar] [CrossRef] [PubMed]
  6. Bland, R.C.; Newman, S.C.; Orn, H. Help-seeking for psychiatric disorders. Can. J. Psychiatry 1997, 42, 935–942. [Google Scholar] [CrossRef] [PubMed]
  7. Sirey, J.A.; Bruce, M.L.; Alexopoulos, G.S.; Perlick, D.A.; Friedman, S.J.; Meyers, B.S. Stigma as a barrier to recovery: Perceived stigma and patient-rated severity of illness as predictors of antidepressant drug adherence. Psychiatr. Serv. 2001, 52, 1615–1620. [Google Scholar] [CrossRef]
  8. World Health Organization. Depression is a Common Illness and People Suffering from Depression Need Support and Treatment. 2012. Available online: http://www.who.int/mediacentre/news/notes/2012/mental_health_day_20121009/en/ (accessed on 5 March 2020).
  9. Cho, S.C.; Kim, J.W.; Choi, H.J.; Kim, B.N.; Shin, M.S.; Lee, J.H.; Kim, E.H. Associations between symptoms of attention deficit hyperactivity disorder, depression, and suicide in Korean female adolescents. Depress. Anxiety 2008, 25, E142–E146. [Google Scholar] [CrossRef]
  10. World Health Organization. 10 Facts on the State of Global Health. 2014. Available online: www.who.int/features/factfiles/global_burden/en (accessed on 5 March 2020).
  11. Ministry of Health and Welfare of Korea; Korea Centers for Disease Control and Prevention. Korea Health Statistics 2014: Korea National Health and Nutrition Examination Survey (KNHANES VI-2); Ministry of Health and Welfare of Korea: Sejong, Republic of Korea, 2015. [Google Scholar]
  12. Townsend, L.; Musci, R.; Stuart, E.; Heley, K.; Beaudry, M.B.; Schweizer, B.; Ruble, A.; Swartz, K.; Wilcox, H. Gender differences in depression literacy and stigma after a randomized controlled evaluation of a universal depression education program. J. Adolesc. Health 2019, 64, 472–477. [Google Scholar] [CrossRef]
  13. Silverstein, B. Gender differences in the prevalence of somatic versus pure depression: A replication. Am. J. Psychiatry 2002, 159, 1051–1052. [Google Scholar] [CrossRef]
  14. Blum, R.W.; Li, M.; Naranjo-Rivera, G. Measuring adverse child experiences among young adolescents globally: Relationships with depressive symptoms and violence perpetration. J. Adolesc. Health 2019, 65, 86–93. [Google Scholar] [CrossRef]
  15. Ogburn, K.M.; Sanches, M.; Williamson, D.E.; Caetano, S.C.; Olvera, R.L.; Pliszka, S.; Hatch, J.P.; Soares, J.C. Family environment and pediatric major depressive disorder. Psychopathology 2010, 43, 312–318. [Google Scholar] [CrossRef] [PubMed]
  16. Rotermann, M. Marital breakdown. Health Rep. 2007, 18, 33–44. [Google Scholar] [PubMed]
  17. Song, T.M.; Song, J.; An, J.Y.; Hayman, L.L.; Woo, J.M. Psychological and social factors affecting Internet searches on suicide in Korea: A big data analysis of Google search trends. Yonsei Med. J. 2014, 55, 254–263. [Google Scholar] [CrossRef] [PubMed]
  18. Hatfield, E.; Cacioppo, J.T.; Rapson, R.L. Emotional contagion. Curr. Dir. Psychol. Sci. 1993, 2, 96–100. [Google Scholar] [CrossRef]
  19. Morris, M.E.; Consolvo, S.; Munson, S.; Patrick, K.; Tsai, J.; Kramer, A.D. Facebook for health: Opportunities and challenges for driving behavior change. In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI’11), Vancouver, BC, Canada, 7–12 May 2011; Extended Abstracts on Human Factors in Computing Systems. pp. 443–446. [Google Scholar] [CrossRef]
  20. Kramer, A.D.; Guillory, J.E.; Hancock, J.T. Experimental evidence of massive-scale emotional contagion through social networks. Proc. Natl. Acad. Sci. USA 2014, 111, 8788–8790. [Google Scholar] [CrossRef] [PubMed]
  21. Krasnova, H.; Wenninger, H.; Widjaja, T.; Buxmann, P. Envy on Facebook: A hidden threat to users’ life satisfaction? In Proceedings of the International Conference on Wirtschaftsinformatik (WI2013), Leipzig, Germany, 27 February–1 March 2013; pp. 1477–1491. [Google Scholar] [CrossRef]
  22. Tandoc, E.C., Jr.; Ferrucci, P.; Duffy, M. Facebook use, envy, and depression among college students: Is Facebooking depressing? Comput. Hum. Behav. 2015, 43, 139–146. [Google Scholar] [CrossRef]
  23. Sagioglou, C.; Greitemeyer, T. Facebook’s emotional consequences: Why Facebook causes a decrease in mood and why people still use it. Comput. Hum. Behav. 2014, 35, 359–363. [Google Scholar] [CrossRef]
  24. O’Keeffe, G.S.; Clarke-Pearson, K. The impact of social media on children, adolescents, and families. Pediatrics 2011, 127, 800–804. [Google Scholar] [CrossRef]
  25. Lin, L.Y.; Sidani, J.E.; Shensa, A.; Radovic, A.; Miller, E.; Colditz, J.B.; Hoffman, B.L.; Giles, L.M.; Primack, B.A. Association between social media use and depression among US young adults. Depress. Anxiety 2016, 33, 323–331. [Google Scholar] [CrossRef]
  26. Park, M.; Cha, C.; Cha, M. Depressive moods of users portrayed in Twitter. In Proceedings of the 18th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD 2012), Chicago, IL, USA, 11–14 August 2012; Volume 2012, pp. 1–8. [Google Scholar]
  27. Noy, N.F.; McGuinness, D.L. Ontology Development 101: A Guide to Creating your First Ontology; Stanford University Press: Redwood City, CA, USA, 2001. [Google Scholar]
  28. Jung, H.; Park, H.A.; Song, T.M. Development and Evaluation of an Adolescents’ Depression Ontology for Analyzing Social Data. Stud. Health Technol. Inform. 2016, 225, 442–446. [Google Scholar] [CrossRef]
  29. Agrawal, R.; Srikant, R. Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB 1994), Santiago, Chile, 12–15 September 1994; Volume 1215, pp. 487–499. [Google Scholar]
  30. Kass, G.V. An exploratory technique for investigating large quantities of categorical data. J. R. Stat. Soc. Ser. C Appl. Stat. 1980, 29, 119–127. [Google Scholar] [CrossRef]
  31. Cavazos-Rehg, P.A.; Krauss, M.J.; Sowles, S.; Connolly, S.; Rosas, C.; Bharadwaj, M.; Bierut, L.J. A content analysis of depression-related tweets. Comput. Hum. Behav. 2016, 54, 351–357. [Google Scholar] [CrossRef]
Figure 1. Flowchart of sample selection. Note. a Whereas the entire number of online posts during the study period was available for Twitter due to the full crawler retrieval method, it was not available for the other social media websites due to the focused crawler retrieval method.
Figure 1. Flowchart of sample selection. Note. a Whereas the entire number of online posts during the study period was available for Twitter due to the full crawler retrieval method, it was not available for the other social media websites due to the focused crawler retrieval method.
Ijerph 20 06665 g001
Figure 2. Decision Tree Analysis of Emotional Susceptibility.
Figure 2. Decision Tree Analysis of Emotional Susceptibility.
Ijerph 20 06665 g002
Table 1. Distribution of Emotional Susceptibility and Depressive Symptoms.
Table 1. Distribution of Emotional Susceptibility and Depressive Symptoms.
Emotional SusceptibilityN(%)Depressive Symptom 1N(%)
2012Emotionally stable4294(16.8)DSM138,413(42.4)
Moderately stressed14,794(57.9)DSM24886(5.4)
Highly distressed6441(25.2)DSM32227(2.5)
2013Emotionally stable4694(15.9)DSM45971(6.6)
Moderately stressed16,527(56.1)DSM514,352(15.9)
Highly distressed8232(27.9)DSM66508(7.2)
2014Emotionally stable4463(14.0)DSM73127(3.5)
Moderately stressed19,666(61.5)DSM83429(3.8)
Highly distressed7846(24.5)DSM911,624(12.8)
TotalEmotionally stable13,451(15.5)Total90,535
Moderately stressed50,987(58.6)


Highly distressed22,519(25.9)


Total86,957


Note. DSM1: depressed mood most of the day, nearly every day, as indicated by subjective report; DSM2: markedly diminished interest or pleasure in all, or almost all, activities most of the day, nearly every day; DSM3: significant weight loss when not dieting or weight gain, or decrease or increase in appetite nearly every day; DSM4: insomnia or hypersomnia nearly every day; DSM5: psychomotor agitation or retardation nearly every day; DSM6: fatigue or loss of energy nearly every day; DSM7: feelings of worthlessness or excessive or inappropriate guilt; DSM8: diminished ability to think or concentrate, or indecisiveness, nearly every day; DSM9: recurrent thoughts of death, recurrent suicidal ideation without a specific plan, or a suicide attempt or a specific plan for committing suicide. 1 Depressive symptoms may be counted multiple times.
Table 2. Crosstabulation of Emotional Susceptibility with Depressive Symptoms and Online Channels.
Table 2. Crosstabulation of Emotional Susceptibility with Depressive Symptoms and Online Channels.
CharacteristicsEmotionally StableModerately StressedHighly
Distressed
χ2 (p)
Depressive Symptoms13998(16.8)12,259(51.5)7524(31.6)3334.1
(<0.001)
≥26448(30.5)5352(25.3)9336(44.2)
Online ChannelsTwitter3369(6.5)40,634(77.9)8185(15.7)22,354.9
(<0.001)
Blog4586(38.1)3096(31.4)4352(36.2)
Cafe2500(29.7)2650(31.4)3277(38.9)
Board1084(16.9)1850(28.8)3494(54.4)
News1912(24.3)2757(35.0)3211(40.7)
Note. Numbers in parentheses are percentages across rows.
Table 3. Profit Chart in Modeling Depression.
Table 3. Profit Chart in Modeling Depression.
ClassificationNodeProfit IndexCumulative Index
n%Profit (%)Index (%)n%Profit (%)Index (%)
Highly Distressed112920.30.7220.82920.30.7220.8
743375.010.8216.646295.311.5216.9
138010.91.5162.054306.213.0208.8
919632.33.6158.773938.516.6195.5
869318.012.1152.314,32416.528.8174.6
1227923.24.5141.217,11619.733.3169.2
1025,18229.038.8133.942,29848.672.1148.2
1444,65951.427.954.486,957100100.0100.0
Moderately Stressed1444,65951.468.8133.944,65951.468.8133.9
1025,18229.022.778.369,84180.391.4113.9
1227923.22.475.972,63383.593.9112.4
138010.90.771.873,43484.494.5112.0
869318.03.239.880,36592.497.7105.7
919632.30.835.882,32894.798.5104.1
112920.30.135.682,62095.098.6103.8
743375.01.427.386,957100.0100.0100.0
Emotionally Stable919632.35.524519632.35.5245.0
869318.019.2240.6889410.224.7241.6
743375.09.0180.513,23115.233.7221.6
112920.30.5141.713,52315.634.2219.9
1025,1822936.4125.638,70544.570.6158.5
1227923.23.9122.541,49747.774.5156.1
138010.91.0103.342,29848.675.4155.1
1444,65951.424.647.886,957100.0100.0100.0
Note. Node numbers refer to the nodes in Figure 2.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Song, J.; Song, T.-M.; Lee, S.; Seo, D.-C. Depression in South Korean Adolescents Captured by Text and Opinion Mining of Social Big Data. Int. J. Environ. Res. Public Health 2023, 20, 6665. https://doi.org/10.3390/ijerph20176665

AMA Style

Song J, Song T-M, Lee S, Seo D-C. Depression in South Korean Adolescents Captured by Text and Opinion Mining of Social Big Data. International Journal of Environmental Research and Public Health. 2023; 20(17):6665. https://doi.org/10.3390/ijerph20176665

Chicago/Turabian Style

Song, Juyoung, Tae-Min Song, Sangho Lee, and Dong-Chul Seo. 2023. "Depression in South Korean Adolescents Captured by Text and Opinion Mining of Social Big Data" International Journal of Environmental Research and Public Health 20, no. 17: 6665. https://doi.org/10.3390/ijerph20176665

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop