Next Article in Journal
Electric Vehicle Charging Load Prediction Model Considering Traffic Conditions and Temperature
Previous Article in Journal
A Novel Cellular Network Traffic Prediction Algorithm Based on Graph Convolution Neural Networks and Long Short-Term Memory through Extraction of Spatial-Temporal Characteristics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Study on Environmental Trends and Sustainability in the Ocean Economy Using Topic Modeling: South Korean News Articles

1
College of Business Administration, Incheon National University, 119, Academy-ro, Yeonsu-gu, Incheon 22012, Republic of Korea
2
College of Business Administration, Gyeonsang Natonal University, 501, Jinju-daero, Jinju-si 52828, Republic of Korea
3
Port Research Department, Korea Maritime Institute, 301, Haryang-ro, Youngdo-gu, Busan 49111, Republic of Korea
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Processes 2023, 11(8), 2253; https://doi.org/10.3390/pr11082253
Submission received: 8 June 2023 / Revised: 21 July 2023 / Accepted: 24 July 2023 / Published: 26 July 2023

Abstract

:
The ocean economy plays a critical role in global economic growth, yet it confronts substantial environmental risks. This study employs topic modeling of South Korean news articles to analyze the evolving trends of environmental risks and sustainability in ocean economy. A dataset comprising 50,213 articles from 2008 to 2022 is examined, revealing prevalent environmental concerns that have persisted over the years. The findings demonstrate an increasing emphasis on sustainability and marine environmental issues, as evidenced by prominent keywords related to construction, safety, plastic pollution, and ecosystem conservation. Through Latent Dirichlet Allocation (LDA) in topic modeling, 10 distinct themes are identified, encompassing sustainable fisheries management, accident and disaster response, polar environment, carbon neutrality, microplastic pollution, habitat ecosystems, cruise tourism development, nuclear power plant pollution, and infectious diseases. The outcomes highlight the necessity of collaborative efforts and international partnerships, underpinned by diplomatic cooperation, to effectively address transboundary environmental challenges encountered in the ocean-based industries.

1. Introduction

According to the OECD, the ocean economy encompasses a wide range of industries, including shipping, fisheries, and ports, as well as offshore wind power generation, marine biotechnology, marine ecosystems, and marine tourism. It has the potential to contribute to the economic growth, employment, and innovation of various countries worldwide [1]. In 2010, the production value of the ocean economy accounted for 25% of the global gross value added (GVA), reaching $1.5 billion, and it is projected to more than double by 2030 [1]. Furthermore, according to the OECD [2], the six maritime economic activities, including marine fishing, aquaculture, fish processing, shipbuilding, maritime freight transport, and maritime passenger transport, accounted for 2% of the gross domestic product (GDP) of high-income countries, 11% of middle-income countries’ GDP, and 20% of low-income and island countries’ GDP [3]. The ocean economy is rapidly expanding due to factors such as population growth, economic and income level growth, climate and environmental changes, and technological advancements. It is expected to exceed the global economic growth rate in terms of value-added and employment in all aspects by 2030.
However, as the scale of the ocean economy grows worldwide, it is increasingly exposed to numerous risks and faces threats. The recent COVID-19 pandemic that has affected the entire world, coupled with the Russia–Ukraine conflict that erupted in early 2022, has acted as a continuous threat to the global supply chain. This has plunged the world economy into a quagmire of stagflation, characterized by high inflation, high interest rates, unemployment, and economic downturn occurring simultaneously. The global economic growth rate due to COVID-19 decreased by more than 3.1% in 2020, and the world trade volume decreased by more than 8.2% [4]. The global economic situation has inflicted severe damage on the maritime and port industry, which is responsible for the majority of international trade.
Alongside these macroeconomic conditions, the ocean economy also faces significant threats due to the environmental risks associated with the specificity of the ocean. Sumaila et al. [5] stated, “A vibrant ocean economy depends on sustainable and healthy oceans, however, many aspects of current ocean resource use patterns make it unsustainable”. Furthermore, Bennett et al. [6] compiled a list of ten risks that undermine the sustainable ocean economy, including such as “Dispossession, displacement and ocean grabbing”, “Environmental justice concerns from pollution and waste”, and “Environmental degradation and reduction of availability of ecosystem services”. However, it was noted that awareness of these risks is still insufficient [6]. In particular, environmental issues such as accelerated global warming, the occurrence of super typhoons, increasing marine plastic pollution, and the radiation contamination of seafood from nuclear power plant effluents, as well as risks related to technological advancements, such as cyber vulnerabilities in ports, are growing concerns for the ocean economy [7].
The devastating earthquake that occurred in Japan in 2011 had a profound impact on the marine fisheries industry [8]. It led to the closure of 16 ports in Japan for one month, causing a collapse in the global supply chain network of port-utilizing enterprises. The resulting production disruptions in the automotive, electrical, and electronics manufacturing industries alone caused a minimum of $40 billion in damages to the port sector. Similarly, the grounding of the container ship Ever Given in the Suez Canal in March 2021 resulted in a six-day disruption to global east–west shipping routes, causing the suspension of over 200 vessels and approximately 17 million tons of cargo transportation [9]. This incident inflicted significant damage on shipping companies. The estimated damages to Egypt, which manages the Suez Canal, are believed to exceed $1 billion. These examples illustrate how the ocean economy can have significant ripple effects on the global economy when it suffers losses.
Furthermore, the economic effects of large-scale maritime accidents may be limited to the short term, but the damages resulting from environmental pollution are long-term and difficult to estimate in scale. The issue of contaminated water leakage from nuclear power plants following the Great East Japan Earthquake continues to have a significant impact on neighboring countries even after 12 years, and the International Atomic Energy Agency (IAEA) has published reports examining the safety of wastewater disposal. In the case of the Ever Given grounding incident, although the incident lasted for about a week, concentrations of sulfur dioxide (SO2), a prominent pollutant, increased to up to five times the normal levels in the Mediterranean Sea as hundreds of vessels remained anchored for an extended period. Other ripple effects are still under investigation. Thus, ocean economy, as the foundation of various industries in many countries, has the potential to significantly impact both national industries and the global economy when it incurs damages. Among these, environmental damages are of particular concern due to their long-lasting and extensive consequences, highlighting the need for sustained attention.
Therefore, in this study, we aim to analyze the changing environmental risks that threaten the ocean economy from the past to the present and gain insights for the future direction. To achieve this, we collected 15 years of South Korean marine news text data and conducted an analysis using topic modeling.
South Korea’s ocean economy ranks 14th out of 32 domestic industries in terms of total production (as of 2019). It accounts for 71.5% of the size of the Korean automobile industry and is about 2.5 times the size of the agriculture and forestry industry [10]. In particular, the maritime freight transportation volume in South Korea accounts for 99.8% of the national total (as of 2020), amounting to 1.276 billion tons, playing a crucial role in the national economy [10]. Furthermore, the South Korean ocean economy has shown steady growth and maintains a highly competitive position globally. According to the report “The Leading Maritime Nations of the World” published by Menon Economics [11], which evaluates the maritime industry of each country and city using 24 indicators such as shipping, maritime finance and law, maritime technology, and ports and logistics, South Korea ranks 4th in the overall ranking along with Germany and Norway. Notably, it ranks 1st in the field of maritime technology, which has the highest competitiveness globally, and 4th in terms of volume handling [11].
Amidst the projected decrease in global economic growth rates by 0.5% to 0.9% compared to the previous year, the growth rate of South Korea’s economy is predicted to be around 1.5% [12]. The value-added growth rate of the South Korean ocean-based industry is forecasted to be 3.2%, a decrease of 0.7% compared to 2022. While the fisheries, ports, marine leisure tourism, and shipbuilding industries are expected to experience moderate growth, the value-added of the shipping industry is expected to decline due to worsening supply chain conditions caused by geopolitical and economic tensions between countries, global economic slowdown, and the spread of inflation leading to weak consumer demand. Through this study, a systematic analysis of the environmental risks faced by ocean-based industries can provide valuable implications for guiding future development towards a sustainable maritime economy. By identifying and assessing these risks, policymakers, industry stakeholders, and researchers can gain insights into the areas that require attention and intervention.
The remaining sections of this paper are structured as follows. In Section 2, a review of previous studies applying topic modeling to text data related to the marine fisheries field is presented. Section 3 provides an explanation of the topic modeling methodology, while Section 4 describes the data used in this study. The analysis results are summarized in Section 5, and the discussion and conclusion, including the implications of the research, are presented in Section 6.

2. Literature Review

Existing studies that have applied topic modeling to text data in the marine fisheries field include the following.
Rezende and Moretti [13] analyzed research on microplastics in marine and freshwater habitats using topic modeling. A total of 1681 research articles published between January 2010 and May 2021 were collected from the Scopus and Web of Science databases and 15 topics were identified. These topics include “effects on behavior and development of aquatic biota, pollution in freshwater, toxicity tests, ingestion and trophic transfer, adsorption capacity, pollution in urban waters, bioaccumulation in mussels, biofilm colonization, toxicity on aquatic biota, pollution in marine habitats, presence in biotic and abiotic compartments, isolation and quantification protocols, assessment of marine litter, occurrence and removal from wastewater, and transport and sinking behavior” [13].
Hay Mele et al. [14] conducted a study in which they categorized topics encompassing ecological and economic aspects in marine science. They grouped these topics within fields relevant to integrated coastal management (ICM) and examined the flow of knowledge between these fields using an information-flow network. The dataset they utilized included 583 papers from Isi-WoS and 5459 papers from Scopus. The identified representative topics comprised coastal resilience, ecological economics, integrated coastal management, marine ecology, marine/maritime economy, socio-ecological systems, network science, and topic modeling.
In a separate study, Otero et al. [15] employed Twitter data analysis to evaluate public interest in the impact of marine plastic pollution. They scraped a dataset of approximately 140,000 tweets specifically related to marine plastic pollution from Twitter. Their objective was to gain insights into the characteristics of users who tweet about this topic and examine how and when they engage in discussions. The researchers identified six optimal topics, including the impact on wildlife, microplastics and water pollution, estimates and reports, legislation and protection, and recycling and cleaning initiatives.
Tomojiri et al. [16] used topic modeling to analyze 2172 articles, published between 1975 and 2021, on anthropogenic marine debris (AMD). The study identified 50 topics, including “plastic pollution, spatiotemporal dynamics and distribution patterns of marine debris, and interdisciplinary or transdisciplinary research areas”. Keller and Wyles [17] conducted a study that examined the newspaper coverage of marine plastic pollution in four prominent online newspapers in the UK. Using structural topic modeling, the researchers analyzed 943 news articles published in 2019 that discussed marine plastics. These articles were categorized into 36 topics. The identified representative topics included plastic pollution, devices for cleaning the ocean and rivers, plastic straws, plastic bags, and the impact of microplastics on humans and animals. In another study, Yan et al. [18] developed a content-aware corpus-based model to analyze marine accidents. They utilized a dataset consisting of 207 investigation reports of ship collision accidents, which were published by the Maritime Safety Administration of China. For the purpose of topic modeling, they selected 40 topics, including hazards, causes of accidents, and accident scenarios.
Zhou et al. [19] used a text-mining approach to analyze sustainability disclosure for container shipping. The authors applied a hierarchical unsupervised text-mining method to 33 sustainability reports published between 2016 and 2019 from 12 listed container shipping companies. As a result, an integrated framework with three primary dimensions was developed: employee training and management, sustainable business management, and sustainable transportation operations. Each of these primary dimensions was further divided into three secondary sub-dimensions. Shin et al. [20] further explored issues related to sustainability issues in the maritime industry. The authors outlined their data collection and analysis process, which includes selecting 155 papers from SCI and SSCI indexed journals published between 1993 and 2017. They preprocessed the text data, generated a document-term matrix, applied a Latent Dirichlet Allocation (LDA) model to uncover hidden topics and patterns, and conducted a bibliometric analysis to visualize the landscape of sustainability research.
Hwang et al. [21] explored research trends and key issues in marine spatial planning (MSP) using topic modeling and bibliometric analysis. The authors analyzed 1726 articles related to MSP published between 2010 and 2020, identifying 29 topics that represent the intellectual structure of the literature. Additionally, the paper applied the policy readiness level (PRL) framework to examine changes in the core themes of MSP research over time. Jiang et al. [22] conducted a topic modeling analysis of 1726 papers related to hydropower. The authors examined studies published between 1994 and 2013 and established 29 topic models. A topic modeling study related to maritime leisure was also conducted. Sánchez-Franco and Rey-Moreno [23] applied a service and feature-oriented approach to explore the subjective experiences shared by Airbnb guests in their public reviews. Their processed dataset contains 73,557 reviews of Airbnb stays in both coastal and urban destinations between 2017 and 2020. In this study, community #10, which includes topics such as beaches, landscapes, litter, islands, and climate change, is related to maritime areas and is characterized by conservation and a focus on the future of the local community.
Others include maritime transport [24], mobility patterns from ship trajectories [25], ship motion patterns [26], COVID-19 [27], etc. Although there are several studies that have conducted topic modeling analyses of the environment in the marine fisheries sector, most of them are oriented to specific keywords such as microplastics and marine accidents, and there are not many studies that analyze overall trends in the marine environment. To understand the sustainability of the ocean economy, it is necessary to analyze trends and issues from a macro perspective.
According to Ortiz et al. [28], newspaper data are relatively easy to collect and are sometimes the only continuously available source of event data. While there are methods such as surveys or opinion polls to understand public opinions on social issues, they often require significant time and cost, and conducting them repeatedly over a period of time can make time series analysis difficult. On the other hand, news data are convenient for handling historical data and are suitable for research purposes due to their ability to provide objective and independent information based on specialized knowledge.

3. Topic Modeling

Topic modeling is a text mining technique that aims to discover abstract semantic “topics” within a collection of documents [29]. Essentially, it involves grouping words that are likely to belong to the same topic and assigning a topic based on the meaning of these word clusters. The underlying assumption is that words within a document have a high probability of being associated with the same topic, assuming each document has a single topic. This allows for determining the probability of a word belonging to a specific topic and the percentage of that topic within a document.
Various techniques can be used for topic modeling, including Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA), probabilistic LSA (pLSA), and Dirichlet Multinomial Regression (DMR) [29]. LSA constructs a semantic space based on a corpus and uses it to cluster words, sentences, paragraphs, and documents by comparing their similarities [30,31,32,33]. While LSA clusters based on word frequency, pLSA is a probabilistic model that considers the probability of word occurrence. It only relies on the document-term matrix and does not take into account the distribution of topics within a document. On the other hand, LDA considers both the distribution of topics across documents and the distribution of terms within topics. LDA is a Bayesian version of pLSA and employs a Dirichlet distribution with a conjugate prior [34]. DMR, an extension of LSA, incorporates the assumption that the hyperparameter α can vary based on document metadata, such as author, department, or region [35,36].
Currently, LDA is the most widely used topic modeling method in many studies. While some studies have compared LSA and LDA, no clear dominance of one method over the other has been established. LSA offers the advantage of intuitive analysis results based on term frequency, while LDA, as a probability-based model, can reveal new associations that may not be apparent from frequency alone. Given that this study aims to identify environmental and sustainability issues and trends in the marine fisheries sector, the LDA model was employed. The DMR model was excluded as the deviation of information from metadata is not substantial in the case of news data.
The LDA technique is illustrated in Figure 1.
The LDA algorithm employs a process of inferring hidden variables from observed variables in a document to uncover concealed topics [29]. In Figure 1, the observed variable corresponds to a word ( W d , n ). Then, there are hyper parameters α and   η , and hidden parameter β k , which are used to extract the word. Finally, there are hidden variables Z d , n and W d , n that are not directly observed in the document and that we want to infer through the LDA model. In the LDA model, Z d , n is generated from θ d , which is the ratio of topics by document whose value follows the Dirichlet prior weight determined by the value α . Similarly, β k , representing the probability of a word being generated by a topic, is determined by the parameter η . The Dirichlet prior distribution of β k is shaped by η . The word W d ,   n is consequently identified through the combination of Z d , n , which indicates the topic assignment for each word, and β k , representing the word-topic ratio.
The LDA algorithm is summarized as follows (Table 1). In this study, 10 topics were selected and analyzed.

4. Data Description

The news text data utilized in this study were collected through the BigKinds news big data analysis service [37]. BigKinds evolved from the Korea Integrated News Database System (KINDS), a service that allows storage and retrieval of news from broadcasts and major daily newspapers since 1990. It has incorporated big data analysis technology since April 2016. BigKinds offers a database capable of storing, searching, and processing over 10 million news articles from 54 media outlets in South Korea, including 11 national dailies, 8 economic dailies, 28 regional dailies, 5 broadcasting stations, and 2 professional magazines. Additionally, it provides features such as data clustering, keyword analysis, word cloud and network visualization, and information extraction capabilities [38,39].
For this study, we collected and analyzed 50,213 news articles from the 11 national daily newspapers provided by BigKinds over the past 15 years (from 1 January 2008 to 31 December 2022). According to the research objective of analyzing the ocean economy at the national level and focusing on broader issues instead of specific regional concerns, only the aforementioned 11 national daily newspapers were included in the analysis to ensure the homogeneity of the text data. The representative keywords, to search and collect news data, used in the field of marine and fisheries, are: ‘Ocean, Maritime, Fishery, Port, Shipping’. These search terms were selected based on the organizational classification system of the Ministry of Oceans and Fisheries of South Korea, which includes the Marine Policy Office, Fisheries Policy Office, Shipping and Logistics Bureau, and Ports and Harbours Bureau. To search for news data related to the ocean environment, the following terms were utilized: ‘Environment, Biology, Resource, Ecosystem, Climate, Carbon, Mortality, Epidemic, Infection, Hygiene, Trash, Microplastic, Disease, Contaminator, Habitat, Wastewater, Noise’. These terms were selected by extracting general nouns from 215 text-based reports (from 2008 to 2023) related to marine policy and then identifying the top words based on term frequency that are relevant to the environment. The selection of search terms was conducted by researchers affiliated with the Korea Maritime Institute (KMI), a government-funded research institution established for the purpose of formulating marine and fisheries policies.
The number of collected news articles by year is shown in Figure 2. As can be seen from the figure, environmental issues in the marine and fisheries sector have shown a steady increase since 2008 and have experienced significant growth, particularly after 2017. This trend is interpreted to be attributable to the multitude of policies and plans related to the ocean environment announced by the International Maritime Organization (IMO) and the Ministry of Ocean and Fisheries of South Korea since 2017. In 2018, the IMO unveiled the Initial IMO GHG Strategy on the reduction of GHG (greenhouse gas) emissions from ships, setting out a vision which confirms IMO’s commitment to reducing GHG emissions from international shipping and to phasing them out as soon as possible. Likewise, the South Korean government also released the 1st Climate Change Response Plan in December 2016, followed by the 2030 Greenhouse Gas Reduction Roadmap in 2018, both serving as foundational elements for their climate change mitigation efforts. With the establishment of these mid-to-long-term environmental policies, a noticeable surge in interest in the ocean environment has been observed.
Subsequently, the preprocessing steps were performed on the collected data. Following the typical text data preprocessing steps, we first conducted tokenization based on whitespace as the delimiter. Next, part-of-speech analysis was performed to extract only the words corresponding to common nouns. During this process, we applied normalization techniques such as stemming and lemmatization. Subsequently, we removed words with a length of one character or less, as well as specific local names, personal names, positions, titles, and other words that excessively possess generic meanings (e.g., world, people, process, start, manage). Additionally, since it is self-evident that the keywords used for data collection (Ocean, Marine, Maritime, Fishery, Port, Shipping) would frequently appear and contribute little to the interpretation of the results, we also removed these words from the dataset.

5. Results

5.1. Results of Word Clouds

Figure 3 presents word cloud images based on term frequency, organized by year. The word clouds depict the 15-year period from 2008 to 2022, encompassing the entire data collection phase. By observing these visual representations, a general overview of the prevalent environmental issues in the marine and fisheries sector can be obtained.
From 2009 to 2012, keywords such as ‘Construction, Facility, Inspection, Creation, Energy’ were prominently identified. This period was characterized by extensive efforts to construct large-scale facilities and complexes related with ‘tourism, R&D, fishery processing, logistics warehouses’. It was a time before the establishment of a clear national policy focusing on marine environmental issues. Particularly noteworthy is the steady increase in the number of foreign tourists, from 6.89 million in 2008 to 11.14 million in 2012, which fueled the demand for the development of tourism resources, including ‘Ecological Experiences and Cruises’ [40]. Furthermore, the increase in per capita annual seafood consumption from 36.7 kg in the early 2000 s to 54.2 kg in 2012 reflects the growth in infrastructure facilities for seafood production, processing, and distribution [41].
From 2013 to 2017, the keyword ‘Safety’ dominated as a key term. This can be attributed to the aftermath of the 2011 Great East Japan Earthquake (Fukushima nuclear accident) and the Sewol ferry disaster in 2014. When searching for keywords such as ‘Fukushima’ and ‘Marine, Fisheries, Port, Shipping’, it was found that the number of news articles reported was 293 in 2011, followed by 43 articles in 2012, 361 articles in 2013, 81 articles in 2014, 69 articles in 2015, and 39 articles in 2016. In other words, more news coverage was observed in 2013 compared to the year of the accidents in 2011. This can be attributed to the release of reports such as the UNSCEAR 2013 Report [42] investigating the cause and impact of the accidents, which brought them back into the spotlight as social issues in neighboring countries. Particularly, in 2013, the official report confirming the leakage of contaminated water from the Fukushima Daiichi Nuclear Power Plant’s cooling water tank into the ocean had a significant impact. Furthermore, the Sewol ferry sinking on 16 April 2014, which resulted in the deaths of 304 passengers, was a major tragedy in South Korean history and led to a surge in public interest in marine safety accidents for a prolonged period. Since the Sewol incident, there has been increased media exposure and attention to both significant and minor maritime incidents and accidents that previously received less attention. This trend can be interpreted as a reflection of the increased media coverage and public awareness of maritime safety issues following the Sewol incident.
Starting from 2018, keywords such as ‘Trash, Plastic, Contaminator, Discharge’ became prominent. This period marked a time when South Korea, following international trends, became increasingly interested in creating a sustainable marine environment and formulating relevant policies. This can be observed in the Korean-Sustainable Development Goals (K-SDGs) developed in 2019, which were based on the United Nations’ Sustainable Development Goals (SDGs) announced in 2015 [43]. The SDGs encompass 17 goals to be achieved by 2030 for global sustainable development, and the 14th goal specifically expresses the intent to ‘Conserve and sustainably use the oceans, seas, and marine resources for sustainable development’. The five sub-tasks under this goal include ‘Plastic/Marine Pollution, Over-fishing, Acidification, Eutrophication, Ocean Warming’. The K-SDGs align with these objectives and establish seven goals, including ‘Establish marine pollution management system, Manage ecological environment and fishery resources habitat, Minimize marine acidification, Sustainable fishery resources and its use, Designate coastal and marine protection areas, Expand R&D efforts on marine science’ [44].
Lastly, in 2022, keywords such as ‘Ecosystem, Cooperation, Establishment’ indicate a shift from directly addressing factors that pose immediate threats to the environment, such as ‘Trash, Plastic, Contaminator’, towards emphasizing intergovernmental and intercorporate cooperation for the sustainable conservation of marine ecosystems.

5.2. Results of Topic Modeling

Table 2 below presents the titles and descriptions of the 10 topics identified in the topic modeling analysis. The distribution of topic proportions is shown in Figure 4, where Topic 4 (Carbon Neutral, Hydrogen Energy) has the highest share of 18%, followed by Topic 7 (Cruise Tourism Development and Smartization) in second place, and Topics 5 (Micro Plastic, Climate Change) and 10 (Diplomatic Cooperation Between Countries) with proportions of 13% each.
Topic 4 has the largest proportion, which can be interpreted as a result of the increasing interest in marine environment as time goes by. Recent news articles have a relatively higher proportion in the overall dataset, and among them, carbon neutrality and hydrogen energy emerge as the main issues. In particular, greenhouse gas emissions from large vessels have been recognized as a serious problem, reaching approximately 1.1 billion tons annually. In response to this, the International Maritime Organization (IMO) has set a goal to reduce ship carbon emissions by 50% compared to 2008 levels by 2050, with the motto “safe, secure and efficient shipping on clean oceans”. It is expected that this goal will be further revised upwards to 100% at the 80th Marine Environment Protection Committee (MEPC) meeting in July 2023. Consequently, countries worldwide are making efforts to introduce environmentally friendly ships using hydrogen, ammonia fuel cells, and other technologies, or to develop new technologies in this field.
Table 3 presents the probabilistic distribution of words associated with each topic. Topic modeling provides a set of keywords with a high probability of being grouped together within the same topic, as well as the proportion of topics within the documents. Therefore, to interpret the results and derive implications, an additional exploration of the specific content related to each topic in the news articles was conducted.
The initial topic, namely “Sustainable Fisheries and Aquaculture Management”, encompasses instances of abandoned fishing grounds, volatile supply and demand of fishery products, fluctuations in production and prices of fishery products, hygiene concerns regarding fishery products, biofouling, and the aging of fishing grounds and vessels. The second topic, labeled “Accident, Calamity, Disaster Response”, incorporates instances of ship-based oil spills, underwater earthquakes and typhoons, damage to logistics infrastructure, and disruptions in supply chains due to natural disasters. Topic 3, referred to as “Polar Environment”, addresses issues related to environmental degradation in polar regions and transformations in the Arctic shipping environment, while Topic 4, denoted as “Carbon Neutral, Hydrogen Energy”, focuses on matters concerning carbon emissions stemming from ship operations, the development of marine energy sources, and policies aimed at curtailing carbon emissions.
The subsequent topic, designated as “Microplastic, Climate Change (Topic 5)”, encompasses diverse concerns pertaining to the escalation of microplastic waste, global warming, ocean acidification resulting in pH reduction, detrimental effects on fisheries due to climate change, and corresponding response measures. “Tidal Flats, Habitat Ecosystems (Topic 6)” encompasses content related to the destruction and disruption of ecosystems, habitat damage, and the introduction of non-indigenous marine species. The seventh topic, known as “Cruise Tourism Development and Smartization”, addresses issues such as ecosystem damage resulting from tourism infrastructure development, decline in cruise tourism, and job loss due to the implementation of smart technologies.
The eighth topic, titled “Japanese Nuclear Power Plant Pollution”, predominantly explores the issue of pollution arising from Japanese nuclear power plants and its particular relevance to Korea’s geographical characteristics. It includes problems of marine pollution resulting from various wastewater sources. The ninth topic highlights the damages incurred by infectious diseases, with COVID-19 being a representative case. It verifies the emergence of marine economic risks, such as port closures or reduced operations, and alterations in consumption patterns attributable to infectious diseases. Lastly, “Diplomatic Cooperation Between Countries” reflects concerns surrounding carbon reduction, international cooperation for environmental, social, and governance (ESG) management, and conflicts between nations over fishing rights or territorial sovereignty. This underscores the necessity of global collaborative endeavors through diplomatic cooperation to foster the growth of the marine economy, surpassing the limitations of individual national efforts.

5.3. Analysis of Yearly Trends

Following the topic modeling results, a trend analysis was conducted over time. Firstly, the overall trend was assessed by visualizing the data using graphs for the entire period. Subsequently, a quantitative analysis was performed using a linear regression model. In their work, Sun and Yin [45] collected transportation research data from 1991 to 2015 and analyzed it through topic modeling. Sun and Yin [45] defined an index r k using the proportion of topic k in each journal article, denoted as θ k t . Based on this, if the value of r k was less than 1, it was considered a “hot topic”, whereas if it was greater than 1, it was considered a “cold topic”.
r k = t = 1991 1995 θ k t t = 2011 2015 θ k t  
However, using a simple arithmetic mean, such as r k , for analysis has limitations in reflecting the overall trend of the research. Therefore, in this study, a linear regression model was constructed using the method proposed by Griffiths and Steyvers [46] to classify hot and cold topics based on significant regression coefficients. The independent variables were divided into a total of 15 intervals from 2008 to 2022, and the dependent variable was set as the proportion of each topic based on the corresponding year (Figure 5). Significant topics were selected at a p-value of 0.5% and categorized as hot topics if the regression coefficient was positive (+) or cold topics if it was negative (−) (Table 4).
According to the trend analysis conducted using the linear regression model, six topics were classified as hot topics due to their significant increasing trends: 1. Sustainable Fisheries and Aquaculture Management; 4. Carbon Neutral, Hydrogen Energy; 5. Microplastic, Climate Change; 7. Cruise Tourism Development and Smartization; 9. COVID-19; and 10. Diplomatic Cooperation Between Countries. Only one topic, 2. Accident, Calamity, Disaster Response, was classified as a cold topic. This indicates a decline in interest regarding accidents, calamities, and disasters, while environmental issues such as microplastic pollution, carbon reduction, and climate change, as well as diplomatic cooperation between countries, have emerged as global trends.
In particular, in the case of Korea, the lack of large-scale oil spill incidents since the Taean oil spill in 2007, the relatively small scale of recent oil spill accidents, the absence of major maritime accidents since the Sewol ferry disaster in 2014, and the scarcity of natural disasters related to the sea have contributed to the decreased interest in Topic 2 (Accident, Calamity, Disaster Response). However, globally, the frequency of natural disasters and disasters caused by climate change is increasing, emphasizing the need for continuous attention and measures to address these issues.
It Is important to note that the classification of hot and cold topics based on trend analysis does not imply the complete disappearance or lack of importance of certain topics. Rather, it reflects the shifting focus and increasing prominence of specific issues over time.

5.4. Text Network Analysis

To obtain additional implications, network analysis was conducted. Text network analysis is a technique that constructs and analyzes networks based on the relationships between words appearing in texts, drawing on social network analysis [47]. Social network analysis encompasses the process of quantifying, statistically analyzing, and visualizing relationships among individual actors, such as individuals, organizations, companies, and groups, in a network, using nodes and edges [48]. In text network analysis, the co-occurrence matrix of keywords serves as an adjacency matrix to build the network.
In text network analysis, various centrality measures, such as degree centrality, closeness centrality, betweenness centrality, and eigenvector centrality, can be applied to the constructed network for analysis. Eigenvector centrality, in particular, defines the centrality value of each node as the corresponding elements of the eigenvector that corresponds to the largest eigenvalue of the adjacency matrix when representing the network [49]. In this study, utilizing the results of the topic modeling analysis conducted earlier, a network was generated, and eigenvector centrality was measured. Among the top three keywords extracted for each topic, a total of 30 keywords (nodes) were selected (excluding ‘cooperation’, ‘nation’, and ‘Japan’ due to their overly general meanings). The weight of the edge was determined by the number of newspapers in which two different keywords co-occurred. To compare differences based on time periods, the entire data period of 15 years (2008–2023) was divided into three groups of five years each for analysis. The results of eigenvector centrality analysis are presented in Table 5, and the network adjacency matrix is included in the Appendix A.
The analysis results show that the keywords ‘Contaminator, Accident, Fishery, Foreshore, Energy’ consistently had high centrality throughout the entire period. These keywords are among the most central and highly related to various issues and topics related to the marine economy. Their continued presence over the past 15 years suggests the need for sustained attention to ensure the sustainability of the marine economy. Particularly, the keyword ‘Accident’ showed a decrease in centrality ranking from second to seventh place in the most recent period of 2018–2022. This decrease can be attributed to a decrease in the number of major marine accidents. However, policies and financial support for continuous accident prevention and safety measures should still be maintained.
Furthermore, in the period of 2008–2012, the keyword ‘Climate’, which was highly ranked in centrality measures, was replaced by ‘Aquatic products’ in the period of 2013–2017. In the subsequent period of 2018–2022, ‘Climate’ and ‘Carbon’ entered the top ranks instead of ‘Aquatic products’ and ‘Tourism’. These results align with the yearly trend analysis conducted in the previous topic modeling analysis. The emergence of ‘Aquatic products’ reflects the impact of reports on the Fukushima nuclear accident and subsequent issues related to the release of contaminated water, which have been ongoing since 2013, two years after the earthquake in Japan in 2011. The inclusion of ‘Climate’ and ‘Carbon’ keywords in the top ranks demonstrates the increasing importance of greenhouse gas-induced climate change and carbon neutrality goals, particularly among various environmental issues related to the ocean. This trend is expected to continue to develop as a significant topic.
Lastly, the keywords ‘Smart, Micro, Plastic’ demonstrate a consistent upward trend, despite being positioned in the lower to middle ranks of centrality measures. This observation indicates a growing interest in introducing smart technologies to address the environmental issues of microplastics and micro-dust in the ocean economy. Smartization is rapidly progressing across various sectors of the ocean-based industry, including the intelligent transformation of maritime transportation and logistics services, the establishment of sustainable and eco-friendly smart fisheries management systems, the implementation of smart monitoring and response systems for disaster and safety management, and the adoption of IoT devices to facilitate ocean environmental management systems. Particularly, microplastics are identified as a major cause of ocean environmental pollution, driving vigorous technological developments for detection, prevention of inflow, collection, treatment, and hazard assessment. As such, it is anticipated that these efforts will continue to expand in the future.

6. Discussion and Conclusions

In this study, news text data were collected using keywords related to ocean economy as well as the environment. The data were then analyzed using the LDA (Latent Dirichlet Allocation) technique, one of the topic modeling methods, to examine the keyword trends over time based on term frequency. Furthermore, for the 10 resulting topics, significant increasing or decreasing trends over time were identified. Text network analysis was conducted based on the co-occurrence frequency of keywords within each topic to measure the centrality of keywords. Through this research, a systematic analysis of the environmental risks faced by ocean-based industries was carried out, and implications for the future development of sustainable marine economy were obtained.
The topics of Aquaculture, Carbon Neutrality, Microplastic, and Climate Change highlight the environmental issues that span the entire marine economy. Resolving environmental risks is crucial for the future development of the marine economy. Topics such as Climate Change and Carbon Neutrality are shared challenges for all industries based on the ocean, including fisheries, shipping, and ports. Microplastic and Aquaculture represent tasks that must be pursued for sustainable fisheries. Moreover, these topics align with the Sustainable Development Goals (SDGs) announced by the United Nations and the specific items of Korea-SDGs. Carbon Neutrality, in particular, is a prominent keyword in Korea’s marine economy policy, leading to various carbon neutrality measures such as the development of low-carbon and zero-carbon ship technologies in the shipping and logistics sector, the transition from aging fishing vessels to low-carbon vessels, the introduction of low-carbon certification for aquaculture products, and the reduction of greenhouse gas emissions through the recycling of marine waste.
Another hot topic, Cruise Tourism Development and Smartization, represents the cruise industry as a key sector in marine tourism and the trend of smartization in the shipping and port sectors. The cruise industry has consistently shown growth as a prominent industry in the marine economy before the COVID-19 pandemic. However, the significant decrease in global cruise tourists from 29.67 million in 2019 to 5.77 million in 2020 indicates that the cruise industry was severely impacted during the COVID-19 period [50]. The selection of Cruise Tourism Development as a hot topic emphasizes the necessity of rebuilding the cruise industry for the continued growth of the marine economy. Smartization, on the other hand, is a crucial topic in the shipping and port industry, encompassing the automation and digitization of ship operations and port handling processes, as well as environmentally friendly practices through carbon neutrality and fuel conversion. While job reduction is a noted drawback of smartization, it is considered a necessary topic for securing the sustainability of the marine economy in the future.
The final hot topic, “Diplomatic Cooperation Between Countries”, demonstrates that global, cross-border cooperation is essential for achieving goals of environmental protection and sustainable growth. It also highlights that such cooperation is the most effective and only viable approach to addressing environmental issues in the marine and fisheries sector. The topics derived from this study indicate that in order to promote the development of the marine economy, responding to environmental issues and ensuring sustainability are paramount across various ocean-based industries such as fisheries, shipping, and ports. Carbon neutrality, which emerged as a crucial keyword, is not only important for South Korea but for all countries with a significant stake in the marine economy.
As mentioned in the introduction, the marine economy holds great importance, not only for the national economy of South Korea, but also in the global economy. However, recent events such as the COVID-19 pandemic, the Ukraine–Russia conflict, and trade frictions in the era of the new Cold War have posed significant risks to the sustainability of the marine economy. These risks include disruptions in global supply chains, increased volatility in water traffic, congestion and delays, rising shipping costs, and prolonged inflation. Notably, the global economic growth rate decreased by over 3.1% in 2020, and world trade volume declined by over 8.2% compared to the previous year (IMF, 2022).
These low-probability, high-impact problems can only be addressed through short-term response measures and events such as vaccine development or conflict resolution. While predicting the occurrence of such events is practically impossible, strengthening proactive prediction capabilities and formulating response strategies can minimize the adverse effects of extreme events and enhance the resilience of future societies in the face of uncertainties. Particularly, the COVID-19 pandemic has revealed significant challenges in global pandemic response capacities. Pandemics not only affect the health of living organisms but also cause extensive damage across all sectors of the marine economy, including seafood production, the cruise industry, port logistics, and maritime transportation. As the pandemic is being declared under control, it is necessary to reevaluate and thoroughly examine pandemic response measures spanning the entire ocean-based industry.
Furthermore, accidents and natural disasters at sea, such as oil spills, earthquakes, and typhoons, are difficult to predict, but efforts must be continually made to minimize damages. According to the results of this study, topics related to accidents, disasters, and hazards exhibited a continuous declining trend, except during the years when these events occurred. Although the volume of oil spills has decreased, the number of oil spill incidents remains at a similar level each year, and the frequency of natural disasters continues to rise steadily. In light of these findings, proactive preventive measures, system establishment, and regular inspections are necessary to prevent large-scale accidents and disasters.
In addition, environmental issues such as microplastic pollution, climate change, and carbon neutrality pose challenges that cannot be easily resolved without sustained execution of solutions through diplomatic cooperation between nations, despite their immediate impacts not being significant. The Korean government has implemented various policies for achieving carbon neutrality in the marine economy. However, due to the nature of the marine economy, the efforts of a single country alone may not yield substantial results. Hence, international environmental problems with transnational and interconnected characteristics require collaborative efforts from members of the international community, including governments, local authorities, and NGOs. It is necessary to share policies and establish unified policies among all countries with a significant stake in the marine economy.
The data analyzed in this study are confined to “South Korea” and “news”, which presents certain limitations. As mentioned in the introduction, the scale and technological capabilities of South Korea’s marine economy place it as one of the top five globally, making it a suitable research subject. However, there is a limitation in not being able to comprehensively analyze global news data due to language constraints. Considering the nature of the marine economy, it would be beneficial to conduct research targeting the global environment using expanded data sources in the future. Additionally, the study did not consider the diverse media platforms available through various social networking services such as YouTube, Facebook, and Instagram. These social platforms contain data that are more public-friendly compared to news data. Therefore, they can be utilized for research focusing on ideas development and direction setting for addressing environmental issues faced at the individual or small-scale community level, rather than macro-level and national policymaking. In particular, YouTube data, when combined with quantitative and statistical analysis involving metrics such as views, subscribers, number of videos, and video length, hold promise for conducting comprehensive research.

Author Contributions

Methodology, H.J.K.; Formal analysis, H.J.K.; Investigation, C.K. (Chanho Kim); Resources, S.K.; Data curation, S.K.; Writing—original draft, H.J.K.; Writing—review & editing, C.K. (Changhee Kim); Supervision, C.K. (Changhee Kim); Project administration, C.K. (Chanho Kim); Funding acquisition, C.K. (Changhee Kim). All authors have read and agreed to the published version of the manuscript.

Funding

This work was suported by Incheon National University Research Grant in 2023.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Co-occurrence matrix of keywords between 2008–2012.
Table A1. Co-occurrence matrix of keywords between 2008–2012.
KeywordsW1W2W3W4W5W6W7W8W9W10W11W12W13W14W15W16W17W18W19W20W21W22W23W24W25W26W27
W1Fishery 328296184624199102181442514175309867024511222214702529117
W2Aquaculture 137651225556110189451302930663791150181217
W3Aquatic products 1172518334253952682321769421536007317
W4Accident 1142930271314419891695224109188191366420506443
W5Calamity 1543860612245432316319632702226
W6Disaster 1021020224910042021120006
W7Antarctica 975757142268425341110122360117
W8North Pole 675813151390462812112033110128
W9Glacier 3617121175261286402100019
W10Energy 16767334836518478212382439197114164119
W11Carbon 17529030251040716391004020
W12Hydrogen 7219530102211110001
W13Plastic 16102317412123480122
W14Micro 20171349112950345
W15Climate 232733416930221094415564
W16Foreshore 140721862682776003544
W17Ecosystem 71753414915010524
W18Habitat 46725330523
W19Tourism 83311091315471
W20Cruise 41230005
W21Smart 1480417
W22Contaminator 2010241348
W23Nuclear power plant 01127
W24Corona 000
W25Virus 541
W26Infection 5
W27Diplomacy
Table A2. Co-occurrence matrix of keywords between 2013–2017.
Table A2. Co-occurrence matrix of keywords between 2013–2017.
KeywordsW1W2W3W4W5W6W7W8W9W10W11W12W13W14W15W16W17W18W19W20W21W22W23W24W25W26W27
W1Fishery 524507423166155405082812345244200501136114405487638610404672181
W2Aquaculture 2045920179133745121205419139248073197310212613
W3Aquatic products 1891429141635242181341150493010610202681470162232
W4Accident 301363232910154246363968220743110926356722982181681
W5Calamity 17653511251324674617010663511955031049
W6Disaster 67153635121747112462131054313642
W7Antarctica 79624614661683394118101235501312
W8North Pole 5858144111391453028153145402221
W9Glacier 4112458752824813511810225
W10Energy 1213434662619284222012510115794145124
W11Carbon 138248823381242219341403112
W12Hydrogen 68145517151390113
W13Plastic 5915324010110710380534
W14Micro 45253931314118151118
W15Climate 112894312715311103101711101
W16Foreshore 117117143281526359181433
W17Ecosystem 7676911139130101316
W18Habitat 4750618016163
W19Tourism 995811439191688
W20Cruise 413301211
W21Smart 262402218
W22Contaminator 2961333751
W23Nuclear power plant 05235
W24Corona 110
W25Virus 974
W26Infection 7
W27Diplomacy
Table A3. Co-occurrence matrix of keywords between 2018–2022.
Table A3. Co-occurrence matrix of keywords between 2018–2022.
KeywordsW1W2W3W4W5W6W7W8W9W10W11W12W13W14W15W16W17W18W19W20W21W22W23W24W25W26W27
W1Fishery0771779375211304561254132358922217443867221221549742401475179179182193218
W2Aquaculture0035561447101810118482372401291966657126122321683040362918
W3Aquatic products0002263015713474603194469621974471341012234733481576284
W4Accident000020411019311519910171135891702951435610118109775612685061216
W5Calamity000003821262418794334511323158431512541021575294515366
W6Disaster000000014195413028281311547442467521
W7Antarctica000000013512860512412816848866923145321011712
W8North Pole00000000133113664535123552714622101078611181826
W9Glacier00000000081827322522349663315434982116107
W10Energy000000000080743131123183518123590380283833892341948241189
W11Carbon00000000000302267115746175189781658193283142152392187
W12Hydrogen0000000000004167125415031068168110100374628
W13Plastic0000000000000400314218224925347079127133796332
W14Micro0000000000000021310311015869985514466582334
W15Climate000000000000000287349203251211664101911999060253
W16Foreshore0000000000000000197248209298646317135242880
W17Ecosystem00000000000000000200159117129210368553042
W18Habitat00000000000000000011771215173228247
W19Tourism0000000000000000000101313134541834053126
W20Cruise00000000000000000000261871861611
W21Smart00000000000000000000011060112332258
W22Contaminator0000000000000000000000577155135114272
W23Nuclear power plant0000000000000000000000035710324
W24Corona00000000000000000000000032926781
W25Virus000000000000000000000000042337
W26Infection0000000000000000000000000041
W27Diplomacy000000000000000000000000000

References

  1. OECD. The Ocean Economy in 2030; OECD: Paris, France, 2016. [Google Scholar]
  2. OECD. Sustainable Ocean for All: Harnessing the Benefits of Sustainable Ocean Economies for Developing Countries, The Development Dimension; OECD: Paris, France, 2020. [Google Scholar] [CrossRef]
  3. KMI. Korea’s Ocean Economy 2020; Korea Maritime Institute: Busan, Republic of Korea, 2020. [Google Scholar]
  4. IMF. (2022, Jan). World Economic Outlook; IMF: Washington, DC, USA, 2022. [Google Scholar]
  5. Sumaila, U.R.; Walsh, M.; Hoareau, K.; Cox, A.; Teh, L.; Abdallah, P.; Akpalu, W.; Anna, Z.; Benzaken, D.; Crona, B.; et al. Financing a sustainable ocean economy. Nat. Commun. 2021, 12, 3259. [Google Scholar] [CrossRef]
  6. Bennett, N.J.; Blythe, J.; White, C.S.; Campero, C. Blue growth and blue justice: Ten risks and solutions for the ocean economy. Mar. Policy 2021, 125, 104387. [Google Scholar] [CrossRef]
  7. Arora, N.K.; Mishra, I. Ocean sustainability: Essential for blue planet. Environ. Sustain. 2020, 3, 1–3. [Google Scholar] [CrossRef] [Green Version]
  8. Johnson, R. Japan’s 2011 Earthquake and Tsunami: Food and Agriculture Implications; Congressional Research Service: Washington, DC, USA, 2011; p. 18. [Google Scholar]
  9. Kamis, A.S.; Fuad, A.F.A.; Anwar, A.Q.; Monir Hossain, M. A systematic scoping review on ship accidents due to off-track manoeuvring: A systematic scoping review on ship accidents due to off-track manoeuvring. WMU J. Marit. Aff. 2022, 21, 453–492. [Google Scholar] [CrossRef]
  10. KMI. Korea’s Ocean Economy 2021; Korea Maritime Institute: Busan, Republic of Korea, 2021. [Google Scholar]
  11. Menon Economics; DNV GL. The Leading Maritime Capitals of the World; Menon Economics: Oslo, Norway, 2018. [Google Scholar]
  12. IMF. (2023, Apr). World Economic Outlook; IMF: Washington, DC, USA, 2023. [Google Scholar]
  13. Rezende, J.D.; Moretti, M.S. An overview of microplastic research in marine and freshwater habitats using topic modeling. Hydrobiologia 2023, 850, 1413–1426. [Google Scholar] [CrossRef]
  14. Hay Mele, B.; Russo, L.; D’Alelio, D. Combining marine ecology and economy to roadmap the integrated coastal management: A systematic literature review. Sustainability 2019, 11, 4393. [Google Scholar] [CrossRef] [Green Version]
  15. Otero, P.; Gago, J.; Quintas, P. Twitter data analysis to assess the interest of citizens on the impact of marine plastic pollution. Mar. Pollut. Bull. 2021, 170, 112620. [Google Scholar] [CrossRef]
  16. Tomojiri, D.; Takaya, K.; Ise, T. Temporal trends and spatial distribution of research topics in anthropogenic marine debris study: Topic modelling using latent Dirichlet allocation. Mar. Pollut. Bull. 2022, 182, 113917. [Google Scholar] [CrossRef]
  17. Keller, E.; Wyles, K.J. Straws, seals, and supermarkets: Topics in the newspaper coverage of marine plastic pollution. Mar. Pollut. Bull. 2021, 166, 112211. [Google Scholar] [CrossRef] [PubMed]
  18. Yan, K.; Wang, Y.; Jia, L.; Wang, W.; Liu, S.; Geng, Y. A content-aware corpus-based model for analysis of marine accidents. Accid. Anal. Prev. 2023, 184, 106991. [Google Scholar] [CrossRef]
  19. Zhou, Y.; Wang, X.; Yuen, K.F. Sustainability disclosure for container shipping: A text-mining approach. Transp. Policy 2021, 110, 465–477. [Google Scholar] [CrossRef]
  20. Shin, S.H.; Kwon, O.K.; Ruan, X.; Chhetri, P.; Lee, P.T.W.; Shahparvari, S. Analyzing sustainability literature in maritime studies with text mining. Sustainability 2018, 10, 3522. [Google Scholar] [CrossRef] [Green Version]
  21. Hwang, K.W.; Jang, A.R.; Lee, M.S. A Study on the Trends in the Studies on Marine Spatial Planning: Focusing on Topic Modeling. J. Korean Soc. Mar. Environ. Saf. 2021, 27, 954–966. [Google Scholar] [CrossRef]
  22. Jiang, H.; Qiang, M.; Lin, P. A topic modeling based bibliometric exploration of hydropower research. Renew. Sustain. Energy Rev. 2016, 57, 226–237. [Google Scholar] [CrossRef]
  23. Sánchez-Franco, M.J.; Rey-Moreno, M. Do travelers’ reviews depend on the destination? An analysis in coastal and urban peer-to-peer lodgings. Psychol. Mark. 2022, 39, 441–459. [Google Scholar] [CrossRef]
  24. Bai, X.; Zhang, X.; Li, K.X.; Zhou, Y.; Yuen, K.F. Research topics and trends in the maritime transport: A structural topic model. Transp. Policy 2021, 102, 11–24. [Google Scholar] [CrossRef]
  25. Huang, L.; Wen, Y.; Guo, W.; Zhu, X.; Zhou, C.; Zhang, F.; Zhu, M. Mobility pattern analysis of ship trajectories based on semantic transformation and topic model. Ocean. Eng. 2020, 201, 107092. [Google Scholar] [CrossRef]
  26. Li, G.; Liu, M.; Zhang, X.; Wang, C.; Lai, K.H.; Qian, W. Semantic Recognition of Ship Motion Patterns Entering and Leaving Port Based on Topic Model. J. Mar. Sci. Eng. 2022, 10, 2012. [Google Scholar] [CrossRef]
  27. Kang, S.; Kim, S. Lessons Learned from Topic Modeling Analysis of COVID-19 News to Enrich Statistics Education in Korea. Sustainability 2022, 14, 3240. [Google Scholar] [CrossRef]
  28. Ortiz, D.; Myers, D.; Walls, E.; Diaz, M.E. Where do we stand with newspaper data? Mobilization Int. Q. 2005, 10, 397–419. [Google Scholar] [CrossRef]
  29. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  30. Deerwester, S.; Dumais, S.T.; Furnas, G.W.; Landauer, T.K.; Harshman, R. Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 1990, 41, 391–407. [Google Scholar] [CrossRef]
  31. Foltz, P.W.; Kintsch, W.; Landauer, T.K. The measurement of textual coherence with latent semantic analysis. Discourse Process. 1998, 25, 285–307. [Google Scholar] [CrossRef]
  32. Landauer, T.K.; Dumais, S.T. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 1997, 104, 211. [Google Scholar] [CrossRef]
  33. Landauer, T.K.; Foltz, P.W.; Laham, D. An introduction to latent semantic analysis. Discourse Process. 1998, 25, 259–284. [Google Scholar] [CrossRef]
  34. Liu, L.; Tang, L.; Dong, W.; Yao, S.; Zhou, W. An overview of topic modeling and its current applications in bioinformatics. SpringePlus 2016, 5, 1608. [Google Scholar] [CrossRef] [Green Version]
  35. Lee, S.; Song, J.; Kim, Y. An empirical comparison of four text mining methods. J. Comput. Inf. Syst. 2010, 51, 1–10. [Google Scholar]
  36. Bergamaschi, S.; Po, L. Comparing LDA and LSA topic models for content-based movie recommendation systems. In Web Information Systems and Technologies, Proceedings of the 10th International Conference, WEBIST 2014, Barcelona, Spain, 3–5 April 2014; Revised Selected Papers 10; Springer International Publishing: Cham, Switzerland, 2015; pp. 247–263. [Google Scholar]
  37. Available online: https://www.bigkinds.or.kr/ (accessed on 5 June 2023).
  38. Ko, J.; Paek, S.; Park, S.; Park, J. A news big data analysis of issues in higher education in Korea amid the COVID-19 pandemic. Sustainability 2021, 13, 7347. [Google Scholar] [CrossRef]
  39. Jung, J.H.; Shin, J.I. Big data analysis of media reports related to COVID-19. Int. J. Environ. Res. Public Health 2020, 17, 5688. [Google Scholar] [CrossRef]
  40. Available online: https://kosis.kr/ (accessed on 5 June 2023).
  41. Available online: https://www.mof.go.kr/statPortal/main/portalMain.do (accessed on 5 June 2023).
  42. Haridasan, P. Unscear 2013 Report. Radiat. Prot. Environ. 2013, 36, 143. [Google Scholar]
  43. United Nations. The UN Sustainable Development Goals; United Nations: New York, NY, USA, 2015; Available online: http://www.un.org/sustainabledevelopment/summit/ (accessed on 5 June 2023).
  44. Ministry of Environment of the Republic of Korea. A Report on Korea-Sustainable Development Goals(K-SDGs) 2019. Available online: https://ncsd.go.kr/research?content=1&post=2320 (accessed on 5 June 2023).
  45. Sun, L.; Yin, Y. Discovering themes and trends in transportation research using topic modeling. Transp. Res. Part C Emerg. Technol. 2017, 77, 49–66. [Google Scholar] [CrossRef] [Green Version]
  46. Griffiths, T.L.; Steyvers, M. Finding scientific topics. Proc. Natl. Acad. Sci. USA 2004, 101 (Suppl. S1), 5228–5235. [Google Scholar] [CrossRef] [PubMed]
  47. Doerfel, M.L.; Connaughton, S.L. Semantic networks and competition: Election year winners and losers in US televised presidential debates, 1960–2004. J. Am. Soc. Inf. Sci. Technol. 2009, 60, 201–218. [Google Scholar] [CrossRef]
  48. Otte, E.; Rousseau, R. Social network analysis: A powerful strategy, also for the information sciences. J. Inf. Sci. 2002, 28, 441–453. [Google Scholar] [CrossRef]
  49. Bonacich, P. Some unique properties of eigenvector centrality. Soc. Netw. 2007, 29, 555–564. [Google Scholar] [CrossRef]
  50. CLIA. Global Market Report. 2021. Available online: https://cruising.org/en/news-and-research/research/2022/july/clia-global-passenger-report-2021 (accessed on 9 July 2023).
Figure 1. Schematic of the topic modeling algorithm. K : number of topics; α: Dirichlet prior weight of topic k by document, the parameter which determines the value of θ; η: Dirichlet prior weight of word w by document, the parameter which determines the value of β; θd: the ratio of topics by document; βk: the probability that word w will be generated by topic; Zd,n: the topic of the nth word in document d (index); Wd,n: the nth word in document d (variable observed in document, index).
Figure 1. Schematic of the topic modeling algorithm. K : number of topics; α: Dirichlet prior weight of topic k by document, the parameter which determines the value of θ; η: Dirichlet prior weight of word w by document, the parameter which determines the value of β; θd: the ratio of topics by document; βk: the probability that word w will be generated by topic; Zd,n: the topic of the nth word in document d (index); Wd,n: the nth word in document d (variable observed in document, index).
Processes 11 02253 g001
Figure 2. Number of news articles by year.
Figure 2. Number of news articles by year.
Processes 11 02253 g002
Figure 3. Word clouds by year.
Figure 3. Word clouds by year.
Processes 11 02253 g003
Figure 4. Ratios by topic.
Figure 4. Ratios by topic.
Processes 11 02253 g004
Figure 5. Topical research trends by year.
Figure 5. Topical research trends by year.
Processes 11 02253 g005aProcesses 11 02253 g005b
Table 1. Simplified procedure for performing topic modeling.
Table 1. Simplified procedure for performing topic modeling.
1. Decide how many topics you want to categorize.
2. Analyze topic modeling.
    2.1 Randomly assign each word to a topic—for example, say the word ‘A’ is assigned to topic 1.
    2.2 Determine which topic ‘A’ belongs to based on two criteria.
          2.2.1 What is the percentage of the word ‘A’ in topic 1?
          2.2.2 What percentage of the document is currently occupied by topic 1?
    2.3 Assign word ‘A’ to a new topic based on the above criteria.
    2.4 Repeat 2.2 and 2.3 for all words.
3. Give each topic a topic name.
Table 2. Titles and descriptions of the topics.
Table 2. Titles and descriptions of the topics.
Topic NumberTopic TitleDescription
1Sustainable Fisheries and Aquaculture ManagementDamage to Coastal Fisheries and Aquaculture due to Environmental Pollution
2Accident, Calamity, Disaster ResponseDamage and Countermeasures Caused by Oil Leaks, Typhoons, and Earthquakes
3Polar EnvironmentPolar Environment Damage
4Carbon Neutral, Hydrogen EnergyDevelopment and Side Effects of Marine Renewable Energy to Reduce Carbon Emissions
5Micro Plastic, Climate ChangeInfluence and Measures on Climate Change such as Microplastic Waste Problems and Global Warming
6Tidal Flats, Habitat EcosystemsEcosystem Destruction and Disturbance, Habitat Damage
7Cruise Tourism Development and SmartizationEnvironmental Pollution by Tourism Resource Development, Development and Utilization of Smart Technology in the Marine Fisheries Sector
8Japanese Nuclear Power Plant PollutionRadioactive Contamination due to the Release of Japanese Nuclear Power Plant Pollution
9COVID-19Global Supply Chain Paralysis due to Infectious Diseases, Establishing a Prevention System
10Diplomatic Cooperation Between CountriesDiplomatic Cooperation to Achieve Carbon Neutral and ESG Management, Disputes of Maritime Law and Fishing Rights Between Countries
Table 3. Probability distribution of words by topic.
Table 3. Probability distribution of words by topic.
Topic 1Topic 2Topic 3Topic 4Topic 5
Fishery0.0197Accident0.0270Antarctica0.0755Energy0.0116Plastic0.0175
Aquaculture0.0143Calamity0.0260North Pole0.0356Carbon0.0110Micro0.0158
Aquatic products0.0121Disaster0.0215Ice0.0192Hydrogen0.0095Climate0.0146
Sea area0.0112Respond0.0172Glacier0.0181Global0.0082Change0.0104
Fishing0.0112Oil0.0147Penguin0.0161Invest0.0076Effect0.0086
Resource0.0091Outflow0.0145Polar region0.0146Market0.0062Weather change0.0081
Coast0.0080Maritime Police0.0137Sea ice0.0134Wind force0.0058Rise0.0068
Damage0.0077IAEA0.0133Ice shelf0.0101LNG0.0057Earth0.0067
Water temperature0.0074Inspection0.0124Antarctic Sea0.0098Generation0.0049Biology0.0064
Fish species0.0073Nuclear power0.0114Arctic Ocean0.0097Export0.0048Observation0.0062
Topic 6Topic 7Topic 8Topic 9Topic 10
Foreshore0.0167Tourism0.0174Japan0.0473COVID-190.0197Cooperation0.0091
Ecosystem0.0163Attracting0.0122Contaminator0.0391Corona0.0143Nation0.0085
Habitat0.0117Market0.0118Nuclear power plant0.0287Virus0.0112Diplomacy0.0085
Marsh0.0117Growth0.0113Discharge0.0250Occurrence0.0080Meeting0.0079
Inspection0.0113Job0.0094Fukushima0.0225Standard0.0075Summit0.0070
Area0.0109Development0.0078Decision0.0130Prevention0.0065Respond0.0065
Protect0.0090Cruise0.0072Radiation0.0128Infection0.0065Emphasis0.0062
Facility0.0088Smart0.0069Emission0.0123Diffusion0.0059USA0.0059
Development0.0082Future0.0064Material0.0121Respond0.0058Politics0.0057
Disturbance0.0075Application0.0057Process0.0114Damage0.0058Congress0.0056
Table 4. Hot/cold topics.
Table 4. Hot/cold topics.
Topic No.Coefficientp-ValueHot/Cold
17.6390.001hot
2−4.0820.042cold
3−0.2290.358
438.0140.000hot
524.4250.000hot
6−1.9710.543
726.1680.001hot
816.7710.083
924.2680.004hot
1014.3460.008hot
Table 5. Results of text network analysis.
Table 5. Results of text network analysis.
2008~20122013~20172018~2022
KeywordsCentralityKeywordsCentralityKeywordsCentrality
Contaminator0.408Fishery0.471Contaminator0.355
Accident0.374Accident0.386Fishery0.353
Fishery0.373Contaminator0.379Energy0.324
Foreshore0.349Foreshore0.291Climate0.322
Energy0.325Aquatic products0.271Carbon0.255
Climate0.276Tourism0.227Foreshore0.236
Tourism0.254Energy0.225Accident0.236
Aquatic products0.199Aquaculture0.222Plastic0.222
Aquaculture0.192Nuclear power plant0.182Aquatic products0.197
Ecosystem0.168Climate0.170Nuclear power plant0.194
Nuclear power plant0.165Calamity0.169Tourism0.189
Diplomacy0.114Disaster0.159Ecosystem0.170
Calamity0.105Ecosystem0.127Smart0.164
Carbon0.086Diplomacy0.114Micro0.163
Habitat0.079Habitat0.080Aquaculture0.163
North Pole0.066Micro0.065Diplomacy0.137
Antarctica0.061Smart0.060Corona0.124
Glacier0.046Plastic0.056Calamity0.116
Cruise0.038North Pole0.056Hydrogen0.113
Micro0.037Carbon0.050Habitat0.104
Plastic0.037Antarctica0.047Virus0.084
Smart0.033Cruise0.040Infection0.073
Hydrogen0.027Infection0.040North Pole0.065
Disaster0.022Virus0.031Glacier0.052
Virus0.022Glacier0.028Antarctica0.048
Infection0.018Hydrogen0.013Disaster0.024
Corona0.001Corona0.001Cruise0.022
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kang, H.J.; Kim, C.; Kim, S.; Kim, C. A Study on Environmental Trends and Sustainability in the Ocean Economy Using Topic Modeling: South Korean News Articles. Processes 2023, 11, 2253. https://doi.org/10.3390/pr11082253

AMA Style

Kang HJ, Kim C, Kim S, Kim C. A Study on Environmental Trends and Sustainability in the Ocean Economy Using Topic Modeling: South Korean News Articles. Processes. 2023; 11(8):2253. https://doi.org/10.3390/pr11082253

Chicago/Turabian Style

Kang, Hee Jay, Changhee Kim, Sungki Kim, and Chanho Kim. 2023. "A Study on Environmental Trends and Sustainability in the Ocean Economy Using Topic Modeling: South Korean News Articles" Processes 11, no. 8: 2253. https://doi.org/10.3390/pr11082253

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop