Next Article in Journal
Assessment of Flood-Induced Geomorphic Changes in Sidere Creek of the Mountainous Basin Using Small UAV-Based Imagery
Previous Article in Journal
A Hybrid-Strategy-Improved Dragonfly Algorithm for the Parameter Identification of an SDM
Previous Article in Special Issue
Classifying Urban Functional Zones Based on Modeling POIs by Deepwalk
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Neighborhood Identity Formation and the Changes in an Urban Regeneration Neighborhood in Gwangju, Korea

1
Asia Research Institute, National University of Singapore, Singapore 119260, Singapore
2
Department of Architecture, Mokpo National University, Muan-gun 58554, Republic of Korea
*
Authors to whom correspondence should be addressed.
Sustainability 2023, 15(15), 11792; https://doi.org/10.3390/su151511792
Submission received: 11 April 2023 / Revised: 1 July 2023 / Accepted: 18 July 2023 / Published: 31 July 2023
(This article belongs to the Special Issue Geographic Big Data for Sustainable City)

Abstract

:
Since the Urban Regeneration Act in 2013, central and local Korean governments have endeavored to regenerate deprived urban neighborhoods. This study analyzed how these efforts have changed the nature of neighborhood identity in Yanglim, Gwangju, Korea. The authors analyzed 62,386 Naver blog posts from 2013 to 2022, utilizing an Artificial Intelligence (AI) technique, Topic Modeling (i.e., Latent Dirichlet Allocation). Using trend analysis by topic, three phases were identified: (1) Phase 1: Flourishment (January 2013 to October 2016); (2) Phase 2: Maturation (November 2016 to February 2020); and (3) Phase 3: COVID-19 (March 2020 to October 2022). In the first phase, the collective actions between the local government and citizens to improve the declined neighborhood formed the Yanglim area’s reputation as the “History and Cultural Village” and as “Penguin Village”. The unique identity of the area in the second phase, along with gentrification issues, created a hot spot (e.g., cafés and restaurants), drawing the attention of tourists and locals. More recently, the Yanglim area has become a place for locals’ daily activities with their loved ones, as tourist traffic greatly dropped off due to the COVID-19 outbreak. Until now, the Yanglim area has experienced a process of successful urban regeneration from flourishment to degentrification. AI techniques represent a novel application that can support policy makers and stakeholders in understanding citizens and taking further actions to create economically and socially sustainable neighborhoods.

1. Introduction

The identities of urban spaces or neighborhoods are manifested by their people (e.g., their residents and visitors). These identities are shaped by unique images, not only due to landscapes and buildings but also due to food, music, customs, language, relationships, and political viewpoints. All of these aspects (i.e., physical and cultural) interact with each other to form unique identities [1]. Lynch defines identity as “the extent to which a person can recognize or recall a place as being distinct from other places” [2] (p. 131). Oktay states that “identity is one of the essential goals for the future of good environments. People should feel that some part of the environment belongs to them individually and collectively, some part for which they care and are responsible, whether they own it or not” [3] (p. 261). As stated above, human beings are important stakeholders interacting with built environments. Thus, it is important to research human perceptions and their interactions, identifying the distinctiveness of different neighborhoods.
In recent decades, relevant stakeholders (e.g., local governments, urban planners, community leaders and residents) have been challenged to revitalize dilapidated neighborhood environments as well as to mitigate negative images by collective actions in many countries. The proliferation of collective actions is categorized as “commercial-led regeneration”, “culture-led regeneration”, “tourism-led regeneration”, “design-led regeneration”, or complexes of these terms [4,5,6,7]. As one of the countries implementing urban regeneration policies with a proliferation of projects, South Korea (hereafter, Korea) has worked to revitalize depopulated and economically unsustainable city centers. Accordingly, a number of articles dealing with urban regeneration have been published in Korea; the majority of research has shed light on Seoul and the metropolitan areas, with limited knowledge of other cities such as Gwangju.
In recent years, utilizing social media platforms to mine people’s opinions in order to broaden ideas on urban environments has become widespread. In particular, Natural Language Processing (NLP) and Artificial Intelligence (AI) techniques are applied to mine the opinions of diverse people, overcoming the usual time and budget limitations for data collection [8]. Using big data analysis has great significance. First of all, an overview of social media permits new and chronological trends to be uncovered. Second, this paper illustrates the potential of topic modeling to the greatest extent [9]. This technique can offer invaluable insights to relevant stakeholders such as policy makers and planners for the purpose of understanding citizens, which is necessary for urban regeneration. Nonetheless, a great many previous studies have concentrated on qualitative and phenomenological approaches such as interviews, observations, or mapping to reveal urban or neighborhood identities.
Capitalizing on the benefits of social media data, the authors aim to answer the two following questions:
1.
Do social media data allow us to capture a certain point in the changing nature of neighborhood identity through AI techniques?
2.
If so, is it possible to capture how changed the neighborhood identity is at the discursive level over time?
To answer these questions, the authors utilized the lens of neighborhood stakeholders and users of the space, including locals, small business owners, tourists, and government officials who posted their opinions and activities on social media during a certain period of time and then analyzed these data with a newly emerging computational method.

2. Literature Review

2.1. Understanding and Measuring Neighborhood Identity

In this section, the authors review how research trends on neighborhood identity have changed. Lynch (1960) addressed the image of a city and how urban planners can create memorable city images in the book The Image of the City [10]. The image of a neighborhood space is constructed in a two-way process between human beings and their environment as people select, organize, and endow what they see with meaning. This image differs from person to person, and different neighborhood environments are associated with different processes of image-making. An environmental image may possess three components: identity, structure, and meaning. Among these three components, identity implies distinctiveness compared to others and its recognition separate from another entity. A neighborhood identity is a collective representation created by subjective perceptions of any single individual or group rather than by objective reality in the neighborhood [11,12,13]. Understanding neighborhood identity based on Lynch’s [10] and Suttles’ work [14], researchers have found that people interact with their neighborhood as a creative imposition and that people’s activities are connected to their ‘cognitive maps’(i.e., their internal representation of the space) [15].
In one stream of urban research, researchers have mainly studied how urban or neighborhood identity is depicted using two methodological approaches: (1) traditional methods, and (2) computational techniques. The first addresses the interdependent identity as perceptions among external observers (e.g., tourists, non-natives), the in-group (e.g., natives, residents rooted in place), and stakeholders (e.g., policy makers) [3,11,16,17,18,19,20]. Their perceptions are measured using traditional methods (e.g., interviews, survey questionnaires, diaries). Moreover, researchers have added their own perspectives through observations. For example, Huovinen and colleagues tried to measure neighborhood identity by using interviews and diaries to gauge residents’ perceptions [20]. Salesses and colleagues created a discursive map with different key words in four different places, then related the identity of places to different perceptions [21]. In addition to methodologies such as interviews and survey questionnaires, the systematic reconstruction of cognitive maps of neighborhood boundaries by relying on residents’ perceptions represents a methodological challenge. The boundaries of neighborhoods should be decided by the average or the largest example [22]. In addition to being expensive, these methods are difficult to validate [23] and do not capture enough of the meanings and identities of neighborhoods that are attached to languages [13].
The second approach uses computational techniques to gather and process high volumes of data through Geographical Information Systems (GIS), space syntax, and programming languages (e.g., R and Python) [1,21,24,25]. For example, geo-tagged digital images collected from four different places through online maps and on-site observations were analyzed in the context of city identity. The safety, uniqueness, and social class of neighborhoods were used as a proxy of city identity as measured by the perceptions of the public (i.e., crowdsourcing). The results were then be visualized as maps. Zhou and colleagues analyzed over two million geo-tagged photos from Google Street Views, Flikers, and Panoramio to identify city identities from twenty-one different cities [25]. Their study labeled scene attributes with one hundred and two distinctive classifications, such as natural, eating, and open areas by drawing on deep learning processes. Later, spatial analysis was performed with these characteristics on maps and a similarity network analysis was created.
In addition to image processing for place identity, determining the linguistic patterns in big data has not yet fully revealed its potential. Research has been performed with big data, such as that from Twitter, TripAdviser, Yelp, and news article data from diverse geographical regions [1,8,13,26]. Such works have explored, for instance, (1) how similar or changed neighborhoods are over time [13], (2) how different city characteristics or urban green space characteristics are from each other [1,8], and (3) how different the topics are based on geo-tagged locations [26]. Certain data types, such as from TripAdviser and Yelp, only provide insights for commercial destinations and do not cover diverse opinions. Although Twitter covers spatiotemporal information with texts being used by diverse populations, the number of users in Korea is limited and the data do not cover a long-term period. Thus, the authors mainly focused on an analysis of how neighborhood identities have been constructed through an emerging methodology by analyzing big data (i.e., Naver blogs), thereby adding more knowledge to the literature. Through big data analysis, the authors adopted text mining to deal with linguistic patterns for neighborhood changes using data accumulated over a period of almost ten years.

2.2. Understanding the Study Neighborhood

Urban decline is a multidimensional process describing the reduction of local economic opportunities along with a decrease in jobs and a rise in unemployment, depopulation, and dilapidation of the neighborhood environment [27]. Central and local governments in many developed countries have tried to lessen the negative effects of urban and neighborhood shrinkage and to bring back the positive image of cities and neighborhoods. Korea is one of the countries investing tremendous amounts on urban regeneration. As an exemplar of local cities, the Yanglim neighborhood in Gwangu was chosen for this study. Gwangju is the sixth largest city located in southwestern Korea. It covers 501 km2 and had almost 1.5 million people in 2021 [28]. It is one of the cities currently experiencing a gentle population decline [29].
Yanglim is a residential area covering 0.68 km2 and surrounded by the Sajik and Yanglim mountains in Gwangju. The Gwangju stream is located nearby. While Yanglim is located close to the old city center, prior to 1904 the land around it was affordable, and included a great many graveyards. Taking advantage of affordable land, missionaries from the Presbyterian church in the United States settled in the area, using it as a base camp for their religious missions and social work [30]. Their most outstanding activities were education, medicine, and mission, establishing schools (e.g., Speer Girls’ Schools, the first institute for girls’ education in Gwangju), a hospital (the current Gwangju Christian Hospital), and a church (Yanglim Presbyterian Church). The Yanglim area played a role as a center for the arts (e.g., traditional Korean and Western music and performance), intellectual exchange, and the movement for independence from Japan and was a center for religion, medicine, education, and social work. Until 1970, this place was called a “western village”, “Christian village”, or “cradle for modern culture in Gwangju”, with a modern historical heritage and Korean traditional houses (e.g., western-style buildings and Hanoks). In addition, the Yanglim neighborhood is the home town of famous artists [31]. Since 1976, the lower part of Yanglim has been developed as a residential area to solve the housing shortage; most of the housing in this neighborhood is 30 years old or more, with some vacant and dilapidated houses. The aged houses, narrow streets, public health issues such as streets inaccessible to fire trucks, and lack of parking lots and walking paths have been considered important issues for improvement [32].
In 2009, the local government launched initiatives to develop the Yanglim neighborhood as a historical and cultural village to attract tourists [33]. The local district assigned funding (approximately USD 24 million) to regenerate the neighborhood over a nine-year period (2009–2017). The first initiative was the renovation of the missionary heritage, the formation of a missionary memorial park, and the maintenance of walking paths in the neighborhood. This initiative included the construction of a memorial tower (the current Sajik Observatory Tower) along with renovations of old schools and the medical center [32]. During this period, the local government focused on developing art and exhibition content and constructing several small art galleries [34]. In addition to these local government initiatives, the residents voluntarily began beautifying the neighborhood. One resident started to clean up a burnt-down house, and hung items such as old-style clocks on fences and walls for decoration. Other residents began urban farming in the vacant lots, accelerating the urban regeneration process. The collective actions of the residents, the local artists, and the local government have made the neighborhood, now known as Penguin Village, widely renowned. “Penguin” is the nickname of a resident who waddles like a penguin after a car accident, and who has been actively involved in the neighborhood project [35]. Penguin Village has become renowned, drawing a large volume of tourists and locals. This has functioned as a driver of neighborhood change, stimulating construction and renovation of commercial buildings and houses to suit the demands of both insiders and outsiders. On top of these nine years of improvements, the local government secured additional funding (approximately USD 51.6 million) for the next six-year period (2018–2023). The goals in this period are as follows: (1) improvement of the residential area (e.g., funding and program support for housing repairs and securement of parking spaces); (2) improvement of the urban environment (e.g., creation of a smart city and Book Street); (3) vitalization of the local economy (e.g., by attracting small business owners and opening an urban regeneration support center); and (4) social integration (e.g., the management of a local community and citizen art school) [36]. To realize these goals, the local government has incentivized local artists and small business owners to settle in the area by subsidizing rent and offering affordable studios in cooperation with the neighborhood community. These endeavors have included constructing new buildings and launching additional programs for the local economy, arts, and community through incentives and sponsorships.

3. Methodology

This section describes the methodology used to detect neighborhood identity and the changing nature of the Yanglim neighborhood in Gwangju. Three stages were set up for the data collection and analysis process using computational technique and statistics analysis. The entire process is shown in Figure 1. It is possible to draw out individuals’ impressions and behavioral patterns from the analysis of blog data. Thus, in the first stage, we collected data from the Naver website for a ten-year period. In the second stage, after data cleaning, topic modeling and trend analysis with all data were performed in order to explore the neighborhood identity. In this stage, a great volume of unstructured text data were converted into quantifiable data. In the third stage, after dividing the dataset using the trend analysis, the authors analyzed how the neighborhood identity changed over the study period. The detailed methodology is described below.

3.1. Stage 1: Data Collection

Web scraping is a type of data mining that involves extracting unstructured data from targeted websites, then transforming these data into structured data and storing them as a file or in a database [37]. Using this data collection technique, the keyword, Yanglim-dong was searched on the website Naver Blogs for a ten-year period from January 2013 to October 2022. Naver is one of the largest portals in Korea, and has provided a blog service since October 2003. Thus, the accumulated amount of data is quite massive. The blog dataset includes information about what happens in the area, how and where users are, as well as what their opinions are. The voices and behaviors of users include tourists, visitors, residents, local business owners, policy makers, and tourism stakeholders; these data can be used to gain information on users’ diverse activities and opinions. Geo-tagged information was collected, if available, in order to determine whether the postings were about Yanglim. The dataset, consisting of 96,356 blog posts in total, was scraped through self-implemented Python programming by the first author, and a total of 62,386 blog posts were left for analysis after the data cleaning process. Data cleaning was performed using keywords (e.g., rental car, potentially tagging various neighborhoods beyond Yanglim) as well as with geotagged information, blog user IDs, and even reading of specific blog lines. Certain bloggers tagged Yanglim or Yanglim-dong in order to make their blogs highly searchable, even when the postings were not relevant to Yanglim at all. Thus, the data cleaning process was important to securing a fine dataset.

3.2. Stage 2: Topic Modeling for All Data

The topic modeling used Latent Dirichlet Allocation (LDA) calculated for the data analysis, which is an unsupervised method [38] using an Artificial Intelligence (AI) technique. Therefore, all collected data needed to be written in one language (i.e., Korean). This section illustrates the corpus preparation, model preparation, and topic generation processes. A text corpus is defined as a large and unstructured set of text. Recently, it has been defined as a set of data which is electronically processed and stored for statistical analysis and hypothesis testing that confirms occurrences or validates linguistic rules within a certain language territory [39]. “A corpus does not contain new information about language, but the software offers us a new perspective on the familiar. In order to gain this new perspective, the first analytical steps generally involve two related processes: the production of frequency lists (either in rank order, or sorted alphabetically) and the generation of concordances” [40] (p. 122). In corpus preparation, tokenization is the very first step in text processing. Korean is an agglutinative language, meaning that combinations of nouns with or without whitespaces generate the same meaning. This leads to difficulties in tokenization. MeCab in KoNLPy, a Python library for NLP, was originally developed for Japanese morpheme analysis, and has been modified to MeCab-Ko for Korean morpheme analysis [41]. Compared to other libraries, MeCab-Ko provides stronger tokenization in terms of both speed and performance. A major strength of MeCab-Ko is that it offers users the function of an editable library of proper nouns. The blog contents that the first author collected contained a great many proper nouns (e.g., human or business names) and loanwords or English words directly written in Korean. Combinations including these words can result in less accurate tokenization. Thus, the authors input a proper noun list into the library of MeCab-Ko after reviewing the collected blog contents. To improve the quality of the tokenization and to perform better topic modeling, the raw corpus then needed to be cleaned of special characters, URLs, punctuation, and stopwords which are less significant in semantic contexts. Tokenization was performed by morpheme, then only nouns were selected into the tokenized list. Items from the tokenized list with fewer than three corpora were dropped as well.
This study implemented LDA, “a generative probabilistic model of collections of discrete data such as text corpora” [38] (p. 994), to analyze topics. Model preparation started with vectorization of documents from the corpus. To prepare the model, the vectorization process assigned a unique identification number for each word. While the unigram corpora (individual words) do not exactly capture the meaning of two to three words in a sequence, the n-gram analysis made of n words benefits from the co-occurrence observations [42]. By applying the bigram_mode and trigram_mode phrase modeling models in turn, the frequency of the corpus was counted and the result was transformed into a dictionary with the ID as the query key. The tokenized list combined with unigram to trigram was saved into a dictionary and the words or terms from documents were transformed into tuples (i.e., from document to bag-of-words) for their subsequent numerical calculation.
To generate topic modeling, a Term Frequency–Inverse Document Frequency (TF-IDF) matrix was used to compute the frequency of the terms in a corpus of documents (i.e., term X appears in document Y) [43]. LDA starts from the intuition that documents exhibit multiple topics [44], and is a part of the larger field of probabilistic modeling. In generative probabilistic modeling, researchers deal with data arising from a generative process including hidden variables. A joint probability distribution over the observed and hidden random variables is defined by this generative process. Data analysis is performed by harnessing the joint distribution to calculate the conditional distribution, which is called the posterior distribution, of the hidden variables with the observed variables [44]. Under the given requirement, the LDA model groups documents with at least two relevant topics [45]. The LDA model is one of the most important and widely used probabilistic models [46].
Because the number of topics k is a significant parameter for topic modeling, k was determined by the calculation of coherence and perplexity. On the basis of these scores, the better model (i.e., higher value) for the desired topic number was chosen. The authors acquired up to the top thirty most relevant terms from each topic and created visualizations of those terms using word clouds and graphs based on the relative importance of the topic. In particular, the visualization of word clouds aimed to represent the percentage of individual topics that emerged, with the size of individual words standing for the importance within each topic, not across all topics. The importance of words for the LDA was extracted using a document–word matrix wd[w, fd] indicating the importance of a word w in document fd; please refer to the work of Maskeri and colleagues for further details [47].
Whole-topic saliency was computed based on the following formula:
Distinctiveness ( w ) = T P ( T | w )   l o g P ( T | w ) P T
“For a given word w, we computed its conditional probability P(T|w): the likelihood that observed word w was generated by latent topic T. We computed the marginal probability P(T): the likelihood that any randomly selected word w’ was generated by topic T. We defined the distinctiveness of a word as the Kullback–Leibler divergence [48] between P(T|w) and P(T)” [49] (p.2). For more detail, please refer to the work of Chuang and colleagues [49].
Labeling topics and thematic analysis of topics are interactive processes that involve human interpretation, although topic modeling itself is an automated process [50]. The grouping of each topic (i.e., thematic analysis) depends on the results of the Intertopic Distance Map used to determine the topic distances through an automated process. Thus, the authors decided on the labels of the topics and themes considering the keywords and the Intertopic Distance Map. The percentage of each topic was calculated during the generation of the Intertopic Distance Map.

3.3. Stage 3: Topic Changes by Period

The government interventions have been ongoing since 2009, and the interventions during the data collection period were not likely to have had any immediate effects on the responses of the locals and tourists posting on social media. Thus, trend analysis by topic was performed to determine how to categorize a period by topic change instead of dividing the period by government interventions. First of all, the LDA model assumes each document contains more than one topic, as mentioned above. Thus, the percentage contribution by each topic for each document was computed throughout all the documents. After that, each assigned topic was computed by year and month (e.g., January 2013).
Next, the authors used the same procedures for cleaning, tokenization, and bag-of-words (e.g., trigram modeling) detailed in Section 3.2. Because the rate of each topic presented differed by phase, the number of topics and the contents of the topics that emerged were inconsistent for each stage. As described above, the authors computed the perplexity and coherence scores, then selected the better LDA models based on these values. The results were drawn as word clouds and the word counts and importance of topic keywords were illustrated in figures. The individual topics were labeled considering the themes of the keywords, as described above. For instance, the topic containing keywords such as travel, Penguin Village, and alleyway was labeled as tour and culture. The saliency of terms for each phase was computed using the formula stated above.

4. Results

4.1. Topic Modeling for All Data

As described above, all data (January 2013 to October 2022) were analyzed in this stage to detect neighborhood identity for the entire period of time. On the whole, the exclusive identity of Yanglim, as a hot spot and tourist destination for sightseeing, consists of famous cafés, restaurants, and Penguin Village. The most salient terms that emerged were café (96,038), photo (79,606), famous place (54,285), time (44,234), menu (42,083), order (41,758), coffee (32,203), vibe (31,971), recommendation (29,206), and feeling (25,778). Other terms listed in this analysis are travel, Penguin Village, space, food, unni (which means older females of similar age or in a blood relationship in Korean), mom, dessert, bread, beverage, looking around, table, pasta, meat, village, sauce, street, interior, alley, tour, and culture (Table 1).
As explained in the methodology section, the authors calculated coherence and perplexity scores to identify better models with an optimal number of topics; in the end, an LDA model with five topics for all data was calculated. Based on the LDA modeling, word clouds and the count and importance of keywords were generated. Figure 2 illustrates the top thirty keywords composing the five selected topics out of the entire set of postings. During the entire period, the emerging topics were as follows: Topic 0 (daily life: 33.2%), Topic 1 (tour and culture; 10.3%), Topic 2 (café and hot spot; 17.5%), Topic 3 (food and hot spot; 18.6%), and Topic 4 (culture and citizen participation; 20.4%; Appendix A and Appendix B).
Topic 0 is labeled daily life, which includes people or loved ones interacting in daily life. The keywords of importance for Topic 0 are as follows: photo (0.0153), human (0.0109), friend (0.0095), thought (0.0080), time (0.0077), mom (0.0075), unni (0.0073), café (0.0060), oppa (0.0057; opposite gender of unni in Korean), and dinner (0.0056). Topic 1 (tour and culture) is related to historic and cultural resources present or cultivated in the Yanglim area. Topic 1 incorporates keywords such as travel (0.0326), Penguin Village (0.0278), photo (0.0225), time (0.0139), village (0.0109), alley (0.0088), looking around (0.0087), street (0.0086), thought (0.0085), and human (0.0082), in order of importance. Topic 3 is about cafés and hot spots. This topic contains keywords including café (0.0684), coffee (0.0244), photo (0.0165), vibe (0.0137), order (0.0137), dessert (0.0096), bread (0.0092), feeling (0.0089), recommendation (0.0087), and spot (0.0085). Coffee and dessert-related keywords occur as well. Topic 3 is about food and hot spots. The keywords (e.g., famous place, menu, and order) are concurrent with Topic 2, but food-related terms are dominant here. Topic 4 represents culture and citizen participation; the relevant terms are illustrated in Figure 3.
The resulting trends of the topics are shown in Figure 4. In particular, the rate of Topic 4 (i.e., culture and citizen participation) is noticeably higher than others before 2017. Topic 1, tour and culture related to Penguin Village, shows a similar pattern to Topic 4 since 2015. However, other topics are comparable to each other in the first part of the graph, showing only moderate increases or decreases in rate. Moreover, the rate of Topic 4 dramatically dropped after the COVID-19 outbreak in Gwangju. Thus, the authors separated the periods of analysis into three: (1) January 2013 to October 2016 (stage 1: flourishment); (2) November 2016 to February 2020 (stage 2: maturation); and (3) March 2020 to October 2022 (stage 3: COVID-19). The numbers of documents contained in each stage are as follows: 5362 for stage 1, 26,408 for stage 2, and 30,616 for stage 3.
Figure 5 presents the topic counts by documents and the five major terms generated through LDA modeling. As seen in Figure 5, the volume of topics related to culture and citizen participation was higher in the first stage than for others. Over time, the volumes of other topics balanced one another, then the daily life-related topic (Topic 0) dramatically outweighed the others during the COVID-19 period (stage 3). Overall, the quantity of blog postings increased noticeably over time.

4.2. Topic Changes by Period

4.2.1. Stage 1: Flourishment (January 2013–October 2016)

The first phase of urban regeneration was from January 2013 to October 2016. In this stage, the local government worked to revitalize the neighborhood by commodifying its historical and cultural resources, and the area was named Yanglim History and Culture Village. In tandem with these initiatives, the local residents revamped the streets and alleys in the neighborhood through voluntary and cooperative participation. Later on, the Yanglim neighborhood was named Penguin Village and gained fame, being reported on in mass media and going viral.
First of all, the authors identified the theme of Stage 1 as cultural heritage and tourism. In this stage, the keywords related to this theme are history, art, and travel, rather than cafés and hot spots. Specifically, the most salient keywords determined through LDA modeling included photo (4927 times), human (4673), travel (3304), thought (3179), culture (2572), Penguin Village (2550), café (2289), village (2150), story (2049), and area (1833). Other terms are as follows: space, friend, artwork, alley, proceeding, looking around, Korea, coffee, art, Mr., experience, missionary, church, artist, order, menu, movie, China, (musical) performance, and music (Table 1).
In addition, ten topics were identified by perplexity and coherence scores in this stage, bringing up more detailed keywords. The authors organized these ten topics into four themes: (1) culture, travel, and area (36.3%); (2) historic and cultural heritage and the evolution of Penguin Village (23.8%); (3) the creation of the neighborhood (20.9%); and (4) cafés and daily life (19.0%). These classifications are illustrated in Figure 6 and Appendix C.
The first theme (culture, travel, and area) is related to Topic 5 (8.3%), Topic 6 (12.1%), and Topic 7 (15.9%). The relevant keywords with the highest importance scores are (1) area (0.0125), business (0.0123), tourism (0.0087), support (0.0086), and citizen (0.0074) in Topic 5; (2) culture (0.0286), art (0.0109), performance (0.0107), area (0.0101), and artwork (0.0092) in Topic 6; and (3) travel (0.0230), village (0.0077), Seoul (0.0063), market (0.0059), and time (0.0059) in Topic 7.
The second theme, historic and cultural heritage and the evolution of Penguin Village, is associated with three topics (i.e., Topic 1, 4, and 9). The emerging keywords with the highest importance are as follows: (1) building (0.0268), photo (0.0148), House of Jangwoo Yi (0.0116), appearance (0.0115), and Sajik Park (0.0099) in Topic 1 (7.7%); (2) café (0.0207), travel (0.0199), photo (0.0169), alley (0.0144), and neighborhood (0.0143) in Topic 4 (9.5%); and (3) Penguin Village (0.0595), photo (0.0213), village (0.0185), time (0.0136), and penguin (0.0128) in Topic 9 (6.6%).
The third theme of the Yanglim neighborhood is the creation of the neighborhood story, with two topics (Topic 0 and 3). In addition to commodifying the heritage of the neighborhood, the enthusiastic storytelling about the neighborhood attracted unique small businesses. The relevant keywords by the highest importance scores are human (0.0205), thought (0.0181), photo (0.0179), story (0.0178), and time (0.0111), as well as book, heart, photographing, love, and movie (Topic 0; 9.2%). In Topic 3 (11.7%), human (0.0229), missionary (0.0120), church (0.0116), Mr. (0.0112), and Korea (0.0095) are relevant terms.
The last theme is about cafés and daily life. This topic accounts for merely 19% of the data (9.1% for Topic 2 and 9.9% for Topic). The important keywords are (1) café (0.0198), order (0.0150), menu (0.0134), coffee (0.0129), and famous place (0.0129; Topic 2) and (2) photo (0.0235), friend (0.0128), time (0.0113), thought (0.0107), and bread (0.0098; Topic 9). On the whole, the most important keyword in Topic 9 is Penguin Village (0.0595), as illustrated in Figure 7.

4.2.2. Stage 2: Maturation (November 2016–February 2020)

The strategy for urban regeneration utilizing cultural and historical resources in the neighborhood became successful and attracted both locals and tourists. The newly opened local businesses with their own distinctiveness, such as restaurants and cafés, led the change in the neighborhood identity. Thus, the salient keywords in this stage are related to cafés and famous eateries and to their own vibes. The top keywords categorized by saliency are as follows: café (37,563), photo (31,725), famous place (20,894), menu (18,836), order (18,623), friend (14,871), vibe (14,224), coffee (12,204), recommendation (11,334), and travel (11,028). Feeling, visit, food, bread, table, dessert, pasta, village, beverage, meat, menu (board), oppa, etc., are included as well (Table 1). The theme for Stage 2 was hot spots and cultural tourism.
In this stage, the authors identified five different topics by perplexity and coherence scores; these are grouped into three unique themes (Figure 8 and Appendix D). The first theme is daily life (Topic 0; 21.8%). The second theme is about cultural activities and tourism. With an unreproducible identity as a cultural and historical place, the summed ratio of Topic 2 (Penguin Village and travel) and Topic 1 (culture and relevant activities) entries occupies one third (31.2%) of the total. The third theme is an identity as a hot spot with good eateries and cafés (Topic 3 and 4; 47.0%). The topics for famous eateries (Topic 3; 24.5%) and cafés (Topic 4; 22.5%) are the most relevant topics, sharing fifteen keywords (e.g., menu, photo, and vibe) among the top thirty and occupying almost half of the topics (47%).
First of all, daily-life-related themes or topics account for 21.8% of the data; users’ loved ones, daily activities, and feelings are all mentioned (i.e., photo, human, friend, oppa, time, mom, unni, thought, love, dinner, happiness, daily life, gift, morning, beginning, Starbucks, feel like, drink, preparation, weekend, Seoul, lunch, heart, hand, child, Dongmyungdong, last, hair, dongsaeng (younger people of similar age or in a blood relationship), and today).
Second, under the theme of cultural activities and tourism, Topic 2 (Penguin Village and travel; 18.4%) illustrates physical characteristics of the neighborhood, with keywords including Penguin Village (0.0190), travel (0.0168), village (0.0129), human (0.0099), photo (0.0079), time (0.0069), street (0.0057), thought (0.0056), alley (0.0056), and appearance (0.0049). Other keywords include name, beginning, tour, space, history, building, neighborhood, looking around, artwork, heart, winter, child, missionary, tourist spot, alleyway, course, visit, reminiscence, museum, and History and Culture Village. The keywords (Topic 1; 12.8%) indicating culture and relevant activities contain experience (0.0084), story (0.0069), culture (0.0068), performance (0.0061), proceeding (0.0061), time (0.0058), area (0.0056), beginning (0.0048), participation (0.0047), and youth (0.0047). Other culture-related keywords are operation, festival, theme, program, event, variety, space, city, afternoon, thought, activity, representative, preparation, charm, site, artist, exhibition, installation, schedule, and narrative.
Last, under the theme of a hot spot with good eateries and cafés, menu and types of foods are important keywords, reflecting the popularity of specific restaurants in this area in Topic 3 (famous eateries). The most important related terms are famous place (0.0284), menu (0.0189), order (0.0146), food (0.0104), thought (0.0083), photo (0.0083), pasta (0.0080), vibe (0.0079), meat (0.0078), and recommendation (0.0072). Other relevant keywords are as follows: sauce, visit, menu (board), friend, eating place, feeling, table, pizza, human, time, meal, sushi, spot, salad, price, today, shrimp, cheese, curry, and Testa (a restaurant in Yanglim). The keywords with the highest importance scores for famous cafés (Topic 4) include café (0.0558), photo (0.0217), coffee (0.0177), vibe (0.0119), order (0.0109), bread (0.0103), recommendation (0.0076), spot (0.0073), feeling (0.0073), and dessert (0.0069). The other keywords were listed in the following order: thought, human, beverage, friend, menu, time, visit, interior, space, famous place, cake, posting, price, Dongmyungdong, hot spot, business owner, table, inside, macaroon, and sensibility. Dongmyungdong, another revitalized neighborhood with its own success, is located near Yanglim, and the two places were frequently mentioned or visited together.
When looking at the importance of keywords from Stage 2 (i.e., maturation), café is the highest, followed by famous place, photo, and Penguin Village from Stage 2 (Figure 9). During this stage, the Yanglim neighborhood became famous for its unique cafés and restaurants, appealing to a growing number of tourists and locals. Gentrification was highlighted in the mass media as evidence of this fame.

4.2.3. Stage 3: COVID-19 (March 2020–October 2022)

The third phase of urban regeneration was during the COVID-19 pandemic. The first confirmed case was in January 2020; there were nine confirmed cases in Gwangju by February 2020 [51]. The initiatives and activities of the governments and local businesses related to tourism slowed down to suppress the spread of COVID-19. Several of local entrepreneurs closed down their own businesses due to the pressure of rent increases, very slow business, and a lack of capital. In accordance with these changing environments, fifteen salient keywords in this phase were associated with cafés, famous places, and daily life rather than travel and activities of local artists. The keywords include café (53,767), famous place (29,929), order (22,162), menu (21,439), coffee (17,525), recommendation (16,528), vibe (15,644), visit (15,193), feeling (14,186), spot (13,719), space (11,321), unni (11,066), mom (10,753), dessert (10,391), and food (10,111; Table 1). Thus, the theme was identified as daily life and a hot spot with loved ones.
Through LDA modeling, six topics were identified, with three major themes with two individual topics each, after computing perplexity and coherence scores. The three main themes were daily life (52.3%), cafés and local hot spots (34.0%), and travel, art, and local stories (14.7%; Figure 10 and Appendix E).
First of all, for the main theme, daily life, stories about loved ones in the neighborhood (Topic 0; 29.0%) were the most common postings during this phase. The most important ten keywords were friend (0.0102), human (0.0098), mom (0.0094), unni (0.0094), café (0.0080), dinner (0.0078), oppa (0.0075), happiness (0.0051), morning (0.0049), and drink (0.0049; Figure 11). Along with loved ones and activities, other keywords expressing feelings and happiness (e.g., lunch, time, today, thought, coffee, love, Dongmyungdong, exercise, gift, getting off work, feel like, dad, dog, blog, last, best, beginning, daily life, hair, weather, and tea) were salient. For Topic 1 (activities in the neighborhood; 22.3%), keywords related to neighborhood businesses and activities, including book, flower, writing, tea, and painting, were salient. The top ten keywords were thought (0.0164), time (0.0164), human (0.0164), heart (0.0164), flower (0.0164), beginning (0.0164), story (0.0164), book (0.0164), looking around (0.0164), and travel (0.0164). Other relevant terms were love, feeling, child, preparation, name, thanks, appearance, friend, tea, memory, happiness, recommendation, writing, necessity, arrival, painting, gift, feel like, and one day.
For the second theme, keywords associated with cafés and local hot spots are illustrated. For Topic 5 (famous places and food; 17.2%), western food such as pizza and pasta along with famous places and the vibes were all mentioned. It seems that this neighborhood is famous for western food rather than local food; as western food is more expensive than local food, this is highly likely to be evidence of gentrification in Yanglim. The ten most important keywords were famous place (0.0343), menu (0.0233), order (0.0176), food (0.0122), recommendation (0.0112), visit (0.0096), sauce (0.0090), pasta (0.0089), vibe (0.0085), and thought (0.0071). There were twenty additional salient keywords such as feeling, friend, meat, pizza, table, meal, eating place, spot, menu (board), soup, time, store, dish, cheese, salad, today, human, steak, waiting, and shrimp. In topic 4 (16.8%), keywords about cafés and the vibe emerged. The keywords included café (0.0730), coffee (0.0220), order (0.0150), dessert (0.0146), vibe (0.0122), famous place (0.0115), recommendation (0.0106), beverage (0.0101), space (0.0100), and bread (0.0100). There are twenty keywords about feelings, friends, and thoughts as well. The Yanglim neighborhood has worked to combine art and local business. As a successful example, Lee, Leenam studio appeared, which is a café-cum-media art studio exhibiting the artwork of the media artist Lee, Leenam.
Travel, art, and local stories are the last theme. Topic 2—travel, Penguin Village, and art—makes up 9.5% of the keywords. Although the influx of tourist traffic slowed down in this phase, artwork, Penguin Village, and travel remained the top ranked for this topic. The importance of each keyword was as follows: artwork (0.0107), Penguin Village (0.0096), village (0.0091), travel (0.0089), space (0.0080), area (0.0074), proceeding (0.0073), afternoon (0.0073), introduction (0.0071), and exhibition (0.0063). In addition to art and culture, space-related keywords included street, artist, visit, building, culture, operation, alley, appearance, city, course, representative, art, use, place, participation, experience, site, tour, history, and making. Topic 3 was about the local story and urban regeneration (5.2%). As a heritage site of Christianity, relevant terms were human (0.0087), missionary (0.0074), thanks (0.0063), church (0.0062), Korea (0.0061), life (0.0047), school (0.0044), history (0.0042), mother (0.0041), and name (0.0038). Keywords such as love, beginning, Chosun (the last imperial dynasty of Korea), support, pastor, country, activity, story, exercise, urban regeneration, society, progress, real estate, earth, China, America, Mr., spirit, song, and spot were included as well.

5. Discussion

Analyzing big data in urban regeneration areas helps policy makers to better understand citizens’ thoughts and needs and how they position themselves within the cities and neighborhoods. Comprehending neighborhood identity and its changing nature offers a framework for this analysis. Local government policies making good use of historical and cultural resources and citizen participation have become drivers of neighborhood changes and have gradually transformed neighborhood identities, as has the COVID-19 pandemic more recently. In this study, the authors have examined how the identity of the Yanglim area of Gwangju, Korea has changed through urban regeneration initiatives and citizens’ participation and interactions. The authors divided this neighborhood identity into three phases using trend analysis: (1) January 2013 to October 2016 (stage 1: flourishment), (2) November 2016 to February 2020 (stage 2: maturation), and (3) March 2020 to October 2022 (stage 3: COVID-19).
The topics included the issues around events, businesses, emotions and sentiments, users, activities, history, art, and culture in the neighborhood [52]. The topics and salient keywords identified have changed over time.
In the beginning, Yanglim was unnoticed by locals and tourists. However, cooperation between the local government and residents brought about successful urban regeneration. The local government steered the maintenance and improvement of the neighborhood environment and made use of storytelling through historical and cultural resources (i.e., culture-led urban regeneration). In addition, the voluntary involvement of the local residents and the cooperation of local artists in beautifying a burnt house site and the surrounding neighborhood helped it acquire its current reputation as Penguin Village, the new name of the Yanglim area. This cooperation between government and local residents played a crucial role in burgeoning urban regeneration and a new identity introduced by cultural and historical resources for this area.
Second, the growing demands from new clientele became an important driver of neighborhood change [53]. Thanks to the influx of tourists and local visitors for sightseeing, small business entrepreneurs opened shops with their own featured items and foods. During the first phase, businesses taking advantage of affordable rents (e.g., photo studios, flower shops, wedding shops, and guest houses as well as coffee shops, bakeries, and restaurants) were the primary businesses in the area. For example, wedding shops, guest houses, and photo studios lured local customers who desired special experiences combining cultural and modern heritage. As time went by, newly opened food-related businesses such as coffee shops, bakeries, and restaurants became the predominant businesses due to a demand for these businesses by a new population in the area, such as tourists and young locals. As a result, the neighborhood’s identity changed to a hot spot going through commercialization, with a great many top-rated and must-see restaurants and cafés with their own unique tastes and moods. The distinctive characteristic of urban regeneration projects in Korea is commercialization by converting residential buildings to commercial buildings such as cafés and restaurants. This conversion changes neighborhoods into tourist spots, though this brings up gentrification issues [54]. These food-related businesses function as a proxy of local economic change associated with gentrification in neighborhoods, providing real-time and up-to-date insights for gentrification before official statistics are released [55]. As expected, this identity change in Yanglim has led to gentrification issues, with the growing increase of rents reported in the news media. Thus, the local government, landlords, and small entrepreneurs (i.e., tenants) made an agreement to prevent a dramatic increase in rent in 2019, promising a government subsidy [56]. Government statistics released later showed that the number of food-related businesses that closed down increased by 50% from 2018 to 2019, confirming these news reports (Table 2). On top of these endeavors, the local government has worked to attract local artists by designating a block as the Yanglim Culture Park (4071 m2, 1077 m2 for buildings) created between 2018 and 2020 [57]. All of the buildings built or renovated with the Hanok architecture type are occupied by fourteen different studios for arts and crafts [58].
Lastly, in addition to the gentrification issue, COVID-19 has propelled unexpected neighborhood changes. Yanglim’s identity as a hot spot for tourists and locals has been weakened due to the worldwide pandemic. Small business owners (tenants) with insufficient capital closed down their businesses due to slow business as well as rising rents and real estate prices. As seen in the data in Table 2, the number of businesses shutting down has gradually grown since 2017, and almost doubled in 2022 compared to 2018 (i.e., from 14 to 27). As the Yanglim neighborhood has become less famous as a hot spot for the young and tourists following the COVID-19 pandemic, the changing nature of the neighborhood is identified by the majority of blog posts as a place for individuals and their loved ones. Following the COVID-19 pandemic, this neighborhood now lies between the point of retrieving its previous reputation as a renowned tourist spot with an abundant heritage and art studios and a hot spot with rich cafés and eateries drawing the attention of the young. It is likely that gentrification is one of the barriers to winning back its fame. Because the Yanglim Culture Park opened during the COVID-19 outbreak, local artists have stated that initiatives embracing their activities (e.g., one-day classes or festivals) are necessary to appeal to more potential tourists and locals. However, the Korean central government has cut the budget for urban regeneration projects, and currently the local government feels pressure to create self-sustainable neighborhoods [59]. Subsequently, it is important to strategize for economically and socially sustainable neighborhoods in the era of depopulation and degentrification.

6. Conclusions and Limitations

This study has attempted to analyze how neighborhood identity has changed through linguistic patterns using computational techniques (i.e., text mining). User-generated data provide insights about urban regeneration. Such insights can be valuable for improving current situations and planning for economically and socially sustainable neighborhoods and cities [8].
On the one hand, urban regeneration in the Yanglim area was considered to be successful overall. In the first phase, the identity of the Yanglim neighborhood in Gwangju, Korea, was formed by different stakeholders. The activities of policymakers and local residents led to a shared identity known as the Yanglim History and Culture Village, or Penguin Village. Starting with an identity as a tourist spot, the identity of this area evolved into a hot spot with trendy and unique cafés and restaurants, drawing attention from a considerable number of tourists and local young adults. On the other hand, gentrification became an issue, similar to other urban regeneration sites. The outbreak of the COVID-19 pandemic and the resulting decrease in visitors and tourists changed the neighborhood’s identity to a place for everyday life. The dominant blog postings during that time were about time and activities spent with loved ones such as friends and family members in the neighborhood. This means that the neighborhood has been experiencing degentrification, losing its reputation as a hot spot due to economic crisis [54]. Consequently, it appears that the process of enabling this neighborhood to be economically and socially sustainable has been challenged by successive experiences of gentrification and degentrification.
While blog data allow us to listen to citizens’ voices through massive volumes of data, it has possible biases. First of all, blog postings are more likely to be used to promote businesses due to entrepreneurs or compensated service users uploading topics which generate a high number of keywords rather than personal online postings. Second, analyzing blog data may not adequately capture the viewpoints of residents and small business owners in the area, especially those who experience displacement and gentrification. Whereas displacement and gentrification issues may show up in micro-level data analysis, LDA modeling, as a technique for macro-level analysis, may not be able to capture these issues. In addition, the majority of blog or social media users tend to emphasize the positive side of their lives [52]. Thus, considering other diverse research methods is recommended for further research.

Author Contributions

H.Y.Y. contributed to the entire research process, including conceptualization, data collection, writing, and revision. H.-a.K. contributed to funding acquisition, project administration, supervision, and review of the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Education of the Republic of Korea through the National Research Foundation of Korea (NRF-2019R1I1A3A01061072).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is not available for confidentiality.

Acknowledgments

The authors acknowledge financial support from the funder and comments from three unknown reviewers for the improvement of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Most Important Terms and Blog Posting Numbers by Topic and Year

TKeyword1Keyword2Keyword3Keyword4Keyword52013201420152016201720182019202020212022Sum% of tokens
0photohumanfriendthoughttime426924051516011690175821603532508516,69233.2
1travelPenguin Villagephototimevillage471033868591237933652530743689617910.3
2cafécoffeephotovibeorder563712434116491940201720712832236413,43117.5
3famous placemenuorderfoodpasta3210110538518012658237522352805268415,18118.6
4humanculturestoryareabeginning188354781120413651178116511761476119210,07920.4

Appendix B. Most Important Terms by Topic for All Data

Topic 0: Daily Life
(33.2%)
Topic 1: Tour and Culture (20.4%)Topic 2: Café and Hot Spot (18.6%)Topic 3: Food and Hot Spot
(17.5%)
Topic 4: Culture and Citizen Participation (10.3%)
KeywordsImportanceKeywordsImportanceKeywordsImportanceKeywordsImportanceKeywordsImportance
photo0.0153travel0.0326café0.0684famous place0.0344human0.0073
human0.0109Penguin Village0.0278coffee0.0244menu0.0234culture0.0058
friend0.0095photo0.0225photo0.0165order0.0170story0.0054
thought0.0080time0.0139vibe0.0137food0.0131area0.0052
time0.0077village0.0109order0.0137pasta0.0093beginning0.0043
mom0.0075alley0.0088dessert0.0096recommendation0.0091proceeding0.0041
unni0.0073looking around0.0087bread0.0092vibe0.0090time0.0037
café0.0060street0.0086feeling0.0089thought0.0080thought0.0035
oppa0.0057thought0.0085recommendation0.0087meat0.0078space0.0034
dinner0.0056human0.0082spot0.0085sauce0.0078performance0.0034
love0.0050tour0.0079beverage0.0085visit0.0067history0.0033
happiness0.0047appearance0.0073space0.0082eating place0.0061village0.0031
morning0.0041course0.0067interior0.0081photo0.0061city0.0031
beginning0.0037neighborhood0.0067menu0.0079friend0.0060artist0.0030
today0.0037child0.0062thought0.0067feeling0.0060love0.0030
heart0.0036reminiscence0.0059famous place0.0066table0.0055operation0.0030
daily life0.0036feeling0.0053visit0.0065pizza0.0054Korea0.0030
gift0.0036recommendation0.0050friend0.0060meal0.0054artwork0.0028
feel like0.0033building0.0050human0.0056menu(board)0.0052representative0.0027
last0.0032café0.0047time0.0050price0.0048citizen0.0027
drink0.0030beginning0.0046business owner0.0049spot0.0047event0.0026
lunch0.0030visit0.0046table0.0046time0.0043activity0.0026
coffee0.0029name0.0046cake0.0044sushi0.0041introduction0.0025
tea0.0029bus0.0045Dongmyungdong0.0043human0.0041exhibition0.0025
Dongmyungdong0.0028tea0.0043price0.0042steak0.0041participation0.0025
one day0.0028walk0.0043macaroon0.0042cheese0.0039appearance0.0024
weather0.0026alleyway0.0043inside0.0041salad0.0039missionary0.0024
weekend0.0026heart0.0043Americano0.0040store0.0039art0.0024
story0.0025arrival0.0041tea0.0039shrimp0.0036heart0.0023
dongsaeng0.0025Penguin_Village0.0041posting0.0038today0.0035Mr.0.0022

Appendix C. Most Important Terms by Topic and Theme for Stage 1

Theme 1: Culture, Travel, and Area (36.3%)Theme 3: the Creation of the Neighborhood (20.9%)
Topic 5 (8.3%)Topic 6 (12.1%)Topic 7 (15.9%)Topic 0 (9.2%)Topic 3 (11.7%)
KeywordImportanceKeywordImportanceKeywordImportanceKeywordImportanceKeywordImportance
area0.0125culture0.0286travel0.023human0.0205human0.0229
business0.0123art0.0109village0.0077thought0.0181missionary0.012
tourism0.0087performance0.0107Seoul0.0063photo0.0179church0.0116
support0.0086area0.0101market0.0059story0.0178Mr.0.0112
citizen0.0074artwork0.0092time0.0059time0.0111Korea0.0095
operation0.0073event0.0088history0.0058book0.0109love0.0075
city0.0072proceeding0.0087course0.0054heart0.0099school0.0074
resident0.0059artist0.0083street0.0053photographing0.0094mother0.0064
planning0.0058exhibition0.0079human0.0048love0.0083story0.0061
representative0.0057participation0.0079beginning0.0047movie0.0082God0.0061
progress0.0056space0.0076bus0.0047beginning0.0079heart0.006
proceeding0.0054citizen0.0071culture0.0046friend0.0058pastor0.0059
education0.0052festival0.0069city0.0045song0.0054Jesus0.0057
necessity0.0045program0.0061area0.0045preparation0.0052mission0.0053
village0.0041theme0.006Mudeungsan0.0043writing0.0052life0.0053
facility0.004Biennale0.006experience0.0042thanks0.0051beginning0.0051
visit0.004activity0.0059Daein_Market0.004music0.0048name0.0051
development0.0039Korea0.0054appearance0.0039Jeongyulseong0.0048Christianity0.0049
effort0.0039world0.0053park0.0038Dongaebi0.0048thought0.0046
participation0.0038schedule0.0052Daegu0.0035child0.0047word0.0045
evaluation0.0037afternoon0.0051nature0.0035appearance0.0047hospital0.0044
activity0.0036variety0.0045tour0.0034work0.0042country0.004
development0.0035youth0.0042spot0.0032happiness0.0041history0.0038
economy0.0033culture_art0.0042Pusan0.0032study0.004Father0.0037
formation0.0031operation0.0042alley0.0032reason0.0038global society0.0035
management0.0031representative0.0041view0.003gift0.0036worship0.0033
thought0.0031experience0.004tourist spot0.003place0.0035spirit0.0033
result0.0031opening0.004museum0.003global society0.0034America0.0033
woman0.003music0.0039country0.0029memory0.0034Chosun0.003
industry0.003China0.0038operation0.0028Mr.0.0033prayer0.0029
Theme 2: historic and cultural heritages and evolution of Penguin Village (23.8%)Theme 4: cafés and daily life (19.0%)
Topic 1 (7.7%)Topic 4 (9.5%)Topic 9 (6.6%)Topic 2 (9.1%)Topic 8 (9.9%)
building0.0268café0.0207Penguin Village0.0595café0.0198photo0.0235
photo0.0148travel0.0199photo0.0213order0.015friend0.0128
House of Jangwoo Yi0.0116photo0.0169village0.0185menu0.0134time0.0113
appearance0.0115alley0.0144time0.0136coffee0.0129thought0.0107
Sajik Park0.0099neighborhood0.0143penguin0.0128famous place0.0129bread0.0098
House_of_Miss-ioary_Wilson0.0096human0.0127performance0.0111price0.0102human0.0095
history0.0077thought0.0111Penguin_Village0.0102meat0.0094mom0.0094
Sajik_Park_ob-servatory0.0073time0.0108Yanglim Salon0.0089business owner0.009unni0.0086
house0.0068looking around0.0105experience0.0082food0.0088posting0.0069
History and Culture Village0.0066village0.0099artwork0.0081Tteokbokki0.0087oppa0.0068
flower shop0.0066space0.0093tour0.0071vibe0.008today0.0065
use0.0065feeling0.009Penguin Village_Penguin Village0.007table0.0071dinner0.0064
architecture0.0063Penguin Village0.0081resident0.0067recommendation0.0059morning0.006
observatory0.0062heart0.008reminiscence0.0065interior0.0055weekend0.0059
Hanok0.0061map0.0078child0.0061eating place0.0052dongsaeng0.0058
tree0.0061street0.0077item0.0059spot0.0052daily life0.0055
space0.0059painting0.0069flea_market0.0059menu (board)0.005tea0.0054
Dahyung Tearoom0.0057appearance0.0066looking around0.0058store0.0048arrival0.0054
modern0.0057vibe0.0065clock0.0056feeling0.0048Yanglim Bakery0.0052
Memorial House of
Owen
0.0057tea0.0063beginning0.0056beverage0.0043night0.0052
missionary_house0.0056name0.0062human0.0056photo0.004feel like0.0049
Modern_History_
and_Culture_Village
0.0052alleyway0.0061proceeding0.0055thought0.0039weather0.0048
inside0.0052coffee0.0053appearance0.0053friend0.0038hand0.0044
Choi Seunghyo_House0.0051walk0.0052name0.0051tea0.0037rain0.0041
Yanglim Church0.005building0.0049Penguin_vegetable garden0.0051side dishes0.0037hair0.004
time0.0046discovery0.0048alley0.005name0.0035beginning0.0039
beginning0.0045mural0.0046vegetable garden0.0049visit0.0034lunch0.0037
night view0.0044Songjung_Sta-tion_Market0.0045story0.0049cheese0.0033last0.0037
name0.0044friend0.0044introduction0.0048taste0.0033looking around0.0036
housing0.0043art gallery0.0044thought0.0048sauce0.0033memory0.0036

Appendix D. Most Important Terms by Topic and Theme for Stage 2

Theme 1: Daily Life (21.8%)Theme 2: Cultural Activities and Tourism (31.2%)Theme 3: Hot Spot with Good Eateries and Cafés (47.0%)
Topic 0: Daily Life (21.8%)Topic 1: Culture and Relevant Activities (12.8%)Topic 2: Penguin Village and Travel (18.4%)Topic 3: Famous Eateries (24.5%)Topic 4: Famous Cafés (22.5%)
KeywordImportanceKeywordImportanceKeywordImportanceKeywordImportanceKeywordImportance
photo0.0145experience0.0084Penguin Village0.0190famous place0.0284café0.0558
human0.0096story0.0069travel0.0168menu0.0189photo0.0217
friend0.0095culture0.0068village0.0129order0.0146coffee0.0177
oppa0.0094performance0.0061human0.0099food0.0104vibe0.0119
time0.0088proceeding0.0061photo0.0079thought0.0083order0.0109
mom0.0081time0.0058time0.0069photo0.0081bread0.0103
unni0.0072area0.0056street0.0057pasta0.0080recommendation0.0076
thought0.0066beginning0.0048thought0.0056vibe0.0079spot0.0073
love0.0056participation0.0047alley0.0056meat0.0078feeling0.0073
dinner0.0052youth0.0047appearance0.0049recommendation0.0072dessert0.0069
happiness0.0050operation0.0046name0.0048sauce0.0062thought0.0065
daily life0.0048festival0.0044beginning0.0048visit0.0062human0.0065
gift0.0042theme0.0043tour0.0044menu (board)0.0060beverage0.0062
morning0.0041program0.0042space0.0043friend0.0059friend0.0059
beginning0.0040event0.0042history0.0042eating place0.0057menu0.0057
Starbucks0.0040variety0.0040building0.0040feeling0.0055time0.0050
feel like0.0037space0.0039neighborhood0.0037table0.0055visit0.0049
drink0.0036city0.0039looking around0.0035pizza0.0054interior0.0047
preparation0.0035afternoon0.0039artwork0.0035human0.0050space0.0046
weekend0.0035thought0.0037heart0.0032time0.0048famous place0.0046
Seoul0.0034activity0.0037winter0.0031meal0.0047cake0.0043
lunch0.0034representative0.0036child0.0031sushi0.0044posting0.0041
heart0.0034preparation0.0036missionary0.0030spot0.0044price0.0037
hand0.0033charm0.0036tourist spot0.0030salad0.0040Dongmyung-dong0.0036
child0.0033site0.0036alleyway0.0030price0.0040hot spot0.0036
Dongmyung-dong0.0032artist0.0035course0.0029today0.0034business owner0.0036
last0.0032exhibition0.0034visit0.0027shrimp0.0034table0.0035
hair0.0031installation0.0032reminiscence0.0027cheese0.0033inside0.0035
dongsaeng0.0030schedule0.0031museum0.0027curry0.0032macaroon0.0033
today0.0029narrative0.0030History and Culture Village0.0026Testa0.0031sensibility0.0033

Appendix E. Most Important Terms by Topic and Theme for Stage 3

Theme 1: Daily Life (52.3%)Theme 3: Travel, Art, and Local Stories (14.7%)Theme 2: Cafés and Local Hot Spots (34.0%)
Topic 0: Loved Ones in the Neighborhood (29.0%)Topic 1: Activities in the Neighborhood (22.3%)Topic 2: Travel, Penguin Village, and Art (9.5%)Topic 3: Local Story and Urban Regeneration (5.2%)Topic 4: Café and Vibe (16.8%)Topic 5: Famous Places and the Food (17.2%)
KeywordImportanceKeywordImportanceKeywordImportanceKeywordImportanceKeywordImportanceKeywordImportance
friend0.0102thought0.0164artwork0.0107human0.0087café0.0730famous place0.0343
human0.0098time0.0153Penguin Village0.0096missionary0.0074coffee0.0220menu0.0233
mom0.0094human0.0136village0.0091thanks0.0063order0.0150order0.0176
unni0.0094heart0.0097travel0.0089church0.0062dessert0.0146food0.0122
café0.0080flower0.0056space0.0080Korea0.0061vibe0.0122recommendation0.0112
dinner0.0078beginning0.0055area0.0074life0.0047famous place0.0115visit0.0096
oppa0.0075story0.0054proceeding0.0073school0.0044recommendation0.0106sauce0.0090
happiness0.0051book0.0052afternoon0.0071history0.0042beverage0.0101pasta0.0089
morning0.0049looking around0.0046introduction0.0063mother0.0041space0.0100vibe0.0085
drink0.0049travel0.0045exhibition0.0063name0.0038bread0.0100thought0.0071
lunch0.0047love0.0043street0.0058love0.0034spot0.0096feeling0.0069
time0.0043feeling0.0041artist0.0056beginning0.0031feeling0.0091friend0.0064
today0.0042child0.0041visit0.0054Chosun0.0031visit0.0087meat0.0061
thought0.0040preparation0.0041building0.0051support0.0031menu0.0085pizza0.0059
coffee0.0039name0.0037culture0.0051pastor0.0031interior0.0071table0.0058
love0.0039thanks0.0037operation0.0050country0.0030friend0.0058meal0.0054
Dongmyu-ngdong0.0038appearance0.0036alley0.0043activity0.0030thought0.0055eating place0.0053
exercise0.0035friend0.0034appearance0.0042story0.0030human0.0055spot0.0053
gift0.0035tea0.0034city0.0041exercise0.0030inside0.0051Menu (board)0.0048
getting off work0.0034memory0.0034course0.0040urban
_regeneration
0.0029cake0.0051soup0.0043
feel like0.0032happiness0.0030representative0.0040society0.0029Leeleenam Studio0.0046time0.0042
dad0.0032recommendation0.0029art0.0038progress0.0028business owner0.0043store0.0041
blog0.0032today0.0028use0.0037real estate0.0028table0.0040dish0.0036
last0.0032writing0.0028place0.0037earth0.0027tea0.0040cheese0.0036
best0.0031necessity0.0025participation0.0036China0.0027Americano0.0040salad0.0035
beginning0.0030arrival0.0025experience0.0035America0.0027sensibility0.0037today0.0034
daily life0.0030painting0.0024site0.0034Mr.0.0026latte0.0035human0.0034
hair0.0029gift0.0024tour0.0033spirit0.0025time0.0034steak0.0033
weather0.0029feel like0.0024history0.0033song0.0024cup0.0033waiting0.0033
tea0.0029one day0.0023making0.0033spot0.0024looking around0.0032shrimp0.0032

References

  1. de Oliveira Capela, F.; Ramirez-Marquez, J.E. Detecting urban identity perception via newspaper topic modeling. Cities 2019, 93, 72–83. [Google Scholar] [CrossRef]
  2. Lynch, K. Good City Form; MIT Press: Cambridge, MA, USA, 1984. [Google Scholar]
  3. Oktay, D. The quest for urban identity in the changing context of the city: Northern Cyprus. Cities 2002, 19, 261–271. [Google Scholar] [CrossRef]
  4. Guimarães, P.P.C. An evaluation of urban regeneration: The effectiveness of a retail-led project in Lisbon. Urban Res. Pract. 2017, 10, 350–366. [Google Scholar] [CrossRef]
  5. Shin, H.; Stevens, Q. Debates around cultural re-imaging and culture-led Urban regeneration: The politics of two festivals in Gwangju and Glasgow. Asian J. Soc. Sci. 2014, 41, 628–652. [Google Scholar] [CrossRef]
  6. Mbhiza, M.H. Newtown Cultural Precinct as a Tool for Urban Regeneration within the Johannesburg Inner-City. Masters Thesis, University of Johannesburg, Johannesburg, South Africa, 2013. [Google Scholar]
  7. Bell, D.; Jayne, M. ‘Design-led’ urban regeneration: A critical perspective. Local Econ. 2003, 18, 121–134. [Google Scholar] [CrossRef]
  8. Ghahramani, M.; Galle, N.J.; Ratti, C.; Pilla, F. Tales of a city: Sentiment analysis of urban green space in Dublin. Cities 2021, 119, 103395. [Google Scholar] [CrossRef]
  9. Huang, R. Network fields, cultural identities and labor rights communities: Big data analytics with topic model and community detection. Chin. J. Sociol. 2019, 5, 3–28. [Google Scholar] [CrossRef] [Green Version]
  10. Lynch, K. Image of the City; M.I.T. Press: Cambridge, MA, USA, 1960. [Google Scholar]
  11. Matlovičová, K.; Kormaníková, J. City brand-image associations detection. Case study of Prague. In Proceedings of the SGEM International Multidisciplinary Scientific Conferences on Social Sciences and Arts, SGEM 2014, Psychology & Psychiatry, Sociology & Healthcare, Education, Conference Proceedings, Albena, Bulgaria, September 2014; pp. 139–146. [Google Scholar]
  12. Matlovičová, K.; Mocák, P.; Kolesárová, J. Environment of estates and crime prevention through urban environment formation and modification. Geogr. Pannonica 2016, 20, 168–180. [Google Scholar] [CrossRef]
  13. Olson, A.W.; Calderon-Figueroa, F.; Bidian, O.; Silver, D.; Sanner, S. Reading the city through its neighbourhoods: Deep text embeddings of Yelp reviews as a basis for determining similarity and change. Cities 2021, 110, 103045. [Google Scholar] [CrossRef]
  14. Suttles, G.D. The Social Construction of Communities; University of Chicago Press: Chicago, IL, USA, 1972. [Google Scholar]
  15. Phillips, N.E.; Levy, B.L.; Sampson, R.J.; Small, M.L.; Wang, R.Q. The social integration of American cities: Network measures of connectedness based on everyday mobility across neighborhoods. Sociol. Methods Res. 2021, 50, 1110–1149. [Google Scholar] [CrossRef]
  16. Bernardo, F.; Almeida, J.; Martins, C. Urban identity and tourism: Different looks, one single place. Proc. Inst. Civ. Eng.-Urban Des. Plan. 2017, 170, 205–216. [Google Scholar] [CrossRef]
  17. Casakin, H.; Hernández, B.; Ruiz, C. Place attachment and place identity in Israeli cities: The influence of city size. Cities 2015, 42, 224–230. [Google Scholar] [CrossRef]
  18. Ragab, T.S. The crisis of cultural identity in rehabilitating historic Beirut-downtown. Cities 2011, 28, 107–114. [Google Scholar] [CrossRef]
  19. Riza, M.; Doratli, N.; Fasli, M. City branding and identity. Procedia Soc. Behav. Sci. 2012, 35, 293–300. [Google Scholar] [CrossRef] [Green Version]
  20. Huovinen, A.; Timonen, E.; Leino, T.; Seppälä, T. Changing urban identities on a discursive map. City Cult. Soc. 2017, 11, 20–28. [Google Scholar] [CrossRef]
  21. Salesses, P.; Schechtner, K.; Hidalgo, C.A. The collaborative image of the city: Mapping the inequality of urban perception. PLoS ONE 2013, 8, e68400. [Google Scholar] [CrossRef] [Green Version]
  22. Hwang, J. The social construction of a gentrifying neighborhood: Reifying and redefining identity and boundaries in inequality. Urban Aff. Rev. 2016, 52, 98–128. [Google Scholar] [CrossRef]
  23. Jenks, M.; Dempsey, N. Defining the neighbourhood: Challenges for empirical research. Town Plan. Rev. 2007, 78, 153–177. [Google Scholar] [CrossRef]
  24. Filomena, G.; Verstegen, J.A.; Manley, E. A computational approach to ‘The Image of the City’. Cities 2019, 89, 14–25. [Google Scholar] [CrossRef]
  25. Zhou, B.; Liu, L.; Oliva, A.; Torralba, A. Recognizing city identity via attribute analysis of geo-tagged images. In Computer Vision—ECCV 2014, PT III; Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T., Eds.; Springer International Publishing: Cham, Switzerland, 2014; Volume 8691, pp. 519–534. [Google Scholar]
  26. Lansley, G.; Longley, P.A. The geography of Twitter topics in London. Comput. Environ. Urban Syst. 2016, 58, 85–96. [Google Scholar] [CrossRef] [Green Version]
  27. Fol, S. Urban shrinkage and socio-spatial disparities: Are the remedies worse than the disease? Built Environ. 2012, 38, 259–275. [Google Scholar] [CrossRef]
  28. KOSIS Area by Administrative Unit. Available online: https://kosis.kr/statHtml/statHtml.do?orgId=101&tblId=DT_1ZGA17 (accessed on 23 March 2023).
  29. SGIS Registered Population. Available online: https://sgis.kostat.go.kr/view/thematicMap/thematicMapMain?stat_thema_map_id=sI8uqsnppF20160121115806975w5w92MFI5L&theme=CTGR_001&mapType=05&CTGRS=CTGR_001:recommend,CTGR_002:recommend,CTGR_003:recommend,CTGR_004:recommend,CTGR_005:recommend,CTGR_006:recommend (accessed on 23 March 2023).
  30. Unknown Yanglim History and Culture Village. Available online: https://visityangnim.kr (accessed on 28 February 2023).
  31. Jung, S.-M.; Kang, S.-K. Stakeholder’s network structure analysis in community tourism development: Based on the Yanglimdong’s urban regeneration process. J. Tour. Stud. 2015, 27, 215–239. [Google Scholar]
  32. Beak, A.-K. A Study on the Revitalzation of Culture and Tourism for Gwangju Yanglim History and Culture Town; Chonnam National University: Gwangju, Republic of Korea, 2013. [Google Scholar]
  33. Jung, J.-Y. Gwangju has a plan to develop missionary heritages as historical and cultural village. Christian Newspaper, 7 May 2010. [Google Scholar]
  34. Choi, Y.H. The White Paper on Nam-Gu District 2016. Available online: https://www.namgu.gwangju.kr/upload/whitePaper/2016_backseo/EBook.htm (accessed on 25 May 2023).
  35. Park, C.; Kang, S. A nostalgic old time, the memory of Penguine Village, Yanglim, Gwangju. Hansan News, 9 July 2021. [Google Scholar]
  36. Nam, S.J. White Paper on Urban Regeneration in Gwangju 2022; Gwangju Urban Regeneration Communiy Center: Gwangju, Republic of Korea, 2022. [Google Scholar]
  37. Sirisuriya, D.S. A Comparative Study on Web Scraping. In Proceedings of the 8th International Research Conference, General Sir John Kotelawala Defence University, Sri Lanka, November 2015; pp. 135–140. Available online: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=6zZ7vcIAAAAJ&citation_for_view=6zZ7vcIAAAAJ:kNdYIx-mwKoC (accessed on 10 April 2023).
  38. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  39. Unknown Language Corpora. Available online: https://guides.library.uq.edu.au/research-techniques/text-mining-analysis/language-corpora (accessed on 23 May 2023).
  40. Evison, J. What are the basics of analysing a corpus? In The Routledge Handbook of Corpus Linguistics; Routledge: London, UK, 2010; pp. 122–135. [Google Scholar]
  41. Park, E.L.; Cho, S. KoNLPy: Korean natural language processing in Python. In Proceedings of the 26th Annual Conference on Human & Cognitive Language Technology, Chuncheon, Republic of Korea, 10–11 October 2014; pp. 133–136. [Google Scholar]
  42. Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G.; Dean, J. Distributed Representations of Words and Phrases and their Compositionality; Cornell University Library, arXiv.org: Ithaca, NY, USA, 2013. [Google Scholar]
  43. Ramos, J. Using TF-IDF to determine word relevance in document queries. In Proceedings of the First Instructional Conference on Machine Learning, Piscataway, NJ, USA, 3 December 2003; pp. 1–4. [Google Scholar]
  44. Blei, D. Probabilistic Topic Models; ACM: New York, NY, USA, 2012; Volume 55, pp. 77–84. [Google Scholar]
  45. Moro, S.; Cortez, P.; Rita, P. Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and Latent Dirichlet Allocation. Expert Syst. Appl. 2015, 42, 1314–1324. [Google Scholar] [CrossRef] [Green Version]
  46. David Sontag, D.R. Complexity of Inference in Latent Dirichlet Allocation. In Proceedings of the Part of Advances in Neural Information Processing Systems 24, Granada, Spain, 12–15 December 2011; pp. 1–9. [Google Scholar]
  47. Maskeri, G.; Sarkar, S.; Heafield, K. Mining business topics in source code using latent dirichlet allocation. In Proceedings of the 1st India Software Engineering Conference, Hyderabad, India, 19–22 February 2008; pp. 113–120. [Google Scholar]
  48. Kullback, S.; Leibler, R.A. On Information and Sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
  49. Chuang, J.; Manning, C.; Heer, J. Termite: Visualization Techniques for Assessing Textual Topic Models; Tortora, G., Levialdi, S., Tucci, M., Eds.; ACM: New York, NY, USA; pp. 74–77.
  50. Gillies, M.; Murthy, D.; Brenton, H.; Olaniyan, R. Theme and topic: How qualitative research and topic modeling can be brought together. arXiv 2022, arXiv:2210.00707. [Google Scholar]
  51. Unknown [COVID-19 Briefing] COVID-19 Report in Gwangju. Available online: https://www.gwangju.go.kr/c19/boardView.do?pageId=corona64&boardId=BD_0000000027&seq=5019&movePage=133&searchTy=T&searchQuery=광주광역시%20코로나19 (accessed on 31 March 2023).
  52. Park, Y.; Kim, M.; Shin, J.; Heim LaFrombois, M.E. Changing trends in long-term sentiments and neighborhood determinants in a shrinking city. J. Plan. Educ. Res. 2021, 1–14. [Google Scholar] [CrossRef]
  53. Yun, H.Y.; Kim, J.; Ho, K.C. Spatial capital, cultural consumption and expatriate neighbourhoods in Hanoi, Vietnam. Asia Pac. Viewp. 2022, 63, 426–440. [Google Scholar] [CrossRef]
  54. Han, S.; Bohannon, C.L.; Kwon, Y. Degentrification? Different aspects of gentrification before and after the COVID-19 pandemic. Land 2021, 10, 1234. [Google Scholar] [CrossRef]
  55. Zukin, S.; Lindeman, S.; Hurson, L. The omnivore’s neighborhood? Online restaurant reviews, race, and gentrification. J. Consum. Cult. 2017, 17, 459–479. [Google Scholar] [CrossRef]
  56. Chung, J. Let’s stop gentrification. Convention of reciprocal cooperation between landlords and tenants in Yanglim. Yunhan New, 26 February 2019. [Google Scholar]
  57. Lee, Y. Craft Street, Penguin Village, Yanglim, September 2020. In Architect; Korea Institute of Registered Architects: Seoul, Republic of Korea, 19 January 2023; Available online: https://kiramonthly.com/871#:~:text=2020년%20현재%20양림동은,한옥이%20자리하고%20있다 (accessed on 10 April 2023).
  58. Unknown Yanglim Culture Park. Available online: http://craftst.or.kr/html/park_introduction (accessed on 31 March 2023).
  59. Park, Y. Gwangju district office: Urban regeneration projects face setbacks and reductions... No new projects selected. Nocutnews, 15 January 2023. [Google Scholar]
Figure 1. Research framework for this research.
Figure 1. Research framework for this research.
Sustainability 15 11792 g001
Figure 2. Word clouds for all data.
Figure 2. Word clouds for all data.
Sustainability 15 11792 g002
Figure 3. Importance (weight) of keywords by each topic and total frequency of keywords for all data.
Figure 3. Importance (weight) of keywords by each topic and total frequency of keywords for all data.
Sustainability 15 11792 g003
Figure 4. Trends for each topic from 2013 to 2022.
Figure 4. Trends for each topic from 2013 to 2022.
Sustainability 15 11792 g004
Figure 5. The count of blog posting by topic and month.
Figure 5. The count of blog posting by topic and month.
Sustainability 15 11792 g005
Figure 6. Word cloud for Stage 1. Topic numbers under the same theme have the same color for readability.
Figure 6. Word cloud for Stage 1. Topic numbers under the same theme have the same color for readability.
Sustainability 15 11792 g006
Figure 7. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 1.
Figure 7. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 1.
Sustainability 15 11792 g007
Figure 8. Word clouds for Stage 2. Topic numbers under the same theme have the same color for readability.
Figure 8. Word clouds for Stage 2. Topic numbers under the same theme have the same color for readability.
Sustainability 15 11792 g008
Figure 9. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 2.
Figure 9. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 2.
Sustainability 15 11792 g009
Figure 10. Word clouds for Stage 3. Topic numbers under the same theme have the same color for readability.
Figure 10. Word clouds for Stage 3. Topic numbers under the same theme have the same color for readability.
Sustainability 15 11792 g010
Figure 11. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 3.
Figure 11. Importance (weight) of keywords by each topic and total frequency of the keywords for Stage 3.
Sustainability 15 11792 g011
Table 1. Most salient terms and their frequencies by stages.
Table 1. Most salient terms and their frequencies by stages.
AllStage 1Stage 2Stage 3
Hot Spot and Tourist DestinationCultural Heritages and TourismHot Spot and Cultural TourismDaily Life and Hot Spot with Loved Ones
KeywordsFrequencyKeywordsFrequencyKeywordsFrequencyKeywordsFrequency
café96,038photo4927café37,563café53,767
photo79,606human4673photo31,725famous place29,929
famous place54,285travel3304famous place20,894order22,162
time44,234thought3179menu18,836menu21,439
menu42,083culture2572order18,623coffee17,525
order41,758Penguin Village2550friend14,871recommendation16,528
coffee32,203café2289vibe14,224vibe15,644
vibe31,971village2150coffee12,204visit15,193
recommendation29,206story2049recommendation11,334feeling14,186
feeling25,778area1833travel11,028spot13,719
travel23,915space1763feeling10,209space11,321
Penguin Village23,385friend1671visit9561unni11,066
space20,242artwork1274food8732mom10,753
food19,521alley1263bread7552dessert10,391
unni18,183proceeding1212table6318food10,111
mom17,958looking around1173dessert6148oppa9090
dessert16,834Korea1160pasta5917travel8643
bread16,125coffee1103village5904bread8288
beverage14,228art1035beverage5768beverage7969
looking around14,143Mr.1003meat5285table7053
table13,809experience993menu(board)5108Penguin Village6901
pasta12,878missionary941oppa4852pasta6251
meat11,769church863sauce4772meat6181
village11,523artist856eating place4451sauce6059
sauce10,743order799cake3567pizza5615
street9816menu690pizza3393artwork4725
interior9537movie619sushi2861exhibition3736
alley7942China528steak2349village3662
tour6929(musical) performance179Testa2277artist3235
culture6675music166experience2222soup2610
Table 2. The number of small businesses that closed down between 2013 and 2022 in Yanglim.
Table 2. The number of small businesses that closed down between 2013 and 2022 in Yanglim.
2013201420152016201720182019202020212022
N of Businesses closed down
(café, bakery, restaurant etc.)
11887111421232027
Data obtained from https://www.localdata.go.kr/main.do (accessed on 22 March 2023).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yun, H.Y.; Kwon, H.-a. Neighborhood Identity Formation and the Changes in an Urban Regeneration Neighborhood in Gwangju, Korea. Sustainability 2023, 15, 11792. https://doi.org/10.3390/su151511792

AMA Style

Yun HY, Kwon H-a. Neighborhood Identity Formation and the Changes in an Urban Regeneration Neighborhood in Gwangju, Korea. Sustainability. 2023; 15(15):11792. https://doi.org/10.3390/su151511792

Chicago/Turabian Style

Yun, Hae Young, and Hyun-ah Kwon. 2023. "Neighborhood Identity Formation and the Changes in an Urban Regeneration Neighborhood in Gwangju, Korea" Sustainability 15, no. 15: 11792. https://doi.org/10.3390/su151511792

APA Style

Yun, H. Y., & Kwon, H. -a. (2023). Neighborhood Identity Formation and the Changes in an Urban Regeneration Neighborhood in Gwangju, Korea. Sustainability, 15(15), 11792. https://doi.org/10.3390/su151511792

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop