Next Article in Journal
Reconstructing Rural Settlements Based on Structural Equation Modeling—Taking Hongshanyao Town of Jinchang City as an Example
Next Article in Special Issue
A Composite Resilience Index (CRI) for Developing Resilience and Sustainability in University Towns
Previous Article in Journal
Simulation Research on Energy Evolution and Supply Law of Rock–Coal System under the Influence of Stiffness
Previous Article in Special Issue
Scientometric Analysis of the Global Scientific Literature on Circularity Indicators in the Construction and Built Environment Sector
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Novel Use of Social Media Big Data and Artificial Intelligence for Community Resilience Assessment (CRA) in University Towns

by
Mohammed Abdul-Rahman
1,2,3,*,
Mayowa I. Adegoriola
1,
Wilson Kodwo McWilson
1,
Oluwole Soyinka
4 and
Yusuf A. Adenle
5
1
Department of Building and Real Estate, The Hong Kong Polytechnic University, Hong Kong, China
2
Department of Urban and Regional Planning, University of Lagos, Lagos 101017, Nigeria
3
AI Africa Lab, Lagos 101017, Nigeria
4
School of Public Policy and Global Affairs, University of British Columbia, Vancouver Campus, Vancouver, BC V6T 1Z2, Canada
5
Department of Geography and Resource Management, Chinese University of Hong Kong, Hong Kong, China
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(2), 1295; https://doi.org/10.3390/su15021295
Submission received: 10 November 2022 / Revised: 30 December 2022 / Accepted: 4 January 2023 / Published: 10 January 2023

Abstract

:
University towns face many challenges in the 21st century due to urbanization, increased student population, and higher educational institutions’ inability to house all their students on-campus. For university towns to be resilient and sustainable, the challenges facing them must be assessed and addressed. To carry out community resilience assessments, this study adopted a novel methodological framework to harness the power of artificial intelligence and social media big data (user-generated content on Twitter) to carry out remote studies in six university towns on six continents using Text Mining, Machine Learning, and Natural Language Processing. Cultural, social, physical, economic, and institutional and governance community challenges were identified and analyzed from the historical big data and validated using an online expert survey. This study gives a global overview of the challenges university towns experience due to studentification and shows that artificial intelligence can provide an easy, cheap, and more accurate way of conducting community resilience assessments in urban communities. The study also contributes to knowledge of research in the new normal by proving that longitudinal studies can be completed remotely.

1. Introduction

As the world experiences geometric growth in population and youth bulge in the 21st century, radical changes had to be made to higher education funding in most countries to meet the increasing demand for university education [1,2]. In most countries, such as the United Kingdom and the United States, these changes have also led to a shift in the funding of most Higher Educational Institutions (HEIs) away from the state, which increased the marketization of higher education [1,3]. According to Brooks, Byford, and Sela [1], the United Kingdom’s commercialization of higher education has changed the narratives. Students now “see degrees as private investments rather than public good”. To obtain the best “investment”, students now travel far away from home in search of “quality” when making their higher education choices. Related to this, Kinton, Smith, Harrison, and Culora [2] emphasised that global competition among HEIs for student “customers” have made universities more responsive, increased their teaching quality and focused on providing more conducive learning environments. For students, framing “students-as-consumers” clearly extends beyond the selection of universities and courses to other aspects of university life, such as residential decision-making, cost of living and students’ lifestyle. As a result of the above, there has been a growing global debate on the changing trends of student geographies. Housing developments are changing from traditional living pathways (on-campus accommodation) to off-campus shared Housing with Multiple Occupancies (HMOs) and Purpose-Built Students Accommodation (PBSA) enclaves, which gradually change the morphology of university towns and affect their sustainability [2,4,5].
“Studentification”, a term coined by British geographer Darren P. Smith in 2002, has been globally used to describe the significant processes of urban change and the challenges university towns face due to the growing students’ concentration off-campus. This is due to the inability of universities to house all their students within their campuses [4,6,7,8]. Some of the impacts of studentification have been well documented in the research corpus for the last two decades, but they were mainly woven around housing studies. Hence, most existing studies mainly discuss the economic, social, and environmental negative impacts of housing and students’ accommodation and proffer solutions around the same issues using human geography and social theories [2,9,10,11,12,13,14,15]. For university towns to be sustainable, they have to be resilient against the chronic stresses and shocks affecting them [16]. Building resilience requires a holistic assessment in all the dimensions of resilience [17,18]. Review of extant studentification literature shows that there are no studies looking at the negative impacts of studentification from the community resilience perspective, providing holistic community assessment, or identifying community challenges from textual big data using artificial intelligence [19].
To fill this identified research gap, this study proposed a novel Community Resilience Assessment (CRA) framework that uses Artificial Intelligence (AI) tools to identify and holistically assess community challenges within university towns. The research answered the questions of the possibility of using AI and textual big data to assess community challenges and the reliability of using such an assessment in university towns suffering from the negative impacts of studentification. We chose six university towns as case studies. Namely: Loughborough in Leicestershire, UK; Akoka in Lagos, Nigeria; Ann Arbor in Michigan, USA; Hung Hom in Kowloon, Hong Kong; Sydney in New South Wales, Australia; and Aguita de la Perdiz in Concepcion, Chile. These towns were selected because they have the highest studentification user-generated content in each continent based on Twitter’s big data. Figure 1 shows the geo-location of the six case studies.
This study gives a global overview of university towns’ challenges due to studentification beyond the housing issues often discussed in the literature. It also shows that AI and textual big data from microblogs can provide an easy, cheap, and more accurate way of conducting community resilience assessments. Section 2 of this paper shows the literature review and other related work, Section 3 explains the methodology, Section 4 shows the results from the case studies, Section 5 discusses the findings, and Section 6 gives the summary and conclusion as well as the limitations and areas for future research.

2. Theoretical and Conceptual Background

2.1. Studentification: Practical Challenges and Benefits

Studentification leads to urban changes over time. According to Smith [20] and Situmorang et al. [21] these changes have five key dimensions: social, cultural, physical, economic, and governance. Socially, studentification leads to structural gentrification and segregation. Culturally, the social clusters or concentrations of youths with shared students’ culture, lifestyle, and consumption practices lead to the introduction of new sub-cultures in the area. Physically, the environment may either be upgraded to cater to the new teaming customers (especially in retail and service infrastructure) or downgraded to a slum over time. And economically, housing stock changes to accommodate the student population lead to higher densities and inflation of property and rental prices. Local businesses also change their models over time to satisfy the needs of the students. With such rapid new complexities in the university towns, governance issues gradually manifest.
Although studentification is often portrayed as a negative phenomenon in the media and research, the town-gown relationship is not all parasitic. Some of the benefits of studentification to the university towns and their residents include the following: the provision of a young and educated workforce, cheaper labour and increased volunteerism [22]; adding more diversity and vibrancy to local cultures and raising the aspirations of the local youths [23]; enhancing the spending power, improving the local economy, creating more jobs and sustaining the local retail businesses [24]; supporting the local real estate sector and its associated trades (agency, insurance, finance, etc.) and driving up demands for quality housing provision [25]; as well as making the town more attractive to tourists and investors [26]. However, this study only looks at the practical challenges studentification has on university towns and their residents.

2.2. The Concept of Sustainability, Resilience, and Community Resilience Assessment

Defining sustainability depends on the framing and dimension. A common framework with substantial nexus with resilience is “the triple bottom line”, which conceptualizes that societies should not make decisions about their future based only on economic returns but also on environmental protection, social justice, and equity [27]. The principle of the triple bottom line suggests that human settlements must be environmentally bearable, socially equitable, and economically viable for the current generations and the future ones yet unborn [28]. According to UN-Habitat [29], resilience is essential to sustainability. That is why United Nations Sustainable Development Goal 11 (UNSDG 11) categorically mandated the 193 UN member nations to strive to make their human settlements inclusive, safe, resilient, and sustainable. In urban planning, the “concept of resilience” is defined as the ability of human settlements to prepare and plan for, absorb, recover from, and more successfully adapt to environmental, social, and economic adverse events [30]. Community resilience, therefore, is learning from the past, understanding current situations and using that information to minimize future negative impacts. Influenced by the above philosophy and the global call to develop a sustainable world, as well as the increasing challenges of human settlements, resilience research and the concept of community resilience assessment are fast becoming popular in global policy and scientific research and discourse [31].
Community Resilience Assessment (CRA) is an assessment carried out to identify and analyze the challenges human communities face [32]. CRAs are summative or formative toolkits, indexes, scorecards, and frameworks that identify and analyze socio-cultural, economic, environmental, and institutional community resilience challenges [31]. Sharifi [31] posited that good CRA methodologies should be able to identify community challenges in all dimensions of resilience, capture spatiotemporal dynamism, address uncertainties, and seek the opinions of the people involved. In the last two decades, more than 100 CRA methodologies (toolkits, indexes, scorecards, and frameworks) have been created by different organizations for different purposes, countries, or regions. No CRA methodology was explicitly developed to identify or assess community challenges in university towns. However, few can be modified to identify and evaluate specific challenges within university towns, such as natural disasters and climate change impacts.

2.3. The Use of Artificial Intelligence and User-Generated Content from Social Media Microblogs in Community Resilience Assessment

Processes in the built environment have seen a lot of disruptions in the 21st century [33]. This is mainly due to the new challenges human settlements face in the 21st century, coupled with the drive for smarter cities, the widespread use of AI, and the explosive data generation in the fourth industrial revolution [34]. Today, billions of data points are generated in cities globally because of the increase in internet usage and smart gadgets (Internet of Things) [35]. The rising complexities and challenges of our cities in this information age require new innovative methods because most traditional approaches can no longer harness the potential of the big data generated in our cities [36]. To rise to the occasion, professionals and researchers in the built environment now use AI systems to automate traditional processes and make them more efficient and smarter [37].
In simple terms, the vast and constantly expanding field of AI refers to machines or computers mimicking cognitive functions that humans associate with the human mind, such as learning and solving problems [38]. AI applications are being used in almost every sector. In urban planning, AI is used in security surveillance and smart transport systems (including traffic management) [39], robotics, automation and installation of infrastructure [40], health care delivery [40], garbage collection [41], air quality monitoring [42], and disaster management [43], among others. On the other hand, Machine Learning (ML) is a subfield of AI that trains machines to learn from experiences and make intelligent decisions with or without supervision [44]. One of such functions is learning human languages, communicating with humans, and reading human emotions [45]. This subfield of ML is called Natural Language Processing (NLP). Figure 2 summarizes the AI, ML, and NLP relationships.
Social media microblogs have become a key medium of communication and expression with the increased use of Internet of Things (IoT) and smartphones. This has made User-Generated Content (UGC) from Twitter, WeChat, Facebook, and Instagram a huge part of research in areas such as marketing, commerce, tourism, and health [46]. For example, Alharbi et al. [47] used Twitter big data, ML, and NLP methods to study the opinions of Apple phone users. Their research examines users’ sentiments to determine if they are happy or sad about using the new iPhones. Using a similar methodology and Twitter big data, Asghar et al. [48] also studied people’s automobile preferences. Generally, in commerce and marketing, companies use UGC to understand customers’ perceptions and satisfaction and how their goods and services are compared with other similar products in the market [49].
In the health and human settlements nexus, Carlos et al. [50] used Twitter data to study the outbreaks of dengue fever in Brazil, while Shah et al. [51] used data from medical microblogs to analyse the sentiments patients have toward their physicians in the UK. And in travel and tourism, Nilashi et al. [52] used data from social media microblogs and ML to study travellers’ decision-making processes and develop a system to recommend hotels tailored to their preferences. Similarly, Sun et al. [53] also used big data from social media to study trends and tourists’ opinions in China. Ahani et al. [54] also used a similar methodology to study customer behaviour and customer satisfaction in the hotel industry to develop a better marketing plan and recommend strategies for hotel owners to increase customer satisfaction and retention.
In an attempt to use similar methodologies above for urban planning and management Abdul-Rahman et al. [55] developed a framework to simplify pre-processing of social media big data using Text Mining ™, ML, and NLP. Their study showed that UGC from Twitter can be used to identify community challenges using AI. Similar to studies in marketing and tourism, people also share their opinions and sentiments on how they feel about their communities, what challenges their communities experience, and what they think the solutions are. This study expanded Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55] study and methodology to develop a CRA framework for university towns. Apart from its efficiency, the novelty of this proposed framework lies in its ability to provide spatiotemporal analysis of community challenges among the five dimensions of resilience.
Among all social media microblogs, Twitter is commonly used for text mining because of the rich textual UGC, the size of the data, and the ease of using the Twitter API [56]. This study also used big data from Twitter.

3. Materials and Methods

Since this study adopted an existing AI-based framework with high accuracy [55], only key modified codes and procedures were repeated here. However, apart from adapting the framework to identify and assess the negative impacts of studentification in multiple case studies, the original approach’s validation step was also modified to online experts’ validation. This makes validation easier, faster, and cheaper.
The methodological framework in Figure 3 comprises the following steps:
(a)
Getting started—The user connects the computer (Local Host) to the internet.
(b)
Connecting to case study and Python environment—User receives geographical coordinates from case study and launches Python v3 (or a newer version) (Python Software Foundation, Beaverton, OR, USA), launches PyQuery, and Lxml.
(c)
Text mining—The User downloads the Optimized-Modified-GetOldTweets3-OMGOT (https://github.com/marquisvictor/Optimized-Modified-GetOldTweets3-OMGOT, accessed on 24 December 2022) library from GitHub and follows the instructions in the ReadMe file to mine public UGC from Twitter. Optimized-Modified-GetOldTweets3-OMGOT is a python-based open-source tool containing a set of programmatic algorithms designed by Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55] to streamline searches and bypass the rate limits of the Twitter APIs, allowing the download of unlimited historic tweets generated from a specific geo-location using the PyQuery tool, from terminal or command prompt. The algorithms download both the UGC (tweets) and their metadata into Microsoft Excel files (.csv) directly to the Local Host. Since the data is downloaded to .csv file(s), it can easily be transferred outside of the Python environment for further data analysis. In this study, only tweets in the English language were downloaded.
(d)
Topic ModellingLatent Dirichlet Allocation (LDA) (https://github.com/lda-project/lda, accessed on 24 December 2022) An ML and NLP Python-based tool were used to split the big data downloaded in step (c) into major topics. These topics represent major discussion themes within the selected case study areas based on Twitter UGC. 45 themes (topics) were identified. The 45 topics were converted to keywords and used to re-mine the textual data “per topic” using the Use Cases in the ReadMe file. Data from each topic was then saved in a separate .csv file. This step helps to validate the previously mined data and break down the big data into manageable sizes for further analysis. Blei et al. [57], Chuang et al. [58], Sievert–Shirley [59], Moody et al. [60], Momtazi [61], Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55] and Asghari, et al. [62] all published great papers on how to use LDA.
(e)
Sentiments Analysis—Each topic folder in step (d) was analyzed for sentiment polarity using Valence Aware Dictionary and sEntiment Reasoner (VADER) (https://github.com/cjhutto/vaderSentiment, accessed on 24 December 2022). VADER is an ML and NLP open-source tool that analyses textual data according to their sentiment polarity (positive, negative, and neutral) and intensity [63]. Negative comments from the community residents and visitors represent displeasure and community challenges. Due to the unstructured nature of the social media data, VADER is one of the best NLP tools for analysing sentiments from social media UGC [47,48].
(f, g and h)
Survey and Data Validation—VADER is trained and validated by the developers [64], and Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55] showed that the output has high accuracy. However, to further reduce bias and narrow the error margin, the assumption that the residents, workers, and visitors’ displeasures about a community (negative polarities) represent the community’s challenges needs to be re-validated. Physical distribution of the questionnaire survey as used by Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55] slows down the process, therefore, this study proposed an online survey via email and twitter to experts identified through research databases and some identified from the big data based on their work on studentification and community resilience, sustainability and artificial intelligence in the 6 countries of the case studies. The survey instrument was designed and tested followed techniques used by Darko [65]. A pilot survey was carried out before the main questionnaire survey. The purpose of the pilot survey was to test the survey procedures and verify the comprehensiveness and the use of technical language [66]. The pilot survey was administered to five participants: two professors, one chief resilience officer, one post-doctoral researcher, and a doctoral researcher. These participants are all well knowledgeable in the field of CRA and the use of artificial intelligence for big data mining and natural language processing. After the pre-testing phase, the survey instrument was perfected and administered to experts for seven months, from June 2020 to February 2021. The experts were asked to forward the questionnaire link to others they feel are eligible to answer the questionnaire within their network, including experts outside of their countries and copy the research team. A total of 392 valid responses were received. Figure 4 shows the number of responses received for validation and the extra 17 countries the survey snowballed to. The questionnaire used for this study is available online via https://theses.lib.polyu.edu.hk/handle/200/11732 (pg. 99–203), accessed on 24 December 2022. Only sections A and B were used for validation in this study. Section A was used to collect the respondents’ biodata. In contrast, section B collected data on the respondents’ countries and the respondents’ agreements on the data grouped under the five dimensions of studentification (cultural, social, physical, economic, and institutional and governance challenges). A 5-point Likert scale (1 = strongly disagree; 2 = somewhat; disagree; 3 = neither agree nor disagree; 4 = somewhat agree; 5 = strongly agree). Four data analysis methods were used: (1) The reliability of the scales was measured using Cronbach’s alpha; (2) Ranking was performed using Mean value ranking; (3) Standard Deviation scores; (4) The Mean values were normalized (Normalized value = (mean—minimum mean)/(maximum mean—minimum mean)). SPSS v26 and Python v3.10.8 were used for the validation analysis.

4. Results

4.1. Data Mining using the Optimized-Modified-GetOldTweets3-OMGOT Library

Ten years of Twitter’s historic UGC within the six case study areas was downloaded (from 1 January 2010, to 31 December 2020). A total of 4,577,107 tweets containing slags and emojis and their metadata (usernames, permalinks, replies, favourites, dates, etc.) were mined from all case studies. See Table 1 in Supplementary Data for the breakdown of the tweets per case study and Appendix A for the codes used for text mining and data cleaning.

4.2. Topic Modelling and Identifying Community Challenges Using Latent Dirichlet Allocation

A total of 45 topics were identified from the first mining datasets combined (total) using LDA. The topic modelling was also performed per case study. A total of 31 of the 45 issues match those from Loughborough’s data, 28 from Ann Arbor, 35 from Akoka, 18 from Hung Hom, 22 from Sydney, and 17 from Aguita de la Perdiz. The data mining was then repeated in the case studies based on each topic found in the case studies using case 3 of the Optimized-Modified-GetOldTweets3-OMGOT library (see Abdul-Rahman, Chan, Wong, Irekponor and Abdul-Rahman [55]). A total of 4,561,311 tweets were mined under the 45 topics (99.65% of the first mining). A total of 15,796 tweets were automatically excluded because they did not fit into any of the primary 45 topic clusters, and the topics they were under didn’t have significant data under them. See Table 2 for the final output and Appendix B for the coding scripts.

4.3. Sentiments Analysis Using VADER

Each tweet within each topic was analysed and classified using the sentiment index in Table 3. Generally, tweets with sentiment matric scores of 0.674 (67%) are regarded as positive. This means the authors (residents or visitors) are satisfied with the situation in the community. Tweets with scores of 0.0326 (33%) are recorded as neutral, meaning the authors (residents and visitors) are indifferent about the situation. On the other hand, tweets with 0.000 scores are negative and represent complaints or displeasure from residents and visitors [63]. The three scores sum up to 1. For better accuracy, the standardized compound matric scores (sums of all the lexicon ratings) are normalized between −1 and +1 [64]. This means = or >0.05 is a positive sentiment polarity, >−0.05 and <0.05 is neutral, and = or <−0.05 is negative.
Within each of the identified topics in each case study, there were positive, neutral, and negative UGC tweets. Table A1 in Appendix D contains the summations of all normalized and weighted composite scores (sentiment polarity) for each topic. Table 3 shows the identified community challenges and their ranks based on the frequency of their negative sentiment polarity. While Figure 4, Figure 5 and Figure 6 show the sentiments polarities in each case study, the thematic cluster of community challenges and the intensity of community challenges in each case study, respectively.
The codes used for the VADER sentiment analysis are also contained in Appendix C. See Hutto and Gilbert [63] for more information on the parameters and scoring of the VADER model on Python.

4.4. Result Validation

To test the reliability of the scales, Cronbach’s Alpha (CA) was calculated using Howard [67] Python methodology. The CA values for the subscales were 0.799 (cultural), 0.972 (social), 0.957 (physical), 0.869 (economic), and 0.798 (institutional and governance). By statistical standards, CA scores above 0.7 are said to have good internal consistency [68]; therefore, the validation data is reliable. Table 4 shows the respondents’ profile, while the mean values, standard deviation scores, normalized mean values, and ranking of all community challenges are shown in Table 5. All the mean and normalized mean values were more than the 3.5 and 0.5 average [65], respectively. This means none of the 45 community challenges was collectively rejected by the 392 experts, who were mainly from academia or research institutes and had more than 5 years of experience working as researchers, urban planners, or in the community resilience domain. The majority of the experts also have experience either developing CRA methodology or using one.

5. Discussion

5.1. General Overview of Community Challenges in University Towns

The UGC from the six case studies shows that university towns face similar challenges globally. This was confirmed by the experts’ validation since none of the community challenges was rejected. Some of the community challenges, such as increased racism, tribalism, and religious challenges (C09) and increased levels of prostitution and sexually transmitted diseases (S04) were unique to only Akoka (Nigeria). At the same time, the lack of social interactions among groups (S12) was unique to only Hung Hom (Hong Kong). The rest of the community challenges were reported in at least two case studies, as seen in Table 2.
Loughborough, with the highest number of mined UGC (see Table 1), has the highest negative polarity (complaints), followed by Ann Arbor, then Akoka, Hung Hom, Sydney, and Aguita de la Perdiz (see Figure 5). But overall, Akoka has the highest number of community challenges (35 challenges), followed by Loughborough (31 challenges), Ann Arbor (28 challenges), Sydney (22 challenges), Hung Hom (18 challenges), and Aguita de la Perdiz (17 challenges). Thematically, the challenges were grouped into cultural, social, physical (environmental), economic, and institutional and governance challenges. Figure 6 shows that most community challenges identified were physical/environmental, followed by social, economic, cultural, and institutional and governance challenges. However, no institutional and governance challenges were identified from the data in Sydney and Aguita de la Perdiz. Figure 7 shows that 47.8% of the community challenges identified in Loughborough were physical/environmental, 25.1% had to do with the community’s economy, 19.4% were social, 5.8% were cultural, and only 1.9% of the community challenges were institutional and governance challenges. In Ann Arbor, 42.1% were physical, 33.3% were social, 16.8% were economic, 6% were cultural, and only 1.8% were institutional and governance challenges. Akoka has 35.9% of her identified community challenges as physical, 28% economic, 23.8% social, 6.6% cultural, and 5.7% institutional and governance issues. Hung Hom has more than half of her community challenges (53.9%) as physical, 22% as social, 18.1% as economic, and 6% as institutional and governance-related challenges (due to studentification). Sydney has 43% social challenges, 39.5% economic, 9.6% physical, and 7.9% cultural. Lastly, Aguita de la Perdiz has 36% economic challenges, 31.7% social, 20.3% physical, and 12% cultural.
Generally, the overall ranking by the sentiment analyzer (VADER), the ranking by the experts in the 23 countries (total), and those from the 6 case study countries do not differ much. Although the community challenges were ranked slightly differently in the three separate rankings, as shown in Table 5, the top 10 community challenges remain the same across the three rankings. These top 10 community challenges include the following: the illegal subdivision of family homes and apartments into housing with multiple occupancies (P01); high rental prices (E01); high environmental pollution (noise, air pollution and indiscriminate waste/garbage disposal (P07)); increased anti-social behaviour and social disorder (S01); high cost of living (E04); defacing neighbourhoods with graffiti, posters, writings and rental boards and advertisements (P04); increased level of alcoholism, drugs peddling, and abuse (S03); community slumification due to the decline in housing renovations and environmental maintenance (P03); displacement/replacement of established residents (gentrification) (S07); and on-street parking and traffic congestion (P10).
These results show that the intensity of community challenges varies from one community to the other, but overall most university towns experience similar challenges due to studentification. This points to the fact that students have similar behaviours regardless of the country or region [69,70]. This novel CRA framework allows university towns to collaborate and co-produce solutions against studentification challenges, share best practices and learn coping mechanisms from one another, especially those with similar challenges [71].

5.2. Novelty and Implications of the Proposed CRA Methodology

a.
Assessment of all major community resilience dimensions
Communities have multiple complex dimensions [72]. This novel framework identified and analysed challenges under the five major dimensions of resilience (cultural, social, physical/environmental, economic, and institution and governance) in all the university towns. This allows community planners and managers to study community resilience challenges holistically and zoom deeper into individual community challenges or resilience dimensions.
b.
Assessing the spatiotemporal dynamism of the community challenges
Capturing time horizons and knowing the specific areas where the residents’ and visitors’ sentiments were generated will help the community managers better assess the challenges and focus on “hotspots”. Since the UGC big data from microblogs such as Twitter come with metadata that contains the date and time of tweets generated within a specified spatial radius, the negative polarities can be modelled further after sentiments analysis using Microsoft Excel 3-D Clustered Columns to show spatiotemporal dynamics. Figure 8 shows a polarity-based model of residents’ monthly complaints from 1 January 2010 to 31 December 2020 in Loughborough, UK. The data for P07 (negative sentiments for Loughborough = 98,852 tweets) from figure of Appendix C was grouped into months before it was modelled. The model shows a clear pattern that follows the term periods of Loughborough University and College. The complaints reduced during the summer term and semester three (April to August) and also in December when the university town was almost empty. Over the last 10 years, the complaints about noise and indiscriminate waste disposal have increased in line with the growth of student residents in the town. This model can be generated to analyse any of the community challenges identified.
c.
Addressing uncertainties and ensuring public participation
Carrying out longitudinal studies to understand historical events and analysing patterns help to develop better action plans and reduce uncertainties [73]. This framework gives room for such assessments and provides an opportunity for sampling the opinions of millions of people concerning community issues. The sampled opinions were from residents, workers, and visitors, regardless of gender, race, age, religion, etc.

6. Summary and Conclusions

By adapting and modifying the novel framework for pre-processing location-based social media data by Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55], this study demonstrated UGC from microblogs can be used to identify community challenges using AI (ML and NLP) tools such as LDA and VADER. Six university towns were used as case studies.
First, a programmatic algorithm was used to mine the big data using the Twitter API and search engine. Then LDA was used to extract major topics from the data of each case study and the combined big data. These topics were used to re-mine the data, and VADER was used to analyse the sentiment polarity under each issue. The negative Normalized Weighted Composite Scores (NWCS) frequencies were used to rank the identified community challenges. An online expert survey was conducted to validate and rank the negative impacts of studentification. Mean ranking, standard deviation, and normalized mean values were used to rank the community challenges. The statistical results showed that all 45 challenges clustered around the 5 community resilience dimensions were accepted as negative impacts of studentification.
Apart from being comprehensive enough to identify cultural, social, physical/environmental, economic, and institutional and governance challenges in the university towns, this novel framework also provides a deeper spatiotemporal analysis of each community challenge. Using a large opinion poll (sample size) helps minimize errors and increases the accuracy of the data.
This study contributes to the community resilience body of knowledge by providing a simple, fast, cheap, and efficient way of conducting CRA remotely. This novel methodology can be used by urban planners, community managers, community-based organizations, and universities. It can be used to identify community challenges and make university towns resilient and sustainable.
This methodological framework works better in well-connected urban university towns where more people are connected to the internet and the use of social media is high. This limitation will not render the methodology useless, but it will affect the amount of data available for analysis if the framework is used in a rural community with low Internet connectivity. Future works may include using APIs from other microblogs such as WeChat and Facebook. The framework can also be improved to predict future trends based on historical data. Geographic Information System (GIS) can also be used to overlay the data on the base maps of the case studies to run more analysis and visualization.

Author Contributions

Conceptualization, M.A.-R. and Y.A.A.; methodology, M.A.-R., Y.A.A. and O.S.; software, M.A.-R.; validation, W.K.M. and M.I.A.; formal analysis, visualization and data curation, M.A.-R., Y.A.A. and W.K.M.; writing—original draft preparation and writing—review and editing, M.A.-R., O.S. and M.I.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research work was part of a larger doctoral study titled “A community Resilience Assessment Framework for University Towns” supported by a PhD studentship from the Research Institute for Sustainable Development (RISUD) and the Department of Building and Real Estate of the Hong Kong Polytechnic University [research grant: G-R006.RJET].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all respondents involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

The authors acknowledge Professor Edwin H.W. Chan and Professor Man Sing Wong’s supervision, for their advice and mentorship for the PhD thesis “A community Resilience Assessment Framework for University Towns”, which led to the development of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A

1.
Data mining codes.
Sustainability 15 01295 i001
2.
Codes for text cleaning
a.
Loading the dataset
Sustainability 15 01295 i002
b.
Data cleaning and noise reduction
Sustainability 15 01295 i003
c.
Convert the corpus into a document-term matrix.
Sustainability 15 01295 i004

Appendix B

Codes for Topic Modelling using LDA
Sustainability 15 01295 i005

Appendix C

Codes for Sentiment Analysis Using VADER
Sustainability 15 01295 i006

Appendix D

Table A1. Sentiment polarity for 4,561,311 tweets mined from six university towns in the UK, US, Nigeria, Hong Kong, Australia and Chile from 1 January 2010, to 31 December 2020.
Table A1. Sentiment polarity for 4,561,311 tweets mined from six university towns in the UK, US, Nigeria, Hong Kong, Australia and Chile from 1 January 2010, to 31 December 2020.
Case StudyS/NTopicsnegTweetsneuTweetsposTweets Tweets
Loughborough, UK Cultural
1C0110,52414,756489830,178
2C0216,97614925818,526
3C0315,02610217716,124
4C0656527635196415,251
5C0813,769718430821,261
Social
6S01110,4575715180116,352
7S0359,8204602102265,444
8S0589764792124115,009
9S0710,8726197104218,111
10S0899922824207314,889
11S1057663624536214,752
Physical
12P01122,82517,9242109142,858
13P0215,1844625120721,016
14P0356,62512,612176671,003
15P0478,56311,162152691,251
16P0646,021601214,48866,521
17P0798,852164529100,526
18P0911,524516796217,653
19P1068,0665160102574,251
20P1283832186142411,993
Economic
21E0191,251316485295,267
22E029391152686511,782
23E0338,7263784152144,031
24E0455,69215062257,220
25E0513,66811,11415,52640,308
26E068644228258011,506
27E0710,536301444113,991
28E088904251160112,016
29E0929,413572334235,478
Institution & Governance
30I0297253516142514,666
31I0310,526152672512,777
Total1,060,349166,00165,6611,292,011
Ann Arbor, USA Cultural
1C0111,2418251206121,553
2C0416,340556175122,652
3C0611,51412,351200725,872
3C0813,005310290117,008
Social
5S0170,04524149672,555
6S0220,9614669125126,881
7S0353,8172619120157,637
8S0514,7699226332627,321
9S0623,1415622200130,764
10S0748,32316275250,002
11S0816,5219523562231,666
12S0993134098100314,414
13S1023,8165783351233,111
14S1187844531421117,526
Physical
15P0197,2342152983100,369
16P0210,2384242185616,336
17P0340,711140062142,732
18P0479,5184343251486,375
19P0627,825755111,41246,788
20P0773,51210085674,576
21P098075152357110,169
22P1028,029516181134,001
Economics
23E0192,5625044215599,761
24E0311,1643509142116,094
25E0436,5432131101739,691
26E0557291217450611,452
Institution & Governance
27I018674245188212,007
28I036751232899310,072
Total868,155123,43757,7931,049,385
Akoka, Nigeria Cultural
1C014526264310048173
2C0370221012288062
3C0413,352447842118,251
4C055521610471012,335
5C0752023758111210,072
6C0871581395338586
7C098520124185010,611
Social
8S0161,503810944370,055
9S0281117335129356
10S0344,874201212847,014
11S0428,77712,824102442,625
12S0711,741553987218,152
13S085545236812639176
14S09479930002048003
15S1011,9003776211117,787
16S118012365210911,773
Physical
17P0179,7212254645188,426
18P0231,0411782197234,795
19P0318,955664529225,892
20P0485635172251616,251
21P058934444162313,998
22P065662212012239005
23P0757,204121710358,524
24P08472635129849222
25P1148,461258315251,196
26P1215,9653026102320,014
Economic
27E0179,176132665181,153
28E0210,6721231364815,551
29E0319,9802641100223,623
30E0474,590132010176,011
31E0524,432562411829,112
32E087821108510319937
Institution & Governance
33I0128,731920499238,927
34I0288824516186315,261
35I03699316822188893
Total777,072118,96339,787935,822
Hung Hom, Hong Kong Cultural
1C01663220,571212129,324
2C0616,26115,75116,75248,764
Social
3S02230116,304840827,013
4S03601510,50275417,271
5S0718,0276232270326,962
6S1114,22218,150719139,563
7S1243,4525798178351,033
Physical
8P0129,52216,025617651,723
9P0311,0324002101216,046
10P0426,8219991451241,324
11P0531,992201687534,883
12P0747,88523,65317,72389,261
13P0856,62318,63715,96291,222
14P10216220,015324425,421
Economic
15E0134,11210,681120645,999
16E03257218,618187123,061
17E0428,190707448835,752
18E054332980113,02127,154
Total382,153233,821105,802721,776
Sydney, Australia Cultural
1C0111,4652215732321,003
2C03601321202198352
3C072190454212517983
Social
4S0133,231655923140,021
5S0322,338245176225,551
6S0611,5265632140418,562
7S0733,2434237214339,623
8S08404625165017063
9S102981162123926994
Physical
10P0147,031302047150,522
11P03423412511425627
12P0425,1233203102529,351
13P0542206142205112,413
14P0626,410517152132,102
15P0748,921242271852,061
16P092722142110225165
17P1011,1021403150114,006
Economic
18E0143,4841206231247,002
19E03296113417245026
20E0439,9912871101143,873
21E0584204350325116,021
22E0637714520186110,152
Total395,42370,21431,836498,473
Aguita de la Perdiz, Chile Cultural
1C0215217452542520
2C0520114715102992
Social
3S0231248591614144
4S03286110503414252
5S07340510151724592
Physical
6P01541212511086771
7P034439721915251
8P0450396812115931
9P051424742552221
10P0756971462617220
11P1112655641131942
12P1211235583332014
Economic
13E014571424375032
14E043961751914803
15E055162249611701
16E071003350881441
17E095713351111017
Total47,94312,203369863,844

References

  1. Brooks, R.; Byford, K.; Sela, K. Students’ unions, consumerism and the neo-liberal university. Br. J. Sociol. Educ. 2016, 37, 1211–1228. [Google Scholar] [CrossRef] [Green Version]
  2. Kinton, C.; Smith, D.P.; Harrison, J.; Culora, A. New frontiers of studentification: The commodification of student housing as a driver of urban change. Geogr. J. 2018, 184, 242–254. [Google Scholar] [CrossRef]
  3. Brooks, R. The social construction of young people within education policy: Evidence from the UK’s Coalition government. J. Youth Stud. 2013, 16, 318–333. [Google Scholar] [CrossRef] [Green Version]
  4. Smith, D.P.; Sage, J.; Balsdon, S. The geographies of studentification:here, there and everywhere? Geography 2014, 99, 116. [Google Scholar] [CrossRef]
  5. Holton, M.; Riley, M. Talking on the move: Place-based interviewing with undergraduate students. Area 2014, 46, 59–65. [Google Scholar] [CrossRef]
  6. Hubbard, P. Regulating the Social Impacts of Studentification: A Loughborough Case Study. Environ. Plan. A: Econ. Space 2008, 40, 323–341. [Google Scholar] [CrossRef] [Green Version]
  7. Smith, D.P.; Hubbard, P. The segregation of educated youth and dynamic geographies of studentification. Area 2014, 46, 92–100. [Google Scholar] [CrossRef]
  8. Sage, J.; Smith, D.; Hubbard, P. The Diverse Geographies of Studentification: Living Alongside PeopleNotLike Us. Hous. Stud. 2012, 27, 1057–1078. [Google Scholar] [CrossRef]
  9. Baron, M.G.; Kaplan, S. The Impact of Studentification on the Rental housing Market. In Proceedings of the 50th Congress of the European Regional Science Association, Jönköping, Sweden, 19–23 August 2010. [Google Scholar]
  10. Donaldson, R.; Benn, J.; Campbell, M.; De Jager, A. Reshaping urban space through studentification in two South African urban centres. Urbani Izziv 2014, 25, S176–S188. [Google Scholar] [CrossRef] [Green Version]
  11. Foote, N.S. Beyond studentification in United States College Towns: Neighborhood change in the knowledge nodes, 1980–2010. Environ. Plan. A 2017, 49, 1341–1360. [Google Scholar] [CrossRef]
  12. Haghighi, F. Study. Be silent. Die: Indeterminate architecture and the dispositif of studentification. J. Cult. Res. 2018, 22, 55–72. [Google Scholar] [CrossRef]
  13. Holton, M. Living together in student accommodation: Performances, boundaries and homemaking. Area 2016, 48, 57–63. [Google Scholar] [CrossRef]
  14. Hubbard, P. Geographies of Studentification and Purpose-Built Student Accommodation: Leading Separate Lives? Environ. Plan. A: Econ. Space 2009, 41, 1903–1923. [Google Scholar] [CrossRef]
  15. Kinton, C.; Smith, D.P.; Harrison, J. De-studentification: Emptying housing and neighbourhoods of student populations. Environ. Plan. A: Econ. Space 2016, 48, 1617–1635. [Google Scholar] [CrossRef] [Green Version]
  16. Seeliger, L.; Turok, I. Towards sustainable cities: Extending resilience with insights from vulnerability and transition theory. Sustainability 2013, 5, 2108–2128. [Google Scholar] [CrossRef] [Green Version]
  17. Burroughs, S. Development of a tool for assessing commercial building resilience. Procedia Eng. 2017, 180, 1034–1043. [Google Scholar] [CrossRef]
  18. Schipper, E.L.F.; Langston, L. A comparative overview of resilience measurement frameworks. In Analyzing Indicators and Approaches; Overseas Development Institute: London, UK, 2015; Volume 422. [Google Scholar]
  19. Abdul-Rahman, M.; Chan, E.H.W.; Li, X.; Wong, M.S.; Xu, P. Big Data for Community Resilience Assessment: A Critical Review of Selected Global Tools. In Proceedings of the 24th International Symposium on Advancement of Construction Management and Real Estate, Chongqing, China, 19–22 November 2021; pp. 1345–1361. [Google Scholar]
  20. Smith, D.P. Studentification. In The Wiley Blackwell Encyclopedia of Urban and Regional Studies; Wiley Blackwell: New York, NY, USA, 2006; pp. 1–3. [Google Scholar]
  21. Situmorang, R.; Sudikno, A.; Surjono, S.; Wicaksono, A.D. Conceptual Framework of Studentification Impacts in Malang City, Indonesia. Int. J. Adv. Sci. 2020, 29, 585–593. [Google Scholar]
  22. Smith, D.P. Studentification: A Guide to Opportunities, Challenges and Practice; Universities UK: London, UK, 2006; Volume 52. [Google Scholar]
  23. Smith, D.P.; Fox, M. Studentification Guide for North America: Delivering Hermonious Town and Gown Associations; Loughborough University UK and Mount Allison University, Canada: Loughborough, UK, 2019; p. 73. [Google Scholar]
  24. Holton, M. Adapting relationships with place: Investigating the evolving place attachment and ‘sense of place’ of UK higher education students during a period of intense transition. Geoforum 2015, 59, 21–29. [Google Scholar] [CrossRef]
  25. Laidley, T.M. The Privatization of College Housing: Poverty, Affordability, and the U.S. Public University. Hous. Policy Debate 2014, 24, 751–768. [Google Scholar] [CrossRef]
  26. He, S. Consuming urban living in ‘villages in the city’: Studentification in Guangzhou, China. Urban Stud. 2014, 52, 2849–2873. [Google Scholar] [CrossRef]
  27. Marshall, J.D.; Toffel, M.W. Framing the elusive concept of sustainability: A sustainability hierarchy. Environ. Sci. Technol. 2005, 39, 673–682. [Google Scholar] [CrossRef] [PubMed]
  28. Hoosain, M.S.; Paul, B.S.; Ramakrishna, S. The Impact of 4IR Digital Technologies and Circular Thinking on the United Nations Sustainable Development Goals. Sustainability 2020, 12, 10143. [Google Scholar] [CrossRef]
  29. UN-Habitat. Urbanization and development: Emerging Futures; United Nations Human Settlements Programme (UN-Habitat): Nairobi, Kenya, 2016; Volume 5, p. 49. [Google Scholar]
  30. National Research Council. Disaster Resilience: A National Imperative; The National Academies Press: Washington, DC, USA, 2012; p. 260.
  31. Sharifi, A. A critical review of selected tools for assessing community resilience. Ecol. Indic. 2016, 69, 629–647. [Google Scholar] [CrossRef] [Green Version]
  32. Houston, J.B. Bouncing Forward: Assessing Advances in Community Resilience Assessment, Intervention, and Theory to Guide Future Work. Am. Behav. Sci. 2014, 59, 175–180. [Google Scholar] [CrossRef]
  33. Yigitcanlar, T.; Wilson, M.; Kamruzzaman, M. Disruptive impacts of automated driving systems on the built environment and land use: An urban planner’s perspective. J. Open Innov. Technol. Mark. Complex. 2019, 5, 24. [Google Scholar] [CrossRef] [Green Version]
  34. Lavalle, A.; Teruel, M.A.; Maté, A.; Trujillo, J. Improving Sustainability of Smart Cities through Visualization Techniques for Big Data from IoT Devices. Sustainability 2020, 12, 5595. [Google Scholar] [CrossRef]
  35. Silva, B.N.; Khan, M.; Jung, C.; Seo, J.; Muhammad, D.; Han, J.; Yoon, Y.; Han, K. Urban Planning and Smart City Decision Management Empowered by Real-Time Data Processing Using Big Data Analytics. Sensors 2018, 18, 2994. [Google Scholar] [CrossRef] [Green Version]
  36. Batty, M. Big data, smart cities and city planning. Dialogues Hum. Geogr. 2013, 3, 274–279. [Google Scholar] [CrossRef]
  37. Yigitcanlar, T.; Cugurullo, F. The Sustainability of Artificial Intelligence: An Urbanistic Viewpoint from the Lens of Smart and Sustainable Cities. Sustainability 2020, 12, 8548. [Google Scholar] [CrossRef]
  38. Kassens-Noor, E.; Hintze, A. Cities of the future? The potential impact of artificial intelligence. AI 2020, 1, 192–197. [Google Scholar] [CrossRef]
  39. Nikitas, A.; Michalakopoulou, K.; Njoya, E.T.; Karampatzakis, D. Artificial intelligence, transport and the smart city: Definitions and dimensions of a new mobility era. Sustainability 2020, 12, 2789. [Google Scholar] [CrossRef]
  40. Macrorie, R.; Marvin, S.; While, A. Robotics and automation in the city: A research agenda. Urban Geogr. 2020, 42, 1–21. [Google Scholar] [CrossRef] [Green Version]
  41. Barns, S. Platform Urbanism: Negotiating Platform Ecosystems in Connected Cities; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
  42. Caprotti, F.; Liu, D. Emerging Platform Urbanism in China: Reconfigurations of Data, Citizenship and Materialities; Elsevier: Amsterdam, The Netherlands, 2019. [Google Scholar]
  43. Zahra, K.; Imran, M.; Ostermann, F.O. Automatic identification of eyewitness messages on twitter during disasters. Inf. Process. Manag. 2020, 57, 102107. [Google Scholar] [CrossRef]
  44. Zhou, Z.-H. Machine Learning; Springer Nature: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
  45. Chowdhary, K. Natural language processing. Fundam. Artif. Intell. 2020, 1, 603–649. [Google Scholar]
  46. Kennedy, H.; Moss, G. Known or knowing publics? Social media data mining and the question of public agency. Big Data Soc. 2015, 2, 2053951715611145. [Google Scholar] [CrossRef] [Green Version]
  47. Alharbi, A.N.; Alnnamlah, H.; Liyakathunis. Classification of Customer Tweets Using Big Data Analytics. In Advances in Intelligent Systems and Computing, Proceedings of the 5th International Symposium on Data Mining Applications, Riyadh, Saudi Arabia, 20–22 July 2018; Springer: Berlin/Heidelberg, Germany, 2018; pp. 169–180. [Google Scholar]
  48. Asghar, Z.; Ali, T.; Ahmad, I.; Tharanidharan, S.; Nazar, S.K.A.; Kamal, S. Sentiment Analysis on Automobile Brands Using Twitter Data. Commun. Comput. Inf. Sci. 2019, 932, 76–85. [Google Scholar] [CrossRef]
  49. Abumalloh, R.A.; Ibrahim, O.; Nilashi, M. Loyalty of young female Arabic customers towards recommendation agents: A new model for B2C E-commerce. Technol. Soc. 2020, 61, 101253. [Google Scholar] [CrossRef]
  50. Carlos, M.A.; Nogueira, M.; Machado, R.J. Analysis of dengue outbreaks using big data analytics and social networks. In Proceedings of the 2017 4th International Conference on Systems and Informatics (ICSAI), Hangzhou, China, 11–13 November 2017; pp. 1592–1597. [Google Scholar]
  51. Shah, A.M.; Yan, X.; Tariq, S.; Ali, M. What patients like or dislike in physicians: Analyzing drivers of patient satisfaction and dissatisfaction using a digital topic modeling approach. Inf. Process. Manag. 2021, 58, 102516. [Google Scholar] [CrossRef]
  52. Nilashi, M.; Ibrahim, O.; Yadegaridehkordi, E.; Samad, S.; Akbari, E.; Alizadeh, A. Travelers decision making using online review in social network sites: A case on TripAdvisor. J. Comput. Sci. 2018, 28, 168–179. [Google Scholar] [CrossRef]
  53. Sun, Y.; Ma, H.; Chan, E.H.W. A Model to Measure Tourist Preference toward Scenic Spots Based on Social Media Data: A Case of Dapeng in China. Sustainability 2018, 10, 43. [Google Scholar] [CrossRef] [Green Version]
  54. Ahani, A.; Nilashi, M.; Ibrahim, O.; Sanzogni, L.; Weaven, S. Market segmentation and travel choice prediction in Spa hotels through TripAdvisor’s online reviews. Int. J. Hosp. Manag. 2019, 80, 52–77. [Google Scholar] [CrossRef]
  55. Abdul-Rahman, M.; Chan, E.H.W.; Wong, M.S.; Irekponor, V.E.; Abdul-Rahman, M.O. A framework to simplify pre-processing location-based social media big data for sustainable urban planning and management. Cities 2020, 102986. [Google Scholar] [CrossRef]
  56. Sykora, M.; Elayan, S.; Jackson, T.W. A qualitative analysis of sarcasm, irony and related #hashtags on Twitter. Big Data Soc. 2020, 7, 2053951720972735. [Google Scholar] [CrossRef]
  57. Blei, D.M.; Carin, L.; Dunson, D. Probabilistic Topic Models: A focus on graphical model design and applications to document and image analysis. IEEE Signal Process. Mag. 2010, 27, 55. [Google Scholar] [PubMed] [Green Version]
  58. Chuang, J.; Manning, C.D.; Heer, J. Termite: Visualization techniques for assessing textual topic models. In Proceedings of the International Working Conference on Advanced Visual Interfaces, Capri Island, Italy, 21–25 May 2012; pp. 74–77. [Google Scholar]
  59. Sievert, C.; Shirley, K. LDAvis: A method for visualizing and interpreting topics. In Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces, Baltimore, MD, USA, 27 June 2014; pp. 63–70. [Google Scholar]
  60. Moody, C.; Johnson, R.; Zhang, T. Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec. arXiv 2016, arXiv:1605.02019. [Google Scholar]
  61. Momtazi, S. Unsupervised Latent Dirichlet Allocation for supervised question classification. Inf. Process. Manag. 2018, 54, 380–393. [Google Scholar] [CrossRef]
  62. Asghari, M.; Sierra-Sosa, D.; Elmaghraby, A.S. A topic modeling framework for spatio-temporal information management. Inf. Process. Manag. 2020, 57, 102340. [Google Scholar] [CrossRef]
  63. Hutto, C.J.; Gilbert, E.E. VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Tweet. In Proceedings of the Eighth International Conference on Weblogs and Social Media (ICWSM-14), Ann Arbor, MI, USA, 2–4 June 2014. [Google Scholar]
  64. Kumar, C.V.; Ashish, B.; Amita, G. Twitter Sentiment Analysis Using Vader. Int. J. Adv. Res. Ideas Innov. Technol. 2018, 4, 485–490. [Google Scholar]
  65. Darko, A. Adoption of green building technologies in Ghana: Development of a model of green building technologies and issues influencing their adoption. In Green Building in Developing Countries; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
  66. Soyinka, O.; Adenle, Y.A.; Abdul-Rahman, M. Urban informality and sustainable design of public space facilities: A case study of Hong Kong SAR of China in 2018. Environ. Dev. Sustain. 2021, 23, 16560–16587. [Google Scholar] [CrossRef]
  67. Howard, M.C. Calculating Cronbach’s Alpha in Python. 2022. [Google Scholar]
  68. Norusis, M.J. PASW Statistics 18 Guide tso Data Analysis; Prentice Hall Press: Hoboken, NJ, USA, 2010. [Google Scholar]
  69. Talavera, L.; Gaudioso, E. Mining student data to characterize similar behavior groups in unstructured collaboration spaces. In Workshop on Artificial Intelligence in CSCL, Proceedings of the 16th European Conference on Artificial Intelligence, Valencia Spain, 22–27 August 2004; pp. 17–23.
  70. Limanond, T.; Butsingkorn, T.; Chermkhunthod, C. Travel behavior of university students who live on campus: A case study of a rural university in Asia. Transp. Policy 2011, 18, 163–171. [Google Scholar] [CrossRef]
  71. Mathooko, F.M.; Ogutu, M. Coping strategies adopted by public universities in Kenya in response to environmental changes. J. Manag. Strategy 2014, 5. [Google Scholar] [CrossRef] [Green Version]
  72. Cimellaro, G.P.; Renschler, C.; Reinhorn, A.M.; Arendt, L. PEOPLES: A Framework for Evaluating Resilience. J. Struct. Eng. 2016, 142, 04016063. [Google Scholar] [CrossRef]
  73. Pringle, P. AdaptME: Adaptation monitoring and evaluation. In Adaptme: Adaptation Monitoring and Evaluation; UK Climate Impacts Programme (UKCIP): Oxford, UK, 2011. [Google Scholar]
Figure 1. Map showing the location of the six case studies. Source: Authors’ fieldwork.
Figure 1. Map showing the location of the six case studies. Source: Authors’ fieldwork.
Sustainability 15 01295 g001
Figure 2. The AI—ML—NLP nexus.
Figure 2. The AI—ML—NLP nexus.
Sustainability 15 01295 g002
Figure 3. The methodological framework adapted from Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55].
Figure 3. The methodological framework adapted from Abdul-Rahman, Chan, Wong, Irekponor, and Abdul-Rahman [55].
Sustainability 15 01295 g003
Figure 4. Network showing how the questionnaire survey snowballed from the six countries into 23.
Figure 4. Network showing how the questionnaire survey snowballed from the six countries into 23.
Sustainability 15 01295 g004
Figure 5. Sentiment polarities calculated from the Normalized Weighted Composite Scores (NWCS).
Figure 5. Sentiment polarities calculated from the Normalized Weighted Composite Scores (NWCS).
Sustainability 15 01295 g005
Figure 6. Thematic clusters of community challenges in university towns.
Figure 6. Thematic clusters of community challenges in university towns.
Sustainability 15 01295 g006
Figure 7. Charts showing the intensity of community challenges in percentages in the case studies.
Figure 7. Charts showing the intensity of community challenges in percentages in the case studies.
Sustainability 15 01295 g007
Figure 8. Polarity-based model for high environmental pollution (Noise and indiscriminate waste/garbage disposal) in Loughborough, UK.
Figure 8. Polarity-based model for high environmental pollution (Noise and indiscriminate waste/garbage disposal) in Loughborough, UK.
Sustainability 15 01295 g008
Table 1. Case studies and the number of tweets downloaded.
Table 1. Case studies and the number of tweets downloaded.
S/N Case Study Number of Tweets (UGC) Mined
First MiningBased on Topics
1Loughborough, UK1,297,1121,292,011
2Ann Arbor, USA1,052,4251,049,385
3Akoka, Nigeria936,575935,822
4Hung Hom, Hong Kong724,055721,776
5Sydney, Australia502,615498,473
6Aguita de la Perdiz, Chile64,32563,844
Total4,577,1074,561,311
Table 2. 45 topics generated from the big data mined from the 6 case study areas.
Table 2. 45 topics generated from the big data mined from the 6 case study areas.
Theme Code Generated Topics Number of Mined Tweets per Case Study
Lough-BoroughAnn ArborAkokaHung HomSydneyAguita de la Perdiz
CulturalC01Demographic changes leading to more youths30,17821,553817329,32421,003-
C02Declining moral and community values18,526----2520
C03Lack of community cohesion & integration due to the transient nature of the student population16,124-8062-8352-
C04Aversion of crime and barriers to community policing caused by a transient population-22,65218,251---
C05Differing standards of acceptable behaviours by different social groups--12,335--2992
C06Cultural diversity and lifestyle conflicts15,25125,872-48,764--
C07Divergent perceptions on what makes up communal obligations--10,072-7983-
C08Inconsideration and lack of place attachment21,26117,0088586---
C09Increased racism, tribalism and religious challenges--10,611---
SocialS01Increased anti-social behaviour and social disorder.116,35272,55570,055-40,021-
S02High level of crime due to the vulnerability & carelessness of the youthful population-26,881935627,013-4144
S03Increased level of alcoholism, drugs peddling and abuse.65,44457,63747,01417,27125,5514252
S04Increased level of prostitution and sexually transmitted diseases--42,625---
S05Loss of social services such as reduction in catchment areas for public schools & elderly care15,00927,321----
S06Marginalization of permanent residents-30,764--18,562-
S07Displacement/replacement of established residents (gentrification)18,11150,00218,15226,96239,6234592
S08Increased competition for privately rented apartments14,88931,6669176-7063-
S09Lack of year-round goods & services due to the resort-economy nature of the community-14,4148003---
S10Establishments of night-time entertainment ventures at the detrimental impacts of residential amenities14,75233,11117,787-6994-
S11Segregation and social stratification-17,52611,77339,563--
S12Lack of social interactions among groups---51,033--
PhysicalP01Illegal subdivision of family homes & apartments into housing with multiple occupancies142,858100,36988,42651,72350,5226771
P02Changes in community land use21,01616,33634,795---
P03Community slumification due to the decline in housing renovations and environmental maintenance.71,00342,73225,89216,04656275251
P04Defacing neighbourhoods with graffiti, posters, writings and rental boards and advertisements91,25186,37516,25141,32429,3515931
P05Congestion and overcrowding on the streets and in public places including shops.--13,99834,88312,4132221
P06Increased population density66,52146,7889005-32,102-
P07High environmental pollution—Noise, air pollution and indiscriminate waste/garbage disposal100,52674,57658,52489,26152,0617220
P08Increased incidents of protests leading to vandalism of the physical environment.--922291,222--
P09Increased pressure on urban basic services due to higher population than planned for17,65310,169--5165-
P10On-street parking and traffic congestion74,25134,001-25,42114,006-
P11Pressure on public transport--51,196--1942
P12Ghost community during off-term periods11,993-20,014--2014
EconomicE01High rental prices95,26799,76181,15345,99947,0025032
E02Lucrative student housing business deters access to affordable housing for non-student residents.11,782-15,551---
E03Change in consumer behaviour & taste leading to changes in business models & structures.44,03116,09423,62323,0615026-
E04High cost of living (goods and services)57,22039,69176,01135,75243,8734803
E05High influx of commercial activities40,30811,45229,11227,15416,0211701
E06Seasonal demand for students’ accommodation11,506---10,152-
E07Seasonal scarcity of manpower in shops, restaurants, bars, etc.13,991----1441
E08Seasonal customer base (on and off term periods)12,016-9937---
E09Low tax generation from the community since students are exempted from taxation.35,478----1017
Institution & GovernanceI01Weak and disjointed community leadership-12,00738,927---
I02Neglect by politicians due to low voting power.14,666-15,261---
I03Challenges to existing urban plans and policies12,77710,0728893---
Total Tweets1,292,0111,049,385935,822721,776498,47363,844
No of Topics312835182217
Table 3. Identified community challenges and their ranks based on the frequency of their negative sentiment polarity from VADER.
Table 3. Identified community challenges and their ranks based on the frequency of their negative sentiment polarity from VADER.
CodeCommunity ChallengesFrequency (Negative Sentiment Polarity)Ranking within Case StudiesVADER Overall Rank
Lough-BoroughAnn ArborAkokaHung HomSydneyAguita de la Perdiz
P01Illegal subdivision of family homes & apartments into housing with multiple occupancies381,7451113221
E01High rental prices345,1564226352
P07High environmental pollution—Noise, air pollution & indiscriminate waste disposal332,0713452113
S01Increased anti-social behaviour and social disorder.275,236254-5-4
E04High cost of living (goods and services)238,96710939465
P04Defacing neighbourhoods with graffiti, posters, writings & rental boards & advertisements223,62753187836
S03Increased level of alcoholism, drugs peddling and abuse.189,72596717987
P03Community slumification due to decline in housing renovations & environ. maintenance135,9967812182048
S07Displacement/replacement of established residents (gentrification)125,6111871614679
P10On-street parking and traffic congestion109,359611-1513-10
P06Increased population density105,91881030-7-11
E03Change in consumer behaviour and taste leading to changes in business models & structures.75,4031123131622-12
P08Increased incidents of protests leading to vandalism of the physical environment.61,349--281--13
E05High influx of commercial activities57,09712261112121514
P02Changes in community land use56,463162210---15
P11Pressure on public transport49,726--6--1416
P05Congestion and overcrowding on the streets and in public places including shops.46,570--2110141217
S10Establishments of night-time ent. ventures at the detrimental impacts of residential amenities44,463241217-19-18
C01Demographic changes leading to more youths44,3881419331110-19
S12Lack of social interactions among groups43,452---4--20
I01Weak and disjointed community leadership37,405-259- -21
S08Increased competition for privately rented apartments36,104231329-18-22
S06Marginalization of permanent residents34,667-14--11-23
S02High level of crime due to the vulnerability & carelessness of the youthful population34,497-162713-924
C08Inconsideration and lack of place attachment33,932152132 --25
C06Cultural diversity and lifestyle conflicts33,4272117 5--26
S11Segregation and social stratification31,018 20238--27
E09Low tax generation from the community since students are exempted from taxation.29,98413----1728
C04Aversion of crime and barriers to community policing caused by a transient population29,692-1815---29
S04Increased level of prostitution and sexually transmitted diseases28,777--8---30
C03Lack of community cohesion & integration due to the transient nature of the population28,06120-34-16-31
P12Ghost community during off-term periods25,47129-14--1332
I03Challenges to existing urban plans and policies24,270272831---33
S05Loss of social services such as reduction in catchment areas for public schools, elderly care, etc.23,7452215----34
P09Increased pressure on urban basic services due to higher population than planned for22,3211927--21-35
E02Lucrative student housing business deters access to affordable housing for non-students20,06330-19---36
I02Neglect by politicians due to low voting power.18,60725-20---37
C02Declining moral and community values18,49717----1138
E08Seasonal customer base (on and off term periods)16,72528-26---39
S09Lack of year-round goods & services due to the resort-economy nature of the community14,112-2435---40
E06Seasonal demand for students’ accommodation12,41531---15-41
E07Seasonal scarcity of manpower in shops, restaurants, bars, etc.11,53926----1642
C09Increased racism, tribalism and religious challenges8520--24----43
C05Differing standards of acceptable behaviours by different social groups7532--22--1044
C07Divergent perceptions on what makes up communal obligations7392--25-17-45
Table 4. Respondents’ profiles for validation.
Table 4. Respondents’ profiles for validation.
Data on Survey RespondentsResponsesPercentage
Category
Academia/research institute18948.2
Consulting/private sector4210.7
Public sector/government agency or department369.2
Intergovernmental organization/international NGO9724.8
Others287.1
Profession
Academic/researcher12832.7
Urban planner11228.6
Resilience project manager/officer5113.0
Architect297.4
Economist/development economist123.0
Sociologist225.6
Engineer (civil, construction, etc.)276.9
Others112.8
Years of experience
1–5 years369.2
6–10 years9123.2
11–15 years10226.0
16–20 years5514.0
Above 20 years10827.6
Type of involvement in community resilience & Sustainability
Development of as assessment methodology19148.7
Use of an assessment method13835.2
Both of the above5113.0
Others123.1
Table 5. Validated and ranked community challenges in university towns.
Table 5. Validated and ranked community challenges in university towns.
CodeCommunity ChallengesVADER Overall RankRanking by Experts in all 23 CountriesRanking by Experts in the 6 Countries
Mean ValueStandard DeviationNormalized Mean ValueRankMean ValueStandard DeviationNormalized Mean ValueRank
P01Illegal subdivision of family homes & apartments into housing with multiple occupancies14.1721.2410.97624.1901.0620.9983
E01High rental prices24.1860.9621.00014.1910.2241.0001
P07High environmental pollution—Noise, air pollution and indiscriminate waste/garbage disposal34.1560.9210.94954.1900.8620.9982
S01Increased anti-social behaviour and social disorder.44.1601.2310.95634.1580.2510.9456
E04High cost of living (goods and services)54.1560.2880.94944.1810.4130.9834
P04Defacing neighbourhoods with graffiti, posters, writings and rental boards and advertisements64.1490.0810.93774.1400.6130.9158
S03Increased level of alcoholism, drugs peddling and abuse74.1470.1120.93494.1310.2510.90010
P03Community slumification due to the decline in housing renovations and environmental maintenance84.1520.1770.94264.1730.5710.9705
S07Displacement/replacement of established residents (gentrification)94.1410.1670.924104.1350.1550.9079
P10On-street parking and traffic congestion104.1490.2310.93784.1410.3520.9177
P06Increased population density114.1321.0030.908134.1221.2160.88512
E03Change in consumer behaviour and taste leading to changes in business models & structures.124.1190.3150.886174.1281.0080.89511
P08Increased incidents of protests leading to vandalism of the physical environment.134.1010.4320.856204.0930.2510.83717
E05High influx of commercial activities144.1390.1520.920114.1150.6240.87413
P02Changes in community land use154.1291.0850.903144.1000.2630.84915
P11Pressure on public transport164.1250.1550.896154.1090.2130.86414
P05Congestion and overcrowding on the streets and in public places including shops.174.1350.2620.913124.0960.3620.84216
S10Establishments of night-time entertainment ventures at the detrimental impacts of residential amenities184.1120.3320.874184.0871.2010.82719
C01Demographic changes leading to more youths194.1220.4210.891164.0810.5210.81720
S12Lack of social interactions among groups204.1121.0250.874194.0900.2410.83218
I01Weak and disjointed community leadership214.0841.0450.827254.0550.9140.77428
S08Increased competition for privately rented apartments224.0900.1280.837234.0690.2690.79724
S06Marginalization of permanent residents234.0550.2610.778304.0790.3230.81421
S02High level of crime due to the vulnerability & carelessness of the youthful population244.0580.3830.783294.0580.8240.77927
C08Inconsideration and lack of place attachment254.0810.0560.822264.0610.7310.78426
C06Cultural diversity and lifestyle conflicts264.0870.1990.832244.0750.5180.80722
S11Segregation and social stratification274.0991.0740.852214.0700.4190.79923
E09Low tax generation from the community since students are exempted from taxation.284.0910.3610.839224.0460.9820.75931
C04Aversion of crime and barriers to community policing caused by a transient population294.0401.0420.752334.0501.0430.76630
S04Increased level of prostitution and sexually transmitted diseases304.0311.4270.737344.0411.0990.75132
C03Lack of community cohesion and integration due to the transient nature of the student population314.0531.0540.774314.0511.0110.76729
P12Ghost community during off-term periods324.0771.1180.815274.0631.2310.78725
I03Challenges to existing urban plans and policies334.0691.2260.801284.0191.3060.71440
S05Loss of social services such as reduction in catchment areas for public schools, elderly care, etc.344.0111.1180.703364.0381.0820.74634
P09Increased pressure on urban basic services due to higher population than planned for354.0431.3010.757324.0381.0550.74633
E02Lucrative student housing business deters access to affordable housing for non-student residents.364.0111.2300.703384.0271.0700.72838
I02Neglect by politicians due to low voting power.374.0271.3770.730354.0271.1030.72839
C02Declining moral and community values383.9831.4010.655414.0291.1900.73137
E08Seasonal customer base (on and off term periods)394.0081.2310.698393.8991.2220.51543
S09Lack of year-round goods & services due to the resort-economy nature of the community403.9521.0010.603424.0341.0260.73936
E06Seasonal demand for students’ accommodation413.9521.0070.603434.0071.0090.69441
E07Seasonal scarcity of manpower in shops, restaurants, bars, etc.424.0111.1800.703374.0381.2310.74635
C09Increased racism, tribalism and religious challenges433.9901.2200.667403.9891.0250.66442
C05Differing standards of acceptable behaviours by different social groups443.5971.1530.000453.8991.3020.51544
C07Divergent perceptions on what makes up communal obligations453.9211.0320.550443.5891.2470.00045
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Abdul-Rahman, M.; Adegoriola, M.I.; McWilson, W.K.; Soyinka, O.; Adenle, Y.A. Novel Use of Social Media Big Data and Artificial Intelligence for Community Resilience Assessment (CRA) in University Towns. Sustainability 2023, 15, 1295. https://doi.org/10.3390/su15021295

AMA Style

Abdul-Rahman M, Adegoriola MI, McWilson WK, Soyinka O, Adenle YA. Novel Use of Social Media Big Data and Artificial Intelligence for Community Resilience Assessment (CRA) in University Towns. Sustainability. 2023; 15(2):1295. https://doi.org/10.3390/su15021295

Chicago/Turabian Style

Abdul-Rahman, Mohammed, Mayowa I. Adegoriola, Wilson Kodwo McWilson, Oluwole Soyinka, and Yusuf A. Adenle. 2023. "Novel Use of Social Media Big Data and Artificial Intelligence for Community Resilience Assessment (CRA) in University Towns" Sustainability 15, no. 2: 1295. https://doi.org/10.3390/su15021295

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop