Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil

Santana, Keila Valente de Souza de; Marino, Aluízio; Martins, Gabriela Rosa; Lima, Pedro Henrique Barbosa Muniz; Mendonça, Pedro Henrique Rezende; Rolnik, Raquel

doi:10.3390/ijgi13110397

Open AccessArticle

Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil

by

Keila Valente de Souza de Santana

^1,*

,

Aluízio Marino

¹

,

Gabriela Rosa Martins

²,

Pedro Henrique Barbosa Muniz Lima

¹,

Pedro Henrique Rezende Mendonça

³ and

Raquel Rolnik

¹

Faculty of Architecture and Urban Planning, University of São Paulo, São Paulo 05508-080, Brazil

²

Faculty of Philosophy, Literature and Human Sciences, University of São Paulo, São Paulo 05508-080, Brazil

³

Institute of Mathematics and Statistics, University of São Paulo, São Paulo 05508-090, Brazil

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2024, 13(11), 397; https://doi.org/10.3390/ijgi13110397

Submission received: 8 August 2024 / Revised: 23 October 2024 / Accepted: 31 October 2024 / Published: 7 November 2024

(This article belongs to the Special Issue HealthScape: Intersections of Health, Environment, and GIS&T)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

This study sought to identify clusters of a high and low risk of incidence and mortality from COVID-19 throughout the pandemic period, from 2020 to 2022, in the Metropolitan Region of São Paulo (MRSP), analyzing their relationship with socioeconomic and demographic variables. Spatiotemporal and temporal variations in the clusters were determined using scan statistics, a multidimensional point process that performs multiple tests for each geographic point analyzed, in SaTScan v10.0. Socioeconomic and demographic differences were analyzed using the nonparametric Mann–Whitney and Kruskal–Wallis tests. Temporal clusters of high incidence and high mortality were observed in May 2020 and March to June 2021. In the spatiotemporal analysis, the clusters of high incidence and high mortality were concentrated in the city of São Paulo and neighboring cities, indicating that the capital was an area of influence and convergence at all times during the COVID-19 pandemic. Clusters of low mortality were found in the central region of the capital, which concentrates the highest incomes and the lowest percentages of Black, mixed-race, and Indigenous people in the MRSP. All clusters were identified in densely occupied areas and point to a pattern of disease spread that is related to income and ethnicity, as well as to the circulation dynamics of a metropolitan region.

Keywords:

COVID-19; space–time cluster; Metropolitan Region of São Paulo; population density; socioeconomic variables

1. Introduction

The COVID-19 (coronavirus disease 2019) pandemic, an infectious disease caused by the highly transmissible coronavirus SARS-CoV-2 (Severe Acute Respiratory Syndrome—Coronavirus-2), was first detected in December 2019 in the People’s Republic of China [1]. In Brazil, at the national level, measures such as social distancing, lockdowns, and extensive testing to track the disease were severely hampered by political bias. The number of deaths and people infected by COVID-19 increased steadily in the first months of the pandemic [2], and the rapid spread of the virus triggered heterogeneous health and social repercussions across Brazil’s states and municipalities [3].

São Paulo, Brazil’s most populous state (45 million inhabitants), was severely affected by COVID-19, leading the state government to declare the closure of businesses, schools, and other non-essential services. Initially, the virus spread rapidly in the capital, also called São Paulo, and its metropolitan region, where the first case of the disease was identified in Brazil. Although the capital was a hotspot for the disease, with the highest number of cases at the beginning of the pandemic, cases quickly spread to neighboring municipalities and the interior of the state, which had the highest rate in the country during this period [3,4].

Without a country-wide policy, measures to restrict the movement of people were adopted by some states and others by municipalities, which generated different epidemiological scenarios. The strategies adopted to control the COVID-19 pandemic in the Metropolitan Region of São Paulo (MRSP), such as social isolation, were sometimes established independently by the municipalities that comprise it, without taking into account territorial specificities and the links between them. MRSP, with 20 million inhabitants, is the largest city in the country and is characterized by a high level of interdependence between the municipalities and high levels of circulation density in the public transportation system, with millions of commuters using buses, trains, and the subway daily [5]. Jobs, businesses, and economic opportunities are highly concentrated in the capital, the center of MRSP, where a higher-income population lives. The combination of a high degree of internal circulation density with exclusively municipal restrictions, which did not consider this metropolitan area’s territorial dynamics, may have contributed to the ineffectiveness in controlling the spread of the disease.

The inclusion of the territorial dimension as a non-pharmacological action to combat the COVID-19 pandemic is still essential today, given the risk of the emergence and re-emergence of diseases. Therefore, the spatial analysis of the spread of the disease, through geographic information systems (GIS), not only contributes to the elaboration of territorial scenarios that could have guided and enabled strategic actions but can also guide current policies that mitigate the effects of socio-territorial inequalities on health [6]. Therefore, this study sought to identify areas with a high and low risk of incidence and mortality from COVID-19 throughout the pandemic period, from 2020 to 2022, in the MRSP, analyzing their relationship with socioeconomic and demographic variables and considering the urban territorial dynamics of a metropolis. Our hypothesis is that the use of a congested public transportation system which had not put in place any protection measures by inhabitants of peripheral regions may have influenced the dynamics of contamination of the local population through community transmission of COVID-19.

Through the development of geoinformation technologies, detailed analyses and visualizations of the propagation patterns of the COVID-19 pandemic still play an important role in understanding spatial clusters and trends in SARS-CoV-2 transmission. This article used the free software SaTScan (an abbreviation for Space and Time Scan Statistics) that can detect increased disease activity without a priori specification of the time period, geographic location, or size. As a recognized surveillance tool, the detection of “active” and “emerging” spatiotemporal clusters of COVID-19 in Brazil was mainly carried out at the municipal scale during the COVID-19 pandemic [4,7,8]. Through prospective spatiotemporal scanning analysis of the disease, they assessed whether the mortality rate, the GINI index, and social inequality were predictors of the relative risk of each cluster through a Generalized Linear Model (GLM) among Brazilian municipalities [8]. Our study is the first in Brazil to detect spatial–temporal clusters of COVID-19 cases and deaths on a more detailed scale, in areas of the São Paulo Metropolitan Region, and to assess the socioeconomic and demographic differences between them throughout the pandemic period. Despite being a retrospective study, we seek to disseminate a viable method of health surveillance that can be carried out during health emergencies and that considers the particularities of a metropolitan urban territory in local decision-making, where regional dynamics are sometimes disregarded.

2. Study Site

The MRSP is the largest city in South America and the sixth largest in the world, according to a 2014 United Nations (UN) report [5]. The MRSP is made up of 39 municipalities (Figure 1), including São Paulo, the capital of the state of São Paulo and the MRSP’s center. The municipalities in this region, which are distributed into five sub-regions, are listed below:

i.: North: Caieiras, Cajamar, Francisco Morato, Franco da Rocha, and Mairiporã.
ii.: East: Arujá, Biritiba-Mirim, Ferraz de Vasconcelos, Guararema, Guarulhos, Itaquaquecetuba, Mogi das Cruzes, Poá, Salesópolis, Santa Isabel, and Suzano.
iii.: Southeast: Diadema, Mauá, Ribeirão Pires, Rio Grande da Serra, Santo André, São Bernardo do Campo, and São Caetano do Sul.
iv.: Southwest: Cotia, Embu, Embu-Guaçu, Itapecerica da Serra, Juquitiba, São Lourenço da Serra, Taboão da Serra, and Vargem Grande Paulista.
v.: West: Barueri, Carapicuíba, Itapevi, Jandira, Osasco, Pirapora do Bom Jesus, and Santana de Parnaíba.

The capital is home to 12,330,000 inhabitants; it is the economic epicenter of the country and is a global city. In addition to São Paulo, the municipalities of Guarulhos, São Bernardo, Santo André, and Osasco have significant populations exceeding 700,000 inhabitants. Two cities in the region have the highest population densities in Brazil: Taboão da Serra and Diadema, with 13,400 and 12,800 inhabitants per km², respectively [6].

It is worth noting that population and economic activity are not evenly distributed throughout the metropolitan area. The MRSP is home to municipalities with a very complex economy, such as São Paulo, Guarulhos, Osasco, Barueri and the municipalities of ABC (santo André, Sao Bernardo, São Caetano, Mauá and Diadema), as well as municipalities with less economic weight, such as Francisco Morato, and mostly ones like Juquitiba. Most of those more densely populated municipalities also have the highest numbers of daily commuters to the capital [9].

3. Materials and Methods

This is an ecological and descriptive study assessing secondary data about the incidence and mortality of COVID-19 in the 633 weighting areas of the 39 municipalities that comprise the MRSP. Weighting areas (WA) are territorial units identified by sets of contiguous census sectors belonging to the same district, for the purpose of weighting the results of the population census sample questionnaire. A census sector is a territorial unit established for survey control purposes, comprising a continuous area located in a single urban or rural block, with a size and number of households that allow the survey by a census agent [10].

Information from the period March 2020 to February 2022 about the date of notification, sex, age, disease progression, and postal code of each patient with COVID-19 who recovered or died was accessed through a partnership with the Data Center of the State of São Paulo (CDESP), which provided data from the Epidemiological Surveillance System (SIVEP-Gripe) of the State Epidemiological Surveillance Center. These data were grouped by postal code (first five digits of the postal code) and georeferenced using the postal code database of the Centro de Estudos da Metrópole (Center for Metropolitan Studies) [11]. Linear geometries of the postal code grouping system were intersected with the weighting areas from the 2010 IBGE Census, with cases assigned proportionally to the length of the intersected lines. This study was approved by the Research Ethics Committee of the School of Psychology, Universidade de São Paulo, report number CAAE: 71605223.2.0000.5561, 14 August 2023.

The socioeconomic variables—per capita income, persons per household, and percentage of Black, Brown (mixed-race), and Indigenous people (BBIP)—by WA were built based on data from the Brazilian Institute of Geography and Statistics (IBGE), according to the 2010 census, the most recent census available. The variables were selected according to bibliographic references that analyzed the relationship between COVID-19 spread and socioeconomic factors [6,7,8].

Dasymetric mapping techniques were used to analyze the population density, which subdivide areas of origin into smaller spatial units so that there is greater internal consistency of the variable being mapped [12]. In this study, the variable of population density was calculated by dividing the number of inhabitants in WA by the total area built for residential purposes in that area. This analysis used Google Open Buildings, a large-scale open dataset that contains the vectorization of building roof contours generated from a deep learning model that was trained to determine building areas from high-resolution satellite images. Data are available under the Creative Commons Attribution license (CC BY-4.0) and the Open Data Commons Open Database License (ODbL) v1.0 [13].

The analyses were performed using the incidence and mortality rates for COVID-19 obtained for the 633 weighted areas of the 39 municipalities of the MRSP from March 2020 to February 2021, Year 1, and from March 2021 to February 2022, Year 2. To detect spatiotemporal clusters, the SaTScan v10.0 software was used, which uses a scanning window that varies in both space and time. This window spans examining different geographic regions and periods to identify where there is an anomalous concentration of the event [14]. Thus, the scanning window is an interval in time, a circle or an ellipse in space, or a cylinder with a circular or elliptical base in space–time, as in our study in which multiple different window sizes were used. The Poisson probability distribution model was used, which counts cases and deaths in space and time [15]. The cluster analysis model was built with the following conditions: COVID-19 cases and deaths were grouped by month, without cluster overlap, with circular clusters; the proportion of the population considered was 10% for the spatial scanning window, calculated by the Gini index in SatScan for purely spatial analysis. This option encourages the search for smaller true clusters and can be characterized as a coefficient of population inequality [14]. We also calculated the RR (relative risk) of COVID-19 occurrence and mortality, considering each WA and clusters in relation to the surrounding areas.

In SaTScan, the expected number of cases is estimated based on the spatial and temporal distribution of the population. In our study, no population adjustment was necessary, since the population did not vary substantially in the territory and period analyzed. However, the rates were adjusted for sex and age, as they are potential confounders for the outcome analyzed. Thus, the software calculated the expected cases in each location, taking into account the expected cases and deaths in each demographic group. This means that the expected number of cases was adjusted to reflect the age and gender structure of the population in each location, by comparing the proportion of observed cases with those that would be expected for that demographic composition. In the case of COVID-19, the risk catching of the disease was higher among the elderly, and the software adjusted the expected cases to take this into account when a location has a predominantly elderly population. This type of adjustment ensures that the clusters identified reflect a real risk, and not just differences in the demographic structure of the population [14,15].

Statistical tests were calculated using the likelihood ratio. The null hypothesis (H0) is that the observed number of cases is the same as the expected number. The alternative hypothesis (H1) is that the number of observed cases and deaths exceeds the expected number of cases derived from the null model. The window with the maximum likelihood is the most likely cluster, meaning that the observed data are more likely under the hypothesis that a cluster exists, indicating a possible focus of concentration of cases. SaTScan uses this measure to assess whether the number of events in a region or period is higher than expected, taking into account the overall incidence rate outside the scanning window. The likelihood ratio therefore serves as the criterion for identifying where and when clusters are present, with the cluster with the highest likelihood ratio being identified as the most likely. A p-value is assigned to the cluster. Results with a p-value < 0.05 using 999 Monte Carlo simulations were considered significant [15].

After identifying the space–time clusters, we statistically compared the values of the demographic and socioeconomic variables of the group of WAs belonging to high-mortality clusters to low-mortality clusters and between high-incidence clusters. Then, we compared the values of the groups for variables using the Mann–Whitney and Kruskal–Wallis non-parametric tests for the non-normal distribution of data. The null hypothesis was that the medians and interquartile ranges for the same variable were equal, with a significance level of 5%.

SaTScan™ version 10.0.1 (Kulldorff, Harvard Medical School, Boston, MA, USA), which uses geographical coordinates [14], was used to identify cases grouped in space–time and time. Maps with significant clusters and their relative risks from the space–time analyses were generated in QGIS 3.28. Temporal trends were obtained in SatScan. The significance level was set at p = 0.05. R 4.3.2 for Mac was used for database manipulation and statistical analysis.

4. Results

A total of 191,083 cases of COVID-19 were reported in the MRSP between March 2020 and February 2022. According to the progression of the patient’s condition, the notifications were either recovery or death. This study was conducted in two periods, from March 2020 to February 2021 and from March 2021 to February 2022. In the first period of the pandemic, 100,587 cases were reported, with an annual rate of 512.9 cases per 100,000 inhabitants. In the second period of the pandemic, 89,783 cases were reported, with an annual rate of 457.8 cases per 100,000 inhabitants.

4.1. Incidence Clusters

In temporal terms (Figure 2A), a high-incidence cluster was identified in the first period of the pandemic in May (relative risk [RR] = 3.07), while a low-incidence cluster was present from August to October 2020 (RR = 0.47). The high-incidence cluster interval identified in the second period of the pandemic was from March to June, with RR ranging from 1.78 to 3.58. A low-incidence period was also identified, beginning in July (RR = 0.87) (Figure 2B).

In the first period of the pandemic, three significant high-incidence clusters were identified using the space–time scan statistics of the total number of cases (Figure 3A). All were located in the capital, the municipality of São Paulo, and neighboring cities, particularly in May 2020. Cluster 1, located in the south region of the city of São Paulo and including the cities of Taboão, Embu, Itapecerica da Serra, Embu-Guaçu, Diadema, and São Bernardo do Campo, showed the highest relative risk (RR = 3.2; p-value < 0.001), followed by cluster 3 (RR = 2.2; p-value < 0.001), located in the northwest region of the capital and including part of the cities of Cajamar and Osasco, and cluster 2 (RR = 2; p-value < 0.001), located in the northeast region of the capital and including part of the city of Guarulhos and a smaller part of the city of Ferraz de Vasconcelos. No low-risk cluster was identified in this period.

In the second period of the pandemic, three space–time clusters located in the capital São Paulo and neighboring municipalities were also observed, from March to June 2021 (Figure 3B). Cluster 1, located in the south region of São Paulo and including the entire municipality of Diadema and part of Embu, Itapecerica da Serra, Embu-Guaçu and São Bernardo do Campo, displayed a high relative risk (RR = 2.9; p-value < 0.001), followed by cluster 2 (RR = 2.9; p-value < 0.001), in the east region of the capital and including part of Guarulhos and a small portion of Ferraz de Vasconcelos, and cluster 3 (RR = 2.6; p-value < 0.001), located in the northwest region of the capital, including all municipalities of Barueri, Carapicuíba, and Osasco in this period, as well as a large part of Cajamar, Santana de Parnaíba, and Taboão da Serra, and to a smaller extent, the cities of Cotia and Itapevi.

The statistical comparison of the demographic and socioeconomic variables from the weighting areas presented in the space–time high-incidence clusters showed important differences between the three groups (Table 1). In the first period of the pandemic, all analyses resulted in significant p-values, except for population density. The areas in clusters 1 and 2 had a lower per capita income, a higher percentage of BBIP, and a higher number of people per household. The median per capita income of cluster 3 was more than 50% higher than those of clusters 1 and 2. The interquartile range of cluster 3 was also much higher than the first and especially the third quartile of clusters 1 and 2.

In the second period of the pandemic, all analyses resulted in significant p-values, except for the number of people per household. The areas in clusters 1 and 2 had lower per capita income when compared to cluster 3. The median per capita income in cluster 3 remained higher than those of clusters 1 and 2, by just over 10%.

4.2. Mortality Clusters

In temporal terms (Figure 4A), a high-mortality cluster was identified in the first year of the pandemic in May (relative risk [RR] = 3.7), while a low-risk cluster was present from August 2020 to January 2021 (RR = 0.53). The high-mortality interval identified in the second period of the pandemic was in March and April, with RR ranging from 5.47 to 2.77. A low-mortality period was also identified, beginning in July (RR = 0.67) (Figure 3B).

In the first year of the pandemic, seven significant high-mortality clusters were identified using the space–time scan statistics of total deaths (Figure 5A). All clusters were located in the capital or neighboring cities, in particular in May 2020. In all high-mortality clusters, the relative risk was higher than 2.7. Two low-mortality clusters were identified in the capital, one in the central area (RR = 0.33; p-value < 0.001), and one in the north region of the capital, near Guarulhos, from July to December 2020 (RR = 0.35; p-value < 0.001).

In the second year of the pandemic, four space–time high-mortality clusters were observed in the capital and neighboring cities, from March to May 2021 (Figure 5B). In all high-mortality clusters, the relative risk was higher than 3.4. A low-mortality cluster was identified in the central-south region of the capital from July to December 2021.

The statistical comparison of the demographic and socioeconomic variables of the weighting areas included in the space–time high-mortality clusters for COVID-19 mortality showed important differences between the high- and low-risk groups (Table 2). In the first year of the pandemic, all analyses resulted in significant p-values, except for the variable of population density. The areas in the high-mortality clusters had lower per capita income, a higher percentage of BBIP, and a higher number of people per household. The median per capita income of low-risk clusters was three times higher than that of the high-mortality clusters.

In the second year of the pandemic, all analyses resulted in significant p-values. The areas in the high-mortality clusters had a lower per capita income than the low-mortality clusters. High-mortality clusters also had a higher percentage of BBIP and a higher number of people per household. There was no significant difference between the clusters in terms of population density in the first period of the study. In the second period of the study, the low-mortality cluster appeared to have a lower population density than the high-mortality cluster. In this case, both the internal variability of the cluster and the urban dynamics of the greater concentration of buildings in central areas need to be assessed.

5. Discussion

Using the multidimensional point scanning method, we identified temporal and spatiotemporal clusters of case and death notifications that demonstrated that the spread of the COVID-19 pandemic did not occur randomly or homogeneously in the MRSP. Based on surveillance data, we found that a spatiotemporal pattern of incidence and risk of death from COVID-19 during the pandemic was related to social and demographic factors and to the insertion of specific locations in the dynamics of metropolitan circulation of people and goods. The significant socioeconomic differences between the clusters express that in addition to sex, age, and comorbidities, widely discussed in the literature as mortality risk variables in relation to COVID-19, social determinants and territorial relations are also variables that can explain such an impact [4,7,8].

In the purely temporal analysis, four notable moments were identified during the two periods analyzed. In the first half of both 2020 and 2021, high-incidence clusters were identified, followed by the second half of each year, displaying low-risk clusters. In the first period (March 2020 to February 2021), after just over 2 months from the first recorded case, in May 2020, a high-incidence cluster with RR > 3 was detected, followed by a prominent decrease in risk, leading to a greater relaxation of control measures in the second half of 2020 [16].

In Brazil, although with the mandatory use of masks, crowds were promoted in the pre-election and election periods in November, in addition to the reopening of businesses and the permitting of travel in the second half of 2020 [16]. These measures, combined with the circulation of the Alpha, Gamma and Delta variants, influenced the high rate of transmission of COVID-19 in the population in early 2021 [17]. The present study reinforces this premise by identifying that the months of March to June 2021 stood out with the highest risk of incidence and mortality from the disease. The decrease in the number of cases and deaths in the region occurred from the second half of 2021, when a first dose of the vaccine had been administered to more than 50% of the population [17].

Vaccination began in February 2021, but as there were few doses available, priority was given to people with comorbidities and elderly people and did not consider socioeconomic and professional aspects, with the exception of prioritizing health professionals [18]. The poorest people who live in peripheral municipalities and needed to be at work in person and made greater use of public transportation, even though they were territorially more vulnerable and exposed to the virus, were not prioritized in the vaccination process.

In the spatiotemporal analysis, this study demonstrated that the high-incidence and high-mortality clusters were concentrated in the WA of São Paulo and neighboring municipalities, indicating that the capital was an area of influence and convergence at all times during the pandemic. Studies have already shown that COVID-19 cases began in the capital, São Paulo, and that they dispersed due to spatial contiguity, shortly after the start of the pandemic in March 2020. However, the scale of analysis in these studies was intra-urban, only in the municipality of São Paulo or inter-municipal, analyzing the dispersion throughout the state of São Paulo [3,4,6,7,19]. Our study was the first to analyze the spatiotemporal dynamics of the COVID-19 pandemic on a more detailed scale of the MRSP, WA, which is an accessible scale of spatial analysis for the entire Brazilian territory. This method pointed to a dynamic of virus dispersion that appears to be associated with an urban dynamic of regional circulation axes that involve the capital and certain neighboring municipalities.

The high-incidence spatial-temporal cluster identified between April and June 2020 in the area where Guarulhos International Airport is located, in a city neighboring São Paulo, corroborates studies that demonstrate the influence of mobility on the spread of the SARS-CoV-2 virus [3,4,8]. São Paulo/Guarulhos International Airport is the largest airport in Brazil and the second busiest in Latin America in terms of the number of passengers transported and the transportation of goods [20]. The other incidence clusters identified are in locations with a high density of people using public transportation [21].

The public transportation system in the SPMR follows a highly radial model, structured to transport passengers from the outskirts to the center, or from the neighborhoods to the radial transportation axes [21]. The areas with the highest mobility rates are located in the central region of the capital, while the areas with the highest immobility rates are located in the outskirts and in neighboring municipalities [22]. These peripheral areas of the capital and neighboring municipalities are home to the majority of the population that still needed to use public transportation to attend essential services that continued to operate in person even during the implementation of control and social distancing measures [22].

Only essential services such as food, supplies, health, banking, cleaning, and security services continued to operate in person during the pandemic [16,17]. Because of this, there was a reduction in the number of public transportation services to avoid economic losses for the companies providing these services [23]. This measure was adopted to respond to the drop in the number of passengers, which accompanied the migration of activities to remote work. However, for those whose work did not allow them to stay at home, the reduction in the number of vehicles increased waiting times for trips and, at times, increased crowding, which may have favored the transmission of the novel coronavirus [23,24].

Our study indicates that social determinants related to income and race influenced the incidence and mortality rates of the disease and need to be considered in the continuation of studies on the relationship between the territorial process of spread of the COVID-19 pandemic and urban mobility. Social behaviors, often managed by economic subsistence needs, were decisive for the pattern of virus transmission [25]. In the present analysis of the spatiotemporal clusters of disease incidence, there were statistically significant differences in the socioeconomic variables per capita income and percentage of BBIP among the three spatiotemporal clusters with the highest risk of incidence that were detected in the same period. This reflects a concentration of areas with a high risk of disease incidence also in an area of lower social vulnerability.

Despite the limitations of the analysis that considered the average of an area with widely fragmented social and economic conditions, this result expresses the need for analyses that seek to deepen the understanding of how complex socioeconomic dynamics intertwine with territorial dynamics and interfere in the spread of diseases. We understand that analyses of the relationship between health and social vulnerabilities need to be carried out spatially, as this relationship does not materialize homogeneously throughout the territory. Regarding COVID-19, although there were other municipalities that were equally or more vulnerable in the metropolitan region, they were not as affected as those where daily interaction with the capital was more frequent. In our study, clusters were identified in densely occupied areas and point to a pattern of disease spread that is related to income and ethnicity, as well as to the circulation dynamics of a metropolitan region.

Regarding COVID-19 mortality, our study reveals low- and high-mortality clusters at different times during the pandemic in the MRSP, in addition to significant differences in income and ethnicity between these clusters. It was shown that low-risk mortality clusters had a higher average per capita income, a lower BBIP percentage, and fewer people per household. The capital of São Paulo is very segregated along ethnic–racial lines. Although 36% of the capital population is Black, some high-income districts are almost 95% white [26]. Generally speaking, there has been a consolidation of the districts, places, and positions of the white social classes in the most developed, rich, and urbanized areas in the center southwest quadrant of the city, while in the distant outskirts, in the favelas and in low-income housing complexes, the Black population has become increasingly concentrated [27].

Studies show a correlation between COVID-19 mortality and socioeconomic indicators, suggesting that living conditions directly affected vulnerability to the disease, as evidenced by the impact on impoverished and Black populations [28,29]. In Brazil, despite the cash transfer policy adopted, called “Emergency Aid”, studies show that mortality rates increased as formal remuneration decreased, highlighting the differentiated impact of the pandemic [30]. This situation may be a reflection of limited access to quality health services in impoverished areas [28].

In our study, we observed that there were no low-incidence clusters in the center southwest region of the city of São Paulo, but there was a low-mortality cluster, in an area with a concentration of the highest incomes in the MRSP. In contrast, high-mortality clusters were observed in the most peripheral region of the capital São Paulo, as well as in neighboring municipalities. By integrating epidemiological models with georeferenced data and socioeconomic indicators, we analyzed how the virus spread in a complex urban environment, characterizing significant territorial disparities in incidence and mortality risk. The method employed in the present study has been widely used to detect statistically significant spatiotemporal clusters of diseases, as well as to calculate relative risks, contributing to the real-time geographic surveillance of diseases and early detection of epidemics and retrospective analysis [3,7,8,25]. We highlight the role of social inequalities interwoven with the spatial dynamics of COVID-19, detailing mortality risks in the 633 weighted areas of the MRSP. Additionally, we provide insights into how urban mobility and specific variables contributed to the spread of coronavirus infection.

The limitations of this study include the use of aggregated data from the corresponding areas, without controlling for individual patient conditions, such as chronic disease conditions. There are also inherent biases in the dataset used due to differentiated access to healthcare, as there was no mass testing to track the disease and, at times, the tests were not accessible throughout the outpatient network, which led to underreporting of cases [31]. Another limitation was the use of information on per capita income, BBIP, and people per household from the 2010 census. Data from the 2022 census have not yet been made available on an WA scale and, therefore, our study probably underestimates changes in the socioeconomic structure even if no such changes occurred in urban aspects in the territories analyzed during this period.

The choice of analysis period influences SaTScan’s ability to detect clusters [32]. The choice of parameters, such as the maximum temporal window for cluster detection, the Poisson distribution, and the shape of the cluster (circular or elliptical), can have a greater or lesser impact on the sensitivity of the model to detect clusters [14]. In the COVID-19 pandemic, the temporal distribution of cases was very heterogeneous and there were outbreaks concentrated in certain months during 2020 and 2021. Our study was intentionally carried out in two periods, Year 1 and Year 2, because clusters that are evident in shorter periods may be diluted in longer periods of analysis. The incidence of COVID-19 decreased significantly in the second half of 2021, with lower rates and a more homogeneous distribution, and the software can interpret that the data are closer to what was expected, reducing the chance of identifying significant clusters.

Even though health actions are strongly associated with medical measures, the spread of the pandemic exposed the need for a territorialized reading of health problems in order to design public policies. The use of geotechnologies during the COVID-19 pandemic, both in academic publications and on information panels of health institutions, highlights the importance of such analyses in public health management. However, access to maps as an efficient means of communicating about the spread of diseases is still a challenge due to the difficulty in incorporating urban complexity and limitations in access to data and qualified labor. This study sought an epidemiological investigation model accessible to public health surveillance management and a subsequent statistical analysis of social variables to contribute to the prioritization of policies and actions to mitigate the spread and impacts of diseases.

6. Conclusions

This study detected temporal and spatiotemporal clusters of WA in the MRSP in the first two years of the COVID-19 pandemic, as well as the respective RR calculated by the incidence within the cluster in relation to the incidence outside the cluster. The method employed can facilitate additional targeted interventions at more detailed spatial scales. In addition, carrying out tests for the set of areas that form the cluster, detecting areas with low and high rates within the cluster, reduces the possibility of detecting a cluster with only one unit of analysis.

Since public health policies to contain the spread of COVID-19 at the beginning of the pandemic relied mainly on non-pharmacological measures, a better understanding of the effectiveness of these measures through spatial analysis would allow for targeted interventions that could have mitigated the effects of the pandemic in the urban environment. However, the analyses performed here may also contribute to better outcomes in controlling future outbreaks. This is because this approach has proven useful for infectious disease surveillance and identifying statistically significant clusters of cases and deaths. However, it needs to be strengthened by adjusting efforts to analyze relevant covariates, such as poverty levels and access to healthcare. Even so, mapping spatiotemporal clusters of different outcomes can support public health policies and contribute to the advancement of epidemiological studies with ecological designs, also allowing for analyses of the relationship between health risks and social and economic indicators.

Author Contributions

Conceptualization, Keila Valente de Souza de Santana, Pedro Henrique Rezende Mendonça, Pedro Henrique Barbosa Muniz Lima, Aluízio Marino and Raquel Rolnik; methodology, Keila Valente de Souza de Santana, Pedro Henrique Rezende Mendonça, Gabriela Rosa Martins and Pedro Henrique Barbosa Muniz Lima; software, Keila Valente de Souza de Santana, Pedro Henrique Rezende Mendonça, Gabriela Rosa Martins and Pedro Henrique Barbosa Muniz Lima; formal analysis, Keila Valente de Souza de Santana; resources, Raquel Rolnik; writing—original draft preparation, Keila Valente de Souza de Santana; writing—review and editing, Pedro Henrique Rezende Mendonça and Raquel Rolnik; visualization, Pedro Henrique Rezende Mendonça, Pedro Henrique Barbosa Muniz Lima, Gabriela Rosa Martins, Aluízio Marino and Raquel Rolnik; supervision, Raquel Rolnik; project administration, Aluízio Marino and Raquel Rolnik; funding acquisition, Raquel Rolnik All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by São Paulo State Research Support Foundation (Fapesp). Funding number: 2023/03355-6, and 2021/08276-1.

Data Availability Statement

Data are contained within this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

World Health Organization. Coronavirus Disease 2019 (COVID-19): Situation Report, 73. 2020. Available online: https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed on 8 January 2024).
UNASUS. Coronavírus: Brasil Confirma Primeiro Caso da Doença. Available online: https://www.unasus.gov.br/noticia/coronavirus-brasil-confirma-primeiro-caso-da-doenca (accessed on 28 January 2024).
Palasio, R.G.S.; Lorenz, C.; Lucas, P.C.; Nielsen, L.; Masuda, E.T.; Trevisan, C.M. Spatial, spatio-temporal, and origin-destination flow analyses of patients with severe acute respiratory syndrome hospitalized for COVID-19 in Southeastern Brazil, 2020–2021. Rev. Do Inst. De Med. Trop. De São Paulo 2023, 65, e6. [Google Scholar]
Ferreira, R.V.; Martines, M.R.; Toppa, R.H.; Assunção, L.M.; Desjardins, M.R.; Delmelle, E. Utilizing prospective space-time scan statistics to discover the dynamics of coronavirus disease 2019 clusters in the State of São Paulo, Brazil. Rev. Da Soc. Bras. De Med. Trop. 2022, 55, e0607. [Google Scholar]
São Paulo. Plano de Desenvolvimento Urbano Integrado da Região Metropolitana de São Paulo. Available online: https://rmsp.pdui.sp.gov.br/ (accessed on 8 May 2024).
Barrozo, L.V. Desigualdades na mortalidade infantil no Município de São Paulo: Em busca do melhor indicador. Confins 2018. [Google Scholar] [CrossRef]
Gomes, D.S.; Andrade, L.A.; Ribeiro, C.J.N. Risk clusters of COVID-19 transmission in northeastern Brazil: Prospective space-time modeling. Epidemiol. Infect. 2020, 148, e188. [Google Scholar] [CrossRef] [PubMed]
Martines, M.R.; Ferreira, R.V.; Toppa, R.H.; Assunção, L.M.; Desjardins, M.R.; Delmelle, E.M. Detecting space–time clusters of COVID-19 in Brazil: Mortality, inequality, socioeconomic vulnerability, and the relative risk of the disease in Brazilian municipalities. J. Geogr. Syst. 2021, 23, 7–36. [Google Scholar] [CrossRef] [PubMed]
Gaspar, R.C.; Aparício, C.A.; Bessa, V. A metrópole de São Paulo: Desenvolvimento econômico recente e configuração interna. In São Paulo: Transformações na Ordem Urbana; Bógus, L., Pasternak, S., Eds.; Eds.; Letra Capital: Rio de Janeiro, Brazil, 2015. [Google Scholar]
IBGE. Censo Populacional. Available online: https://cidades.ibge.gov.br/brasil/sp/araraquara/panorama (accessed on 8 February 2024).
Metrópole Cded. Base Cartográfica Digital Georreferenciada de Logradouros da Região Metropolitana de São Paulo—Edicão. 2020. Available online: https://centrodametropole.fflch.usp.br/pt-br/node/9838 (accessed on 8 March 2024).
Research Google. Open Buildings: A Dataset of Buildings Footprint to Support Social Good Applications. Available online: https://sites.research.google/gr/open-buildings/ (accessed on 1 February 2024).
Petrov, A. One hundred years of dasymetric mapping: Back to the origin. Cartogr. J. 2012, 49, 256–264. [Google Scholar] [CrossRef]
Kulldorff, M. SaTScanTM User Guide. 2018. Available online: https://www.satscan.org/SaTScan_TM_Manual_do_Usu%C3%A1rio_Portugues.pdf (accessed on 1 March 2024).
Kulldorff, M. A spatial scan statistic. Commun. Stat.-Theory Methods 1997, 26, 1481–1496. [Google Scholar] [CrossRef]
São Paulo. Governo de SP Reforça Controle de Pandemia e Põe Estado na Fase Amarela. São Paulo: Governo do Estado 2020. Available online: https://www.saopaulo.sp.gov.br/noticias-coronavirus/governo-de-sp-reforca-controle-de-pandemia-e-poe-estado-na-fase-amarela-2/ (accessed on 8 July 2024).
Brasil. Ministério da Saúde. Boletim Epidemiológico Especial: Doença Pelo Novo Coronavírus—COVID-19. Semana epidemiológica 39. Brasília, DF: Ministério da Saúde. 2021. Available online: https://www.gov.br/saude/pt-br/centrais-de-conteudo/publicacoes/boletins/epidemiologicos/covid-19/2021/boletim_epidemiologico_covid_83.pdf (accessed on 3 October 2024).
São Paulo (Município). Secretaria Municipal da Saúde. Instrutivos Para priorização de doses da Vacina de COVID-19 no Município de São Paulo e Documentos Técnicos. São Paulo: Secretaria Municipal da Saúde. 2024. Available online: https://capital.sp.gov.br/web/saude/w/vigilancia_em_saude/doencas_e_agravos/coronavirus/315208 (accessed on 3 October 2024).
Demenech, L.M.; Dumith, S.d.C.; Vieira, M.E.C.D.; Silva, L.N. Desigualdade econômica e risco de infecção e morte por COVID-19 no Brasil. Rev. Bras. De Epidemiol. 2020, 23, e200095. [Google Scholar] [CrossRef] [PubMed]
OGLOBO. Entre os Maiores do Mundo, Aeroporto de Guarulhos é o Segundo Mais Pontual. 2017. Available online: https://oglobo.globo.com/boa-viagem/entre-os-maiores-do-mundo-aeroporto-de-guarulhos-o-segundo-mais-pontual-20728827 (accessed on 6 June 2024).
Rolnik, R.; Klintowitz, D. Mobilidade na cidade de São Paulo. Estud. Avançados 2011, 25, 89–108. [Google Scholar] [CrossRef]
Pilotto, A.S.; Novaski, M.A.M. Indicadores de mobilidade urbana na RMSP a partir da pesquisa OD-Metrô. Cad. Metrópole 2022, 25, 229–254. [Google Scholar] [CrossRef]
Mendonça, P.H.R.; Rolnik, R.; Yeuw, T.T.; Marino, A. Mobilidade na Cidade de São Paulo: Lições das Transformações Durante a Pandemia de COVID-19; Enanpur: Belém, Brazil, 2023. [Google Scholar]
Silva, R.B. Vidas no sufoco nos transportes na pandemia: Um App de mapeamento colaborativo para alerta de lotação na Região Metropolitana de São Paulo (RMSP). Confin. Rev. Fr.-Brésilienne De Géographie/Rev. Fr.-Bras. De Geogr. 2023, 58, 1–24. [Google Scholar]
Barrozo, L.V.; Serafim, M.B.; Moraes, S.L.; Mansur, G. Monitoramento espaço-temporal das áreas de alto risco de COVID-19 nos municípios do Brasil. Hygeia Rev. Bras. De Geogr. Médica E Da Saúde 2020, 16, 417. [Google Scholar] [CrossRef]
IBGE. Instituto Brasileiro de Geografia e Estatística. Censo Demográfico. 2022. Available online: https://www.ibge.gov.br/estatisticas/sociais/trabalho/22827-censo-demografico-2022.html (accessed on 8 May 2024).
Oliveira, R.J.; Oliveira, R.M.S. São Paulo cidade negra no século XXI. Rev. Da ABPN 2020, 12, 489–515. [Google Scholar] [CrossRef]
Oliveira, R.; Cunha, A.; Santos Gadelha, A.; Carpio, C.; Oliveira, R.; Corrêa, R. Desigualdades raciais e a morte como horizonte: Considerações sobre a COVID-19 e o racismo estrutural. Cad. De Saúde Pública 2020, 36, e00150120. [Google Scholar] [CrossRef] [PubMed]
Barreto, W.L.; Pereira, F.H.; Perez, Y.; Schimit, P.H.T. Spatial dynamics of COVID-19 in São Paulo: A cellular automata and GIS approach. Spat. Spatio-Temporal Epidemiol. 2024, 50, 100674. [Google Scholar] [CrossRef] [PubMed]
Rede Nossa São Paulo. A COVID-19 e as Desigualdades:O Que os Dados nos Contam Após um ano de Pandemia. 2021. Available online: https://www.nossasaopaulo.org.br/wp-content/uploads/2021/09/Mapa-da-Desigualdade-Especial-Covid-2021.pdf (accessed on 8 October 2024).
FIOCRUZ. MonitoraCOVID-19 Avalia Cobertura dos Testes em Massa no Controle da Epidemia no Brasil. MonitoraCOVID-19, Fundação Oswaldo Cruz (Fiocruz). 2021. Available online: https://portal.fiocruz.br/noticia/monitoracovid-19-avalia-cobertura-dos-testes-em-massa-no-controle-da-epidemia-no-brasil (accessed on 8 October 2024).
Levin-Rector, A.; Kulldorff, M.; Peterson, E.R.; Hostovich, S.; Greene, S.K. Prospective Spatiotemporal Cluster Detection Using SaTScan: Tutorial for Designing and Fine-Tuning a System to Detect Reportable Communicable Disease Outbreaks. JMIR Public Health Surveill 2024, 10, e50653. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Map showing the municipalities of the MRSP. Source: IBGE.

Figure 2. Distribution of COVID-19 cases in the MRSP. (A) Seasonal analysis showing the months of low and high risk in Year 1 (March 2020 to February 2021). (B) Seasonal analysis showing the months of low and high risk in Year 2 (March 2021 to February 2022).

Figure 3. (A) Space–time clusters of COVID-19 incidence in Year 1 of the pandemic, from March 2020 to February 2021. (B) Space–time clusters of COVID-19 incidence in Year 2 of the pandemic, from March 2021 to February 2022.

Figure 4. Distribution of deaths due to COVID-19 in the MRSP. (A) Seasonal analysis showing the months of the low- and high-mortality clusters in Year 1 (March 2020 to February 2021). (B) Seasonal analysis showing the months of low- and high-mortality clusters in Year 2 (March 2021 to February 2022).

Figure 5. (A) Space–time clusters of COVID-19 deaths in Year 1 of the pandemic, from March 2020 to February 2021. * Year: 2021. (B) Space–time clusters of COVID-19 deaths in Year 2 of the pandemic, from March 2021 to February 2022.

Table 1. Median and interquartile range of socioeconomic indicators of the space–time clusters of COVID-19 incidence in Year 1 (March 2020 to February 2021) and Year 2 (March 2021 to February 2022). Method: Kruskal–Wallis. Median (25⁰, 75⁰ percentile). * p Value < 0.001.

Variable	High-Incidence Cluster in Year 1			High-Incidence Cluster in Year 2
Variable	1	2	3	1	2	3
Per capita income	551 (461–881)	559 (442–719)	989 (611–1918) *	567 (469–892)	661 (513–867)	739 (521–1179) *
BBIP (%)	52 (40–56)	45 (35–54)	29 (18–42) *	50 (37–56)	40 (30–49.3)	43 (28.3–50) *
People per household	3.4 (3.2–3.5)	3.4 (3.3–3.5)	3.1 (2.8–3.4) *	3.3 (3.2–3.4)	3.3 (3.2–3.3)	3.3 (3.1–3.4)
Population density	328 (253–405)	297 (260–346)	324 (238–411)	334 (255–407)	283 (251–329)	318 (220–379) *

Table 2. Median and interquartile range of socioeconomic indicators of the space–time clusters of COVID-19 deaths in Year 1 (March 2021 to February 2022) and Year 2 (March 2021 to February 2022). Method: Kruskal–Wallis. Median (25⁰, 75⁰ percentile). * p Value < 0.001.

Variable	COVID-19 Death Clusters in Year 1		COVID-19 Death Clusters in Year 2
Variable	High-Mortality	Low-Mortality	High-Mortality	Low-Mortality
Per capita income	548 (452–745)	1943 (989–3490) *	558 (454–718)	3362 (2223–3956) *
BBIP (%)	49 (38–54)	18 (9–20) *	47 (38–45)	12 (9–20) *
People per household	3.4 (3.3–3.5)	2.6 (2.4–3.1) *	3.4 (3.3–3.5)	2.5 (2.3–2.7) *
Population density	314 (243–396)	349 (268–424)	300 (238–361)	355 (256–440) *

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Santana, K.V.d.S.d.; Marino, A.; Martins, G.R.; Lima, P.H.B.M.; Mendonça, P.H.R.; Rolnik, R. Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil. ISPRS Int. J. Geo-Inf. 2024, 13, 397. https://doi.org/10.3390/ijgi13110397

AMA Style

Santana KVdSd, Marino A, Martins GR, Lima PHBM, Mendonça PHR, Rolnik R. Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil. ISPRS International Journal of Geo-Information. 2024; 13(11):397. https://doi.org/10.3390/ijgi13110397

Chicago/Turabian Style

Santana, Keila Valente de Souza de, Aluízio Marino, Gabriela Rosa Martins, Pedro Henrique Barbosa Muniz Lima, Pedro Henrique Rezende Mendonça, and Raquel Rolnik. 2024. "Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil" ISPRS International Journal of Geo-Information 13, no. 11: 397. https://doi.org/10.3390/ijgi13110397

APA Style

Santana, K. V. d. S. d., Marino, A., Martins, G. R., Lima, P. H. B. M., Mendonça, P. H. R., & Rolnik, R. (2024). Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil. ISPRS International Journal of Geo-Information, 13(11), 397. https://doi.org/10.3390/ijgi13110397

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Space–Time Analysis of the COVID-19 Pandemic and Its Relationship with Socioeconomic and Demographic Variables in the Metropolitan Region of São Paulo, Brazil

Abstract

1. Introduction

2. Study Site

3. Materials and Methods

4. Results

4.1. Incidence Clusters

4.2. Mortality Clusters

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI