Next Article in Journal
E-Learning Behavior Categories and Influencing Factors of STEM Courses: A Case Study of the Open University Learning Analysis Dataset (OULAD)
Next Article in Special Issue
Microplastic Pollution and Monitoring in Seawater and Harbor Environments: A Meta-Analysis and Review
Previous Article in Journal
Accurate Detection of Concrete Pavement Thickness Based on Ultrasonic Array
Previous Article in Special Issue
Quality Assessment of Sludge from Filter Backwash Water in Swimming Pool Facilities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Neighborhood-Level Particle Pollution Assessment during the COVID-19 Pandemic via a Novel IoT Solution

by
Christos Spandonidis
,
Dimitrios Paraskevopoulos
* and
Christina Saravanos
Prisma Electronics S.A., Research, and Development, 87 Democratias Avenue, 68132 Alexandroupolis, Greece
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(10), 8233; https://doi.org/10.3390/su15108233
Submission received: 21 March 2023 / Revised: 8 May 2023 / Accepted: 15 May 2023 / Published: 18 May 2023
(This article belongs to the Special Issue Environmental Monitoring and Assessment for Sustainability)

Abstract

:
In recent years, the concentration levels of various air pollutants have been constantly increasing, primarily due to the high vehicle flow. In 2020, however, severe lockdowns in Greece were imposed to reduce the spread of the COVID-19 pandemic, which led to a rapid reduction in the concentration levels of air pollutants such as PM2.5 and PM10 in the atmosphere. Initially, this paper seeks to identify the correlation between the concentration levels of PM10 and the traffic flow by acquiring data from low-cost IoT devices which were placed in Thessaloniki, Greece, from March to August 2020. The correlation and the linearity between the two parameters were further investigated by applying descriptive analytics, regression techniques, Pearson correlation, and independent T-testing. The obtained results indicate that the concentration levels of PM10 are strongly correlated to the vehicle flow. Therefore, the results hint that the decrease in the vehicle flow could result in improving the quality of environmental air. Finally, the acquired results point out that the temperature and humidity are weakly correlated with the concentration levels of PM10 present in the atmosphere.

1. Introduction

Nowadays, the quality of environmental air is constantly deteriorating in most urban areas, resulting in poor quality of life. Several studies have indicated that a wide range of health problems are caused by the high concentration of air pollutants such as CO, NO, and O3 [1]. The studies presented in [2,3,4,5,6,7,8,9] suggest that the high concentration of air pollutants in the urban air has severely impacted overall health, particularly of the cardiorespiratory system. The study conducted by [2] indicated that air pollution caused by vehicle traffic is the primary cause of death in most urban cities. In addition, the study by Han et al. [10] reviewed the negative impact of air pollution on the population of developing countries. Vehicle traffic is the primary factor behind the high concentration level of air particles in the atmosphere [10]. In 2020, lockdowns were enforced due to the outbreak of the COVID-19 pandemic. The studies presented in [11,12,13,14,15,16] suggest that vehicle traffic decreased by approximately 50% in several cities in Europe, the U.S., Asia, and South America during the first waves of the imposed lockdowns. Consequently, the concentration of several air pollutants in the atmosphere, e.g., CO, O3, and particulate matter mass (PM), decreased by 30% to 70%.
In recent years, several studies have emerged that aspire to calculate the correlation between the concentration of air pollutants and vehicle flow in several European cities. Rojas et al. [17] conducted a preliminary statistical analysis on the concentration of NO2 and the traffic flow by employing the analysis of variance (ANOVA), which is used to analyze the significant difference between the means of groups. Rojas et al. [17] acquired the measurements before and after the first imposed lockdown and compared the obtained measurements via ANOVA to determine whether the mean value was the same across the groups. Moreover, Sifakis et al. [18] employed several statistical analysis techniques to investigate the changes during the enforced lockdown of 2020 and to calculate the correlation between the concentration of several air pollutants and vehicle flow. The correlation between the two parameters was calculated via several statistical analysis techniques such as ANOVA, the Pearson Correlation Coefficient, and Fischer’s Least Significant Difference. The authors in [19,20] aspired to calculate the correlation between the concentrations of PM10 and PM2.5 by statistically analyzing the data and employing statistical analysis techniques. Rossi et al. [21] investigated the relationship between the concentration of several air pollutants, e.g., NO, NO2, NOx, PM10, and the traffic flow. The relationship between the two parameters was computed via Spearman’s correlation test.
The majority of air quality monitoring applications rely on data collected by sensors, and tests have been conducted to compare the effectiveness of low-cost PM and gas sensors to that of reference equipment. In a recent study, Cichowicz et al. [22] used commercial sensors positioned throughout the area under investigation to collect data over three years (2019, 2020, 2021). Liu et al. [23] collected data over a period of 13 months, using low-cost air quality devices with two inexpensive sensors, each of which was exposed to a variety of pollution sources and concentrations, relative humidity, and temperature at four locations in Australia and China. The daily relative errors of PM2.5 and CO concentrations between the monitors and the reference instruments were analyzed in order to determine the long-term stability of the KOALA monitors used in [23]. Kim et al. [24] created an IoT-Based Particulate Matter Monitoring System for construction sites, and the authors emphasize that adopting an IP65-grade PM sensor and creating a dustproof and waterproof outlet improved the system’s outdoor durability. It is important to note that the impacts of different ambient conditions on low-cost sensor performance are various, including temperature and relative humidity according to [25,26].
The obtained data often contain noise, outliers, missing or inconsistent values, and therefore several preprocessing techniques should be applied to obtain a clean dataset. Most real-world applications aim to remove extreme values which deviate from the given dataset by employing normalization or standardization techniques [20,21]. The noise present in a dataset can be removed by employing data smoothing techniques such as the simple moving average algorithm [21,27].
In this paper, several low-cost IoT devices with embedded measurement devices were placed in buildings in various neighborhoods in Thessaloniki, Greece. Measurements concerning PM10 and PM2.5 were obtained before, during, and after the first imposed lockdown of 2020. The municipality of Thessaloniki provided information regarding the traffic flow of the city. The obtained measurements were, initially, preprocessed by employing outlier and smoothing techniques, and the preprocessed data hint that the levels of PM10, and PM2.5 decreased during the first wave of imposed lockdowns. Therefore, the air quality, considering the PM, improved in the neighborhoods of Thessaloniki, which is the second largest city in Greece.
This paper, additionally, aims to investigate the correlation and linearity between the concentration levels of PM10, traffic flow, temperature, and humidity. Two null hypotheses were employed to determine the desired correlation. One was that the traffic flow does not influence the concentration level of the air pollutants under investigation, while the other was that the temperature and humidity of the station’s location do not impact the levels of air pollutants. The null hypothesis testing was performed via quantitative research-based experiments.
The remainder of this paper is organized as follows: Section 2 elaborates on the measurement devices and the selection of the stations which were employed to acquire the measurements. Section 3 depicts the methodology and techniques which were applied to preprocess the obtained measurements. Section 4 presents the results of the calculated correlation and linearity between the concentration levels of the PM10, traffic flow, temperature, and humidity. Finally, Section 5 draws the final conclusions and future directions.

2. Data Acquisition

This section elaborates on the devices and the scheme which were employed to acquire the concentration levels of PM10 and PM2.5. Initially, the Sympnia IoT Platform was introduced; this is a low-cost Internet of Things (IoT) device which measures the level of air pollution in the atmosphere. Several Sympnia IoT Platforms were placed in Thessaloniki, Greece. This study, however, relied on the measurements acquired from four stations that were placed in the city, so this section elaborates on which criteria had to be fulfilled for a station to be selected.

2.1. The Sympnia IoT Platform

The concentration levels of PM10 and PM2.5 were acquired from the Sympnia IoT Platform, a low-cost IoT platform that measures the level of air pollution on a neighborhood level. Most environmental monitoring systems measure the concentration levels of air pollutants on a city level. The Sympnia IoT Platform relies on the PrismaSense system, introduced by Prisma Electronics to measure the concentration levels of six air pollutants: NO2, NO, O3, CO, PM2.5, and PM10. In [28], the authors describe the smart collector part (hardware and embedded software) of the platform. There is also information about the hardware system testing, including accuracy, stability, data integrity and sensitivity testing [28].
The wireless transmission of the Sympnia IoT Component relies on the Sympnia Smart Collector. The Smart Collector consists of a microcontroller and several interfaces to connect to the sensors. The Smart Collector preprocesses the obtained measurements to calculate several parameters, and the sampling rate ranges from 1 s to 30 min. Two distinct versions of the Sympnia Smart Collector have been developed. The two Smart Collectors differ in the microcontroller used to acquire the concentration levels of air pollutants in the atmosphere and enable wireless communication with the remote server.
The processing module of the Smart Collector relies on an EPS32 microcontroller manufactured by Espressif. The processing module depends on an ultra-low-power mixed signal microcontroller with a dual-core 32-bit RISC CPU. The CPU runs from 160 to 240 MHz. Table 1 describes the main features of the EPS32 microcontroller of the second version and has been presented in detail in [28].
The measurements of PM10 and PM2.5 were acquired from low-cost sensors via a software-embedded device referred to as the Sympnia Smart Collector or Smart Collector. The Smart Collectors are easily programmed via an in-house environment or a third-party IDE. Advanced calibration circuity is embedded into the Smart Collectors to remove any outliers obtained from extreme weather conditions. The Smart Collector is compatible with 5.5 and 3.3 sensors that employ UART, I2C, SPI, and GPIO protocols. Five distinct sensor platforms were implemented, and they are summarized in Table 2 [28]. The calibration of the sensors is the most important factor for selecting a low-cost reliable sensor. In other words, low-cost air quality sensors were selected based on the calibration sheets provided by their manufacturer, the detailed description of the acquisition and transmission of the data, and the impact of the sensors provided by the respective literature. The measurements acquired by the Sympnia IoT platform could be normalized if selected for the temperature and humidity levels of the microcontroller’s environment. Four sensors were embedded in the Smart Collector, which could be adjusted to work simultaneously.
The embedded software of the Sympnia IoT Component has been developed for wireless transmission, acquiring and visualizing the concentration levels of air pollutants. The Sympnia IoT Platform comprises a user-friendly platform that allows its users to access the current and forecast levels of PM10 and PM2.5. The measurements are obtained from over 10,000 air monitoring stations, and they are displayed in the form of an Air Quality Index to be comprehended by everyone, and are accompanied by a color characterization that defines the levels of the current air quality. The Sympnia IoT device provides additional information obtained from every station. By employing the Sympnia IoT platform, the user can visualize the concentration levels of the air pollutants under investigation and download the necessary data for future use [28,29].

2.2. Selecting the Data Stations

Hundreds of Sympnia IoT platform devices were placed in Thessaloniki, Greece, and as a result a dense network of devices was created in the city. However, the measurements for the following analysis were obtained from four stations. The stations were selected based on three factors: the spatial distribution, the socioeconomic status of the location, and the building’s proximity to the road.
Three additional aspects were considered in selecting which sensors should be activated. These aspects coincide with the parameters indicated by the municipality of the Greek city, the efficiency of the sensor, and the calibration sheets provided by the manufacturer. The former was applied to every area in the city.
Figure 1 illustrates the four stations which were selected to obtain the concentration levels of the PM10 and PM2.5 based on the aforementioned factors. Figure 1 shows the locations against the background of Europe, Greece, and only Thessaloniki. Three stations were selected in different neighborhoods across the city (pins 1, 3, 4 in Figure 1), whereas the fourth was distributed in an industrial area (pin 2 in Figure 1). The stations only acquired measurements of air pollutants PM2.5 and PM10.
The statistical analysis was conducted by considering PM10 the dependent variable, while the city’s traffic flow, temperature, and humidity levels were considered the independent parameters. The traffic flow data were provided by the municipality of Thessaloniki.

3. Data Analysis

This section presents the methodology followed to analyze the measurements obtained from the Sympnia IoT Platform. Figure 2 illustrates this scheme. The obtained measurements were preprocessed depending on the quality of the data. The inspection of the quality of the obtained measurements is often referred to as the initial quality control. The initial quality control relied on employing several techniques to remove outliers, gap-fill, and normalize the acquired data. Then, the preprocessed measurements were visualized. Descriptive analytics were additionally applied to portray the main statistics, e.g., the mean, standard deviation, and minimum and maximum value of the acquired concentration levels of air pollutants. The primary statistical values were applied to provide a better understanding of the quality of the accumulated measurements.
Thereupon, according to Figure 2, the impact of other factors on the concentration levels of the air pollutants was estimated by applying inference analytics. The acquired measurements were processed by considering the PM10 as the dependent variable and the temperature, traffic flow, and humidity levels as the independent. The linear relationship between the dependent and independent variables was calculated by applying linear regression, and ANOVA analysis followed. Inference analytics included other techniques, such as Independent Sample T-test and Pearson Correlation.

3.1. Data Cleansing

Data cleansing is significant due to the presence of outliers and noise in the acquired data. The removal of the outliers within the dataset relies on the Median Absolute Deviation (MAD). The outliers were removed by applying the following equation:
i f   o u t l i e r   i s   o u t s i d e   t h e   i n t e r v a l = μ   ± 1.4826 k M A D
where the μ denotes the mean value and k controls how intense the removal is. Additionally, the data were smoothed by applying a moving average window of length L , which is mathematically defined as:
m o v i n g _ a v g = 1 L i = 1 L a i
The a i corresponds to the i t h element in the moving window, and the length of the moving window ( L ) was set equal to 15.

3.2. Independent Sample T-test

An Independent Sample T-test is a statistical test used to determine whether there is a significant difference between the means of two groups, and it relies on the alpha level. The alpha level suggests the level of the threshold of rejecting the null hypothesis. The alpha level sets the probability that a Type 1 error can occur. A Type 1 error involves rejecting the null hypothesis and accepting the alternative one. The p-value is the probability of obtaining results that are at least as extreme as the observed results under the assumption that the null hypothesis is correct. The p-value is linked to the alpha level. If the value of the p-value is less than the alpha level, then there is a statistically significant relationship between the dependent and independent variables. As the value of p-value decreases, the occurrence of a false positive occurring becomes less likely, and therefore the null hypothesis can be rejected [30,31,32].

3.3. Pearson Correlation Coefficients

The Pearson correlation coefficient measures the linear dependence between two variables, and it is defined as the covariance of the two variables divided by the product of their respective standard deviations. The product of the standard deviations of the two results is used as the normalization factor. Mathematically, the Pearson correlation coefficient is defined as:
r x y = x i x ¯ ( y i y ¯ ) x i x ¯ 2   y i y ¯ 2  
where n ,   x i ,   y i ,   x ¯ = 1 n i = 1 n x i and y ¯ = 1 n i = 1 n y i denote the number of acquired measurements, the i t h element of the variables X and Y , and the mean value of them. The value of the coefficient r x y ranges from 1 to + 1 . The Pearson correlation coefficient indicates the strength of the linear relationship between the two variables X and Y . If the two variables X and Y are positively correlated, then the Pearson correlation coefficient is greater than zero. However, if the value of the Pearson correlation coefficient is less than zero, then the two variables are considered negatively correlated. Finally, if the value of the Pearson correlation coefficient is equal to 0, then the variables X and Y are uncorrelated [33].

3.4. Linear Regression

Regression is a technique that is either used to solve forecasting problems or in statistical analysis to determine the linear relationship between dependent and independent variables. The regression models aim to predict the value of the dependent variable Y based on the values of the independent variable X . There are two types of linear regression, simple and multivariate linear regression, depending on the size of the independent variables and the complexity of their relationship with the dependent variable. Regression analysis relies on several assumptions concerning the independent and dependent variables. In short, these assumptions state that: (i) the relationship between two variables must be linear, (ii) the observations provided by a sample are independent of each other, and (iii) the provided samples should not contain many outliers [34,35].

3.5. Analysis of Variance

ANOVA aims to split the observations provided by a dataset into two parts: systematic and random factors. The former has a statistical influence on the dataset, while the latter does not. The calculated significant p-value suggests that there is at least one pair where the mean difference is considered statistically important. The pair is identified by employing pair-wise multiple comparisons or post hoc tests. Post hoc tests rely on identifying the homogeneity of the variances amongst the different groups. Post hoc tests are used to determine the significant pair or pairs after the respective ANOVA was deemed important [30].
The ANOVA (Analysis of Variance) that follows the calculation of a regression table is called an ANOVA table or regression ANOVA, and is a special type of one-way ANOVA. It tests the null hypothesis that all of the regression coefficients are equal to zero, meaning that there is no significant relationship between the dependent variable and the independent variables in the model. The regression ANOVA table summarizes the sources of variation in the regression model and tests the null hypothesis that the regression model does not explain a significant proportion of the variation in the dependent variable. It does this by comparing the variability explained by the regression model to the variability not explained by the model.

4. Results

This section elaborates on the results of the measurements from the four stations located in Thessaloniki, Greece. In the next subsection, the descriptive analysis of the measurements of PM10 and PM2.5 that were obtained via the Sympnia IoT platform are described, and all measurements were preprocessed via the schemes presented in Section 2 and Section 3, respectively. There is also a subsection on statistical analysis, in which two null hypothesis tests were employed. The first suggested that the traffic flow does not impact the levels of PM10 present in the atmosphere. The second null hypothesis claimed that the temperature and humidity would not influence the concentration levels of the pollutants. The results were obtained by assuming that all variables follow normal distributions.

4.1. Preliminary Descriptive Analysis of the Measurements

Figure 3 illustrates the time series of the concentration levels of PM2.5 air pollutants acquired from the four stations. The illustrated measurements were acquired from February to August 2020. The bottom-right figure in Figure 3 suggests that there is a period with missing data for Station 4 from June to July 2020. The high levels of PM2.5 air pollutants present in the top-right figure suggest that Station 2 is located on one of the busiest streets in Thessaloniki.
Figure 4 illustrates the measurements of the concentration levels of PM2.5 and PM10 air pollutants from Station 2. It points out that the concentration levels of both PM2.5 and PM10 were rather low during the period between March and June 2020. This time frame corresponds to the period when the first wave of lockdowns was imposed in Greece. The imposed lockdown reduced the need to move around by vehicles, which led to a reduction in the concentration levels of PM10 and PM2.5. Then, the concentration levels increased again due to the relaxation of the restrictions, according to Figure 4.
Figure 5 depicts the measurements of the concentration levels of PM10 air pollutants from Station 2, while the yellow, blue, and green segments correspond to the measurements which were obtained before, during, and after the first wave of the enforced lockdowns. Figure 5 suggests that both noise and outliers were present in the acquired measurements. This is quite noticeable due to the sudden changes in the values. Figure 6 portrays the same measurements after applying the preprocessing techniques. The time series were preprocessed by smoothing and removing the outliers.
Figure 7 illustrates the correlation between the concentration levels of PM10 and PM2.5 air pollutants acquired by Station 2 and Station 4. The PM2.5 is the independent variable, while the PM10 is the dependent variable. The illustrated results suggest that the concentration levels of the PM10 air pollutant rose as the values of PM2.5 air pollutants increased. In other words, the two air pollutants were highly positively correlated, as expected.

4.2. Statistical Analysis for the Measurements of Station 2

The available measurements were analyzed using various techniques. The measurements of PM2.5 and PM10 were highly correlated, as depicted in Figure 7. Therefore, we decided to include only measurements of PM10 for the statistical analysis. The statistical analysis which follows does not include the measurements of Stations 1 and 3 because there were no available data of traffic flow to calculate the desired correlation. In this subsection, Station 2 is analyzed and the first null hypothesis testing was applied to the measurements of PM10 air pollutants obtained by Station 2. The descriptive statistics and the independent T-test were employed in the preprocessed measurements, and the results are depicted in Table 3 and Table 4. Table 3 indicates the values of mean, standard deviation, standard error, and coefficient of variation for the two groups (before and after traffic restrictions). Observing Table 4, there is an indication that the p-value was rather low, and as a result the null hypothesis, which suggests that the traffic flow does not impact the concentration levels of PM10 air pollutants, can be rejected.
The correlation between PM10 and traffic flow was also calculated via the Pearson correlation coefficients. The high correlation coefficients between the measurements of PM10 air pollutants and the vehicle flow are presented in Table 5, and there is a strong positive correlation between the two variables.
Table 6 and Table 7 portray the results which emerged after applying linear regression and ANOVA. The null hypothesis H0 suggested that the vehicle flow does not impact the concentration levels of PM10, while the H1 represented the alternative hypothesis, which states that there is a linear relationship between the dependent variable (PM10) and the independent variable (number of vehicles). The regression statistics in Table 6 indicate that the RMSE value of the null hypothesis H0 is rather high, meaning that the hypothesis that the traffic flow does not impact the concentration levels is false and should be discarded. On the other hand, the extremely low RMSE and high R values of the alternate hypothesis H1 indicate that there is a perfect linear relationship between the two variables. Table 7 shows the degrees of freedom (df), the sum of squares, the mean square, the F statistic, and the overall significance of the regression model for the alternate hypothesis H1. The F statistic is calculated as the regression mean square/residual mean square. This statistic indicates whether the regression model provides a better fit to the data than the model that contains no independent variables. According to Table 7, the p-value is less than 0.001, so there is sufficient evidence to conclude that the regression model fits the data better than the model with no predictor variable.
Lastly, Table 8 presents the overall significance of the regression model, and the low p-value which corresponds to the null hypothesis H1 suggests that, similar to the results presented in Table 6 and Table 7, the hypothesis that claims that the traffic flow influences the levels of the PM10 is true, and therefore should be accepted.

4.3. Statistical Analysis for the Measurements of Station 4

In this subsection, the measurements of Station 4 are analyzed. While in the case of Station 2 we only had data for the PM10 and vehicle traffic, in the case of Station 4 we also had data for the humidity and temperature of the environment. Based on this fact, we performed the same evaluation testing but this time we evaluated the dependent parameter (PM10) against the three independent parameters (vehicle flow, humidity, and temperature). The measurements for temperature were obtained from May to July 2020, as illustrated in Figure 8. Figure 8 also illustrates the scatter plots created from calculating the correlation between the concentration levels of the PM10 air pollutant and the traffic flow. The illustrated results indicate that the acquired measurements of PM10 air pollutants were strongly correlated with traffic flow and weakly correlated with temperature.
The results in Table 9, also, point out that the concentration levels of PM10 air pollutants and the vehicle flow are strongly correlated according to the high Pearson correlation coefficient. The low coefficient values between the PM10 and temperature, and between the PM10 and humidity levels indicate that the concentration levels of the air pollutant are weakly correlated to the temperature and humidity. Finally, there is an expected negative correlation between humidity and temperature.
Table 10 and Table 11 present the results after employing linear regression and ANOVA for the measurements of Station 4. In this case, the null hypothesis H0 suggests that the independent variables do not impact the concentration levels of the PM10, while H1 represents the alternative hypothesis, which states that there is a linear relationship between the dependent variable and the three independent variables. Table 10, presenting the regression statistics, suggests accepting the alternative hypothesis H1 since the R-value is high and the RMSE value is extremely low. Table 11 depicts the overall significance of the regression model in the case of H1, and the value of the df is 3 since the independent variables are 3, in contrast to Station 2. The low p-value confirms that there is sufficient evidence to conclude that the regression model fits the data better than the model with no predictor variables. If none of the predictor variables in the model are statistically significant, the overall F statistic would also not be statistically significant.
The analysis of each coefficient is important to assess the impact of the three independent variables in the dependent variable, and therefore Table 12 presents the overall significance of the regression model. Observing the table, the value of the significance (p-value) for the temperature and the humidity is high. On the other, the t-value is extremely high and the p-value low enough for the number-of-vehicles variable. As a result, we conclude that only the number of vehicles is linearly correlated with the dependent variable PM10.

5. Conclusions

The concentration levels of air pollutants such as PM10 and PM2.5 are constantly increasing in urban areas, and recent studies indicate negative impacts on our health. This paper sought to investigate the relationship and, in particular, the correlation and linearity between the concentration levels of PM10, traffic flow, temperature and humidity. The measurements were obtained from several IoT devices, placed in stations in Thessaloniki, Greece. The city of Thessaloniki was selected because its infrastructure is similar to that of most European cities. The period of acquisition corresponded to the period before, during, and after the first wave of COVID-19 lockdowns. Based on the plots of the preliminary descriptive analysis of the measurements of PM levels for all stations, it is indicated that:
  • Surrounding conditions affect the concentration levels of PM, since Station 2—which is located in one of the busiest streets—indicated higher levels.
  • PM2.5 and PM10 variables are highly correlated.
  • There is a positive influence of traffic flow on the concentration of PM, since the period of the COVID-19 pandemic affected its levels.
The obtained measurements were further processed prior to further statistical analysis. More specifically, the outliers and noise were removed from the data. The inference analytics of the measurements were described, and the correlation was evaluated by employing two null hypotheses. One suggested that the traffic flow does not impact the concentration of PM10 in the atmosphere, while the other hinted that the factors regarding the stations’ temperature and humidity do not influence the air pollutants’ concentration levels. The statistical analysis of the measurements of PM10 for Station 2 and Station 4 indicated that:
  • PM10 and vehicle flow are strongly correlated variables.
  • Temperature and humidity are weakly correlated with PM10 compared to the correlation mentioned above.
Regarding future directions of research, other preprocessing techniques could be applied to remove any outliers and noise present in the obtained measurements. Additionally, the data from more stations should be utilized to measure the air pollutant concentrations and average the results for an entire city or region. The data should be collected using a network of sensors located throughout the area. It is important to measure other air pollutants, such as O3, NO2, CO, and SO2. Towards the direction of making the Earth more sustainable, additional environmental factors such as atmospheric dispersion conditions, and especially wind speed, the presence of temperature inversion, and the mixing layer height, should be measured to investigate their impact on the concentration levels of various air pollutants.

Author Contributions

Conceptualization, C.S. (Christos Spandonidis); methodology, C.S. (Christos Spandonidis) and D.P.; software, C.S. (Christos Spandonidis); validation, C.S. (Christos Spandonidis), C.S. (Christina Saravanos) and D.P.; formal analysis, C.S. (Christos Spandonidis); investigation, D.P. and C.S. (Christina Saravanos); resources, C.S. (Christos Spandonidis); data curation, D.P. and C.S. (Christos Spandonidis); writing—original draft preparation, C.S. (Christos Spandonidis); writing—review and editing, C.S. (Christina Saravanos) and D.P.; visualization, C.S. (Christina Saravanos) and C.S. (Christos Spandonidis); supervision, C.S. (Christos Spandonidis); project administration, C.S. (Christos Spandonidis). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data unavailable—part of a private company’s property.

Acknowledgments

This research has been co-financed by the European Union and Greek national funds through the Operational Program Competitiveness, Entrepreneurship and Innovation, under the call RESEARCH—CREATE—INNOVATE (project code: T1EDK-05515).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Guterres, A. The Sustainable Development Goals Report 2020; United Nations Publication Issued by the Department of Economic and Social Affairs: New York, NY, USA, 2020; pp. 1–64. [Google Scholar] [CrossRef]
  2. Künzli, N.; Kaiser, R.; Medina, S.; Studnicka, M.; Chanel, O.; Filliger, P.; Herry, M.; Horak, F., Jr.; Puybonnieux-Texier, V.; Quénel, P.; et al. Public-health impact of outdoor and traffic-related air pollution: A European assessment. Lancet 2000, 356, 795–801. [Google Scholar] [CrossRef] [PubMed]
  3. Bernstein, J.A.; Alexis, N.; Barnes, C.; Bernstein, I.L.; Bernstein, J.A.; Nel, A.; Peden, D.; Diaz-Sanchez, D.; Tarlo, S.M.; Williams, P.B. Health effects of air pollution. J. Allergy Clin. Immunol. 2004, 114, 1116–1123. [Google Scholar] [CrossRef] [PubMed]
  4. Kampa, M.; Castanas, E. Human health effects of air pollution. Environ. Pollut. 2008, 151, 362–367. [Google Scholar] [CrossRef] [PubMed]
  5. Van Der Zee, S.C.; Hoek, G.; Harssema, H.; Brunekreef, B. Characterization of Particulate Air Pollution in Urban and Non-Urban Areas in The Netherlands. Atmos. Environ. 1998, 32, 3717–3729. [Google Scholar] [CrossRef]
  6. Yang, J.; Shi, B.; Shi, Y.; Marvin, S.; Zheng, Y.; Xia, G. Air pollution dispersal in high density urban areas: Research on the triadic relation of wind, air pollution, and urban form. Sustain. Cities Soc. 2020, 54, 101941. [Google Scholar] [CrossRef]
  7. Zhang, M.; Song, Y.; Cai, X. A health-based assessment of particulate air pollution in urban areas of Beijing in 2000–2004. Sci. Total Environ. 2007, 376, 100–108. [Google Scholar] [CrossRef]
  8. Han, X.; Naeher, L.P. A review of traffic-related air pollution exposure assessment studies in the developing world. Environ. Int. 2006, 32, 106–120. [Google Scholar] [CrossRef]
  9. Hudda, N.; Simon, M.C.; Patton, A.P.; Durant, J.L. Reductions in traffic-related black carbon and ultrafine particle number concentrations in an urban neighborhood during the COVID-19 pandemic. Sci. Total Environ. 2020, 742, 140931. [Google Scholar] [CrossRef]
  10. Lipsitt, J.; Chan-Golston, A.M.; Liu, J.; Su, J.; Zhu, Y.; Jerrett, M. Spatial analysis of COVID-19 and traffic-related air pollution in Los Angeles. Environ. Int. 2021, 153, 106531. [Google Scholar] [CrossRef]
  11. Nakada, L.Y.K.; Urban, R.C. COVID-19 pandemic: Impacts on the air quality during the partial lockdown in São Paulo state, Brazil. Sci. Total Environ. 2020, 730, 139087. [Google Scholar] [CrossRef]
  12. Rossi, J.; Bianchini, A.; Guarnieri, P. Circular economy model enhanced by intelligent assets from industry 4.0: The proposition of an innovative tool to analyze case studies. Sustainability 2020, 12, 7147. [Google Scholar] [CrossRef]
  13. Salma, I.; Vörösmarty, M.; Gyöngyösi, A.Z.; Thén, W.; Weidinger, T. What can we learn about urban air quality with regard to the first outbreak of the COVID-19 pandemic? A case study from central Europe. Atmos. Chem. Phys. 2020, 20, 15725–15742. [Google Scholar] [CrossRef]
  14. Xiang, J.; Austin, E.; Gould, T.; Larson, T.; Shirai, J.; Liu, Y.; Marshall, J.; Seto, E. Impacts of the COVID-19 responses on traffic-related air pollution in a Northwestern US city. Sci. Total Environ. 2020, 747, 141325. [Google Scholar] [CrossRef] [PubMed]
  15. Rojas, J.P.; Urdanivia, F.R.; Garay, R.A.; García, A.J.; Enciso, C.; Medina, E.A.; Toro, R.A.; Manzano, C.; Leiva-Guzmán, M.A. Effects of COVID-19 pandemic control measures on air pollution in Lima metropolitan area, Peru in South America. Air Qual. Atmos. Health 2021, 14, 925–933. [Google Scholar] [CrossRef] [PubMed]
  16. Sifakis, N.; Aryblia, M.; Daras, T.; Tournaki, S.; Tsoutsos, T. The impact of COVID-19 pandemic in Mediterranean urban air pollution and mobility. Energy Sources Part A Recovery Util. Environ. Eff. 2021, 1895373. [Google Scholar] [CrossRef]
  17. Rossi, R.; Ceccato, R.; Gastaldi, M. Effect of road traffic on air pollution. Experimental evidence from COVID-19 lockdown. Sustainability 2020, 12, 8984. [Google Scholar] [CrossRef]
  18. Bontempi, E.; Carnevale, C.; Cornelio, A.; Volta, M.; Zanoletti, A. Analysis of the lockdown effects due to the COVID-19 on air pollution in Brescia (Lombardy). Environ. Res. 2022, 212, 113193. [Google Scholar] [CrossRef]
  19. Berman, J.D.; Ebisu, K. Changes in U.S. air pollution during the COVID-19 pandemic. Sci. Total Environ. 2020, 739, 139864. [Google Scholar] [CrossRef]
  20. Ogasawara, E.; Martinez, L.C.; De Oliveira, D.; Zimbrão, G.; Pappa, G.L.; Mattoso, M. Adaptive Normalization: A novel data normalization approach for non-stationary time series. In Proceedings of the 2010 International Joint Conference on Neural Networks, Barcelona, Spain, 18–23 July 2010. [Google Scholar] [CrossRef]
  21. Theodoropoulos, P.; Spandonidis, C.C.; Themelis, N.; Giordamlis, C.; Fassois, S. Evaluation of different deep-learning models for the prediction of a ship’s propulsion power. J. Mar. Sci. Eng. 2021, 9, 116. [Google Scholar] [CrossRef]
  22. Cichowicz, R.; Dobrzański, M. Impact of building types and CHP plants on air quality (2019–2021) in central-eastern European monocentric agglomeration. Sci. Total Environ. 2023, 878, 163126. [Google Scholar] [CrossRef]
  23. Liu, X.; Jayaratne, R.; Thai, P.; Kuhn, T.; Zing, I.; Christensen, B.; Lamont, R.; Dunbabin, M.; Zhu, S.; Gao, J.; et al. Low-cost sensors as an alternative for long-term air quality monitoring. Environ. Res. 2020, 185, 109438. [Google Scholar] [CrossRef] [PubMed]
  24. Kim, H.; Tae, S.; Zheng, P.; Kang, G.; Lee, H. Development of IoT-based particulate matter monitoring system for construction sites. Int. J. Environ. Res. Public Health 2021, 18, 11510. [Google Scholar] [CrossRef] [PubMed]
  25. Feinberg, S.; Williams, R.; Hagler, G.S.; Rickard, J.; Brown, R.; Garver, D.; Harshfield, G.; Stauffer, P.; Mattson, E.; Judge, R.; et al. Long-term evaluation of air sensor technology under ambient conditions in Denver, Colorado. Atmos. Meas. Tech. 2018, 11, 4605–4615. [Google Scholar] [CrossRef] [PubMed]
  26. Liu, H.Y.; Schneider, P.; Haugen, R.; Vogt, M. Performance assessment of a low-cost PM2.5 sensor for a near four-month period in Oslo, Norway. Atmosphere 2019, 10, 41. [Google Scholar] [CrossRef]
  27. Svetunkov, I.; Petropoulos, F. Old dog, new tricks: A modelling view of simple moving averages. Int. J. Prod. Res. 2018, 56, 6034–6047. [Google Scholar] [CrossRef]
  28. Spandonidis, C.; Tsantilas, S.; Giannopoulos, F.; Giordamlis, C.; Zyrichidou, I.; Syropoulou, P. Design and Development of a New Cost-Effective Internet of Things Sensor Platform for Air Quality Measurements. J. Eng. Sci. Technol. Rev. 2020, 13, 81–91. [Google Scholar] [CrossRef]
  29. Spandonidis, C.; Tsantilas, S.; Sedikos, E.; Galiatsatos, N.; Giannopoulos, F.; Papadopoulos, P.; Demagos, N.; Reppas, D.; Giordamlis, C. A compact, modular and low-cost Internet of Things (IoT) platform for air quality monitoring in urban areas. J. Phys. Conf. Ser. 2020, 1710, 012004. [Google Scholar] [CrossRef]
  30. Kim, T.K. T Test as a Statistic and Probability. Korean J. Anesthesiol. 2015, 68, 540–546. [Google Scholar] [CrossRef]
  31. Mishra, P.; Singh, U.; Pandey, C.; Mishra, P.; Pandey, G. Application of student’s t-test, analysis of variance, and covariance. Ann. Card. Anaesth. 2019, 22, 407. [Google Scholar] [CrossRef]
  32. Ross, A.; Willson, V.L. Basic and Advanced Statistical Tests: Writing Results Sections and Creating Tables and Figures; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
  33. Zhou, H.; Deng, Z.; Xia, Y.; Fu, M. A new sampling method in particle filter based on Pearson correlation coefficient. Neurocomputing 2016, 216, 208–215. [Google Scholar] [CrossRef]
  34. Aggarwal, R.; Ranganathan, P. Common pitfalls in statistical analysis: Linear regression analysis. Perspect. Clin. Res. 2017, 8, 100–102. [Google Scholar] [CrossRef] [PubMed]
  35. Maulud, D.; Abdulazeez, A.M. A Review on Linear Regression Comprehensive in Machine Learning. J. Appl. Sci. Technol. Trends 2020, 1, 140–147. [Google Scholar] [CrossRef]
Figure 1. The distribution of the stations in the city of Thessaloniki, with scale and spatial orientation.
Figure 1. The distribution of the stations in the city of Thessaloniki, with scale and spatial orientation.
Sustainability 15 08233 g001
Figure 2. Data Analysis Scheme.
Figure 2. Data Analysis Scheme.
Sustainability 15 08233 g002
Figure 3. The PM2.5 measurements of the four stations.
Figure 3. The PM2.5 measurements of the four stations.
Sustainability 15 08233 g003
Figure 4. The PM2.5 and PM10 measurements of the second station.
Figure 4. The PM2.5 and PM10 measurements of the second station.
Sustainability 15 08233 g004
Figure 5. The raw time series of PM10 at the second station.
Figure 5. The raw time series of PM10 at the second station.
Sustainability 15 08233 g005
Figure 6. The preprocessed time series of PM10 at the second station.
Figure 6. The preprocessed time series of PM10 at the second station.
Sustainability 15 08233 g006
Figure 7. The correlation between the measurements of PM2.5 and PM10 air pollutants.
Figure 7. The correlation between the measurements of PM2.5 and PM10 air pollutants.
Sustainability 15 08233 g007
Figure 8. The plots for Station 4.
Figure 8. The plots for Station 4.
Sustainability 15 08233 g008
Table 1. The technical specifications of the second version of the Sympnia Smart Collector [28].
Table 1. The technical specifications of the second version of the Sympnia Smart Collector [28].
A/ATechnical Specifications
1Processor Dual Core Xtensa ® 32-bit LX6/160 Mhz to 240 Mhz
2Memory (RAM) 512 ΚΒ
3Primary Wireless Communication WiFi 802.11 b/g/n
4Primary Wireless Communication BLE
5Sensor communication 3× UART, 2× I2C
6Power 5V DC (micro-USB)/battery
7Max Current 1.5 A
Table 2. The technical specifications of the sensors [28].
Table 2. The technical specifications of the sensors [28].
No.
Type
Sensor Specifications
Measurement/Unit
1Honeywell HPM 32322550 (PM2.5 and PM10: 0–1000 μg/m3, 15% accuracy)
2Euro Gas 4-NO-250 NO (NO: 0–250 ppm, 15% accuracy)
3SPEC DGS-CO 968-042 (CO: 0–1000 ppm, 15% accuracy) temperature (°C), relative humidity
4SPEC DGS-NO2 968-043 CO2 (NO2: 0–5 ppm, 15% accuracy), temperature (°C), relative humidity
5SPEC DGS-O3 968-042 O3 (O3: 0–5 ppm, 15% accuracy), temperature (°C), relative humidity
Table 3. Descriptive statistical analysis of PM10 air pollutants.
Table 3. Descriptive statistical analysis of PM10 air pollutants.
GroupSamplesMeanStandard DeviationStandard ErrorCoefficient of Variation
141050.0257.1170.3520.142
253926.42417.5920.7580.666
Table 4. Results obtained after the Independent T-test.
Table 4. Results obtained after the Independent T-test.
tdfp
PM1025.810947<0.001
Table 5. Results obtained after calculating the Pearson Correlation for Station 2.
Table 5. Results obtained after calculating the Pearson Correlation for Station 2.
Variable PM10Vehicles
PM10Pears. Coef.-------
p-value-------
Vehicles Pears. Coef.1.000-------
p-value<0.001-------
Table 6. Regression statistics for the measurements of Station 2.
Table 6. Regression statistics for the measurements of Station 2.
ModelRR2Adjusted R2 RMSE
H00.0000.0000.00020.060
H11.0001.0001.0001.043 × 10−12
Table 7. ANOVA for the measurements of Station 2.
Table 7. ANOVA for the measurements of Station 2.
Model Sum of Squaresdf Mean SquareFp
H1Regression1.651 × 10611.651 × 1061.519 × 1030<0.001
Residual4.458 × 10−2141011.087 × 10−24
Total1.651 × 1064102
Table 8. The overall significance of the regression model for the measurements of Station 2.
Table 8. The overall significance of the regression model for the measurements of Station 2.
Model UnstandardizedStandard
Error
Standardizedtp
H0(Intercept)38.1530.313 121.831<0.001
H1(Intercept) −6.389 × 10−143.498 × 10−14 −18.267<0.001
Vehicles0.7075.733 × 10−161.0001.232 × 1015<0.001
Table 9. Results obtained after calculating the Pearson Correlation for Station 4.
Table 9. Results obtained after calculating the Pearson Correlation for Station 4.
Variable PM10Vehicles TemperatureHumidity
PM10Pears. Coef.-------
p-value-------
Vehicles Pears. Coef.1.000------
p-value<0.001------
TemperaturePears. Coef.0.256 ------
p-value<0.001 ------
HumidityPears. Coef.−0.215 −0.744------
p-value<0.001 <0.001------
Table 10. Regression statistics for the measurements of Station 4.
Table 10. Regression statistics for the measurements of Station 4.
ModelRR2Adjusted R2 RMSE
H00.0000.0000.00011.481
H11.0001.0001.0001.704 × 10−13
Table 11. ANOVA for the measurements of Station 4.
Table 11. ANOVA for the measurements of Station 4.
Model Sum of Squaresdf Mean SquareFp
H1Regression199,685.462366,561.8212.294 × 1030<0.001
Residual4.388 × 10−2315122.902 × 10−26
Total199,685.4621515
Table 12. The overall significance of the regression model for the measurements of Station 4.
Table 12. The overall significance of the regression model for the measurements of Station 4.
Model UnstandardizedStandard
Error
Standardizedtp
H0(Intercept)19.1370.295 64.902<0.001
H1(Intercept) 1.869 × 10−133.686 × 10−14 5.070<0.001
Number of
Vehicles
0.4591.816 × 10−161.0002.529 × 1015<0.001
Temperature−4.367 × 10−168.730 × 10−16−2.929 × 10−16−0.5000.617
Humidity−2.733 × 10−164.343 × 10−16−3.601 × 10−16−0.6290.529
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Spandonidis, C.; Paraskevopoulos, D.; Saravanos, C. Neighborhood-Level Particle Pollution Assessment during the COVID-19 Pandemic via a Novel IoT Solution. Sustainability 2023, 15, 8233. https://doi.org/10.3390/su15108233

AMA Style

Spandonidis C, Paraskevopoulos D, Saravanos C. Neighborhood-Level Particle Pollution Assessment during the COVID-19 Pandemic via a Novel IoT Solution. Sustainability. 2023; 15(10):8233. https://doi.org/10.3390/su15108233

Chicago/Turabian Style

Spandonidis, Christos, Dimitrios Paraskevopoulos, and Christina Saravanos. 2023. "Neighborhood-Level Particle Pollution Assessment during the COVID-19 Pandemic via a Novel IoT Solution" Sustainability 15, no. 10: 8233. https://doi.org/10.3390/su15108233

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop