Next Article in Journal
Reservoir Neural Network Computing for Time Series Forecasting in Aerospace: Potential Applications to Predictive Maintenance
Previous Article in Journal
Evaluation of the University of Lagos Waste Generation Trend
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Modelling the Daily Concentration of Airborne Particles Using 1D Convolutional Neural Networks †

1
Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, Josip Juraj Strossmayer University of Osijek, 31000 Osijek, Croatia
2
Centre for Bioanthropology, Institute for Anthropological Research, 10000 Zagreb, Croatia
*
Author to whom correspondence should be addressed.
Presented at the 10th International Conference on Time Series and Forecasting, Gran Canaria, Spain, 15–17 July 2024.
Eng. Proc. 2024, 68(1), 16; https://doi.org/10.3390/engproc2024068016
Published: 4 July 2024
(This article belongs to the Proceedings of The 10th International Conference on Time Series and Forecasting)

Abstract

:
This paper focuses on improving the prediction of the daily concentration of the pollutants, PM10 and nitrogen oxides (NO, NO2) in the air at urban monitoring sites using 1D convolutional neural networks (CNN). The results show that the 1D CNN model outperforms the other machine learning models (LSTM and Random Forest) in terms of the coefficients of determination and absolute errors.

1. Introduction

Air pollution is a major environmental problem with significant impacts on human health and the planet’s ecosystem. Air pollutants can be of natural or anthropogenic origin. One of the most commonly assessed air pollutants is particulate matter (PM), which can vary in size and composition from coarse to fine and from organic to inorganic constituents. Of the commonly measured airborne particles that pollute the air, PM2.5 (diameter below 2.5 µm fine airborne particles) and PM10 (diameter below 10 µm, coarse airborne particles) are smaller and thus more worrisome for human health [1]. PM2.5 particles are very small and can penetrate deep into the lungs and cause respiratory problems and cardiovascular disease, while PM10 particles are slightly larger and cause irritation to the eyes, nose and throat. In addition to the suspended particles mentioned above, the other parameters of air pollution include nitrogen oxides (NO, NO2), sulphur dioxide (SO2), carbon monoxide (CO), and ozone (O3). All of these particles are classified as Group 1 carcinogens by the World Health Organisation (WHO) and can have a significant impact on human health [2], the environment, and climate change.
To reduce the negative effects of air pollution and thus protect the health of affected people and the environment, effective measures must be taken to control air quality. Advances in machine learning (ML), particularly in convolutional neural networks (hereafter, CNNs), have made it possible to model and predict the concentration of particles in the air. Such tools can help in (a) understanding the contributors and (b) understanding trends and seasonalities. Both can help in their control and countermeasures. In this work, a 1D CNN model is developed to predict the concentration of PM10, NO, and NO2 in the air. The data come from public sources of the Austrian government [3] and include measurements of the above variables in the period from 1 January 2014 to 17 March 2022.
Three ML models were created and trained on the data: a 1D CNN model, a Long-Short Term Memory (LSTM) model, and a Random Forest (RF) model. The coefficient of determination (R2) and mean absolute error (MAE) were used to compare and evaluate the models. The preliminary results indicate that the 1D CNN model outperforms the other machine learning models (LSTM and RF) in terms of the R2 and MAE.

2. Related Work

The use of ML methods in air quality prediction is an active research field. Examples of such methods include Least Absolute Shrinkage and Selection Operator (LASSO) Regression [4,5], Support Vector Machines (SVM) [6,7,8], and Random Forest (RF) [9].
Six different ML models were compared to predict the PM10 particle concentration in Ankara, Turkey, namely: LASSO, SVM, RF, kNN (k-Nearest Neighbours), xGBoost (eXtreme Gradient Boosting), and ANN (Artificial Neural Network) [10]. The spatial distribution of PM10 was analysed taking into account the land use characteristics of the region, namely the traffic, industrial density, population density, natural gas use, and changes in income. Data for the years 2009–2017 of six stations in Ankara, Turkey, were used as input, and the PM10 concentrations of the seventh station for the year 2018 were predicted. This procedure was repeated for each of the stations: the data of each station were predicted for the year 2018, by using models trained using 2009–2017 data from the other six stations. The best results were obtained using ANN.
In the investigation of the change in air pollutant concentrations during the COVID-19 lockdown in Graz, Austria, machine learning, in particular RF, was used to analyse various predictions and actual pollutant levels [11]. The models showed good generalization, with the predicted PM10 and NO2 levels exceeding those measured during the closure, while the O3 was underestimated, which was related to lower NOx emissions due to lower traffic volumes. Other examples of PM10 and NO2 modelling and prediction using machine learning models can be found in [12,13].
Deep learning (DL) (a subfield of machine learning) models are also being used in air quality modelling and prediction. Deep neural networks, i.e., neural networks (NN) with multiple hidden layers, are able extract intrinsic features and patterns in large datasets. An overview of such applications can be found in [14]. For example, the authors in [15] used a convolutional neural network to estimate PM2.5 particle concentrations using 2011 data from the United States of America. Data on aerosol optical depth (AOD), meteorological fields, and land use were integrated into this model. The model was tested and evaluated using overall, spatially separated, and temporally separated cross-validation methods to ensure the reliability of the results. It was found that the proposed CNN-based model outperformed all the benchmark models in estimating the daily 24 h averaged ground-level PM2.5 concentration. In addition, a novel metric for the importance of predictions was developed based on the Layerwise Relevance Propagation (LRP) method. The authors noted that the estimation accuracy of the model is boosted by exploiting the spatial correlation of nearby predictors. Other examples of DL models used in air pollution prediction include an LSTM [16,17], a combination of LSTM and convolutional neural networks (CNN) [18], and gated recurrent units (GRU) [19].

3. Materials and Methods

3.1. Study Area and Data

Long term environmental, pollution, and weather data from 5 measuring stations from the Austrian city of Graz namely Sud, Nord, West, Ost, and Don Bosco were analysed. The measurements, taken hourly, covered the time period from January 2014 to May 2020. Graz is a medium-sized European city, which has much in common with respect to the size and layout with many other European urban areas. The Ost and Don Bosco measurement sites are situated on arterial roads with high traffic volumes, especially during morning and evening rush hours. The most polluted measurement site of Graz is Don Bosco, which struggles every year to comply with the NO2 and PM10 regulatory limits of the EU-Council directive 96/62/EC. This is primarily because of traffic-related emissions but also because of the emissions from a nearby steel and iron mill. Although Graz East is located at a heavily frequented commuter-arterial road, the mean pollutant concentrations are lower than at Don Bosco. A more detailed description of the monitoring sites, pictures (Figure 1), and a historical overview, as well as an overview of the dataset can be found can be found in [11,20]. NO, NO2, and PM10 are measured at all stations, with O3 being measured at stations Nord and Sud as well. The processed data used in this paper consisted of 71,377 measurements, with a total of 64 input time variables and 17 output variables of the particle concentrations in the air. However, the focus of this paper is on the aforementioned 3 output variables: NO, NO2, and PM10. For the training of the ML models, the first 80% of the data was used as a training set (measurements from 1 January 2014 to 7 July 2020), and the remaining 20% of the data (from 8 July 2020 to 17 March 2022) was used as the testing set.

3.2. Machine Learning Models

As mentioned earlier, three ML models were created and trained: a 1D CNN model, an LSTM model, and an RF model. One-dimensional CNNs are a type of NN inspired by the well-known 2D CNNs used in image recognition. However, unlike classical CNNs, which process 2D grids of data (like pixels in an image), they are suitable for processing one-dimensional sequences, making them ideal for tasks involving time-series data or audio signals [21]. Similar to the classic 2D CNNs, 1D CNNs consist of convolutional layers, pooling layers, and fully concatenated layers. The convolutional layers extract features from the data. These layers slide a filter (kernel) along the sequence and capture patterns and local dependencies. The pooling layer, such as the max-pooling layer, reduces the spatial dimensions by selecting the maximum value in each region, while the fully linked layers process the extracted features for classification or regression. Since they deal with simpler data, 1D CNNs are generally faster and require less computing power than 2D CNNs. However, similar to 2D CNNs, hyperparameter tuning is required to achieve optimal performance.
LSTMs are specialized recurrent neural networks (RNNs) designed to deal with long-term dependencies in sequential data and are widely used in various applications, specifically in tasks such as language modelling, machine translation, sentiment analysis, and time series prediction [16,17,18]. Conventional RNNs have difficulties with long-term dependencies. Information far back in a sequence can fade (vanishing gradient) or explode, making it difficult to learn long-term relationships. LSTMs introduce a clever gating mechanism that controls the flow of information within the network. This allows them to remember for longer; i.e., LSTMs have a cell state that can store relevant information for longer periods of time. LSTMs also have selective learning, i.e., gates regulate which information is added to the cell, which is retained, and which is forgotten. LSTMs consist of memory units (or LSTM cells) that enable the flow of information over multiple time steps. The most important components of an LSTM cell are
  • Input Gate—controls the flow of new information into the cell;
  • Forget Gate—determines which information from the previous state of the cell should be retained or forgotten;
  • Output Gate—controls the output of the cell;
  • Cell State—represents the memory content;
  • Hidden state—represents the output of the cell.
LSTMs learn to update and maintain the cell state over time, allowing them to capture long-range dependencies.
RF is a versatile ML algorithm that uses the collective decision making of multiple decision trees to create a more accurate and stable prediction model. It generates an ensemble of decision trees, which are inherently simple models that, when used alone, are prone to overfitting and can be sensitive to noise in the data. For classification problems, the mode of the classes of the individual trees is chosen as the output, while for regression problems, the mean prediction of the individual trees is used as the output. The algorithm introduces randomness into the tree building process, which ensures that the high variance of the individual trees is balanced in the final model. By training each tree on a different subset of the data, the trees are less correlated and therefore, on average, more robust to overfitting. Due to this ensemble approach, RF models are also able to handle complex data sets with high dimensionality. Although an RF model is not as easy to interpret as a single decision tree, the use of feature importance scores can provide insight into which variables have the greatest influence on the model’s predictions. In this paper, the RF was chosen due to its good generalization shown in previous research [22,23,24].
The coefficient of determination (R2) and mean absolute error (MAE) were used to compare and evaluate the models in this work.

4. Results and Discussion

Based on the input data, all the models were trained to predict the NO, NO2, and PM10 values at all five measuring stations; hence, each model can be considered to have 64 input or predictive variables and 15 output or target variables. Hyperparameter tuning was performed for all models, and the summary of the results of the best models on the test dataset are shown in Table 1 below.
In Table 1, the maximum R2 values and minimum MAE values of each target value across all model types are shown in bold. When analysing the results, it can be seen that the 1D-CNN model performed the best among the three models tested when considering both metrics, with the exception of Ost-PM10, where the 1D-CNN model was the best model when considering the R2, and the LSTM model was the best model when considering the MAE. The time series plot of the overall best model (Nord-NO2) considering both metrics is shown in Figure 2 below.

5. Conclusions

Over a period from January 2014 to May 2020, hourly measurements of long-term environmental, pollution, and weather data from five measuring stations in the Austrian city of Graz, namely Sud, Nord, West, Ost, and Don Bosco, were analysed. Three machine learning models, namely 1D CNN, LSTM, and RF models, were used to predict NO, NO2, and PM10 levels. The models were compared using R2 and MAE. The 1D CNN model showed the best overall results followed by the LSTM model. For future research, it would be very useful to further investigate the architecture as well as perform extra hyperparameter optimization of the 1D CNN and LSTM models to achieve higher accuracy and reliability.

Author Contributions

Conceptualization, M.L. and E.K.N.; methodology, M.L. and E.K.N.; software, I.G.; validation, I.G., M.L. and E.K.N.; formal analysis, I.G. and M.L.; investigation, I.G. and M.L.; data curation, I.G. and M.L.; writing—original draft preparation, I.G.; writing—review and editing, M.L. and E.K.N.; visualization, E.KN.; supervision, M.L. and E.K.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original data presented in the study are openly available in https://doi.org/10.5281/zenodo.3982670 (accessed 12 February 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Chow, J.C.; Watson, J.G.; Lu, Z.; Lowenthal, D.H.; Frazier, C.A.; Solomon, P.A.; Thuillier, R.H.; Magliano, K. Descriptive Analysis of PM2.5 and PM10 at Regionally Representative Locations during SJVAQS/AUSPEX. Atmospheric Environ. 1996, 30, 2079–2112. [Google Scholar] [CrossRef]
  2. Zhao, S.; Yu, Y.; Yin, D.; He, J.; Liu, N.; Qu, J.; Xiao, J. Annual and Diurnal Variations of Gaseous and Particulate Pollutants in 31 Provincial Capital Cities Based on In Situ Air Quality Monitoring Data from China National Environmental Monitoring Center. Environ. Int. 2016, 86, 92–106. [Google Scholar] [CrossRef] [PubMed]
  3. Austrian Government Data. Available online: https://www.umwelt.steiermark.at/cms/ziel/2060750/DE/ (accessed on 12 July 2023).
  4. Chu, H.; Wei, J.; Wu, W. Streamflow prediction using LASSO-FCM-DBN approach based on hydro-meteorological condition classification. J. Hydrol. 2019, 580, 124253. [Google Scholar] [CrossRef]
  5. Xu, G.; Ren, X.; Xiong, K.; Li, L.; Bi, X.; Wu, Q. Analysis of the driving factors of PM2.5 concentration in the air: A case study of the Yangtze River Delta, China. Ecol. Indic. 2020, 110, 105889. [Google Scholar] [CrossRef]
  6. García Nieto, P.J.; Sánchez Lasheras, F.; García-Gonzalo, E.; de Cos Juez, F.J. PM10 concentration forecasting in the metropolitan area of Oviedo (Northern Spain) using models based on SVM, MLP, VARMA and ARIMA: A case study. Sci. Total Environ. 2018, 621, 753–761. [Google Scholar] [CrossRef] [PubMed]
  7. Fan, W.; Si, F.; Ren, S.; Yu, C.; Cui, Y.; Wang, P. Integration of continuous restricted Boltzmann machine and SVR in NOx emissions prediction of a tangential firing boiler. Chemom. Intell. Lab. Syst. 2019, 195, 103870. [Google Scholar] [CrossRef]
  8. Murillo-Escobar, J.; Sepulveda-Suescun, J.; Correa, M.; Orrego-Metaute, D. Forecasting concentrations of air pollutants using support vector regression improved with particle swarm optimization: Case study in Aburrá Valley, Colombia. Urban Clim. 2019, 29, 100473. [Google Scholar] [CrossRef]
  9. Wang, Y.; Du, Y.; Wang, J.; Li, T. Calibration of a low-cost PM2.5 monitor using a random forest model. Environ. Int. 2019, 133, 105161. [Google Scholar] [CrossRef] [PubMed]
  10. Bozdağ, A.; Dokuz, Y.; Gökçek, B. Spatial prediction of PM10 concentration using machine learning algorithms in Ankara, Turkey. Environ. Pollut. 2020, 263 Pt A, 114635. [Google Scholar] [CrossRef]
  11. Lovrić, M.; Pavlović, K.; Vuković, M.; Grange, S.K.; Haberl, M.; Kern, R. Understanding the True Effects of the COVID-19 Lockdown on Air Pollution by Means of Machine Learning. Environ. Pollut. 2021, 274, 115900. [Google Scholar] [CrossRef] [PubMed]
  12. Choubin, B.; Abdolshahnejad, M.; Moradi, E.; Querol, X.; Mosavi, A.; Shamshirband, S.; Ghamisi, P. Spatial hazard assessment of the PM10 using machine learning models in Barcelona, Spain. Sci. Total. Environ. 2020, 701, 134474. [Google Scholar] [CrossRef] [PubMed]
  13. Stafoggia, M.; Bellander, T.; Bucci, S.; Davoli, M.; de Hoogh, K.; Donato, F.D.; Gariazzo, C.; Lyapustin, A.; Michelozzi, P.; Renzi, M.; et al. Estimation of daily PM10 and PM2.5 concentrations in Italy, 2013–2015, using a spatiotemporal land-use random-forest model. Environ. Int. 2019, 124, 170–179. [Google Scholar] [CrossRef] [PubMed]
  14. Liao, Q.; Zhu, M.; Wu, L.; Pan, X.; Tang, X.; Wang, Z. Deep Learning for Air Quality Forecasts: A Review. Curr. Pollut. Rep. 2020, 6, 399–409. [Google Scholar] [CrossRef]
  15. Park, Y.; Kwon, B.; Heo, J.; Hu, X.; Liu, Y.; Moon, T. Estimating PM2.5 Concentration of the Conterminous United States via Interpretable Convolutional Neural Networks. Environ. Pollut. 2020, 256, 113395. [Google Scholar] [CrossRef] [PubMed]
  16. Drewil, G.I.; Al-Bahadili, R.J. Air pollution prediction using LSTM deep learning and metaheuristics algorithms. Meas. Sensors 2022, 24, 100546. [Google Scholar] [CrossRef]
  17. Diqi, M.; Hamz, I.H.; Leon, A.A.; Adinda, N. Enhancing Weather Prediction Using Stacked Long Short-Term Memory Networks. J. Tek. Inform. Dan Sist. Inf. 2023, 10, 519–530. [Google Scholar]
  18. Gilik, A.; Ogrenci, A.S.; Ozmen, A. Air quality prediction using CNN+LSTM-based hybrid deep learning architecture. Environ. Sci. Pollut. Res. 2021, 29, 11920–11938. [Google Scholar] [CrossRef]
  19. Huang, G.; Li, X.; Zhang, B.; Ren, J. PM2.5 Concentration Forecasting at Surface Monitoring Sites Using GRU Neural Network Based on Empirical Mode Decomposition. Sci. Total. Environ. 2021, 768, 144516. [Google Scholar] [CrossRef] [PubMed]
  20. Lovrić, M.; Pavlović, K.; Kern, R.; Grange, S.K.; Vuković, M.; Haberl, M. Air Pollution 01.2014–05.2020 (Including COVID-19 Lockdown) Data from Graz, Austria. Available online: https://doi.org/10.5281/zenodo.3982670 (accessed on 12 July 2023).
  21. Kiranyaz, S.; Avci, O.; Abdeljaber, O.; Ince, T.; Gabbouj, M.; Inman, D.J. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal Process. 2021, 151, 107398. [Google Scholar] [CrossRef]
  22. Grange, S.K.; Carslaw, D.C.; Lewis, A.C.; Boleti, E.; Hueglin, C. Random forest meteorological normalisation models for Swiss PM10 trend analysis. Atmos. Chem. Phys. 2018, 18, 6223–6239. [Google Scholar] [CrossRef]
  23. Grange, S.K.; Carslaw, D.C. Using meteorological normalisation to detect interventions in air quality time series. Sci. Total. Environ. 2018, 653, 578–588. [Google Scholar] [CrossRef] [PubMed]
  24. Regier, P.; Duggan, M.; Myers-Pigg, A.; Ward, N. Effects of Random Forest Modeling Decisions on Biogeochemical Time Series Predictions. Limnol. Oceanogr. Methods 2023, 21, 40–52. [Google Scholar] [CrossRef]
Figure 1. A city map of Graz with the five measurement sites marked [11].
Figure 1. A city map of Graz with the five measurement sites marked [11].
Engproc 68 00016 g001
Figure 2. Test data NO2 concentration time series plots for Graz Nord. The plots present a 7-day moving average (for better visibility). (a) Actual values and the values predicted by the 1D CNN model; (b) 1D CNN model prediction error.
Figure 2. Test data NO2 concentration time series plots for Graz Nord. The plots present a 7-day moving average (for better visibility). (a) Actual values and the values predicted by the 1D CNN model; (b) 1D CNN model prediction error.
Engproc 68 00016 g002
Table 1. Results of the machine learning models on the test dataset.
Table 1. Results of the machine learning models on the test dataset.
StationsParticlesR2MAE
1D CNNLSTMRF1D CNNLSTMRF
OstNO0.90080.88350.76646.21006.712710.2912
NO20.90640.88030.79053.59894.13275.8821
PM100.81820.80710.73665.51305.39056.5191
WestNO0.91370.90440.76854.43234.68568.0933
NO20.91440.89470.81713.19273.61735.2387
PM100.82220.79400.69764.37164.80006.2122
NordNO0.89130.87160.72343.34973.51625.7668
NO20.91910.89320.81222.69853.30644.7665
PM100.70940.65270.61825.10695.52356.2155
SudNO0.91030.89720.76857.27577.796913.3983
NO20.87370.86200.77414.56804.75306.6125
PM100.87230.85270.75454.35424.88116.5107
Don BoscoeNO0.91590.90370.783210.508411.067317.2618
NO20.90620.87270.77794.01314.87826.7749
PM100.81580.80740.72615.12605.26866.4453
Maximum R2 values and minimum MAE values of each target value across all model types are shown in bold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gudelj, I.; Lovrić, M.; Nyarko, E.K. Modelling the Daily Concentration of Airborne Particles Using 1D Convolutional Neural Networks. Eng. Proc. 2024, 68, 16. https://doi.org/10.3390/engproc2024068016

AMA Style

Gudelj I, Lovrić M, Nyarko EK. Modelling the Daily Concentration of Airborne Particles Using 1D Convolutional Neural Networks. Engineering Proceedings. 2024; 68(1):16. https://doi.org/10.3390/engproc2024068016

Chicago/Turabian Style

Gudelj, Ivan, Mario Lovrić, and Emmanuel Karlo Nyarko. 2024. "Modelling the Daily Concentration of Airborne Particles Using 1D Convolutional Neural Networks" Engineering Proceedings 68, no. 1: 16. https://doi.org/10.3390/engproc2024068016

Article Metrics

Back to TopTop