Next Article in Journal
Direction-Based Hybrid Strategy Combining Pushing and Hitting for Fast Object Singulation
Next Article in Special Issue
Gradient Boosting Machine to Assess the Public Protest Impact on Urban Air Quality
Previous Article in Journal
The Breakage of Shape-Anisotropic Particles under Normal Contact with Different Particle Shape Parameters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia

by
Nurul Amalin Fatihah Kamarul Zaman
1,
Kasturi Devi Kanniah
1,2,*,
Dimitris G. Kaskaoutis
3,4,* and
Mohd Talib Latif
5
1
Tropical Map Research Group, Faculty of Built Environment & Surveying, Universiti Teknologi Malaysia, Skudai 81310, Johor, Malaysia
2
Centre for Environmental Sustainability and Water Security (IPASA), Research Institute for Sustainable Environment, Universiti Teknologi Malaysia, Skudai 81310 UTM, Johor, Malaysia
3
Institute for Environmental Research and Sustainable Development, National Observatory of Athens, 15236 Athens, Greece
4
Environmental Chemical Processes Laboratory, Department of Chemistry, University of Crete, 71003 Crete, Greece
5
Department of Earth Sciences and Environment, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
*
Authors to whom correspondence should be addressed.
Appl. Sci. 2021, 11(16), 7326; https://doi.org/10.3390/app11167326
Submission received: 25 June 2021 / Revised: 29 July 2021 / Accepted: 5 August 2021 / Published: 9 August 2021
(This article belongs to the Special Issue Air Quality Prediction Based on Machine Learning Algorithms II)

Abstract

Southeast Asia (SEA) is a hotspot region for atmospheric pollution and haze conditions, due to extensive forest, agricultural and peat fires. This study aims to estimate the PM2.5 concentrations across Malaysia using machine-learning (ML) models like Random Forest (RF) and Support Vector Regression (SVR), based on satellite AOD (aerosol optical depth) observations, ground measured air pollutants (NO2, SO2, CO, O3) and meteorological parameters (air temperature, relative humidity, wind speed and direction). The estimated PM2.5 concentrations for a two-year period (2018–2019) are evaluated against measurements performed at 65 air-quality monitoring stations located at urban, industrial, suburban and rural sites. PM2.5 concentrations varied widely between the stations, with higher values (mean of 24.2 ± 21.6 µg m−3) at urban/industrial stations and lower (mean of 21.3 ± 18.4 µg m−3) at suburban/rural sites. Furthermore, pronounced seasonal variability in PM2.5 is recorded across Malaysia, with highest concentrations during the dry season (June–September). Seven models were developed for PM2.5 predictions, i.e., separately for urban/industrial and suburban/rural sites, for the four dominant seasons (dry, wet and two inter-monsoon), and an overall model, which displayed accuracies in the order of R2 = 0.46–0.76. The validation analysis reveals that the RF model (R2 = 0.53–0.76) exhibits slightly better performance than SVR, except for the overall model. This is the first study conducted in Malaysia for PM2.5 estimations at a national scale combining satellite aerosol retrievals with ground-based pollutants, meteorological factors and ML techniques. The satisfactory prediction of PM2.5 concentrations across Malaysia allows a continuous monitoring of the pollution levels at remote areas with absence of measurement networks.
Keywords: PM2.5; Himawari-8; random forest; support vector regression; air pollution; Malaysia PM2.5; Himawari-8; random forest; support vector regression; air pollution; Malaysia

Share and Cite

MDPI and ACS Style

Zaman, N.A.F.K.; Kanniah, K.D.; Kaskaoutis, D.G.; Latif, M.T. Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia. Appl. Sci. 2021, 11, 7326. https://doi.org/10.3390/app11167326

AMA Style

Zaman NAFK, Kanniah KD, Kaskaoutis DG, Latif MT. Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia. Applied Sciences. 2021; 11(16):7326. https://doi.org/10.3390/app11167326

Chicago/Turabian Style

Zaman, Nurul Amalin Fatihah Kamarul, Kasturi Devi Kanniah, Dimitris G. Kaskaoutis, and Mohd Talib Latif. 2021. "Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia" Applied Sciences 11, no. 16: 7326. https://doi.org/10.3390/app11167326

APA Style

Zaman, N. A. F. K., Kanniah, K. D., Kaskaoutis, D. G., & Latif, M. T. (2021). Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia. Applied Sciences, 11(16), 7326. https://doi.org/10.3390/app11167326

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop