Unveiling the Transparency of Prediction Models for Spatial PM2.5 over Singapore: Comparison of Different Machine Learning Approaches with eXplainable Artificial Intelligence
Abstract
:1. Introduction
- (i)
- Predicting the spatial PM2.5 values over Singapore and validating the outcomes using machine learning models.
- (ii)
- Investigating the fidelity of the model outcomes with XAI.
2. Study Area and Data Description
3. Methodology
3.1. The Machine Learning Models
3.1.1. Random Forest (RF) Regression
3.1.2. Gradient Boosting (GB) Regression
3.1.3. Extreme Gradient Boosting Regression (XGBoost)
3.1.4. Tree-Based Pipeline Optimization Tool (TP) Optimization Algorithm
3.2. Error Metrics
3.3. XAI Methods
4. Results and Discussion
4.1. Performance Comparison with RF and GB Models
4.2. Comparative Analysis Using TP (AutoML) Meta-Heuristic Approach Using Genetic Algorithm
4.3. Global Interpretability and Local Interpretability Using SHAP Model
4.4. Spatio-Temporal Interpretation Using RF and GF Prediction
4.5. Insights, Strengths and Limitations
- The structure of the ML algorithm: Depending on the structure of ML algorithms (RF, GB, and TP), the prediction mechanism and the spatial estimation of the outcome undergo changes. For instance, as an ensemble method, RF combines multiple decision trees to make predictions; GB builds an ensemble of decision trees sequentially, where each tree corrects the errors of the previous one; TP is an optimization algorithm used for hyperparameter tuning and model/feature selections. These predictions can capture complex relationships between the datasets.
- Integrating external factors for prediction: NDVI, temperature, wind speed, and humidity can impact the dispersion, transformation, and accumulation of PM particles in the air. Thus, these factors contribute more to the temporal and spatial dynamics of PM concentration.
- Need for spatial and time series prediction with data-driven analysis: Best ML prediction provides insights regarding how well PM2.5 concentrations across different locations in the study area are predicted spatially. Time series and spatial prediction help to understand the yearly-based monthly patterns of PM2.5 concentration included with meteorological variables’ effects.
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Chae, S.; Shin, J.; Kwon, S.; Lee, S.; Kang, S.; Lee, D. PM10 and PM2.5 Real-Time Prediction Models Using an Interpolated Convolutional Neural Network. Sci. Rep. 2021, 11, 11952. [Google Scholar] [CrossRef] [PubMed]
- Jat, R.; Gurjar, B.R. Contribution of Different Source Sectors and Source Regions of Indo-Gangetic Plain in India to PM2.5 Pollution and Its Short-Term Health Impacts during Peak Polluted Winter. Atmos. Pollut. Res. 2021, 12, 89–100. [Google Scholar] [CrossRef]
- Naghan, D.J.; Neisi, A.; Goudarzi, G.; Dastoorpoor, M.; Fadaei, A.; Angali, K.A. Estimation of the Effects PM2.5, NO2, O3 Pollutants on the Health of Shahrekord Residents Based on AirQ+ Software during (2012–2018). Toxicol. Rep. 2022, 9, 842–847. [Google Scholar] [CrossRef] [PubMed]
- Bai, H.; Shi, Y.; Seong, M.; Gao, W.; Li, Y. Influence of Spatial Resolution on Satellite-Based PM2.5 Estimation: Implications for Health Assessment. Remote Sens. 2022, 14, 2933. [Google Scholar] [CrossRef]
- Balasubramanian, R. Comprehensive Characterization of PM2.5 Aerosols in Singapore. J. Geophys. Res. 2003, 108, 4523. [Google Scholar] [CrossRef]
- Li, Y.; Zhu, Y.; Tan, J.Y.K.; Teo, H.C.; Law, A.; Qu, D.; Luo, W. The Impact of COVID-19 on NO2 and PM2.5 Levels and Their Associations with Human Mobility Patterns in Singapore. Ann. GIS 2022, 28, 515–531. [Google Scholar] [CrossRef]
- Fang, G.; Wu, Y.; Huang, S.; Rau, J. Review of Atmospheric Metallic Elements in Asia during 2000–2004. Atmos. Environ. 2005, 39, 3003–3013. [Google Scholar] [CrossRef]
- Lelieveld, J.; Evans, J.S.; Fnais, M.; Giannadaki, D.; Pozzer, A. The Contribution of Outdoor Air Pollution Sources to Premature Mortality on a Global Scale. Nature 2015, 525, 367–371. [Google Scholar] [CrossRef]
- Burnett, R.T.; Arden Pope, C.; Ezzati, M.; Olives, C.; Lim, S.S.; Mehta, S.; Shin, H.H.; Singh, G.; Hubbell, B.; Brauer, M.; et al. An Integrated Risk Function for Estimating the Global Burden of Disease Attributable to Ambient Fine Particulate Matter Exposure. Environ. Health Perspect. 2014, 122, 397–403. [Google Scholar] [CrossRef]
- Guo, H.; Kota, S.H.; Chen, K.; Sahu, S.K.; Hu, J.; Ying, Q.; Wang, Y.; Zhang, H. Source Contributions and Potential Reductions to Health Effects of Particulate Matter in India. Atmos. Chem. Phys. 2018, 18, 15219–15229. [Google Scholar] [CrossRef]
- Zhu, W.; Wang, M.; Zhang, B. The Effects of Urbanization on PM2.5 Concentrations in China’s Yangtze River Economic Belt: New Evidence from Spatial Econometric Analysis. J. Clean. Prod. 2019, 239, 118065. [Google Scholar] [CrossRef]
- Chen, G.; Li, S.; Knibbs, L.D.; Hamm, N.A.S.; Cao, W.; Li, T.; Guo, J.; Ren, H.; Abramson, M.J.; Guo, Y. A Machine Learning Method to Estimate PM2.5 Concentrations across China with Remote Sensing, Meteorological and Land Use Information. Sci. Total Environ. 2018, 636, 52–60. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.-C.; Li, D.-C. Selection of Key Features for PM2.5 Prediction Using a Wavelet Model and RBF-LSTM. Appl. Intell. 2021, 51, 2534–2555. [Google Scholar] [CrossRef]
- Ma, Z.; Dey, S.; Christopher, S.; Liu, R.; Bi, J.; Balyan, P.; Liu, Y. A Review of Statistical Methods Used for Developing Large-Scale and Long-Term PM2.5 Models from Satellite Data. Remote Sens. Environ. 2022, 269, 112827. [Google Scholar] [CrossRef]
- Pu, Q.; Yoo, E.H. Ground PM2.5 Prediction Using Imputed MAIAC AOD with Uncertainty Quantification. Environ. Pollut. 2021, 274, 116574. [Google Scholar] [CrossRef] [PubMed]
- Mabasa, B.; Lysko, M.D.; Moloi, S.J. Validating Hourly Satellite Based and Reanalysis Based Global Horizontal Irradiance Datasets over South Africa. Geomatics 2021, 1, 429–449. [Google Scholar] [CrossRef]
- Gupta, P.; Levy, R.C.; Mattoo, S.; Remer, L.A.; Munchak, L.A. A Surface Reflectance Scheme for Retrieving Aerosol Optical Depth over Urban Surfaces in MODIS Dark Target Retrieval Algorithm. Atmos. Meas. Tech. 2016, 9, 3293–3308. [Google Scholar] [CrossRef]
- Sekertekin, A.; Inyurt, S.; Yaprak, S. Pre-Seismic Ionospheric Anomalies and Spatio-Temporal Analyses of MODIS Land Surface Temperature and Aerosols Associated with Sep, 24 2013 Pakistan Earthquake. J. Atmos. Sol.-Terr. Phys. 2020, 200, 105218. [Google Scholar] [CrossRef]
- Xiang, Y.; Ye, Y.; Peng, C.; Teng, M.; Zhou, Z. Seasonal Variations for Combined Effects of Landscape Metrics on Land Surface Temperature (LST) and Aerosol Optical Depth (AOD). Ecol. Indic. 2022, 138, 108810. [Google Scholar] [CrossRef]
- Shukla, K.; Kumar, P.; Mann, G.S.; Khare, M. Mapping Spatial Distribution of Particulate Matter Using Kriging and Inverse Distance Weighting at Supersites of Megacity Delhi. Sustain. Cities Soc. 2020, 54, 101997. [Google Scholar] [CrossRef]
- Feng, X.; Li, Q.; Zhu, Y.; Hou, J.; Jin, L.; Wang, J. Artificial Neural Networks Forecasting of PM2.5 Pollution Using Air Mass Trajectory Based Geographic Model and Wavelet Transformation. Atmos. Environ. 2015, 107, 118–128. [Google Scholar] [CrossRef]
- Beucler, T.; Ebert-Uphoff, I.; Rasp, S.; Pritchard, M.; Gentine, P. Machine Learning for Clouds and Climate. In Clouds and Climate; Cambridge University Press: Cambridge, UK, 2020. [Google Scholar]
- Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Meena, S.R.; Tiede, D.; Aryal, J. Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection. Remote Sens. 2019, 11, 196. [Google Scholar] [CrossRef]
- Ma, J.; Ding, Y.; Gan, V.J.L.; Lin, C.; Wan, Z. Spatiotemporal Prediction of PM2.5 Concentrations at Different Time Granularities Using IDW-BLSTM. IEEE Access 2019, 7, 107897–107907. [Google Scholar] [CrossRef]
- Nazar, M.; Alam, M.M.; Yafi, E.; Su’ud, M.M. A Systematic Review of Human–Computer Interaction and Explainable Artificial Intelligence in Healthcare With Artificial Intelligence Techniques. IEEE Access 2021, 9, 153316–153348. [Google Scholar] [CrossRef]
- Martinez-Seras, A.; Del Ser, J.; Garcia-Bringas, P. Can Post-Hoc Explanations Effectively Detect Out-of-Distribution Samples? In Proceedings of the IEEE International Conference on Fuzzy Systems, Padua, Italy, 18–23 July 2022. [Google Scholar]
- Huang, F.; Shangguan, W.; Li, Q.; Li, L.; Zhang, Y. Beyond Prediction: An Integrated Post–Hoc Approach to Interpret Complex Model in Hydrometeorology. Environ. Model. Softw. 2022, 167, 105762. [Google Scholar] [CrossRef]
- Kakogeorgiou, I.; Karantzalos, K. Evaluating Explainable Artificial Intelligence Methods for Multi-Label Deep Learning Classification Tasks in Remote Sensing. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102520. [Google Scholar] [CrossRef]
- Singh, H.; Roy, A.; Setia, R.K.; Pateriya, B. Estimation of Nitrogen Content in Wheat from Proximal Hyperspectral Data Using Machine Learning and Explainable Artificial Intelligence (XAI) Approach. Model. Earth Syst. Environ. 2022, 8, 2505–2511. [Google Scholar] [CrossRef]
- Stadtler, S.; Betancourt, C.; Roscher, R. Explainable Machine Learning Reveals Capabilities, Redundancy, and Limitations of a Geospatial Air Quality Benchmark Dataset. Mach. Learn. Knowl. Extr. 2022, 4, 150–171. [Google Scholar] [CrossRef]
- Betancourt, C.; Stomberg, T.T.; Edrich, A.K.; Patnala, A.; Schultz, M.G.; Roscher, R.; Kowalski, J.; Stadtler, S. Global, High-Resolution Mapping of Tropospheric Ozone-Explainable Machine Learning and Impact of Uncertainties. Geosci. Model Dev. 2022, 15, 4331–4354. [Google Scholar] [CrossRef]
- Stirnberg, R.; Cermak, J.; Kotthaus, S.; Haeffelin, M.; Andersen, H.; Fuchs, J.; Kim, M.; Petit, J.E.; Favez, O. Meteorology-Driven Variability of Air Pollution (PM1) Revealed with Explainable Machine Learning. Atmos. Chem. Phys. 2021, 21, 3919–3948. [Google Scholar] [CrossRef]
- Gu, Y.; Li, B.; Meng, Q. Hybrid Interpretable Predictive Machine Learning Model for Air Pollution Prediction. Neurocomputing 2022, 468, 123–136. [Google Scholar] [CrossRef]
- Ji, Y.; Zhi, X.; Wu, Y.; Zhang, Y.; Yang, Y.; Peng, T.; Ji, L. Regression Analysis of Air Pollution and Pediatric Respiratory Diseases Based on Interpretable Machine Learning. Front. Earth Sci. 2023, 11, 1105140. [Google Scholar] [CrossRef]
- Tan, S.T.; Mohamed, N.; Ng, L.C.; Aik, J. Air Quality in Underground Metro Station Commuter Platforms in Singapore: A Cross-Sectional Analysis of Factors Influencing Commuter Exposure Levels. Atmos. Environ. 2022, 273, 118962. [Google Scholar] [CrossRef]
- Land Transport Authority Public Transport Ridership. Available online: https://www.lta.gov.sg/content/dam/ltagov/who_we_are/statistics_and_publications/statistics/pdf/PT_Ridership_2015_2019.pdf (accessed on 3 August 2021).
- Government of Singapore Total Land Area of Singapore. Available online: https://data.gov.sg/dataset/total-land-area-of-singapore (accessed on 3 August 2021).
- Barudgar, A.; Singh, J.; Tyagi, B. Variability of Fine Particulate Matter (PM2.5) and Its Association with Health and Vehicular Emissions Over an Urban Tropical Coastal Station Mumbai, India. Thalassas 2022, 38, 1067–1080. [Google Scholar] [CrossRef]
- Sahu, S.K.; Tyagi, B.; Pradhan, C.; Beig, G. Evaluating the Variability, Transport and Periodicity of Particulate Matter over Smart City Bhubaneswar, a Tropical Coastal Station of Eastern India. SN Appl. Sci. 2019, 1, 383. [Google Scholar] [CrossRef]
- Gogikar, P.; Tyagi, B.; Gorai, A.K. Seasonal Prediction of Particulate Matter over the Steel City of India Using Neural Network Models. Model. Earth Syst. Environ. 2019, 5, 227–243. [Google Scholar] [CrossRef]
- Hari, M.; Tyagi, B. India’s Greening Trend Seems to Slow Down. What Does Aerosol Have to Do with It? Land 2022, 11, 538. [Google Scholar] [CrossRef]
- Sahu, R.K.; Hari, M.; Tyagi, B. Forest Fire Induced Air Pollution over Eastern India during March 2021. Aerosol Air Qual. Res. 2022, 22, 220084. [Google Scholar] [CrossRef]
- Gogikar, P.; Tyagi, B.; Padhan, R.R.; Mahaling, M. Particulate Matter Assessment Using In Situ Observations from 2009 to 2014 over an Industrial Region of Eastern India. Earth Syst. Environ. 2018, 2, 305–322. [Google Scholar] [CrossRef]
- Gogikar, P.; Tyagi, B. Assessment of Particulate Matter Variation during 2011–2015 over a Tropical Station Agra, India. Atmos. Environ. 2016, 147, 11–21. [Google Scholar] [CrossRef]
- Peng-in, B.; Sanitluea, P.; Monjatturat, P.; Boonkerd, P.; Phosri, A. Estimating Ground-Level PM2.5 over Bangkok Metropolitan Region in Thailand Using Aerosol Optical Depth Retrieved by MODIS. Air Qual. Atmos. Health 2022, 15, 2091–2102. [Google Scholar] [CrossRef]
- Sethi, J.K.; Mittal, M. Monitoring the Impact of Air Quality on the COVID-19 Fatalities in Delhi, India: Using Machine Learning Techniques. Disaster Med. Public Health Prep. 2020, 16, 604–611. [Google Scholar] [CrossRef] [PubMed]
- Mustakim, N.A.; Ul-Saufie, A.Z.; Shaziayani, W.N.; Noor, N.M.; Mutalib, S. Prediction of Daily Air Pollutants Concentration and Air Pollutant Index Using Machine Learning Approach. Pertanika J. Sci. Technol. 2023, 31, 123–135. [Google Scholar] [CrossRef]
- Gautam, S.; Patra, A.K.; Brema, J.; Raj, P.V.; Raimond, K.; Abraham, S.S.; Chudugudu, K.R. Prediction of Various Sizes of Particles in Deep Opencast Copper Mine Using Recurrent Neural Network: A Machine Learning Approach. J. Inst. Eng. Ser. A 2022, 103, 283–294. [Google Scholar] [CrossRef]
- Adong, P.; Bainomugisha, E.; Okure, D.; Sserunjogi, R. Applying Machine Learning for Large Scale Field Calibration of Low-cost PM2.5 and PM10 Air Pollution Sensors. Appl. AI Lett. 2022, 3, e76. [Google Scholar] [CrossRef]
- Ha, N.T.; Manley-Harris, M.; Pham, T.D.; Hawes, I. The Use of Radar and Optical Satellite Imagery Combined with Advanced Machine Learning and Metaheuristic Optimization Techniques to Detect and Quantify above Ground Biomass of Intertidal Seagrass in a New Zealand Estuary. Int. J. Remote Sens. 2021, 42, 4712–4738. [Google Scholar] [CrossRef]
- Bergstra, J.; Bengio, Y. Random Search for Hyper-Parameter Optimization. J. Mach. Learn. Res. 2012, 13, 281–305. [Google Scholar]
- Eiben, A.E. Introduction to Evolutionary Computing. Assem. Autom. 2004, 24, 324. [Google Scholar] [CrossRef]
- Nunnari, G. Modelling Air Pollution Time-Series by Using Wavelet Functions and Genetic Algorithms. Soft Comput. 2004, 8, 173–178. [Google Scholar] [CrossRef]
- Saini, J.; Dutta, M.; Marques, G. A Novel Application of Fuzzy Inference System Optimized with Particle Swarm Optimization and Genetic Algorithm for PM10 Prediction. Soft Comput. 2022, 26, 9573–9586. [Google Scholar] [CrossRef]
- Garouani, M.; Ahmad, A.; Bouneffa, M.; Hamlich, M.; Bourguin, G.; Lewandowski, A. Using Meta-Learning for Automated Algorithms Selection and Configuration: An Experimental Framework for Industrial Big Data. J. Big Data 2022, 9, 57. [Google Scholar] [CrossRef]
- Le, T.T.; Fu, W.; Moore, J.H. Scaling Tree-Based Automated Machine Learning to Biomedical Big Data with a Feature Set Selector. Bioinformatics 2020, 36, 250–256. [Google Scholar] [CrossRef] [PubMed]
- Olson, R.S.; Bartley, N.; Urbanowicz, R.J.; Moore, J.H. Evaluation of a Tree-Based Pipeline Optimization Tool for Automating Data Science. In Proceedings of the GECCO 2016—2016 Genetic and Evolutionary Computation Conference, Denver, CO, USA, 20–24 July 2016. [Google Scholar]
- Delavar, M.R.; Gholami, A.; Shiran, G.R.; Rashidi, Y.; Nakhaeizadeh, G.R.; Fedra, K.; Afshar, S.H. A Novel Method for Improving Air Pollution Prediction Based on Machine Learning Approaches: A Case Study Applied to the Capital City of Tehran. ISPRS Int. J. Geo-Inf. 2019, 8, 99. [Google Scholar] [CrossRef]
- Srivastava, C.; Singh, S.; Singh, A.P. Estimation of Air Pollution in Delhi Using Machine Learning Techniques. In Proceedings of the 2018 International Conference on Computing, Power and Communication Technologies, GUCON 2018, Greater Noida, India, 28–29 September 2018. [Google Scholar]
- Khan, M.A.; Kim, H.C.; Park, H. Leveraging Machine Learning for Fault-Tolerant Air Pollutants Monitoring for a Smart City Design. Electronics 2022, 11, 3122. [Google Scholar] [CrossRef]
- Arun, G.; Rathi, S. Real Time Air Quality Evaluation Model Using Machine Learning Approach. J. Inf. Technol. Digit. World 2022, 4, 23–33. [Google Scholar] [CrossRef]
- Zhu, B.; Feng, Y.; Gong, D.; Jiang, S.; Zhao, L.; Cui, N. Hybrid Particle Swarm Optimization with Extreme Learning Machine for Daily Reference Evapotranspiration Prediction from Limited Climatic Data. Comput. Electron. Agric. 2020, 173, 105430. [Google Scholar] [CrossRef]
- Aas, K.; Jullum, M.; Løland, A. Explaining Individual Predictions When Features Are Dependent: More Accurate Approximations to Shapley Values. Artif. Intell. 2021, 298, 103502. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.; Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems; NIPS 2017; The MIT Press: Cambridge, MA, USA, 2017; pp. 4765–4774. [Google Scholar]
- Jiang, H.; Senge, E. On Two XAI Cultures: A Case Study of Non-Technical Explanations in Deployed AI System. arXiv 2021, arXiv:2112.01016. [Google Scholar]
- Conibear, L.; Reddington, C.L.; Silver, B.J.; Chen, Y.; Knote, C.; Arnold, S.R.; Spracklen, D.V. Statistical Emulation of Winter Ambient Fine Particulate Matter Concentrations From Emission Changes in China. GeoHealth 2021, 5, e2021GH000391. [Google Scholar] [CrossRef]
- Chen, Z.-Y.; Jin, J.-Q.; Zhang, R.; Zhang, T.-H.; Chen, J.-J.; Yang, J.; Ou, C.-Q.; Guo, Y. Comparison of Different Missing-Imputation Methods for MAIAC (Multiangle Implementation of Atmospheric Correction) AOD in Estimating Daily PM2.5 Levels. Remote Sens. 2020, 12, 3008. [Google Scholar] [CrossRef]
- Hu, X.; Belle, J.H.; Meng, X.; Wildani, A.; Waller, L.A.; Strickland, M.J.; Liu, Y. Estimating PM2.5 Concentrations in the Conterminous United States Using the Random Forest Approach. Environ. Sci. Technol. 2017, 51, 6936–6944. [Google Scholar] [CrossRef]
- Yang, H.; Liu, Z.; Li, G. A New Hybrid Optimization Prediction Model for PM2.5 Concentration Considering Other Air Pollutants and Meteorological Conditions. Chemosphere 2022, 307, 135798. [Google Scholar] [CrossRef]
- Bai, Y.; Zeng, B.; Li, C.; Zhang, J. An Ensemble Long Short-Term Memory Neural Network for Hourly PM2.5 Concentration Forecasting. Chemosphere 2019, 222, 286–294. [Google Scholar] [CrossRef]
Parameters | Random Forest | Gradient Boosting |
---|---|---|
n_estimator | 10 | 1200 |
criterion | mse | friedman_mse |
max_depth | None | 4 |
min_sample_split | 1 | 2 |
min_samples_leaf | 1 | 1 |
min_density | 0.1 | None |
learning_rate | None | 0.01 |
random_state | None | 3 |
subsample | None | 0.5 |
Out of Bag(OOB)_score | bool | None |
n_jobs | 1 | None |
RF | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
MSE | RMSE | ||||||||||||||
Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test |
June | July | August | September | June | July | August | September | ||||||||
0.04 | 0.2 | 0.02 | 0.07 | 0.02 | 0.05 | 0.03 | 0.11 | 0.2 | 0.49 | 0.15 | 0.28 | 0.16 | 0.24 | 0.17 | 0.33 |
0.1 | 0.04 | 0.08 | 0.16 | 0.25 | 0.05 | 0.07 | 0.63 | 0.2 | 0.31 | 0.29 | 0.4 | 0.23 | 0.5 | 0.27 | 0.79 |
0.05 | 0.07 | 0.05 | 0.07 | 0.09 | 0.49 | 0.05 | 0.03 | 0.22 | 0.27 | 0.23 | 0.27 | 0.31 | 0.7 | 0.22 | 0.18 |
0.04 | 0.04 | 0.01 | 0.02 | 0.02 | 0.05 | 0.06 | 0.08 | 0.21 | 0.22 | 0.14 | 0.15 | 0.16 | 0.22 | 0.25 | 0.29 |
0.03 | 0.19 | 0.04 | 0.3 | 0.03 | 0.14 | 0.01 | 0.06 | 0.18 | 0.44 | 0.2 | 0.54 | 0.17 | 0.37 | 0.1 | 0.24 |
0.004 | 0.01 | 0.001 | 0.01 | 0.003 | 0.01 | 0.02 | 0.12 | 0.07 | 0.12 | 0.04 | 0.12 | 0.06 | 0.13 | 0.15 | 0.34 |
GB | |||||||||||||||
4.48 | 0.32 | 2.77 | 0.09 | 0.0002 | 0.06 | 5.65 | 0.18 | 0.56 | 0.006 | 0.005 | 0.3 | 0.01 | 0.25 | 0.007 | 0.42 |
0.21 | 0.0002 | 0.002 | 0.58 | 0.0006 | 0.2 | 0.0005 | 0.51 | 0.46 | 0.01 | 0.05 | 0.76 | 0.02 | 0.45 | 0.02 | 0.71 |
0.04 | 4.006 | 5.02 | 0.27 | 0.001 | 0.8 | 0.0007 | 0.08 | 0.22 | 0.006 | 0.52 | 0.007 | 0.89 | 0.04 | 0.02 | 0.28 |
0.11 | 0.0002 | 1.57 | 0.04 | 0.0002 | 0.13 | 0.0002 | 0.17 | 0.33 | 0.01 | 0.003 | 0.21 | 0.01 | 0.37 | 0.01 | 0.41 |
0.01 | 4.65 | 0.003 | 0.33 | 0.0001 | 0.1 | 0.0001 | 0.08 | 0.382 | 0.006 | 0.06 | 0.58 | 0.01 | 0.32 | 0.01 | 0.29 |
1.75 | 0.02 | 5.43 | 0.01 | 9.73 | 0.01 | 0.006 | 0.11 | 0.004 | 0.16 | 0.002 | 0.1 | 0.009 | 0.14 | 0.07 | 0.34 |
RF | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
R2 | MAE | ||||||||||||||
Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test |
June | July | August | September | June | July | August | September | ||||||||
0.95 | 0.65 | 0.95 | 0.82 | 0.96 | 0.89 | 0.95 | 0.78 | 0.1 | 0.24 | 0.08 | 0.14 | 0.08 | 0.13 | 0.1 | 0.21 |
0.95 | 0.86 | 0.93 | 0.81 | 0.94 | 0.62 | 0.94 | 0.34 | 0.07 | 0.17 | 0.11 | 0.22 | 0.1 | 0.27 | 0.12 | 0.36 |
0.93 | 0.8 | 0.94 | 0.88 | 0.93 | 0.54 | 0.92 | 0.87 | 0.09 | 0.14 | 0.08 | 0.15 | 0.13 | 0.33 | 0.08 | 0.11 |
0.86 | 0.65 | 0.86 | 0.54 | 0.88 | 0.55 | 0.87 | 0.78 | 0.05 | 0.1 | 0.03 | 0.06 | 0.04 | 0.1 | 0.07 | 0.15 |
0.91 | 0.28 | 0.92 | 0.6 | 0.94 | 0.74 | 0.95 | 0.66 | 0.07 | 0.18 | 0.09 | 0.26 | 0.08 | 0.19 | 0.04 | 0.14 |
0.92 | 0.35 | 0.94 | 0.26 | 0.96 | 0.84 | 0.95 | 0.74 | 0.03 | 0.07 | 0.01 | 0.05 | 0.03 | 0.06 | 0.06 | 0.17 |
GB | |||||||||||||||
0.99 | 0.54 | 0.99 | 0.79 | 0.99 | 0.89 | 0.99 | 0.65 | 0.07 | 0.24 | 0.06 | 0.17 | 0.06 | 0.15 | 0.08 | 0.21 |
0.99 | 0.7 | 0.99 | 0.34 | 0.99 | 0.7 | 0.99 | 0.46 | 0.06 | 0.18 | 0.08 | 0.29 | 0.08 | 0.27 | 0.09 | 0.37 |
0.99 | 0.87 | 0.99 | 0.54 | 0.99 | 0.25 | 0.99 | 0.68 | 0.06 | 0.16 | 0.07 | 0.18 | 0.1 | 0.35 | 0.06 | 0.14 |
0.99 | 0.225 | 0.99 | 0.11 | 0.99 | −0.18 | 0.99 | 0.57 | 0.03 | 0.12 | 0.02 | 0.07 | 0.04 | 0.12 | 0.05 | 0.15 |
0.99 | 0.45 | 0.99 | 0.55 | 0.99 | 0.8 | 0.99 | 0.52 | 0.05 | 0.2 | 0.08 | 0.28 | 0.05 | 0.2 | 0.03 | 0.13 |
0.99 | −0.01 | 0.99 | 0.53 | 0.99 | 0.82 | 0.99 | 0.75 | 0.02 | 0.08 | 0.01 | 0.04 | 0.02 | 0.07 | 0.04 | 0.17 |
RF | |||||||
---|---|---|---|---|---|---|---|
MAPE | |||||||
Training | Test | Training | Test | Training | Test | Training | Test |
June | July | August | September | ||||
0.002 | 0.0047 | 0.0014 | 0.0026 | 0.0015 | 0.002 | 0.0012 | 0.0026 |
0.0016 | 0.0036 | 0.0023 | 0.0048 | 0.0015 | 0.004 | 0.0009 | 0.0026 |
0.0017 | 0.0028 | 0.0016 | 0.003 | 0.0022 | 0.0057 | 0.0014 | 0.002 |
0.001 | 0.0023 | 0.0007 | 0.0013 | 0.001 | 0.0022 | 0.0013 | 0.0028 |
0.0014 | 0.0036 | 0.0015 | 0.0044 | 0.0014 | 0.0032 | 0.0007 | 0.0022 |
0.0007 | 0.0017 | 0.0003 | 0.0009 | 0.0004 | 0.001 | 0.0006 | 0.0017 |
GB | |||||||
0.0014 | 0.0046 | 0.0011 | 0.003 | 0.0011 | 0.0029 | 0.001 | 0.0027 |
0.0013 | 0.0038 | 0.0017 | 0.0061 | 0.0012 | 0.0039 | 0.0006 | 0.0027 |
0.0013 | 0.003 | 0.0013 | 0.0036 | 0.0018 | 0.0061 | 0.001 | 0.0025 |
0.0008 | 0.0028 | 0.0005 | 0.0017 | 0.0008 | 0.0027 | 0.0009 | 0.0028 |
0.0011 | 0.0039 | 0.0014 | 0.0047 | 0.0009 | 0.0033 | 0.0005 | 0.0022 |
0.0005 | 0.0019 | 0.0002 | 0.0008 | 0.0003 | 0.0011 | 0.0004 | 0.0017 |
Regression Models | TP | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Metrics | MSE | RMSE | ||||||||||||||
Case | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test | Training | Test |
Year/Months | June | July | August | September | June | July | August | September | ||||||||
2014 | 0.27 | 0.30 | 0.21 | 0.30 | 0.14 | 0.28 | 0.15 | 0.38 | 0.52 | 0.55 | 0.46 | 0.55 | 0.38 | 0.53 | 0.38 | 0.62 |
2015 | 0.20 | 0.38 | 0.57 | 0.58 | 0.51 | 0.32 | 0.68 | 0.54 | 0.45 | 0.61 | 0.76 | 0.76 | 0.72 | 0.57 | 0.82 | 0.74 |
2016 | 0.40 | 0.21 | 0.26 | 0.51 | 0.99 | 1.57 | 0.34 | 0.27 | 0.63 | 0.46 | 0.51 | 0.72 | 1.00 | 1.25 | 0.58 | 0.52 |
2017 | 0.17 | 0.07 | 0.10 | 0.04 | 0.12 | 0.08 | 0.35 | 0.38 | 0.41 | 0.27 | 0.31 | 0.19 | 0.35 | 0.28 | 0.60 | 0.61 |
2018 | 0.29 | 0.45 | 0.32 | 0.59 | 0.28 | 0.28 | 0.10 | 0.11 | 0.54 | 0.67 | 0.57 | 0.77 | 0.53 | 0.53 | 0.31 | 0.34 |
2019 | 0.03 | 0.03 | 0.02 | 0.03 | 0.05 | 0.09 | 0.21 | 0.24 | 0.18 | 0.18 | 0.14 | 0.17 | 0.23 | 0.30 | 0.46 | 0.49 |
R2 | MAE | |||||||||||||||
2014 | 0.68 | 0.52 | 0.59 | 0.23 | 0.79 | 0.52 | 0.78 | 0.27 | 0.27 | 0.35 | 0.25 | 0.37 | 0.21 | 0.32 | 0.24 | 0.42 |
2015 | 0.80 | 0.45 | 0.59 | 0.31 | 0.54 | 0.49 | 0.62 | 0.41 | 0.19 | 0.45 | 0.38 | 0.56 | 0.32 | 0.38 | 0.31 | 0.40 |
2016 | 0.56 | 0.40 | 0.79 | 0.03 | 0.37 | −0.59 | 0.53 | −0.16 | 0.26 | 0.30 | 0.19 | 0.57 | 0.48 | 0.88 | 0.24 | 0.31 |
2017 | 0.63 | 0.42 | 0.49 | 0.18 | 0.60 | 0.24 | 0.14 | 0.04 | 0.13 | 0.15 | 0.11 | 0.11 | 0.13 | 0.16 | 0.28 | 0.41 |
2018 | 0.34 | −1.05 | 0.52 | 0.09 | 0.52 | 0.49 | 0.60 | 0.18 | 0.23 | 0.33 | 0.26 | 0.42 | 0.27 | 0.36 | 0.15 | 0.20 |
2019 | 0.55 | −0.25 | 0.30 | −0.48 | 0.53 | 0.10 | 0.60 | 0.42 | 0.09 | 0.11 | 0.05 | 0.08 | 0.12 | 0.20 | 0.19 | 0.26 |
MAPE | ||||||||||||||||
2014 | 0.00529 | 0.0068 | 0.0045 | 0.0066 | 0.0039 | 0.006 | 0.003 | 0.0053 | ||||||||
2015 | 0.004 | 0.0095 | 0.0078 | 0.0116 | 0.0046 | 0.0054 | 0.0023 | 0.0029 | ||||||||
2016 | 0.0047 | 0.0055 | 0.0036 | 0.0113 | 0.008 | 0.0148 | 0.0042 | 0.0054 | ||||||||
2017 | 0.0028 | 0.0032 | 0.0022 | 0.0023 | 0.0029 | 0.0036 | 0.0052 | 0.0074 | ||||||||
2018 | 0.0045 | 0.0064 | 0.0044 | 0.007 | 0.0044 | 0.0059 | 0.0024 | 0.0031 | ||||||||
2019 | 0.0021 | 0.0027 | 0.001 | 0.0014 | 0.0017 | 0.003 | 0.0018 | 0.0025 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sunder, M.S.S.; Tikkiwal, V.A.; Kumar, A.; Tyagi, B. Unveiling the Transparency of Prediction Models for Spatial PM2.5 over Singapore: Comparison of Different Machine Learning Approaches with eXplainable Artificial Intelligence. AI 2023, 4, 787-811. https://doi.org/10.3390/ai4040040
Sunder MSS, Tikkiwal VA, Kumar A, Tyagi B. Unveiling the Transparency of Prediction Models for Spatial PM2.5 over Singapore: Comparison of Different Machine Learning Approaches with eXplainable Artificial Intelligence. AI. 2023; 4(4):787-811. https://doi.org/10.3390/ai4040040
Chicago/Turabian StyleSunder, M. S. Shyam, Vinay Anand Tikkiwal, Arun Kumar, and Bhishma Tyagi. 2023. "Unveiling the Transparency of Prediction Models for Spatial PM2.5 over Singapore: Comparison of Different Machine Learning Approaches with eXplainable Artificial Intelligence" AI 4, no. 4: 787-811. https://doi.org/10.3390/ai4040040