Prediction of Air Quality Combining Wavelet Transform, DCCA Correlation Analysis and LSTM Model
Abstract
:Featured Application
Abstract
1. Introduction
2. Materials and Methods
2.1. Materials
2.2. Air Pollution Index AQI Calculation
- IAQIp—Air quality pollutant subindex, rounded to the nearest whole number.
- Cp—Concentrations of contaminants.
- BPHi, BPLo—Similar to CP high and low values of pollutant concentrations.
- IAQIHi, IAQILo—The subindex of air quality equivalent to BPHi and BPLo
2.3. Wavelet Transform Noise Reduction
2.4. DCCA Intercorrelation Analysis
- Calculate separately the cumulative signal for sequences X and Y:
- 2.
- The cumulative signal is partitioned into N-n overlapping windows containing n + 1 values, with values ranging from i to i + n.
- 3.
- Using least squares to fit the local signal patterns inside a local window: and .
- 4.
- By deleting the local trend and calculating its covariance, the residual series within the window is obtained.
- 5.
- For various window lengths, the associated detrended covariance is obtained.
- 6.
- The introduction of coefficient ρDCCA to measure the interseries connection.
2.5. Long Short-Term Memory Neural Network
- Determine the informational state of each neural unit. The forgetting gate receives the ht−1 and xt information and outputs a value between 0 and 1 for each element in cell state Ct−1, with 1 indicating complete information retention and 0 indicating complete information discard, where Ct−1 is the value of the cell stored at t − 1. The forgetting gate’s equation is:
- 2.
- Determine the information that needs updating via the input gate. The calculating formula for the input gate is as follows:
- 3.
- The calculation for the output gates is as follows:
- 4.
- Calculate the memory gate cell, select the data to be stored in memory, and then calculate the temporary cell state (the value of the candidate memory cell).
- 5.
- The formula for calculating the value of a memory cell is as follows:
- 6.
- The output at time t of the LSTM structural cell.
3. Results
3.1. Meteorological Series Sym8 Wavelet Four Layer Transform Noise Reduction
3.2. DCCA Coefficients on Each Scale and Their Relationship
3.2.1. AQI Correlation with Temperature
3.2.2. AQI Correlation with Humidity
3.2.3. AQI Correlation with Pressure
3.2.4. AQI Correlation with Wind
3.3. DCCA-LSTM Predictive Analysis at Short-Medium and Long Scales
3.3.1. Short-Scale Application of the DCCA-LSTM Model
3.3.2. Medium-Scale and Long-Scale Application of the DCCA-LSTM Model
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Lu, X.; Zhang, S.; Xing, J.; Wang, Y.; Chen, W.; Ding, D.; Wu, Y.; Wang, S.; Duan, L.; Hao, J. Progress of air pollution control in china and its challenges and opportunities in the ecological civilization era. Engineering 2020, 6, 1423–1431. [Google Scholar] [CrossRef]
- Haq, M.A. Smotednn: A novel model for air pollution forecasting and aqi classification. Comput. Mater. Contin. 2022, 71, 1403–1425. [Google Scholar]
- Xu, K.; Cui, K.; Young, L.-H.; Wang, Y.-F.; Hsieh, Y.-K.; Wan, S.; Zhang, J. Air quality index, indicatory air pollutants and impact of COVID-19 event on the air quality near central China. Aerosol Air Qual. Res. 2020, 20, 1204–1221. [Google Scholar] [CrossRef] [Green Version]
- Yates, E.F.; Zhang, K.; Naus, A.; Forbes, C.; Wu, X.; Dey, T. A Review on the Biological, Epidemiological, and Statistical Relevance of COVID-19 Paired with Air Pollution. Environ. Adv. 2022, 8, 100250. [Google Scholar] [CrossRef]
- Huang, X.L.; Hu, S.Y.; Chen, J.X.; Feng, W.Q. Air quality analysis of Sichuan province based on complex network and CSP algorithm. Int. J. Mod. Phys. C 2021, 33, 2250007. [Google Scholar] [CrossRef]
- Xu, H.; Croot, P.; Zhang, C. Exploration of the spatially varying relationships between lead and aluminium concentrations in the topsoil of northern half of Ireland using Geographically Weighted Pearson Correlation Coefficient. Geoderma 2022, 409, 115640. [Google Scholar] [CrossRef]
- Zhang, J.; Li, Y.; Liu, C.; Wu, B.; Shi, K. A study of cross-correlations between PM2.5 and O3 based on Copula and Multifractal methods. Phys. A Stat. Mech. Its Appl. 2022, 589, 126651. [Google Scholar] [CrossRef]
- Kojić, M.; Schlüter, S.; Mitić, P.; Hanić, A. Economy-environment nexus in developed European countries: Evidence from multifractal and wavelet analysis. Chaos Solitons Fractals 2022, 160, 112189. [Google Scholar] [CrossRef]
- Méndez-Figueroa, H.; Colorado-Garrido, D.; Hernández-Pérez, M.; Galván-Martínez, R.; Cruz, R.O. Neural networks and correlation analysis to improve the corrosion prediction of SiO2-nanostructured patinated bronze in marine atmospheres. J. Electroanal. Chem. 2022, 917, 116396. [Google Scholar] [CrossRef]
- Du, Z.; Lawrence, W.R.; Zhang, W.; Zhang, D.; Yu, S.; Hao, Y. Interactions between climate factors and air pollution on daily HFMD cases: A time series study in Guangdong, China. Sci. Total. Environ. 2019, 656, 1358–1364. [Google Scholar] [CrossRef]
- Shen, C.-H.; Li, C.-L.; Si, Y.-L. A detrended cross-correlation analysis of meteorological and API data in Nanjing, China. Phys. A: Stat. Mech. Its Appl. 2015, 419, 417–428. [Google Scholar] [CrossRef]
- Zebende, G.F.; Brito, A.A.; Castro, A.P. DCCA cross-correlation analysis in time-series with removed parts. Phys. A Stat. Mech. Its Appl. 2020, 545, 123472. [Google Scholar] [CrossRef]
- Pavlov, A.; Pavlova, O.; Koronovskii, A.; Guyo, G. Extended detrended cross-correlation analysis of nonstationary processes. Chaos Solitons Fractals 2022, 157, 111972. [Google Scholar] [CrossRef]
- Ben-Salha, O.; Mokni, K. Detrended cross-correlation analysis in quantiles between oil price and the US stock market. Energy 2022, 242, 122918. [Google Scholar] [CrossRef]
- Chen, J.; Shi, Z.; Li, W.; Guo, Y. Bidirectional lstm-crf attention-based model for chinese word segmentation. arXiv 2021, arXiv:2105.09681. [Google Scholar]
- Atila, O.; Şengür, A. Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition. Appl. Acoust. 2021, 182, 108260. [Google Scholar] [CrossRef]
- Pathak, B.; Mittal, S.; Shinde, K.; Pawar, P. Comparison Between LSTM and RNN Algorithm for Speech-to-Speech Translator. Proceedings of International Conference on Communication, Circuits, and Systems, Washington, DC, USA, 6–9 June 2021; Springer: Singapore, 2021. [Google Scholar]
- Aslan, S.N.; Özalp, R.; Uçar, A.; Güzeliş, C. New CNN and hybrid CNN-LSTM models for learning object manipulation of humanoid robots from demonstration. Clust. Comput. 2022, 25, 1575–1590. [Google Scholar] [CrossRef]
- Ouma, Y.O.; Cheruyot, R.; Wachera, A.N. Rainfall and runoff time-series trend analysis using LSTM recurrent neural network and wavelet neural network with satellite-based meteorological data: Case study of Nzoia hydrologic basin. Complex Intell. Syst. 2022, 8, 213–236. [Google Scholar] [CrossRef]
- Shastri, S.; Singh, K.; Kumar, S.; Kour, P.; Mansotra, V. Deep-LSTM ensemble framework to forecast COVID-19: An insight to the global pandemic. Int. J. Inf. Technol. 2021, 13, 1291–1301. [Google Scholar] [CrossRef]
- Shen, Z.; Fan, X.; Zhang, L.; Yu, H. Wind speed prediction of unmanned sailboat based on CNN and LSTM hybrid neural network. Ocean Eng. 2022, 254, 111352. [Google Scholar] [CrossRef]
- Xiang, L.; Liu, J.; Yang, X.; Hu, A.; Su, H. Ultra-short term wind power prediction applying a novel model named SATCN-LSTM. Energy Convers. Manag. 2022, 252, 115036. [Google Scholar] [CrossRef]
- Haq, M.A. CDLSTM: A Novel Model for Climate Change Forecasting. Comput. Mater. Contin. 2022, 71, 2363–2381. [Google Scholar] [CrossRef]
- He, J.; Cui, J.; Zhang, G.; Xue, M.; Chu, D.; Zhao, Y. Spatial–temporal seizure detection with graph attention network and bi-directional LSTM architecture. Biomed. Signal Process. Control. 2022, 78, 103908. [Google Scholar] [CrossRef]
- Panja, P.; Jia, W.; McPherson, B. Prediction of Well Performance in SACROC Field Using Stacked Long Short-Term Memory (LSTM) Network. Expert Syst. Appl. 2022, 205, 117670. [Google Scholar] [CrossRef]
- Haq, M.A.; Jilani, A.K.; Prabu, P. Deep learning based modeling of groundwater storage change. Comput. Mater. Contin. 2021, 70, 4599–4617. [Google Scholar] [CrossRef]
- Rhif, M.; Ben Abbes, A.; Farah, I.R.; Martínez, B.; Sang, Y. Wavelet transform application for/in non-stationary time-series analysis: A review. Appl. Sci. 2019, 9, 1345. [Google Scholar] [CrossRef] [Green Version]
- Shen, C.-H.; Li, C.-L. An analysis of the intrinsic cross-correlations between API and meteorological elements using DPCCA. Phys. A Stat. Mech. Its Appl. 2016, 446, 100–109. [Google Scholar] [CrossRef]
- Yu, Y.; Si, X.; Hu, C.; Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef]
Features | SNR | RMSE |
---|---|---|
Temperature | 30.087368 | 0.802346 |
Humidity | 29.952903 | 1.256670 |
Pressure | 62.939602 | 0.720441 |
Wind | 10.224503 | 0.452248 |
Window-Size | A-T | A-H | A-P | A-W |
---|---|---|---|---|
24 | 0.638617 | −0.600959 | −0.240827 | −0.125029 |
48 | 0.519856 | −0.396133 | −0.170982 | −0.281052 |
168 | 0.365726 | −0.24658 | −0.095371 | −0.503246 |
Window | Model | Train Loss | Valid Loss | MSE | Time(s) |
---|---|---|---|---|---|
24 | LSTM | 0.052742 | 0.049323 | 0.08974096 | 17.0574 |
DCCA-LSTM | 0.049790 | 0.047054 | 0.08095255 | 16.0810 | |
48 | LSTM | 0.034793 | 0.032198 | 0.13867431 | 21.3187 |
DCCA-LSTM | 0.033351 | 0.030843 | 0.12241591 | 19.4778 | |
168 | LSTM | 0.018447 | 0.015060 | 0.15315998 | 41.0941 |
DCCA-LSTM | 0.017658 | 0.014200 | 0.13412123 | 37.6885 |
Window | Model | Train Loss | Valid Loss | MSE | Time(s) |
---|---|---|---|---|---|
5000 | LSTM | 0.009910 | 0.006814 | 0.2568583 | 662.7335 |
DCCA-LSTM | 0.008748 | 0.005752 | 0.2008781 | 640.0185 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, Z.; Chen, H.; Huang, X. Prediction of Air Quality Combining Wavelet Transform, DCCA Correlation Analysis and LSTM Model. Appl. Sci. 2023, 13, 2796. https://doi.org/10.3390/app13052796
Zhang Z, Chen H, Huang X. Prediction of Air Quality Combining Wavelet Transform, DCCA Correlation Analysis and LSTM Model. Applied Sciences. 2023; 13(5):2796. https://doi.org/10.3390/app13052796
Chicago/Turabian StyleZhang, Zheng, Haibo Chen, and Xiaoli Huang. 2023. "Prediction of Air Quality Combining Wavelet Transform, DCCA Correlation Analysis and LSTM Model" Applied Sciences 13, no. 5: 2796. https://doi.org/10.3390/app13052796
APA StyleZhang, Z., Chen, H., & Huang, X. (2023). Prediction of Air Quality Combining Wavelet Transform, DCCA Correlation Analysis and LSTM Model. Applied Sciences, 13(5), 2796. https://doi.org/10.3390/app13052796