Deep-Learning-Based Natural Ventilation Rate Prediction with Auxiliary Data in Mismeasurement Sensing Environments
Abstract
:1. Introduction
- In the context of deep-learning-based natural ventilation prediction using various IoT sensing data, we address the issue of realistic mismeasurement by generating auxiliary data that utilize the rapidly changing or slowly changing characteristics of sensing data, which can improve the reliability of observation data.
- After constructing three models to apply the characteristics of the reliable features that affect the auxiliary data (i.e., predicted and important features), an ensemble model was designed to improve the generalization performance of the deep learning model for predicting natural ventilation.
2. Background
3. Materials and Methods
3.1. Problem Definition
3.2. Experimental Environments
3.3. Overview of the Proposed Method
3.4. Foundational NVR Prediction Model Based on Deep Learning
3.5. ReLU (Rectified Linear Unit)
3.6. Standard Normalization
3.7. Auxiliary Data Generation
3.7.1. Periodic and Nonperiodic Sensing Data
3.7.2. Generating Auxiliary Data Based on SSL
- In this study, we propose a method for generating auxiliary data based on SSL (semisupervised learning) to address the issue of mismeasurement in sensing data. SSL is a type of ML (machine learning) technology that aims to overcome the limitations of both supervised and unsupervised learning. Supervised learning requires a large amount of training data to classify test data, while unsupervised learning does not require labeled data but struggles to accurately cluster data. To overcome these challenges, SSL learns and labels data with a small amount of training data [25]. By iteratively learning and adding predicted features from a deep learning model to the training data, SSL achieves improved accuracy compared with general ML approaches [26]. Generating auxiliary data based on SSL helps prevent the deterioration of reliability in mismeasurement data.
- Figure 8 shows the process of generating auxiliary data based on SSL. To generate SSL-based auxiliary data, a model trained on observation features, which represent all the features of environmental data, predicts a specific feature, called Prediction Fi. Subsequently, the predicted feature, Prediction Fi, is added to the training data to generate more refined auxiliary data. To ensure accurate generation of auxiliary data, the method distinguishes between data that changes rapidly and gradually over time. It applies specific auxiliary data generation techniques for each type of data. Data with gradual changes and identifiable patterns are used to generate prediction regular features through LSTM-based prediction, while rapidly changing data generate prediction irregular features through prediction using the FC-DNN model. By combining these two types of auxiliary data, we ultimately create generated auxiliary features that complement the mismeasurement data.
3.7.3. Auxiliary Data with Important Features
- Along with the generation of sensing data, this study selects important features after analyzing the correlation between the NVR and environmental variables. We finally use them as auxiliary data. Considering the correlation between target data and input data in the prediction model is one of the ways to improve the model’s predictive performance [26]. Therefore, we improve the prediction accuracy by using the main features that have a relatively greater impact on NVR through prior research results and correlation analysis as auxiliary data. In prior research on NVR prediction, it has been reported that the factors affecting NVR are four elements near the building (Pd, Wd, Ws, and Td) [27]. We use the heatmap method to compare the correlation of the prior research results with the sensing data and analyze the impact between each feature, as shown in Figure 9 in the heatmap. The relationship between the four important factors highlighted in the previous research and NVR are all positive correlations, appearing as 0.84, 0.13, 0.36, and 0.38, respectively. Even excluding these four factors, Tout and Tin are high correlations because they are related to Td. Based on the results of previous research, we reidentified how much wind-related variables and pressure differences could be affected to variations in NVR based on correlation analysis. As shown in the results, in/outdoor temperature differences were significantly influential to changes in NVR, because they are highly related to making pressure differences changeable. Therefore, we select Td as the important feature and exclude Tin and Tout. Also, the environmental data assumptions vary depending on the building’s measurement conditions (e.g., direction, building scale, and location). In this study, considering the differences in measurement conditions, we select four factors as the importance features based on the results of previous research and use them as auxiliary data.
3.8. NVR Prediction Model Scenarios
3.9. Proposal Algorithm
Algorithm 1. Proposed NVR Prediction Methods with Auxiliary Data | |
Input: Observation Train Data: TrainX Observation Test Data: TestX Observation Data Columns: C Output: Natural Ventilation Rate: NVR | |
Step 1 | DNNS1.fit(TrainX) PREDS1 = DNNS1.predict(X) |
Step 2 | For i in C: If Xi ! = Periodicity: ModelXi = DNN else: ModelXi = LSTM ModelXi.fit(TrainX - TrainXi + PREDS1) PREDxi = Modelxi.predict(TestX - TestXi) PREDX + = PREDxi ImportantX = X[[Important]] |
Step 3 | TrainPREDx, TestPREDx = train_test_split(PREDX) TrainImportantx, TestImportantx = train_test_split(ImportantX) DNNS2.fit(TrainPREDx) DNNS3.fit(TrainImportantx) |
Step 4 | Enet = Ensemble_create(DNNS1, DNNS2, DNNS3) Enet.fit(TrainX, TrainPREDx, TrainImportantx) TestX’, TestPREDx’, TestImportantx’ = insert_outlier(TestX, TestPREDx, TestImportantx) PREDE = Enet.predict(TestX’, TestPREDx’, TestImportantx’) return PREDE |
4. Experimental Results
4.1. Evaluation Metrics
4.2. Outlier Dataset
4.3. Evaluation Metrics of Generated Auxiliary Data
4.4. Analyzing NVR Prediction Accuracy
4.5. Analyzing the Impact of Observations and Auxiliary Data on the NVR Prediction Model
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Fathi, S.; Srinivasan, R.; Fenner, A.; Fathi, S. Machine learning applications in urban building energy performance forecasting: A systematic review. Renew. Sustain. Energy Rev. 2020, 133, 110287. [Google Scholar] [CrossRef]
- Khoukhi, M.; Yoshino, H.; Liu, J. The effect of the wind speed velocity on the stack pressure in medium-rise buildings in cold region of China. Build. Environ. 2007, 42, 1081–1088. [Google Scholar] [CrossRef]
- Yan, D.; Song, F.; Yang, X.; Jiang, Y.; Zhao, B.; Zhang, X.; Liu, X.; Wang, X.; Xu, F.; Wu, P.; et al. An integrated modeling tool for simultaneous analysis of thermal performance and indoor air quality in buildings. Build. Environ. 2008, 43, 287–293. [Google Scholar] [CrossRef]
- Dehghan, A.A.; Esfeh, M.K.; Manshadi, M.D. Natural ventilation characteristics of one-sided wind catchers: Experimental and analytical evaluation. Energy Build. 2013, 61, 366–377. [Google Scholar] [CrossRef]
- Wang, J.; Wang, S.; Zhang, T.; Battaglia, F. Assessment of single-sided natural ventilation driven by buoyancy forces through variable window configurations. Energy Build. 2017, 139, 762–779. [Google Scholar] [CrossRef]
- Elshafei, G.; Negm, A.; Bady, M.; Suzuki, M.; Ibrahim, M.G. Numerical and experimental investigations of the impacts of window parameters on indoor natural ventilation in a residential building. Energy Build. 2017, 141, 321–332. [Google Scholar] [CrossRef]
- Han, D.H.; Kim, S.; Choi, J.H.; Kim, Y.S.; Chung, H.; Jeong, H.; Watjanatepin, N.; Ruangpattanawiwat, C.; Choi, S.-H. Experimental study on thermal buoyancy-induced natural ventilation. Energy Build. 2018, 177, 1–11. [Google Scholar] [CrossRef]
- Heracleous, C.; Michael, A. Experimental assessment of the impact of natural ventilation on indoor air quality and thermal comfort conditions of educational buildings in the Eastern Mediterranean region during the heating period. J. Build. Eng. 2019, 26, 100917. [Google Scholar] [CrossRef]
- Calautit, J.K.; Tien, P.W.; Wei, S.; Calautit, K.; Hughes, B. Numerical and experimental investigation of the indoor air quality and thermal comfort performance of a low energy cooling windcatcher with heat pipes and extended surfaces. Renew. Energy 2020, 145, 744–756. [Google Scholar] [CrossRef]
- Muhsin, F.; Yusoff, W.F.M.; Mohamed, M.F.; Sapian, A.R. CFD modeling of natural ventilation in a void connected to the living units of multi-storey housing for thermal comfort. Energy Build. 2017, 144, 1–16. [Google Scholar] [CrossRef]
- Villagran, E.A.; Romero, E.J.B.; Bojacá, C.R. Transient CFD analysis of the natural ventilation of three types of greenhouses used for agricultural production in a tropical mountain climate. Biosyst. Eng. 2019, 188, 288–304. [Google Scholar] [CrossRef]
- Calautit, J.K.; O’Connor, D.; Tien, P.W.; Wei, S.; Pantua, C.A.J.; Hughes, B. Development of a natural ventilation windcatcher with passive heat recovery wheel for mild-cold climates: CFD and experimental analysis. Renew. Energy 2020, 160, 465–482. [Google Scholar] [CrossRef]
- Rodrigues Marques Sakiyama, N.; Frick, J.; Bejat, T.; Garrecht, H. Using CFD to evaluate natural ventilation through a 3D parametric modeling approach. Energies 2021, 14, 2197. [Google Scholar] [CrossRef]
- Park, H.; Park, D.Y. Comparative analysis on predictability of natural ventilation rate based on machine learning algorithms. Build. Environ. 2021, 195, 107744. [Google Scholar] [CrossRef]
- Taheri, S.; Razban, A. Learning-based CO2 concentration prediction: Application to indoor air quality control using de-mand-controlled ventilation. Build. Environ. 2021, 205, 108164. [Google Scholar] [CrossRef]
- Hiyama, K.; Takeuchi, K.; Omodaka, Y.; Srisamranrungruang, T. Operation strategy for engineered natural ventilation using machine learning under sparse data conditions. Jpn. Archit. Rev. 2022, 5, 119–126. [Google Scholar] [CrossRef]
- Wu, W.; Norford, L.K.; Li, N.; Malkawi, A. Model Predictive Control of Single-Sided Natural Ventilation in a Smart Building Using Machine Learning Algorithms. IEEE Trans. Autom. Sci. Eng. 2018, 15, 636–646. [Google Scholar]
- Shakibjoo, A.D.; Moradzadeh, M.; Din, S.U.; Mohammadzadeh, A.; Mosavi, A.H.; Vandevelde, L. Optimized type-2 fuzzy frequency control for multi-area power systems. IEEE Access 2021, 10, 6989–7002. [Google Scholar] [CrossRef]
- Koutsoukas, A.; Monaghan, K.J.; Li, X.; Huan, J. Deep-learning: Investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data. J. Cheminform. 2017, 9, 1. [Google Scholar] [CrossRef] [Green Version]
- Delalleau, O.; Bengio, Y. Shallow vs. deep sum-product networks. Adv. Neural Inf. Process. Syst. 2011, 24, 1–9. [Google Scholar]
- Hinton, G.E.; Osindero, S.; Teh, Y.W. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef] [PubMed]
- Bengio, Y. Learning deep architectures for AI. Found. Trends Mach. Learn. 2009, 2, 1–127. [Google Scholar] [CrossRef]
- Agarap, F. Deep learning using rectified linear units (ReLU). arXiv 2018, arXiv:1803.08375. [Google Scholar]
- Liu, Z. A method of SVM with normalization in intrusion detection. Procedia Environ. Sci. 2011, 11, 256–262. [Google Scholar] [CrossRef] [Green Version]
- Ahfock, D.; McLachlan, G.J. Semi-Supervised Learning of Classifiers from a Statistical Perspective: A Brief Review. In Econometrics and Statistics; Elsevier: Amsterdam, The Netherlands, 2022. [Google Scholar]
- Sajun, A.; Zualkernan, I. Survey on Implementations of Generative Adversarial Networks for Semi-Supervised Learning. Appl. Sci. 2022, 12, 1718. [Google Scholar] [CrossRef]
- Larsen, T.S.; Heiselberg, P. Single-sided natural ventilation driven by wind pressure and temperature difference. Energy Build. 2008, 40, 1031–1040. [Google Scholar] [CrossRef]
- Gaikwad, D.P.; Thool, R.C. Intrusion detection system using bagging ensemble method of machine learning. In Proceedings of the 2015 International Conference on Computing Communication Control and Automation, Pune, India, 26–27 February 2015; IEEE: Toulouse, France, 2015; pp. 291–295. [Google Scholar]
- Joseph, R.V.; Mohanty, A.; Tyagi, S.; Mishra, S.; Satapathy, S.K.; Mohanty, S.N. A hybrid deep learning framework with CNN and Bi-directional LSTM for store item demand forecasting. Comput. Electr. Eng. 2022, 103, 108358. [Google Scholar] [CrossRef]
- Rustam, F.; Reshi, A.A.; Mehmood, A.; Ullah, S.; On, B.W.; Aslam, W.; Choi, G.S. COVID-19 future forecasting using supervised machine learning models. IEEE Access 2020, 8, 101489–101499. [Google Scholar] [CrossRef]
- Berhich, A.; Belouadha, F.Z.; Kabbaj, M.I. An attention-based LSTM network for large earthquake prediction. Soil Dyn. Earthq. Eng. 2023, 165, 107663. [Google Scholar] [CrossRef]
- Vafaei, S.; Soosani, J.; Adeli, K.; Fadaei, H.; Naghavi, H.; Pham, T.D.; Tien Bui, D. Improving accuracy estimation of Forest Aboveground Biomass based on incorporation of ALOS-2 PALSAR-2 and Sentinel-2A imagery and machine learning: A case study of the Hyrcanian forest area (Iran). Remote Sens. 2018, 10, 172. [Google Scholar] [CrossRef] [Green Version]
Year | Simplistic/Complicated | Characteristic of Methods | Major Subject | Consideration of Mismeasurement Sensing Features |
---|---|---|---|---|
2 (2007) 3 (2008) 4 (2013) 5 (2017) | Simplistic | Analytical model through basic equations | Airflow | No |
6 (2017) 7 (2018) 8 (2019) 9 (2020) | Complicated | Combining empirical models for small and large scale | Airflow | No |
10 (2017) 11 (2019) 12 (2020) 13 (2021) | Complicated | Analysis simulation based on CFD for complex data | Airflow | No |
14 (2021) 15 (2021) 16 (2022) 17 (2022) | Complicated | Prediction based on machine learning | Ventilation rate | No |
Proposed | Complicated | Prediction considering mismeasurement data | Ventilation rate | Yes |
Type of Variables | Variable Feature | Symbols | Units | Range | Influence |
---|---|---|---|---|---|
Input (Periodic) | Solar radiation | Sr | Mj/m2 | 0~0.015 | Medium |
Indoor air temperature | Tin | °C | 15.5~27.7 | Medium | |
Outdoor air temperature | Tout | °C | 1.7~28.1 | Medium | |
Difference of in/outdoor air temperature | Td | °C | −2.1~16.4 | High | |
Indoor relative humidity | RHin | % | 16~71 | Medium | |
Outdoor relative humidity | RHout | % | 26~93 | Medium | |
Difference of in/outdoor relative humidity | RHd | % | −53~12 | Medium | |
Input (Nonperiodic) | Pressure difference | Pd | mbar | 0~0.36 | High |
Wind direction | Wd | True-north-based azimuth divided in 16 angles | 0~359 | High | |
Wind speed | Ws | m/s | 0~5.31 | High | |
Target | Natural ventilation rate | NVR | m3/m | 0~3.96 | - |
Wind Direction | Grade | Degree | Building Direction |
0 | 271°~359° 0°~89° | Inside building | |
1 | 90°~270° | Outside building |
Wind Speed | Grade | m/s | Kind of Wind |
0 | 0~0.2 | Calm | |
1 | 0.3~1.5 | Light air | |
2 | 1.6~3.3 | Light breeze | |
3 | 3.4~5.4 | Gentle breeze |
Scenario | Title | Features |
---|---|---|
S1 | Observed features | X |
S2 | Auxiliary features | PREDX |
S3 | Important features | Pd, Wd, Ws, and Td |
S4 | Ensemble of S1, S2, and S3 | |
X: Sr, Tin, Tout, Td, RHin, RHout, RHd, Pd, Wd, and Ws |
Sr | Tin | Tout | Td | RHin | RHout | RHd | Pd | Wd | Ws | |
---|---|---|---|---|---|---|---|---|---|---|
R2score | 0.933 | 0.996 | 0.992 | 0.986 | 0.995 | 0.981 | 0.974 | 0.938 | - | - |
MMscore | 0.916 | 0.997 | 0.987 | 0.964 | 0.992 | 0.981 | 0.937 | 0.806 | - | - |
ACCscore | 0.891 | 0.871 | 0.814 | 0.978 | 0.945 | 0.912 | 0.728 | 0.917 | 0.626 | 0.615 |
Metrics | Train Rates | Scenarios | Outlier Rates | |||
---|---|---|---|---|---|---|
0 | 0.1 | 0.2 | 0.3 | |||
R2score | 0.2 | S1 | 0.684 | 0.603 | 0.681 | 0.628 |
S4 | 0.931 | 0.901 | 0.898 | 0.874 | ||
0.4 | S1 | 0.794 | 0.744 | 0.757 | 0.724 | |
S4 | 0.941 | 0.927 | 0.914 | 0.877 | ||
0.5 | S1 | 0.830 | 0.756 | 0.760 | 0.736 | |
S4 | 0.943 | 0.933 | 0.914 | 0.905 | ||
0.6 | S1 | 0.845 | 0.786 | 0.768 | 0.743 | |
S4 | 0.946 | 0.934 | 0.922 | 0.908 | ||
MMscore | 0.2 | S1 | 0.750 | 0.744 | 0.740 | 0.724 |
S4 | 0.860 | 0.851 | 0.847 | 0.842 | ||
0.4 | S1 | 0.814 | 0.808 | 0.786 | 0.780 | |
S4 | 0.918 | 0.911 | 0.898 | 0.871 | ||
0.5 | S1 | 0.833 | 0.802 | 0.802 | 0.795 | |
S4 | 0.922 | 0.905 | 0.898 | 0.890 | ||
0.6 | S1 | 0.840 | 0.805 | 0.799 | 0.774 | |
S4 | 0.920 | 0.896 | 0.888 | 0.870 | ||
ACCscore | 0.2 | S1 | 0.580 | 0.567 | 0.540 | 0.517 |
S4 | 0.867 | 0.844 | 0.826 | 0.807 | ||
0.4 | S1 | 0.659 | 0.632 | 0.624 | 0.613 | |
S4 | 0.886 | 0.872 | 0.858 | 0.845 | ||
0.5 | S1 | 0.717 | 0.654 | 0.633 | 0.621 | |
S4 | 0.901 | 0.886 | 0.879 | 0.854 | ||
0.6 | S1 | 0.745 | 0.656 | 0.644 | 0.637 | |
S4 | 0.928 | 0.895 | 0.894 | 0.868 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, S.; Kim, M.; Lee, S. Deep-Learning-Based Natural Ventilation Rate Prediction with Auxiliary Data in Mismeasurement Sensing Environments. Electronics 2023, 12, 3294. https://doi.org/10.3390/electronics12153294
Yang S, Kim M, Lee S. Deep-Learning-Based Natural Ventilation Rate Prediction with Auxiliary Data in Mismeasurement Sensing Environments. Electronics. 2023; 12(15):3294. https://doi.org/10.3390/electronics12153294
Chicago/Turabian StyleYang, Subhin, Mintai Kim, and Sungju Lee. 2023. "Deep-Learning-Based Natural Ventilation Rate Prediction with Auxiliary Data in Mismeasurement Sensing Environments" Electronics 12, no. 15: 3294. https://doi.org/10.3390/electronics12153294
APA StyleYang, S., Kim, M., & Lee, S. (2023). Deep-Learning-Based Natural Ventilation Rate Prediction with Auxiliary Data in Mismeasurement Sensing Environments. Electronics, 12(15), 3294. https://doi.org/10.3390/electronics12153294