Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (33)

Search Parameters:
Keywords = recursive gated convolution

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
29 pages, 2790 KB  
Article
A New Hybrid Adaptive Self-Loading Filter and GRU-Net for Active Noise Control in a Right-Angle Bending Pipe of an Air Conditioner
by Wenzhao Zhu, Zezheng Gu, Xiaoling Chen, Ping Xie, Lei Luo and Zonglong Bai
Sensors 2025, 25(20), 6293; https://doi.org/10.3390/s25206293 - 10 Oct 2025
Viewed by 390
Abstract
The air-conditioner noise in a rehabilitation room can seriously affect the mental state of patients. However, the existing single-layer active noise control (ANC) filters may fail to attenuate the complicated harmonic noise, and the deep recursive ANC method may fail to work in [...] Read more.
The air-conditioner noise in a rehabilitation room can seriously affect the mental state of patients. However, the existing single-layer active noise control (ANC) filters may fail to attenuate the complicated harmonic noise, and the deep recursive ANC method may fail to work in real time. To solve the problem, in a bending-pipe model, a new hybrid adaptive self-loading filtered-x least-mean-square (ASL-FxLMS) and convolutional neural network-gate recurrent unit (CNN-GRU) network is proposed. At first, based on the recursive GRU translation core, an improved CNN-GRU network with multi-head attention layers is proposed. Especially for complicated harmonic noises with more or fewer frequencies than harmonic models, the attenuation performance will be improved. In addition, its structure is optimized to decrease the computing load. In addition, an improved time-delay estimator is applied to improve the real-time ANC performance of CNN-GRU. Meanwhile, an adaptive self-loading FxLMS algorithm has been developed to deal with the uncertain components of complicated harmonic noise. Moreover, to achieve balance attenuation, robustness, and tracking performance, the ASL-FxLMS and CNN-GRU are connected by a convex combination structure. Furthermore, theoretical analysis and simulations are also conducted to show the effectiveness of the proposed method. Full article
(This article belongs to the Section Sensor Networks)
Show Figures

Figure 1

27 pages, 10646 KB  
Article
Deep Learning-Based Hybrid Model with Multi-Head Attention for Multi-Horizon Stock Price Prediction
by Rajesh Kumar Ghosh, Bhupendra Kumar Gupta, Ajit Kumar Nayak and Samit Kumar Ghosh
J. Risk Financial Manag. 2025, 18(10), 551; https://doi.org/10.3390/jrfm18100551 - 1 Oct 2025
Viewed by 613
Abstract
The prediction of stock prices is challenging due to their volatility, irregular patterns, and complex time-series structure. Reliably forecasting stock market data plays a crucial role in minimizing financial risk and optimizing investment strategies. However, traditional models often struggle to capture temporal dependencies [...] Read more.
The prediction of stock prices is challenging due to their volatility, irregular patterns, and complex time-series structure. Reliably forecasting stock market data plays a crucial role in minimizing financial risk and optimizing investment strategies. However, traditional models often struggle to capture temporal dependencies and extract relevant features from noisy inputs, which limits their predictive performance. To improve this, we developed an enhanced recursive feature elimination (RFE) method that blends the importance of impurity-based features from random forest and gradient boosting models with Kendall tau correlation analysis, and we applied SHapley Additive exPlanations (SHAP) analysis to externally validate the reliability of the selected features. This approach leads to more consistent and reliable feature selection for short-term stock prediction over 1-, 3-, and 7-day intervals. The proposed deep learning (DL) architecture integrates a temporal convolutional network (TCN) for long-term pattern recognition, a gated recurrent unit (GRU) for sequence capture, and multi-head attention (MHA) for focusing on critical information, thereby achieving superior predictive performance. We evaluate the proposed approach using daily stock price data from three leading companies—HDFC Bank, Tata Consultancy Services (TCS), and Tesla—and two major stock indices: Nifty 50 and S&P 500. The performance of our model is compared against five benchmark models: temporal convolutional network (TCN), long short-term memory (LSTM), GRU, Bidirectional GRU, and a hybrid TCN–GRU model. Our method consistently shows lower error rates and higher predictive accuracy across all datasets, as measured by four commonly used performance metrics. Full article
(This article belongs to the Section Financial Markets)
Show Figures

Figure 1

30 pages, 9222 KB  
Article
Using Deep Learning in Forecasting the Production of Electricity from Photovoltaic and Wind Farms
by Michał Pikus, Jarosław Wąs and Agata Kozina
Energies 2025, 18(15), 3913; https://doi.org/10.3390/en18153913 - 23 Jul 2025
Viewed by 783
Abstract
Accurate forecasting of electricity production is crucial for the stability of the entire energy sector. However, predicting future renewable energy production and its value is difficult due to the complex processes that affect production using renewable energy sources. In this article, we examine [...] Read more.
Accurate forecasting of electricity production is crucial for the stability of the entire energy sector. However, predicting future renewable energy production and its value is difficult due to the complex processes that affect production using renewable energy sources. In this article, we examine the performance of basic deep learning models for electricity forecasting. We designed deep learning models, including recursive neural networks (RNNs), which are mainly based on long short-term memory (LSTM) networks; gated recurrent units (GRUs), convolutional neural networks (CNNs), temporal fusion transforms (TFTs), and combined architectures. In order to achieve this goal, we have created our benchmarks and used tools that automatically select network architectures and parameters. Data were obtained as part of the NCBR grant (the National Center for Research and Development, Poland). These data contain daily records of all the recorded parameters from individual solar and wind farms over the past three years. The experimental results indicate that the LSTM models significantly outperformed the other models in terms of forecasting. In this paper, multilayer deep neural network (DNN) architectures are described, and the results are provided for all the methods. This publication is based on the results obtained within the framework of the research and development project “POIR.01.01.01-00-0506/21”, realized in the years 2022–2023. The project was co-financed by the European Union under the Smart Growth Operational Programme 2014–2020. Full article
Show Figures

Figure 1

20 pages, 6319 KB  
Article
Spatiotemporal Deformation Prediction Model for Retaining Structures Integrating ConvGRU and Cross-Attention Mechanism
by Yanyong Gao, Zhaoyun Xiao, Zhiqun Gong, Shanjing Huang and Haojie Zhu
Buildings 2025, 15(14), 2537; https://doi.org/10.3390/buildings15142537 - 18 Jul 2025
Viewed by 433
Abstract
With the exponential growth of engineering monitoring data, data-driven neural networks have gained widespread application in predicting retaining structure deformation in foundation pit engineering. However, existing models often overlook the spatial deflection correlations among monitoring points. Therefore, this study proposes a novel deep [...] Read more.
With the exponential growth of engineering monitoring data, data-driven neural networks have gained widespread application in predicting retaining structure deformation in foundation pit engineering. However, existing models often overlook the spatial deflection correlations among monitoring points. Therefore, this study proposes a novel deep learning framework, CGCA (Convolutional Gated Recurrent Unit with Cross-Attention), which integrates ConvGRU and cross-attention mechanisms. The model achieves spatio-temporal feature extraction and deformation prediction via an encoder–decoder architecture. Specifically, the convolutional structure captures spatial dependencies between monitoring points, while the recurrent unit extracts time-series characteristics of deformation. A cross-attention mechanism is introduced to dynamically weight the interactions between spatial and temporal data. Additionally, the model incorporates multi-dimensional inputs, including full-depth inclinometer measurements, construction parameters, and tube burial depths. The optimization strategy combines AdamW and Lookahead to enhance training stability and generalization capability in geotechnical engineering scenarios. Case studies of foundation pit engineering demonstrate that the CGCA model exhibits superior performance and robust generalization capabilities. When validated against standard section (CX1) and complex working condition (CX2) datasets involving adjacent bridge structures, the model achieves determination coefficients (R2) of 0.996 and 0.994, respectively. The root mean square error (RMSE) remains below 0.44 mm, while the mean absolute error (MAE) is less than 0.36 mm. Comparative experiments confirm the effectiveness of the proposed model architecture and the optimization strategy. This framework offers an efficient and reliable technical solution for deformation early warning and dynamic decision-making in foundation pit engineering. Full article
(This article belongs to the Special Issue Research on Intelligent Geotechnical Engineering)
Show Figures

Figure 1

19 pages, 4561 KB  
Article
Enhanced Rolling Bearing Fault Diagnosis Using Multimodal Deep Learning and Singular Spectrum Analysis
by Yunhang Wang, Hongwei Wang, Ruoyang Bai, Yuxin Shi, Xicong Chen and Qingang Xu
Appl. Sci. 2025, 15(9), 4828; https://doi.org/10.3390/app15094828 - 27 Apr 2025
Cited by 1 | Viewed by 2125
Abstract
A decision-level multimodal fusion deep learning strategy is proposed for the effective fault detection of rolling bearings based on long-term fault signals collected from multiple sensors. First, key features are extracted from the multimodal signal set using singular spectrum analysis (SSA), and these [...] Read more.
A decision-level multimodal fusion deep learning strategy is proposed for the effective fault detection of rolling bearings based on long-term fault signals collected from multiple sensors. First, key features are extracted from the multimodal signal set using singular spectrum analysis (SSA), and these features are transformed into a composite dataset that combines short-time Fourier transform (STFT) images and time series data. Based on this, a recursive gated convolutional neural network (RGCNN) is designed to process the STFT image data, while a 1D convolutional neural network (1DCNN) is specifically optimized for training with time series data. Furthermore, decision-level multimodal feature fusion is achieved by applying a weighted average method to integrate the features from different deep learning models, aiming to obtain more comprehensive fault prediction results. The proposed method, multimodal fusion fault detection (MFFD), is validated on the Paderborn and Ottawa rolling bearing datasets, which include various typical faults. Experimental results demonstrate the effectiveness of the proposed approach. Compared to traditional single-modality deep learning models, the proposed method shows significant improvements in fault diagnosis accuracy and generalization capability. Full article
Show Figures

Figure 1

19 pages, 5313 KB  
Article
A New Custom Deep Learning Model Coupled with a Flood Index for Multi-Step-Ahead Flood Forecasting
by Jianming Shen, Moyuan Yang, Juan Zhang, Nan Chen and Binghua Li
Hydrology 2025, 12(5), 104; https://doi.org/10.3390/hydrology12050104 - 26 Apr 2025
Cited by 1 | Viewed by 1001
Abstract
Accurate and prompt flood forecasting is essential for effective decision making in flood control to help minimize or prevent flood damage. We propose a new custom deep learning model, IF-CNN-GRU, for multi-step-ahead flood forecasting that incorporates the flood index (IF) [...] Read more.
Accurate and prompt flood forecasting is essential for effective decision making in flood control to help minimize or prevent flood damage. We propose a new custom deep learning model, IF-CNN-GRU, for multi-step-ahead flood forecasting that incorporates the flood index (IF) to improve the prediction accuracy. The model integrates convolutional neural networks (CNNs) and gated recurrent neural networks (GRUs) to analyze the spatiotemporal characteristics of hydrological data, while using a custom recursive neural network that adjusts the neural unit output at each moment based on the flood index. The IF-CNN-GRU model was applied to forecast floods with a lead time of 1–5 d at the Baihe hydrological station in the middle reaches of the Han River, China, accompanied by an in-depth investigation of model uncertainty. The results showed that incorporating the flood index IF improved the forecast precision by up to 20%. The analysis of uncertainty revealed that the contributions of modeling factors, such as the datasets, model structures, and their interactions, varied across the forecast periods. The interaction factors contributed 17–36% of the uncertainty, while the contribution of the datasets increased with the forecast period (32–53%) and that of the model structure decreased (32–28%). The experiment also demonstrated that data samples play a critical role in improving the flood forecasting accuracy, offering actionable insights to reduce the predictive uncertainty and providing a scientific basis for flood early warning systems and water resource management. Full article
Show Figures

Figure 1

16 pages, 1633 KB  
Article
Advancing Rice Grain Impurity Segmentation with an Enhanced SegFormer and Multi-Scale Feature Integration
by Xiulin Qiu, Hongzhi Yao, Qinghua Liu, Hongrui Liu, Haozhi Zhang and Mengdi Zhao
Entropy 2025, 27(1), 70; https://doi.org/10.3390/e27010070 - 15 Jan 2025
Viewed by 1466
Abstract
During the rice harvesting process, severe occlusion and adhesion exist among multiple targets, such as rice, straw, and leaves, making it difficult to accurately distinguish between rice grains and impurities. To address the current challenges, a lightweight semantic segmentation algorithm for impurities based [...] Read more.
During the rice harvesting process, severe occlusion and adhesion exist among multiple targets, such as rice, straw, and leaves, making it difficult to accurately distinguish between rice grains and impurities. To address the current challenges, a lightweight semantic segmentation algorithm for impurities based on an improved SegFormer network is proposed. To make full use of the extracted features, the decoder was redesigned. First, the Feature Pyramid Network (FPN) was introduced to optimize the structure, selectively fusing the high-level semantic features and low-level texture features generated by the encoder. Secondly, a Part Large Kernel Attention (Part-LKA) module was designed and introduced after feature fusion to help the model focus on key regions, simplifying the model and accelerating computation. Finally, to compensate for the lack of spatial interaction capabilities, Bottleneck Recursive Gated Convolution (B-gnConv) was introduced to achieve effective segmentation of rice grains and impurities. Compared with the original model, the improved model’s pixel accuracy (PA) and F1 score increased by 1.6% and 3.1%, respectively. This provides a valuable algorithmic reference for designing a real-time impurity rate monitoring system for rice combine harvesters. Full article
Show Figures

Figure 1

19 pages, 3759 KB  
Article
Fusing Events and Frames with Coordinate Attention Gated Recurrent Unit for Monocular Depth Estimation
by Huimei Duan, Chenggang Guo and Yuan Ou
Sensors 2024, 24(23), 7752; https://doi.org/10.3390/s24237752 - 4 Dec 2024
Cited by 1 | Viewed by 1473
Abstract
Monocular depth estimation is a central problem in computer vision and robot vision, aiming at obtaining the depth information of a scene from a single image. In some extreme environments such as dynamics or drastic lighting changes, monocular depth estimation methods based on [...] Read more.
Monocular depth estimation is a central problem in computer vision and robot vision, aiming at obtaining the depth information of a scene from a single image. In some extreme environments such as dynamics or drastic lighting changes, monocular depth estimation methods based on conventional cameras often perform poorly. Event cameras are able to capture brightness changes asynchronously but are not able to acquire color and absolute brightness information. Thus, it is an ideal choice to make full use of the complementary advantages of event cameras and conventional cameras. However, how to effectively fuse event data and frames to improve the accuracy and robustness of monocular depth estimation remains an urgent problem. To overcome these challenges, a novel Coordinate Attention Gated Recurrent Unit (CAGRU) is proposed in this paper. Unlike the conventional ConvGRUs, our CAGRU abandons the conventional practice of using convolutional layers for all the gates and innovatively designs the coordinate attention as an attention gate and combines it with the convolutional gate. Coordinate attention explicitly models inter-channel dependencies and coordinate information in space. The coordinate attention gate in conjunction with the convolutional gate enable the network to model feature information spatially, temporally, and internally across channels. Based on this, the CAGRU can enhance the information density of the sparse events in the spatial domain in the recursive process of temporal information, thereby achieving more effective feature screening and fusion. It can effectively integrate feature information from event cameras and standard cameras, further improving the accuracy and robustness of monocular depth estimation. The experimental results show that the method proposed in this paper achieves significant performance improvements on different public datasets. Full article
(This article belongs to the Special Issue Event-Driven Vision Sensor Architectures and Application Scenarios)
Show Figures

Figure 1

22 pages, 13858 KB  
Article
Large-Scale Origin–Destination Prediction for Urban Rail Transit Network Based on Graph Convolutional Neural Network
by Xuemei Wang, Yunlong Zhang and Jinlei Zhang
Sustainability 2024, 16(23), 10190; https://doi.org/10.3390/su162310190 - 21 Nov 2024
Cited by 3 | Viewed by 1592
Abstract
Due to data sparsity, insufficient spatial relationships, and the complex spatial and temporal characteristics of passenger flow, it is very challenging to achieve a high prediction accuracy on Origin–Destination (OD) in a large urban rail transit network. This paper proposes a two-stage prediction [...] Read more.
Due to data sparsity, insufficient spatial relationships, and the complex spatial and temporal characteristics of passenger flow, it is very challenging to achieve a high prediction accuracy on Origin–Destination (OD) in a large urban rail transit network. This paper proposes a two-stage prediction network GCN-GRU, using a Graph Convolutional Network (GCN) with a Gated Recursive Unit (GRU). The GCN can obtain the adjacency relationship between different stations by selecting the adjacent neighborhoods and interacting neighborhoods of a station and capturing the spatial characteristics of the OD passenger flow. Then, an advanced weighted aggregator is employed to gather important information from the two above-mentioned types of neighborhoods to capture the spatial relationship of the network OD passenger flow and to perceive the sparsity and range of the OD data. On the other hand, the GRU can extract the temporal relationship, such as periodicity and other time-varying trends. The effectiveness of GCN-GRU is tested with a real-world urban rail transit dataset. The experimental results show that whether it is the OD passenger flow matrix of each period (one hour) on weekdays and weekends or the single-pair OD passenger flow between stations, the proposed GCN-GRU models perform better than the benchmark models. This study provides an important theoretical basis and practical applications for operators, thus promoting the sustainable development of urban rail transit systems. Full article
Show Figures

Figure 1

19 pages, 9044 KB  
Article
Advanced Short-Term Load Forecasting with XGBoost-RF Feature Selection and CNN-GRU
by Jingping Cui, Wei Kuang, Kai Geng, Aiying Bi, Fengjiao Bi, Xiaogang Zheng and Chuan Lin
Processes 2024, 12(11), 2466; https://doi.org/10.3390/pr12112466 - 7 Nov 2024
Cited by 18 | Viewed by 4647
Abstract
Accurate and efficient short-term load forecasting (STLF) is essential for optimizing power system operations. This study proposes a novel hybrid forecasting model that integrates XGBoost-RF feature selection with a CNN-GRU neural network to enhance prediction performance while reducing model complexity. The XGBoost-RF approach [...] Read more.
Accurate and efficient short-term load forecasting (STLF) is essential for optimizing power system operations. This study proposes a novel hybrid forecasting model that integrates XGBoost-RF feature selection with a CNN-GRU neural network to enhance prediction performance while reducing model complexity. The XGBoost-RF approach is first applied to select the most predictive features from historical load data, weather conditions, and time-based variables. A convolutional neural network (CNN) is then employed to extract spatial features, while a gated recurrent unit (GRU) captures temporal dependencies for load forecasting. By leveraging a dual-channel structure that combines long- and short-term historical load trends, the proposed model significantly mitigates cumulative errors from recursive predictions. Experimental results demonstrate that the model achieves superior performance with an average root mean square error (RMSE) of 53.29 and mean absolute percentage error (MAPE) of 3.56% on the test set. Compared to traditional models, the prediction accuracy improves by 28.140% to 110.146%. Additionally, the model exhibits strong robustness across different climatic conditions. This research validates the efficacy of integrating XGBoost-RF feature selection with CNN-GRU for STLF, offering reliable decision support for power system management. Full article
(This article belongs to the Section Automation Control Systems)
Show Figures

Figure 1

24 pages, 2131 KB  
Article
Improving Text Classification in Agricultural Expert Systems with a Bidirectional Encoder Recurrent Convolutional Neural Network
by Xiaojuan Guo, Jianping Wang, Guohong Gao, Li Li, Junming Zhou and Yancui Li
Electronics 2024, 13(20), 4054; https://doi.org/10.3390/electronics13204054 - 15 Oct 2024
Cited by 2 | Viewed by 1700
Abstract
With the rapid development of internet and AI technologies, Agricultural Expert Systems (AESs) have become crucial for delivering technical support and decision-making in agricultural management. However, traditional natural language processing methods often struggle with specialized terminology and context, and they lack the adaptability [...] Read more.
With the rapid development of internet and AI technologies, Agricultural Expert Systems (AESs) have become crucial for delivering technical support and decision-making in agricultural management. However, traditional natural language processing methods often struggle with specialized terminology and context, and they lack the adaptability to handle complex text classifications. The diversity and evolving nature of agricultural texts make deep semantic understanding and integration of contextual knowledge especially challenging. To tackle these challenges, this paper introduces a Bidirectional Encoder Recurrent Convolutional Neural Network (AES-BERCNN) tailored for short-text classification in agricultural expert systems. We designed an Agricultural Text Encoder (ATE) with a six-layer transformer architecture to capture both preceding and following word information. A recursive convolutional neural network based on Gated Recurrent Units (GRUs) was also developed to merge contextual information and learn complex semantic features, which are then combined with the ATE output and refined through max-pooling to form the final feature representation. The AES-BERCNN model was tested on a self-constructed agricultural dataset, achieving an accuracy of 99.63% in text classification. Its generalization ability was further verified on the Tsinghua News dataset. Compared to other models such as TextCNN, DPCNN, BiLSTM, and BERT-based models, the AES-BERCNN shows clear advantages in agricultural text classification. This work provides precise and timely technical support for intelligent agricultural expert systems. Full article
Show Figures

Figure 1

20 pages, 4983 KB  
Article
Parking Lot Traffic Prediction Based on Fusion of Multifaceted Spatio-Temporal Features
by Lechuan Zhang, Bin Wang, Qian Zhang, Sulei Zhu and Yan Ma
Sensors 2024, 24(15), 4971; https://doi.org/10.3390/s24154971 - 31 Jul 2024
Cited by 1 | Viewed by 2418
Abstract
With the rapid growth of population and vehicles, issues such as traffic congestion are becoming increasingly apparent. Parking guidance and information (PGI) systems are becoming more critical, with one of the most important tasks being the prediction of traffic flow in parking lots. [...] Read more.
With the rapid growth of population and vehicles, issues such as traffic congestion are becoming increasingly apparent. Parking guidance and information (PGI) systems are becoming more critical, with one of the most important tasks being the prediction of traffic flow in parking lots. Predicting parking traffic can effectively improve parking efficiency and alleviate traffic congestion, traffic accidents, and other problems. However, due to the complex characteristics of parking spatio-temporal data, high levels of noise, and the intricate influence of external factors, there are three challenges to predicting parking traffic in a city effectively: (1) how to better model the nonlinear, asymmetric, and complex spatial relationships among parking lots; (2) how to model the temporal autocorrelation of parking flow more accurately for each parking lot, whether periodic or aperiodic; and (3) how to model the correlation between external influences, such as holiday weekends, POIs (points of interest), and weather factors. In this context, this paper proposes a parking lot traffic prediction model based on the fusion of multifaceted spatio-temporal features (MFF-STGCN). The model consists of a feature embedding module, a spatio-temporal attention mechanism module, and a spatio-temporal convolution module. The feature embedding module embeds external features such as weekend holidays, geographic POIs, and weather features into the time series, the spatio-temporal attention mechanism module captures the dynamic spatio-temporal correlation of parking traffic, and the spatio-temporal convolution module captures the spatio-temporal features by using graph convolution and gated recursion units. Finally, the outputs of adjacent time series, daily series, and weekly series are weighted and fused to obtain the final prediction results, thus predicting the parking lot traffic flow more accurately and effectively. Results on real datasets demonstrate that the proposed model enhances prediction performance. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

18 pages, 6924 KB  
Article
Dynamic Spatio-Temporal Adaptive Graph Convolutional Recurrent Networks for Vacant Parking Space Prediction
by Liangpeng Gao, Wenli Fan and Wenliang Jian
Appl. Sci. 2024, 14(13), 5927; https://doi.org/10.3390/app14135927 - 7 Jul 2024
Viewed by 1623
Abstract
The prediction of vacant parking spaces (VPSs) can reduce the time drivers spend searching for parking, thus alleviating traffic congestion. However, previous studies have mostly focused on modeling the temporal features of VPSs using historical data, neglecting the complex and extensive spatial characteristics [...] Read more.
The prediction of vacant parking spaces (VPSs) can reduce the time drivers spend searching for parking, thus alleviating traffic congestion. However, previous studies have mostly focused on modeling the temporal features of VPSs using historical data, neglecting the complex and extensive spatial characteristics of different parking lots within the transportation network. This is mainly due to the lack of direct physical connections between parking lots, making it challenging to quantify the spatio-temporal features among them. To address this issue, we propose a dynamic spatio-temporal adaptive graph convolutional recursive network (DSTAGCRN) for VPS prediction. Specifically, DSTAGCRN divides VPS data into seasonal and periodic trend components and combines daily and weekly information with node embeddings using the dynamic parameter-learning module (DPLM) to generate dynamic graphs. Then, by integrating gated recurrent units (GRUs) with the parameter-learning graph convolutional recursive module (PLGCRM) of DPLM, we infer the spatio-temporal dependencies for each time step. Furthermore, we introduce a multihead attention mechanism to effectively capture and fuse the spatio-temporal dependencies and dynamic changes in the VPS data, thereby enhancing the prediction performance. Finally, we evaluate the proposed DSTAGCRN on three real parking datasets. Extensive experiments and analyses demonstrate that the DSTAGCRN model proposed in this study not only improves the prediction accuracy but can also better extract the dynamic spatio-temporal characteristics of available parking space data in multiple parking lots. Full article
(This article belongs to the Special Issue Intelligent Transportation System in Smart City)
Show Figures

Figure 1

40 pages, 3950 KB  
Review
A Review on Machine/Deep Learning Techniques Applied to Building Energy Simulation, Optimization and Management
by Francesca Villano, Gerardo Maria Mauro and Alessia Pedace
Thermo 2024, 4(1), 100-139; https://doi.org/10.3390/thermo4010008 - 6 Mar 2024
Cited by 30 | Viewed by 8339
Abstract
Given the climate change in recent decades and the ever-increasing energy consumption in the building sector, research is widely focused on the green revolution and ecological transition of buildings. In this regard, artificial intelligence can be a precious tool to simulate and optimize [...] Read more.
Given the climate change in recent decades and the ever-increasing energy consumption in the building sector, research is widely focused on the green revolution and ecological transition of buildings. In this regard, artificial intelligence can be a precious tool to simulate and optimize building energy performance, as shown by a plethora of recent studies. Accordingly, this paper provides a review of more than 70 articles from recent years, i.e., mostly from 2018 to 2023, about the applications of machine/deep learning (ML/DL) in forecasting the energy performance of buildings and their simulation/control/optimization. This review was conducted using the SCOPUS database with the keywords “buildings”, “energy”, “machine learning” and “deep learning” and by selecting recent papers addressing the following applications: energy design/retrofit optimization, prediction, control/management of heating/cooling systems and of renewable source systems, and/or fault detection. Notably, this paper discusses the main differences between ML and DL techniques, showing examples of their use in building energy simulation/control/optimization. The main aim is to group the most frequent ML/DL techniques used in the field of building energy performance, highlighting the potentiality and limitations of each one, both fundamental aspects for future studies. The ML approaches considered are decision trees/random forest, naive Bayes, support vector machines, the Kriging method and artificial neural networks. The DL techniques investigated are convolutional and recursive neural networks, long short-term memory and gated recurrent units. Firstly, various ML/DL techniques are explained and divided based on their methodology. Secondly, grouping by the aforementioned applications occurs. It emerges that ML is mostly used in energy efficiency issues while DL in the management of renewable source systems. Full article
(This article belongs to the Special Issue Innovative Technologies to Optimize Building Energy Performance)
Show Figures

Figure 1

11 pages, 3743 KB  
Article
Minimalist Deployment of Neural Network Equalizers in a Bandwidth-Limited Optical Wireless Communication System with Knowledge Distillation
by Yiming Zhu, Yuan Wei, Chaoxu Chen, Nan Chi and Jianyang Shi
Sensors 2024, 24(5), 1612; https://doi.org/10.3390/s24051612 - 1 Mar 2024
Cited by 1 | Viewed by 2197
Abstract
An equalizer based on a recurrent neural network (RNN), especially with a bidirectional gated recurrent unit (biGRU) structure, is a good choice to deal with nonlinear damage and inter-symbol interference (ISI) in optical communication systems because of its excellent performance in processing time [...] Read more.
An equalizer based on a recurrent neural network (RNN), especially with a bidirectional gated recurrent unit (biGRU) structure, is a good choice to deal with nonlinear damage and inter-symbol interference (ISI) in optical communication systems because of its excellent performance in processing time series information. However, its recursive structure prevents the parallelization of the computation, resulting in a low equalization rate. In order to improve the speed without compromising the equalization performance, we propose a minimalist 1D convolutional neural network (CNN) equalizer, which is reconverted from a biGRU with knowledge distillation (KD). In this work, we applied KD to regression problems and explain how KD helps students learn from teachers in solving regression problems. In addition, we compared the biGRU, 1D-CNN after KD and 1D-CNN without KD in terms of Q-factor and equalization velocity. The experimental data showed that the Q-factor of the 1D-CNN increased by 1 dB after KD learning from the biGRU, and KD increased the RoP sensitivity of the 1D-CNN by 0.89 dB with the HD-FEC threshold of 1 × 10−3. At the same time, compared with the biGRU, the proposed 1D-CNN equalizer reduced the computational time consumption by 97% and the number of trainable parameters by 99.3%, with only a 0.5 dB Q-factor penalty. The results demonstrate that the proposed minimalist 1D-CNN equalizer holds significant promise for future practical deployments in optical wireless communication systems. Full article
(This article belongs to the Special Issue Novel Technology in Optical Communications)
Show Figures

Figure 1

Back to TopTop