Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,683)

Search Parameters:
Keywords = gated recurrent units

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 6046 KB  
Article
Research on In-Cylinder Pressure Monitoring Method of Diesel Engine Based on LSTM
by Yi Zhang, Liangyu Li, Yanzhe Liu, Shiliang Yao and Run Zou
Appl. Sci. 2025, 15(22), 11979; https://doi.org/10.3390/app152211979 - 11 Nov 2025
Abstract
The variation in in-cylinder pressure of diesel engine directly determines its working performance; therefore, the real-time monitoring technology for in-cylinder pressure of diesel engine is of great significance for monitoring the operation status of diesel engine. However, restricted by factors such as technology [...] Read more.
The variation in in-cylinder pressure of diesel engine directly determines its working performance; therefore, the real-time monitoring technology for in-cylinder pressure of diesel engine is of great significance for monitoring the operation status of diesel engine. However, restricted by factors such as technology and cost, it is often impossible to monitor the in-cylinder pressure data of diesel engine during actual operation. To solve the above problem, this paper proposes an in-cylinder pressure curve monitoring model for diesel engine based on GRU (Gated Recurrent Unit), using easily collectible data signals such as crankshaft torque, crankshaft angle, and displacement of each cylinder as the basis. The correctness and accuracy of the above monitoring model are trained and verified using data obtained from bench tests, and a comparison is made with the commonly used in-cylinder pressure monitoring method based on simulation model. The results show that the diesel engine in-cylinder pressure monitoring model proposed in this paper has advantages such as high monitoring accuracy and fast calculation speed. Full article
(This article belongs to the Section Applied Physics General)
Show Figures

Figure 1

15 pages, 1998 KB  
Article
A Hybrid GRU-MHSAM-ResNet Model for Short-Term Power Load Forecasting
by Xin Yang, Fan Zhou, Ran Xu, Yiwen Jiang and Hejun Yang
Processes 2025, 13(11), 3646; https://doi.org/10.3390/pr13113646 - 11 Nov 2025
Abstract
Reliable load forecasting is crucial for ensuring optimal dispatch, grid security, and cost efficiency. To address limitations in prediction accuracy and generalization, this paper proposes a hybrid model, GRU-MHSAM-ResNet, which integrates a gated recurrent unit (GRU), multi-head self-attention (MHSAM), and a residual network [...] Read more.
Reliable load forecasting is crucial for ensuring optimal dispatch, grid security, and cost efficiency. To address limitations in prediction accuracy and generalization, this paper proposes a hybrid model, GRU-MHSAM-ResNet, which integrates a gated recurrent unit (GRU), multi-head self-attention (MHSAM), and a residual network (ResNet)block. Firstly, GRU is employed as a deep temporal encoder to extract features from historical load data, offering a simpler structure than long short-term memory (LSTM). Then, the MHSAM is used to generate adaptive representations by weighting input features, thereby strengthening the key features. Finally, the features are processed by fully connected layers, while a ResNet block is added to mitigate gradient vanishing and explosion, thus improving prediction accuracy. The experimental results on actual load datasets from systems in China, Australia, and Malaysia demonstrate that the proposed GRU-MHSAM-ResNet model exhibits superior predictive accuracy to compared models, including the CBR model and the LSTM-ResNet model. On the three datasets, the proposed model achieved a mean absolute percentage error (MAPE) of 1.65% (China), 5.52% (Australia), and 1.57% (Malaysia), representing a significant improvement over the other models. Furthermore, in five repeated experiments on the Malaysian dataset, it exhibited lower error fluctuation and greater result stability compared to the benchmark LSTM-ResNet model. Therefore, the proposed model provides a new forecasting method for power system dispatch, exhibiting high accuracy and generalization ability. Full article
Show Figures

Figure 1

21 pages, 2326 KB  
Article
Highway Accident Hotspot Identification Based on the Fusion of Remote Sensing Imagery and Traffic Flow Information
by Jun Jing, Wentong Guo, Congcong Bai and Sheng Jin
Big Data Cogn. Comput. 2025, 9(11), 283; https://doi.org/10.3390/bdcc9110283 - 10 Nov 2025
Abstract
Traffic safety is a critical issue in highway operation management, where accurate identification of accident hotspots enables proactive risk prevention and facility optimization. Traditional methods relying on historical statistics often fail to capture macro-level environmental patterns and micro-level dynamic variations. To address this [...] Read more.
Traffic safety is a critical issue in highway operation management, where accurate identification of accident hotspots enables proactive risk prevention and facility optimization. Traditional methods relying on historical statistics often fail to capture macro-level environmental patterns and micro-level dynamic variations. To address this challenge, we propose a Dual-Branch Feature Adaptive Gated Fusion Network (DFAGF-Net) that integrates satellite remote sensing imagery with traffic flow time-series data. The framework consists of three components: the Global Contextual Aggregation Network (GCA-Net) for capturing macro spatial layouts from remote sensing imagery, a Sequential Gated Recurrent Unit Attention Network (Seq-GRUAttNet) for modeling dynamic traffic flow with temporal attention, and a Hybrid Feature Adaptive Module (HFA-Module) for adaptive cross-modal feature fusion. Experimental results demonstrate that the DFAGF-Net achieves superior performance in accident hotspot recognition. Specifically, GCA-Net achieves an accuracy of 84.59% on satellite imagery, while Seq-GRUAttNet achieves an accuracy of 82.51% on traffic flow data. With the incorporation of the HFA-Module, the overall performance is further improved, reaching an accuracy of 90.21% and an F1-score of 0.92, which is significantly better than traditional concatenation or additive fusion methods. Ablation studies confirm the effectiveness of each component, while comparisons with state-of-the-art models demonstrate superior classification accuracy and generalization. Furthermore, model interpretability analysis reveals that curved highway alignments, roadside greenery, and varying traffic conditions across time are major contributors to accident hotspot formation. By accurately locating high-risk segments, DFAGF-Net provides valuable decision support for proactive traffic safety management and targeted infrastructure optimization. Full article
(This article belongs to the Special Issue Application of Artificial Intelligence in Traffic Management)
Show Figures

Figure 1

14 pages, 1384 KB  
Article
Training Recurrent Neural Networks for BrdU Detection with Oxford Nanopore Sequencing: Guidance and Lessons Learned
by Haibo Liu, William Flavahan and Lihua Julie Zhu
Genes 2025, 16(11), 1356; https://doi.org/10.3390/genes16111356 - 10 Nov 2025
Abstract
Background/Objectives: BrdU (5′-bromo-2′-deoxyuridine), a synthetic thymidine (T) analog, is widely used to study cell proliferation and DNA synthesis. To precisely identify where and when DNA replication starts and terminates, it is essential to determine the BrdU incorporation rate and sites at a [...] Read more.
Background/Objectives: BrdU (5′-bromo-2′-deoxyuridine), a synthetic thymidine (T) analog, is widely used to study cell proliferation and DNA synthesis. To precisely identify where and when DNA replication starts and terminates, it is essential to determine the BrdU incorporation rate and sites at a single-nucleotide resolution. Although several deep learning-based methods have been developed for detecting BrdU using Oxford nanopore sequencing data, there is a lack of accessible, easy-to-follow tutorials to guide researchers in preparing training data and implementing deep learning approaches as the nanopore sequencing technologies continue to evolve. Methods: Due to the lack of ground truth BrdU-positive data generated on the latest R10 flow cells, we prepared model training data from legacy R9 flow cells, consistent with existing tools. We processed publicly available synthetic and real nanopore DNA sequencing datasets, with and without BrdU incorporation, using a combination of open-source and custom software tools. Subsequently, we trained bidirectional gated recurrent unit (BiGRU)-based recurrent neural networks (RNNs) for BrdU detection using the TensorFlow library on the Google Colab platform. Results: We trained BiGRU-based RNNs for BrdU detection with a high specificity (>94%) but a moderate sensitivity due to limited BrdU-positive data. We detail the setup, training, testing, and fine-tuning of the model using both synthetic and real DNA sequencing data. Conclusions: Though the models were trained with data generated on legacy flow cells, we believe that this detailed protocol, covering both data preparation and model development, can be readily extended to R10 flow cells and basecallers for other base modifications. This work will facilitate the broader adoption of deep learning neural networks in biological research, particularly RNNs, which are well suited for modeling sequential and time-series data. Full article
(This article belongs to the Section Bioinformatics)
Show Figures

Figure 1

29 pages, 5218 KB  
Article
Hybrid Deep Learning Framework for Forecasting Ground-Level Ozone in a North Texas Urban Region
by Jithin Kanayankottupoyil, Abdul Azeem Mohammed and Kuruvilla John
Appl. Sci. 2025, 15(22), 11923; https://doi.org/10.3390/app152211923 - 10 Nov 2025
Viewed by 68
Abstract
Ground-level ozone is a critical secondary air pollutant and greenhouse gas, especially in urban oil and gas regions, where it poses severe public health and environmental risks. Urban areas in North Texas have experienced persistently elevated ozone levels over the past two decades [...] Read more.
Ground-level ozone is a critical secondary air pollutant and greenhouse gas, especially in urban oil and gas regions, where it poses severe public health and environmental risks. Urban areas in North Texas have experienced persistently elevated ozone levels over the past two decades despite emission control efforts, highlighting the need for advanced forecasting tools. This study presents a hybrid recurrent neural network (RNN) model that combines Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM) architectures to predict 8 h average ground-level ozone concentrations over a full annual cycle. The model leverages one-hour lagged ozone precursor pollutants (VOC and NOx) and seven meteorological variables, using a novel framework designed to capture complex temporal dependencies and spatiotemporal variability in environmental data. Trained and validated on multi-year datasets from two distinctly different urban air quality monitoring sites, the model achieved high predictive accuracy (R2 ≈ 0.97, IoA > 0.96), outperforming standalone LSTM and Random Forest models by 6–12%. Beyond statistical performance, the model incorporates Shapley Additive exPlanation (SHAP) analysis to provide mechanistic interpretability, revealing the dominant roles of relative humidity, temperature, solar radiation, and precursor concentrations in modulating ozone levels. These findings demonstrate the model’s effectiveness in learning the nonlinear dynamics of ozone formation, outperforming traditional statistical models, and offering a reliable tool for long-term ozone forecasting and regional air quality management. Full article
(This article belongs to the Special Issue Air Quality Monitoring, Analysis and Modeling)
Show Figures

Figure 1

26 pages, 6224 KB  
Article
GAT-BiGRU-TPA City Pair 4D Trajectory Prediction Model Based on Spatio-Temporal Graph Neural Network
by Haibo Cao, Yinfeng Li, Xueyu Mi and Qi Gao
Aerospace 2025, 12(11), 999; https://doi.org/10.3390/aerospace12110999 - 8 Nov 2025
Viewed by 221
Abstract
With the rapid expansion of the civil aviation industry, the surge in flight numbers has led to increasingly pronounced issues of air route congestion and flight conflicts. 4D trajectory prediction, by dynamically adjusting aircraft paths in real time, can prevent air route collisions, [...] Read more.
With the rapid expansion of the civil aviation industry, the surge in flight numbers has led to increasingly pronounced issues of air route congestion and flight conflicts. 4D trajectory prediction, by dynamically adjusting aircraft paths in real time, can prevent air route collisions, alleviate air traffic pressure, and ensure flight safety. Therefore, this paper proposes a combined model—GAT-BiGRU-TPA—based on the Spatio-Temporal Graph Neural Network (STGNN) framework to achieve refined 4D trajectory prediction. This model integrates Graph Attention Networks (GAT) to extract multidimensional spatial features, Bidirectional Gated Recurrent Units (BiGRU) to capture temporal dependencies, and incorporates a Temporal Pattern Attention (TPA) mechanism to emphasize learning critical temporal patterns. This enables the extraction of key information and the deep fusion of spatio-temporal features. Experiments were conducted using real trajectory data, employing a grid search to optimize the observation window size and label length. Results demonstrate that under optimal model parameters (observation window: 30, labels: 4), the proposed model achieves a 45.72% reduction in mean Root Mean Square Error (RMSE) and a 43.40% decrease in Mean Absolute Error (MAE) across longitude, latitude, and altitude compared to the optimal baseline BiLSTM model. Prediction accuracy significantly outperforms multiple mainstream benchmark models. In summary, the proposed GAT-BiGRU-TPA model demonstrates superior accuracy in 4D trajectory prediction, providing an effective approach for refined trajectory management in complex airspace environments. Full article
(This article belongs to the Section Air Traffic and Transportation)
Show Figures

Figure 1

22 pages, 5166 KB  
Article
A Study on Exploiting Temporal Patterns in Semester Records for Efficient Student Dropout Prediction
by Jungjo Na, Kwan Woo Kim and Hyeon Gyu Kim
Electronics 2025, 14(22), 4356; https://doi.org/10.3390/electronics14224356 - 7 Nov 2025
Viewed by 225
Abstract
Academic achievement data are essential in building a model to predict student dropout. When an attribute of the data has multiple values, each representing a student’s achievement earned over a semester, existing methods typically calculate a mean from those values and use it [...] Read more.
Academic achievement data are essential in building a model to predict student dropout. When an attribute of the data has multiple values, each representing a student’s achievement earned over a semester, existing methods typically calculate a mean from those values and use it to build learning data. Such a summary-based approach has been widely used because it can simplify learning processes, including feature extraction. However, model performance can be further improved if patterns in multiple semester values can be properly extracted and used for learning, instead of using summaries. Despite its potential, this problem has not been investigated in previous studies. In this paper, we demonstrate that recurrent neural networks (RNNs) can effectively be used to exploit the patterns in students’ academic records stored by semester. To identify patterns in the data and find solutions suitable for it, various neural network algorithms were compared. Attention was also adopted to improve model performance. Experiments conducted on real student records showed that the gate recurrent unit (GRU) model with multi-head attention achieved an F1 score of 0.9416, which was approximately 5% higher than the existing summary-based approaches. This demonstrates that the semester records exhibit temporal patterns and RNNs can effectively be used to exploit these patterns. Full article
Show Figures

Figure 1

28 pages, 6285 KB  
Article
Prediction of Construction-Induced Ground Vibrations Using Field Measurements and Bidirectional Gated Recurrent Unit Neural Network
by Reza Rafiee-Dehkharghani, Kamran Esmaeili and Meysam Najari
Vibration 2025, 8(4), 70; https://doi.org/10.3390/vibration8040070 - 6 Nov 2025
Viewed by 171
Abstract
This paper proposes a sequential bidirectional gated recurrent unit (BGRU) model to predict construction-induced ground vibrations. The ground vibration time histories for twelve real construction projects in Toronto, Canada, are collected and used to develop the BGRU model. A single time-step method is [...] Read more.
This paper proposes a sequential bidirectional gated recurrent unit (BGRU) model to predict construction-induced ground vibrations. The ground vibration time histories for twelve real construction projects in Toronto, Canada, are collected and used to develop the BGRU model. A single time-step method is used to predict the vibrations, and the time window is swept continuously over the whole training data. In addition to the BGRU method, and for comparison, two other methods, autoregressive integrated moving average (ARIMA) and random forest (RF), are used to predict the ground vibrations. The results show that the BGRU method performs much better than ARIMA and RF methods in forecasting construction-induced ground vibrations. The BGRU method captures the construction-induced and background vibrations very well, and this method remains accurate when the training data includes both background and construction vibrations. Therefore, this method can be used to predict ground vibrations in real projects where there is always a potential for missing some parts of the ground vibration data due to the malfunction of the vibration recording units. Full article
Show Figures

Figure 1

15 pages, 4045 KB  
Article
A Low-Complexity Receiver-Side Lookup Table Equalization Method for High-Speed Short-Reach IM/DD Transmission Systems
by Junde Lu, Yu Sun, Jun Qin, Changhao Han, Jie Shi, Lanling Chen, Jianyu Shi, Jiaxin Zheng, Shuo Jiang, Chi Zhang, Yang Yang, Yueqin Li, Jian Sun and Guo-Wei Lu
Photonics 2025, 12(11), 1091; https://doi.org/10.3390/photonics12111091 - 6 Nov 2025
Viewed by 209
Abstract
In this paper, we demonstrate a receiver-side lookup table (Rx-side LUT) equalization method for high-speed short-reach intensity modulation and direct detection (IM/DD) transmission systems, which alleviates the computational complexity of neural network-based equalization algorithms. Compared to conventional pre-equalization techniques applied at the transmitter [...] Read more.
In this paper, we demonstrate a receiver-side lookup table (Rx-side LUT) equalization method for high-speed short-reach intensity modulation and direct detection (IM/DD) transmission systems, which alleviates the computational complexity of neural network-based equalization algorithms. Compared to conventional pre-equalization techniques applied at the transmitter side, which utilize distortion correction values stored in LUTs derived from the transmitted symbols and their corresponding recovered counterparts, the Rx-side LUT relies solely on receiver-side data. The received data to be equalized serves as the index of the LUT, with a nearest-neighbor algorithm employed to find the element closest to the index and then return the corresponding equalization result from the table. With a lightweight lookup process, the proposed method releases the computation complexity of neural network-based equalization algorithms by replacing the computationally intensive operations performed during the inference phase. Experimental results indicate that compared to baseline fully connected neural network (FCNN) and gated recurrent unit (GRU) equalization, the Rx-side LUT could decrease the algorithm execution time by 25.5% and 34.6% for 100 GBaud and 22.8% and 36.9% for 112 GBaud PAM4 signals, respectively, while maintaining comparable system performance. The proposed scheme provides a low-complexity solution for high-speed, low-cost IM/DD optical interconnects. Full article
Show Figures

Figure 1

22 pages, 3487 KB  
Article
Research and Optimization of Ultra-Short-Term Photovoltaic Power Prediction Model Based on Symmetric Parallel TCN-TST-BiGRU Architecture
by Tengjie Wang, Zian Gong, Zhiyuan Wang, Yuxi Liu, Yahong Ma, Feng Wang and Jing Li
Symmetry 2025, 17(11), 1855; https://doi.org/10.3390/sym17111855 - 3 Nov 2025
Viewed by 244
Abstract
(1) Background: Ultra-short-term photovoltaic (PV) power prediction is crucial for optimizing grid scheduling and enhancing energy utilization efficiency. Existing prediction methods face challenges of missing data, noise interference, and insufficient accuracy. (2) Methods: This study proposes a single-step hybrid neural network model integrating [...] Read more.
(1) Background: Ultra-short-term photovoltaic (PV) power prediction is crucial for optimizing grid scheduling and enhancing energy utilization efficiency. Existing prediction methods face challenges of missing data, noise interference, and insufficient accuracy. (2) Methods: This study proposes a single-step hybrid neural network model integrating Temporal Convolutional Network (TCN), Temporal Shift Transformer (TST), and Bidirectional Gated Recurrent Unit (BiGRU) to achieve high-precision 15-minute-ahead PV power prediction, with a design aligned with symmetry principles. Data preprocessing uses Variational Mode Decomposition (VMD) and random forest interpolation to suppress noise and repair missing values. A symmetric parallel dual-branch feature extraction module is built: TCN-TST extracts local dynamics and long-term dependencies, while BiGRU captures global features. This symmetric structure matches the intra-day periodic symmetry of PV power (e.g., symmetric irradiance patterns around noon) and avoids bias from single-branch models. Tensor concatenation and an adaptive attention mechanism realize feature fusion and dynamic weighted output. (3) Results: Experiments on real data from a Xinjiang PV power station, with hyperparameter optimization (BiGRU units, activation function, TCN kernels, TST parameters), show that the model outperforms comparative models in MAE and R2—e.g., the MAE is 26.53% and 18.41% lower than that of TCN and Transforme. (4) Conclusions: The proposed method achieves a balance between accuracy and computational efficiency. It provides references for PV station operation, system scheduling, and grid stability. Full article
(This article belongs to the Section Engineering and Materials)
Show Figures

Figure 1

25 pages, 6753 KB  
Article
Short-Term Power Load Forecasting Under Multiple Weather Scenarios Based on Dual-Channel Feature Extraction (DCFE)
by Xiaojun Pu and Mingrui Zhang
Appl. Sci. 2025, 15(21), 11733; https://doi.org/10.3390/app152111733 - 3 Nov 2025
Viewed by 275
Abstract
Grid security and system dispatch can be compromised by pronounced volatility in power load under extreme meteorological conditions. However, the dynamic and nonlinear interactions between power load and meteorological variables across diverse weather scenarios are not well captured by existing methods, resulting in [...] Read more.
Grid security and system dispatch can be compromised by pronounced volatility in power load under extreme meteorological conditions. However, the dynamic and nonlinear interactions between power load and meteorological variables across diverse weather scenarios are not well captured by existing methods, resulting in limited accuracy and robustness. To address this gap, a short-term power load forecasting model with a dual-channel architecture is proposed. Features are extracted in parallel via dual-channel feature extraction (DCFE): the first channel employs an improved Cascaded Multiscale 2D Convolutional Network (CMCNN) to model local fluctuations and global periodicity in the load time series. The second channel derives scenario-aware variable weights using the Maximal Information Coefficient (MIC); meteorological variables are then gated and weighted before being processed by a multi-layer self-attention network to learn global dependencies. Subsequently, dynamic feature-level fusion is achieved through cross-attention, strengthening key interactions between power load and meteorological factors. The fused representation is fed into an Attention-Enhanced Bidirectional Gated Recurrent Unit (AE-BiGRU) to precisely model temporal dependencies across multiple weather scenarios. Experiments on five years of power load and meteorological data from a region in Australia indicate that the proposed method outperforms the best baseline across multiple weather conditions: RMSE, MAE, MAPE, and sMAPE decrease on average by 32.44%, 31.42%, 30.73%, and 31.05%, respectively, while R2 increases by 0.034 on average, demonstrating strong adaptability and robustness. Full article
Show Figures

Figure 1

15 pages, 2638 KB  
Article
Research on Energy Storage Configuration Optimization Method for Wind Farm Substations Based on Wind Power Fluctuation Prediction Integrating Chaotic Features and Bidirectional Gated Recurrent Units
by Fei Wang, Zikai Fan, Yifei Fan, Jiayi Ren, Yan Li, Leiming Suo and Jinrui Tang
Algorithms 2025, 18(11), 698; https://doi.org/10.3390/a18110698 - 3 Nov 2025
Viewed by 205
Abstract
To address wind power fluctuations causing curtailment and high costs, this study proposes an integrated method combining wind power forecasting with substation optimization. An enhanced Bidirectional Gated Recurrent Unit (BiGRU) model is developed by incorporating chaotic features (maximum Lyapunov exponent) and sliding-window statistical [...] Read more.
To address wind power fluctuations causing curtailment and high costs, this study proposes an integrated method combining wind power forecasting with substation optimization. An enhanced Bidirectional Gated Recurrent Unit (BiGRU) model is developed by incorporating chaotic features (maximum Lyapunov exponent) and sliding-window statistical features (mean, standard deviation), significantly improving short-term prediction accuracy. Based on these high-precision forecasts, a dynamic transformer switching optimization model is established to maximize the wind farm’s net profit. This model finely balances power generation revenue, wind curtailment penalties, and transformer losses (no-load and load) at a 15 min timescale. Experimental results from a wind farm in Xinjiang demonstrate that the proposed method effectively enhances the economic efficiency of wind farm operations. The study provides a valuable framework for optimizing energy storage configuration and improving profitability by leveraging accurate forecasting. Full article
Show Figures

Figure 1

21 pages, 1053 KB  
Article
Machine Learning Systems Tuned by Bayesian Optimization to Forecast Electricity Demand and Production
by Zhen Wang, Salim Lahmiri and Stelios Bekiros
Algorithms 2025, 18(11), 695; https://doi.org/10.3390/a18110695 - 3 Nov 2025
Viewed by 354
Abstract
Given the critical importance of accurate energy demand and production forecasting in managing power grids and integrating renewable energy sources, this study explores the application of advanced machine learning techniques to forecast electricity load and wind generation data in Austria, Germany, and the [...] Read more.
Given the critical importance of accurate energy demand and production forecasting in managing power grids and integrating renewable energy sources, this study explores the application of advanced machine learning techniques to forecast electricity load and wind generation data in Austria, Germany, and the Netherlands at different sampling frequencies: 15 min and 60 min. Specifically, we assess the performance of the convolutional neural networks (CNNs), temporal CNN (TCNN), Long Short-Term Memory (LSTM), bidirectional LSTM (BiLSTM), Gated Recurrent Unit (GRU), bidirectional GRU (BiGRU), and the deep neural network (DNN). In addition, the standard machine learning models, namely the k-nearest neighbors (kNN) algorithm and decision trees (DTs), are adopted as baseline predictive models. Bayesian optimization is applied for hyperparameter tuning across multiple models. In total, 54 experimental tasks were performed. For the electricity load at 15 min intervals, the DT shows exceptional performance, while for the electricity load at 60 min intervals, DNN performs the best, in general. For wind generation at 15 min intervals, DT is the best performer, while for wind generation at 60 min intervals, both DT and TCNN provide good results, in general. The insights derived from this study not only advance the field of energy forecasting but also offer practical implications for energy policymakers and stakeholders in optimizing grid performance and renewable energy integration. Full article
Show Figures

Figure 1

22 pages, 1208 KB  
Article
Geo-MRC: Dynamic Boundary Inference in Machine Reading Comprehension for Nested Geographic Named Entity Recognition
by Yuting Zhang, Jingzhong Li, Pengpeng Li, Tao Liu, Ping Du and Xuan Hao
ISPRS Int. J. Geo-Inf. 2025, 14(11), 431; https://doi.org/10.3390/ijgi14110431 - 2 Nov 2025
Viewed by 375
Abstract
Geographic Named Entity Recognition (Geo-NER) is a crucial task for extracting geography-related entities from unstructured text, and it plays an essential role in geographic information extraction and spatial semantic understanding. Traditional approaches typically treat Geo-NER as a sequence labeling problem, where each token [...] Read more.
Geographic Named Entity Recognition (Geo-NER) is a crucial task for extracting geography-related entities from unstructured text, and it plays an essential role in geographic information extraction and spatial semantic understanding. Traditional approaches typically treat Geo-NER as a sequence labeling problem, where each token is assigned a single label. However, this formulation struggles to handle nested entities effectively. To overcome this limitation, we propose Geo-MRC, an improved model based on a Machine Reading Comprehension (MRC) framework that reformulates Geo-NER as a question-answering task. The model identifies entities by predicting their start positions, end positions, and lengths, enabling precise detection of overlapping and nested entities. Specifically, it constructs a unified input sequence by concatenating a type-specific question (e.g., “What are the location names in the text?”) with the context. This sequence is encoded using BERT, followed by feature extraction and fusion through Gated Recurrent Units (GRU) and multi-scale 1D convolutions, which improve the model’s sensitivity to both multi-level semantics and local contextual information. Finally, a feed-forward neural network (FFN) predicts whether each token corresponds to the start or end of an entity and estimates the span length, allowing for dynamic inference of entity boundaries. Experimental results on multiple public datasets demonstrate that Geo-MRC consistently outperforms strong baselines, with particularly significant gains on datasets containing nested entities. Full article
Show Figures

Figure 1

26 pages, 7289 KB  
Article
Spatiotemporal Modeling of Surface Water–Groundwater Interactions via Multi-Task Transformer-Based Learning
by Hao Jing, Yong Tian and Chunmiao Zheng
Hydrology 2025, 12(11), 291; https://doi.org/10.3390/hydrology12110291 - 2 Nov 2025
Viewed by 506
Abstract
A spatiotemporal, multi-task learning (MTL) model for simulating surface water–groundwater (SW-GW) dynamics is developed and applied to the Heihe River Basin, Northwest China. The Transformer-based model (MT-TFT) jointly forecasts surface runoff and groundwater levels, outperforming MTL models built on gated recurrent unit (GRU) [...] Read more.
A spatiotemporal, multi-task learning (MTL) model for simulating surface water–groundwater (SW-GW) dynamics is developed and applied to the Heihe River Basin, Northwest China. The Transformer-based model (MT-TFT) jointly forecasts surface runoff and groundwater levels, outperforming MTL models built on gated recurrent unit (GRU) and long short-term memory (LSTM) architectures. Compared with single-task learning, adding a coupled groundwater-level task markedly improves surface runoff prediction, achieving a Nash–Sutcliffe efficiency (NSE) of 0.73 and a coefficient of determination (R2) of 0.75. Attention-based interpretability shows that the model assigns the highest weights to time steps with elevated precipitation; as lead time shortens, attention further concentrates on these periods, improving the accuracy of near-term, multi-step forecasts. These results highlight the value of inductive transfer across hydrologic targets and demonstrate that MT-TFT provides an effective, interpretable framework for SW–GW coupling. Full article
(This article belongs to the Topic Advances in Groundwater Science and Engineering)
Show Figures

Figure 1

Back to TopTop