1. Introduction
As environmental pollution and the energy crisis worsen, focus on renewable energy is growing. Governments worldwide are prioritizing and supporting dependable and financially sustainable electric power systems. The International Energy Agency’s (IEA) 2023 Electricity Market Report reveals a nearly 11% growth in global renewable energy capacity in 2022, with solar capacity surging almost 18%. However, despite the growth in solar energy’s contribution to renewable energy, its penetration into the mainstream energy market remains modest [
1]. This is primarily attributed to the inherent intermittency and instability caused by the fluctuating nature of solar radiation. Such volatility in solar photovoltaic (PV) systems significantly impacts the stability and reliability of the power supply, underscoring the need for enhanced predictability and security, while accurate forecasting of solar irradiance faces challenges in data acquisition and the influence of multiple variables plays a vital role in addressing these issues. Employing real-time electricity dispatch based on precise predictions of solar irradiance can enhance energy efficiency, stabilize the power supply, and reduce the costs of electricity [
2].
In recent years, numerous models for predicting solar radiation have emerged, including physical models [
3], statistical models [
4], machine learning models, deep learning models [
5], and hybrid models [
6,
7]. Physical models use solar radiation transfer equations and atmospheric principles to forecast solar radiation with meteorological data. For example, Angstrom [
8] estimated total solar radiation at specific locations by analyzing the relationship between global solar radiation and sunshine duration. Meanwhile, Whillier [
9] assumed constant atmospheric transmittance throughout the day and estimated hourly radiation values from daily radiation values through data analysis and theoretical derivation. However, these physical models encounter computational difficulties and have limited accuracy under diverse climatic conditions. In contrast, statistical models like ARIMA [
10], SARIMA [
11], VAR, and VARMA [
12] use historical solar radiation data for forecasting. They handle time series data effectively, capturing trends and periodicity, as evidenced in previous work by Belmahdi et al. [
13] and Shadab et al. [
14]. The accuracy of these statistical models heavily depends on dataset quality and parameter selection, often overlooking the influence of external factors on solar radiation.
With the growing volume of data and advancements in machine learning, researchers have increasingly used neural network models to predict solar radiation. These methods mainly include both traditional machine learning and deep learning approaches. Among conventional machine learning techniques, commonly used methods include the support vector machine (SVM) [
15,
16], decision tree (DT) [
17,
18], artificial neural network (ANN) [
19,
20], and random forest (RF) [
21,
22]. However, these individual machine learning models often struggle to capture the complex patterns and nonlinear relationships in solar radiation time series data. This leads to suboptimal information utilization, incorrect feature selection, and significant prediction biases. To overcome these issues, researchers have combined optimization algorithms with machine learning models [
23,
24] for sequence forecasting. For example, Natgunanathan et al. [
25] utilized RF algorithm and digital twin technology to predict and optimize power generation within the renewable energy microgrid project at the solar farm. Even though using optimization algorithms improves the feature selection capacity of individual machine learning models, they still face challenges with high-dimensional, large-scale, and complex-structured data. Furthermore, deep learning models have gained attention for their ability to automatically extract features, strong generalization capabilities, and expertise in handling large-scale data. For example, Kazem et al. [
26] predicted the current and power of the grid-connected photovoltaic system (GCPV) using a full recurrent neural network (FRNN) combined with principal component analysis (PCA). In a similar vein, models like recurrent neural networks (RNN) [
27,
28], long short-term memory (LSTM) networks [
29,
30], transformer [
31], and time convolutional networks (TCN) [
32] are utilized in solar radiation time series research. Shekar et al. [
33] conducted multi-step forecasting of solar radiation using the LSTM model. In a similar vein, Gao et al. [
34] introduced the use of gated recurrent unit (GRU) for solar irradiance prediction. LSTM and GRU models, with their gating mechanisms and memory units, improve conventional RNNs’ ability to manage long sequences and dependencies. However, this complexity adds to computational demands, slowing down processing with intricate data such as solar radiation and possibly hindering the capture of continuous long-term trends.
Hybrid Models typically combine two or more different types of models [
35,
36]. The hybrid models primarily combine CNN, RNN, LSTM, and the attention mechanism, leveraging their unique advantages to enhance prediction accuracy. For example, Kumari et al. [
37] demonstrated the effectiveness of an LSTM–CNN hybrid model for short-term global horizontal irradiance (GHI) prediction. Similarly, Zang et al. [
38] introduced a cascaded CNN–LSTM structure for spatiotemporal correlation, using CNN for spatial feature extraction and LSTM for temporal dependencies. The cascaded structure improved understanding of spatial relationships and solar radiation time series data. To address the intermittency and instability of solar radiation, Gao et al. [
39] utilized a combined CEEMDAN–CNN–LSTM model, achieving significant improvements in prediction performance, while the application of hybrid models is pivotal for solar radiation forecasting and shows promise, they also introduce complexity. Such complexity can result in slower processing speeds and challenges in capturing continuous long-term trends, especially with intricate data like solar irradiance.
Solar radiation’s dynamic interplay with meteorological variables results in unique traits. One notable characteristic is the overlapping of cycles caused by seasonal oscillations. This complexity imposes higher requirements on prediction models. Specifically, these models need to deal with intricate interrelations among multiple temporal intervals or spatial positions. Moreover, these models need to distribute attention across different features at various levels, thereby enhancing their capability to understand and identify complex relationships and patterns. The Attention Mechanism, which allows for the adaptive focus on different parts of a sequence, has been successfully integrated with deep learning models such as RNN and LSTM. This combination has shown the value of attention-based models in addressing these complexities, providing improved accuracy and efficiency. For example, Qin et al. [
40] developed an approach that blends the time attention mechanism with RNN to capture the long-term temporal dependencies in time series data. Similarly, Pan et al. [
41] integrated the attention mechanism with LSTM for the prediction of photovoltaic power generation. The integration of attention mechanisms augmented the prowess of the LSTM model for long-term sequence prediction. Aslam et al. [
42] proposed a method that combines attention with LSTM, bolstering the model’s performance by assigning disparate weights to the importance of features. Zhang et al. [
43] employed attention mechanisms with GRU for feature extraction, enabling a comparison of data before and after faults and capturing changes in different fault locations. The integration of attention mechanisms has helped to overcome some limitations of RNN and LSTM in feature modeling. However, the design of LSTM and GRU with gate structures introduces a new challenge. During sequence data processing, this model structure can lead to information loss. Consequently, critical information may be omitted, hindering the overall performance.
To address the aforementioned issues, this study propose a deep-learning-based solar radiation forecasting model, integrating three key components: a residual attention time convolution block (RACB), a dual-path information fusion module (DIFM), and a twin self-attention module (TSAM). Specifically, time series data typically contain multiple features, each with a potential impact on target value prediction. Although a correlation analysis is performed on the input data and some low-correlation features are removed, the existing feature selection method still falls short in accurately determining the significance of each feature. To tackle this issue, a feature attention is incorporated into RACB, enabling the network to adaptively discern the importance of each channel during the feature extraction process. By allocating different weights to different channels, the model effectively concentrates on the features most pertinent to the prediction target. Second, the DIFM comprises two parallel time series convolutional networks: the local feature extraction network (LFEN) and the dilated feature extraction network (DFEN). The LFEN is employed to extract local features from the time series, while the DFEN captures a wider context information by expanding the receptive field. By integrating features from both paths, the model is designed to boost its feature representation ability and strengthen its robustness against input data variations. Finally, seasonal fluctuations often lead to overlapping signals from different periods, complicating the model’s ability to extract effective feature representations. To effectively manage this situation, the model needs to learn long-term dependencies in the data. To address this issue, the TSAM is proposed, which encompasses channel self-attention and sequence self-attention. This module aims to globally model the features produced by the DIFM from both a sequence and a channel perspective, thereby enhancing the predictive capabilities of the model.
In summary, the contributions of this study encompass the following three aspects:
A novel solar radiation forecasting model is proposed that innovatively incorporates a residual attention time convolution block (RACB). This design enables the model to selectively enhance significant features for irradiance prediction while diminishing the importance of irrelevant ones;
A dual-path information fusion module (DIFM) is proposed, adeptly integrating both local features and broader contextual information. Through the consolidation of features from distinct scales, the module enhances the model’s representational capacity, thereby bolstering its robustness against variations in input data;
A twin self-attention module (TSAM) is designed. By modeling long-distance dependencies in channels and sequences dimensions, the predictive capability of the model is improved. Experimental results on several public datasets demonstrate the effectiveness of the methods proposed in this study.
The remainder of this study is structured as follows:
Section 2 describes the detailed structure of the model. In
Section 3, the performance of the proposed model is evaluated through experiments on several publicly available datasets. Finally, in
Section 4, a summary and discussion of the entire study is provided.
4. Conclusions and Discussion
This study aims to develop a multi-step sequence prediction model for solar irradiance forecasting. To achieve this, a deep learning-based dual-path information fusion and twin attention-driven solar irradiance forecasting model is proposed. This model comprises three components: the RACB, the DIFM, and the TSAM. The RACB is designed to enable the network to adaptively learn important features while suppressing irrelevant ones. Following that, the DIFM is introduced to reinforce the model’s robustness against input data variations and integrate multi-scale features. Finally, the TSAM is introduced to extract long-term temporal dependencies within the sequence, thereby enhancing multi-step prediction capabilities.
The experimental results indicate that the model proposed in this study can accurately predict irradiance data. In both one-step and multi-step predictions, it significantly outperforms other models. Furthermore, when compared to other models such as TCN, LSTM, LSTM-Attention, CNN-LSTM, and Transformer, the proposed model exhibits superior and more robust performance across datasets from four different regions. For instance, for one-step predictions, the model reports RMSE values of 2.192 /, 2.195 /, 2.508 /, and 2.238 / across the four datasets. When compared to algorithms such as TCN, LSTM, LSTM-Attention, CNN-LSTM, and Transformer, the proposed model achieves average RMSE reductions of 0.381 /, 0.212 /, 0.256 /, 1.052 /, and 1.855 /, underscoring its consistent performance across various datasets. For other multi-step predictions, the proposed model also surpasses other models, with the performance gap becoming more noticeable as prediction steps increase. Additionally, scatter plots and curve diagrams provide a more visual representation of the precision of the proposed method. Lastly, ablation studies were conducted to validate the effectiveness of the DFEN, the LFEN, the RACB, and TSAM. The results highlight that, when removing these components, there is a substantial decline in performance compared to the full model. For example, when LFEN is removed, RMSE increases from 2.19 / to 2.540 /.
The prediction methods employed in this study have been summarized, and several conclusions are drawn. Firstly, solar irradiance is influenced by a multitude of factors, and the RACB effectively extracts essential features from this complex interplay. Secondly, these influencing factors exhibit distinct characteristics across different time scales. The DIFM captures these features across multiple scales, enhancing the model’s robustness and enabling precise predictions. Lastly, for extended-range forecasting, the TSAM addresses long-term dependencies in the data sequence, facilitating more accurate forecasting of future irradiance data. In contrast, other models like TCN, LSTM, and RNN do not consider the impacts of diverse factors and the importance of features across various scales. The model proposed in this study provides valuable support for photovoltaic power generation systems, a pivotal step toward the development of intelligent grid systems.
While the model excels in forecasting solar irradiance over multiple steps, there is room for refining its precision. Its primary reliance on past irradiance data might pose challenges in anticipating abrupt irradiance shifts. For even greater accuracy, future studies might consider broadening data sources and optimizing the neural network’s design. Addressing the intricate dynamics of irradiance patterns could demand meticulous feature engineering and model tweaks. Given the notable weather variations across different regions and seasons, it seems prudent to design models tailored to specific locales and times of the year. Such bespoke modeling, factoring in the unique climatic nuances of each area and season, stands to boost predictive accuracy.