1. Introduction
The continuous rise in global energy demand, coupled with resource depletion and climate change, has posed significant challenges for humanity. By 2050, the share of renewable energy in the total primary energy supply is anticipated to rise from 14% in 2015 to 63% [
1,
2]. Currently, distributed resources like wind and solar power are spreading worldwide and playing a vital role in energy systems. As technology progresses, the integration of solar photovoltaics (PV) into smart grids is becoming increasingly common. Accurate and reliable PV forecasting can provide significant benefits to contemporary smart grid management. However, the PV system is influenced by the environment, weather, and solar radiation, making its output power unstable and fluctuating [
3]. Hence, minimizing the randomness and intermittency of solar power generation is crucial to enhancing the accuracy of PV power prediction.
Recently, researchers have conducted extensive studies on PV power generation forecasting, employing a wide range of methods to build models, e.g., physical methods, statistical and probability methods, artificial methods, and mixed methods. However, physical methods are susceptible to environmental changes, leading to reduced prediction accuracy [
4]. Moreover, the complex and multidimensional nonlinear relationship between input and output data makes it challenging for existing statistical and probabilistic methods to accurately capture and model this complexity, resulting in inaccurate fitting results [
5,
6].
Artificial Intelligence (AI) methods have become integral to the current photovoltaic power prediction framework. Their robust nonlinear mapping and feature extraction capabilities, coupled with rapid advancements in computer and data mining technologies, have made them indispensable [
7]. Furthermore, the rapid development of computer technology has resulted in the extensive use of single neural network prediction models in photovoltaic power forecasting. Lee and Kim [
8] developed three PV prediction models based on neural networks, i.e., Artificial Neural Networks (ANNs), Deep Neural Networks (DNNs), and Long Short-Term Memory (LSTM) networks, and validated their performance. The results show that all three models surpass non-AI methods in accuracy and better accommodate the nonlinear characteristics of photovoltaic power generation. Sarmas et al. [
9] studied the enhancement of short-term PV power prediction by integrating meta-learning and LSTM models independent of numerical weather prediction. Nonetheless, the traditional LSTM model only captures forward temporal information, neglecting backward information [
10]. To solve this problem, Graver and Schmidhuber [
11] proposed the Bidirectional Long Short-Term Memory Network (BiLSTM), which captures time series with higher predictive accuracy than LSTM. Ma et al. [
12] proposed an integrated framework that uses a Gated Recurrent Unit (GRU) combined with an improved sine and cosine algorithm to optimize prediction accuracy. Despite the high prediction accuracy achieved by recurrent neural network (RNN)-based models and their variants, their training processes are often time-consuming and memory-intensive, and they frequently encounter issues with gradient explosion or vanishing. Bai et al. [
13] introduced the Temporal Convolutional Network (TCN) to address lengthy input sequences in their entirety, allowing for faster data processing and robust parallel computing capabilities. Notably, by utilizing residual blocks and extended causal convolution, the TCN maintains strong predictive performance for long time series, surpassing LSTM in certain cases [
14]. Wang et al. [
15] proposed an efficient contract TCN model, which used an improved Deep Residual Shrinkage Network (DRSN) to replace the original TCN residuals to improve the accuracy of PV power prediction. Zhang et al. [
16] proposed the Attention-based Multivariate Temporal Convolutional Network (AMTCN), integrating dilated convolutions with multi-head attention to enhance the accuracy and robustness of electricity consumption forecasting.
Due to the inherent randomness, volatility, and instability of photovoltaic electricity, a single forecast model frequently fails to meet engineering practice standards [
17,
18]. In order to solve this problem, Convolutional Neural Networks (CNNs) combined with deep learning techniques can provide significant advantages, especially in processing time-series data and extracting features. Li et al. [
19] proposed a hybrid PV prediction framework based on a TCN, designed to improve utility-scale PV forecasting several hours ahead. Qu et al. [
20] proposed a CNN-LSTM neural network combined with an attention mechanism for day-by-hour photovoltaic power prediction. By embedding a variety of prediction models of relevant and target variables, the method employs an attention mechanism to improve the ability of the model to capture important features, thereby improving the prediction accuracy. Lai et al. [
21] introduced a Long- and Short-Term Net (LSTNet) model to integrate CNN, LSTM, and AR models, enabling the effective analysis and forecasting of PV data and power. Zhen et al. [
22] developed a hybrid model that merges BiLSTM with a genetic algorithm to improve forecasting accuracy. Abdel-Basset et al. [
23] introduced a novel data-driven PV-Net for short-term PV power forecasting, which redesigns the gates of the GRU with convolutional layers. Yu et al. [
24] proposed a short-term PV power prediction method that combines double-layer decomposition, the whale optimization algorithm, and a Bidirectional Long Short-Term Memory network. Empirical studies on weather datasets showed that their proposed method could significantly improve the accuracy and reliability of the prediction. Limouni et al. [
25] developed a hybrid PV prediction model that combines weather factors and a TCN-LSTM network, demonstrating superior performance to that of the TCN or LSTM alone. Pu et al. [
26] introduced an interactive behavior-learning method based on the TCN-GRU model, which exhibited excellent coupling ability in scenarios with incomplete information. Fu et al. [
27] proposed a hybrid framework combining improved Variational Mode Decomposition (VMD), a TCN, a GRU, and multi-head attention (MA), optimized by the Sparrow Search Algorithm, to enhance the accuracy and robustness of photovoltaic power forecasting under nonlinear and fluctuating conditions.
While TCNs have broad applications in time-series analysis, prior research has primarily utilized one-way TCNs, which only extract forward information and neglect the impact of backward information on predictions [
28]. Notably, redundant secondary feature information can interfere with the TCN’s feature extraction, affecting final predictions. Furthermore, the receptive field size in the TCN depends on the network parameters, including the number of layers and convolution kernel size, which directly influences feature extraction performance, prediction accuracy, and memory usage. Traditional convolution operations, especially when using large steps, reduce the time dimension and lead to the loss of time information. To address the limitations and unidirectivity of TCNs, this paper proposes a combination model based on a BiTCN and BiGRU, which improves the residual block of the BiTCN model with a temporal bottleneck structure. Additionally, this model also introduces a multi-head attention mechanism and a linear AR model to improve the model’s linear extraction capabilities and prediction accuracy. The temporal bottleneck structure enhances feature extraction by reducing redundancy and focusing on critical temporal information, while the bidirectional networks, including the BiTCN and BiGRU, comprehensively capture both past and future dependencies in the sequence. These innovations are designed to mitigate the shortcomings of traditional methods, enabling more accurate and robust PV power forecasting. This paper makes the following specific contributions:
(1) To enhance the information extraction capability of a single network, a new hybrid model, BiTCN–Multi-Head Attention–BiGRU, called the Temporal Bottleneck-enhanced Bidirectional Temporal Convolutional Network with Multi-Head Attention and Autoregressive (TB-BTCGA) model, is developed. The BiTCN extracts the implicit relationships between PV power and meteorological factors, such as solar irradiance and temperature, which are then fed into the BiGRU for prediction. The multi-head attention mechanism amplifies the influence of crucial information on BiGRU output, thereby improving prediction accuracy.
(2) Considering the various meteorological inputs for photovoltaic power generation prediction, the multi-head attention mechanism enables each attention unit to calculate its weight in parallel for each time step. After aggregation, it selects the most relevant time step for prediction.
(3) The residual blocks in the BiTCN have been enhanced to address the issue of gradient disappearance or attenuation when the number of network layers increases significantly, thereby improving feature extraction capabilities. Firstly, the residual block of the TCN is enhanced using a temporal bottleneck structure to improve the model’s time feature extraction ability. Secondly, the DRSN can be used to further refine the residual blocks in the TCN, enhancing the model’s adaptability and interpretability for complex time-series data and reducing redundant features.
(4) Additionally, the TB-BTCGA model effectively captures linear features by incorporating the AR model. Weighting and combining the outputs of other models further enhance the model’s ability to extract linear features and significantly improve its overall prediction accuracy.
Section 2 introduces the primary methods used in the TB-BTCGA model, including the core modules and their functionalities.
Section 3 focuses on the improvements made to the TCN residual block, detailing the structural enhancements and their impact on the model’s performance.
Section 4 discusses the data sources, data analysis, experimental settings, and implementation details. Finally,
Section 5 concludes the paper, summarizing the key findings and outlining directions for future research.
3. Improved TCN Residual Block
To address the issue in the TCN where convolution operations, especially with larger strides, reduce the temporal dimension and cause information loss, a temporal bottleneck residual structure was introduced into the TCN’s residual framework. The temporal bottleneck residual structure, proposed by Choi et al. [
30], incorporates a transposed convolution that enriches temporal information. It specifically includes two components,
and
, along with skip connections. The formula is as follows:
where
is a standard convolution used to reduce the temporal dimension, while
is a transposed convolution function used to restore the temporal dimension.
represents the skip connection, expressed as follows:
where
is a 1D convolution,
is a transpose convolution, stride is the stride size, and
is the size of the convolution kernel.
and
are batch-normalized learnable scaling parameters used to adjust the amplitude of the output.
and
are learnable offset parameters after batch normalization.
is the small-lot mean.
is the small-lot variance.
is a very small constant for numerical stability, preventing division by zero.
,
, and
are the convolution kernel weights.
By incorporating the concept of the temporal bottleneck residual structure, the original residuals in the TCN are improved, as shown in part A of
Figure 6. Only the residual structure of the unidirectional TCN is shown here. The bidirectional structure is similar to that in
Figure 4, with the original structure replaced by the improved structure. In the original TCN residual structure, a transposed convolution is added to create the temporal bottleneck structure. This approach enhances the model’s ability to capture temporal features, ensuring that key temporal information is not lost during feature extraction. By maintaining the integrity of the temporal dimension, the statistical pooling layer can more effectively aggregate temporal information, thereby improving the quality and accuracy of the aggregated features. This improvement significantly enhances the accuracy and stability of photovoltaic power generation predictions.
In addition to the above improvements, this paper also introduces the soft thresholding mechanism from the DRSN, as shown in parts B and C of
Figure 6.
The DRSN is an enhanced algorithm for residual networks that integrates attention mechanisms and soft thresholding methods to support autonomous filter learning. Compared with the traditional wavelet threshold, it is more efficient and accurate, which can avoid the inconvenience and blindness of artificial threshold settings and achieve the purpose of reducing the influence of secondary redundancy features on the network [
31]. The DRSN enhances the original residual network by incorporating a soft thresholding mechanism and an attention mechanism. The soft thresholding mechanism is used for signal noise reduction by setting a threshold value. Features with absolute values beneath this threshold are assigned a value of zero, while other features are shrunk toward 0. The output of the soft thresholding mechanism and its derivatives are as follows:
where
x is the input value,
is the output after the soft threshold, and
is the threshold. This approach efficiently lessens the model’s load in handling redundant information while enhancing its emphasis on significant features. Consequently, it boosts overall prediction accuracy and enhances the model’s robustness. Through this method, the DRSN not only optimizes the feature extraction process but also enhances the adaptability and interpretation ability of the model for complex time-series data. Additionally, the ReLu activation function was replaced with the Gaussian Error Linear Unit (GeLu) activation function, resulting in less information loss and a smoother model, thereby improving the model’s generalization capability.
4. Case Study
4.1. Analyze Dataset
The dataset used in this study consists of one year of data from a power station in Hebei, China, with records from three monitoring stations: Station 0, Station 1, and Station 2, representing the central, northern, and southern regions of Hebei Province, respectively. Each station’s dataset includes power generation, forecasted total irradiance, forecasted direct irradiance, forecasted temperature, humidity, wind speed, and local meteorological data, as detailed in
Table 1. The forecast data were provided by Numerical Weather Prediction (NWP), while the local meteorological data were derived from Local Meteorological Data (LMD). The data cover the period from August 2018 to September 2019, with records collected every 15 min. Taking Station 0 as an example,
Table 2 presents a statistical analysis of the mean, maximum, and minimum values for each data component. The dataset is divided into training, validation, and test sets in a 7:1.5:1.5 ratio.
The three stations differ significantly in terms of their weather patterns. Station 0 (central Hebei) experiences a moderate, temperate climate with relatively balanced seasonal variations, making it suitable for assessing standard operational conditions for PV generation. Station 1 (northern Hebei) faces harsher, colder winters and more extreme temperature variations, often influenced by the Siberian cold air, which can impact the efficiency of PV systems during winter. Station 2 (southern Hebei) enjoys a milder climate with higher humidity and more frequent rainfall in summer, resulting in a different set of challenges, such as higher humidity levels and seasonal storms, which can affect PV performance. These variations in climate across the three stations allow for a comprehensive evaluation of the model’s generalization ability under different meteorological conditions.
4.2. Feature Analysis Method
To minimize the computational complexity of the prediction model, the Spearman Correlation Coefficient (SCC) and the Pearson Correlation Coefficient (PCC) were employed to assess the relationships among input variables [
32,
33]. The PCC (
22) and SCC (
23) are calculated as follows:
where
has the range [−1,1], and the larger the absolute value, the stronger the correlation between
Q and
K.
represents Spearman’s rank correlation coefficient, and
represents the rank difference in the corresponding variable, that is, the position (rank) difference in the paired variables after the two variables are sorted.
n indicates the number of observed objects.
In their study, Gherboudj and Ghedira [
34] demonstrated that effective feature selection for PV power generation models relies on meteorological factors such as irradiance, temperature, and humidity, which significantly influence the solar energy potential and performance. Building on this, the correlation coefficient between meteorological variables and power generation at Station 0 was calculated using the above method, as shown in
Table 3.
Figure 7 visualizes this information in the form of a heat map. The analysis shows that the most relevant meteorological factor for photovoltaic power generation is irradiance, with both forecast and ground-observed data showing a strong correlation between solar irradiance and PV power output. Temperature and humidity exhibit a moderate correlation with photovoltaic power generation, as does wind speed. However, while the wind speed in the forecast data shows a weak correlation, the ground-observed wind speed demonstrates a higher correlation, indicating a moderate relationship with PV power generation. Therefore, the final model also includes ground-observed wind speed and meteorological factors such as irradiance, temperature, and humidity—each showing strong to moderate correlations.
4.3. Performance Evaluation Metrics
Equations (
24)–(
26) define the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and the coefficient of determination (
), which are used to evaluate the prediction model’s accuracy in this experiment [
35]. The smaller the MAE and RMSE values, the more accurate the prediction result. Additionally, an
value closer to 1 indicates higher prediction accuracy.
In this context, stands for the actual PV output power, signifies the predicted PV output power, and N denotes the number of test samples.
4.4. Parameter Setting
The experimental code described in this paper is implemented using the PyTorch 2.1.1 framework. Moreover, selecting hyperparameters is essential for effectively training a model. The chosen hyperparameters in this study are shown in
Table 4. The model’s configuration includes an input dimension of 10 and an output dimension of 1. Additionally, the training process is configured to end after 200 epochs, with a dropout rate set at 0.05. Since PV power generation mainly relies on solar irradiance, the power generation at night is zero. Therefore, a total of 13 h of data, from 6 AM to 7 PM, is selected, and a total of 52 data points are collected. Therefore, the time step is set to 52. The optimizer is Adam.
4.5. Results
This paper evaluates the performance of the proposed model through a single-step prediction test, where the goal is to predict a data point 15 min ahead.
Table 5 summarizes the evaluation metrics for six models in the single-step forecasting test set. The proposed TB-BTCGA model is compared with the TCN, TCN-LSTM, BiTCN, BiTCN-BiGRU, and BiTCN-BiGRU-MA models based on MAE, RMSE, and
. The results in
Table 5 show that the proposed model achieves an MAE of 0.193, RMSE of 0.325, and
of 0.946 for single-step prediction.
By comparing the standalone TCN and BiTCN models, it is evident that the bidirectional TCN outperforms the unidirectional TCN in terms of prediction accuracy. This demonstrates that incorporating a bidirectional structure effectively enhances the model’s predictive capability. Specifically, the bidirectional TCN captures more time-series features by utilizing both past and future contexts, which helps avoid the loss of important information that a unidirectional model might miss.
In the comparison of hybrid models, both TCN-LSTM and BiTCN-BiGRU show significant improvements in . This suggests that combining the TCN with the LSTM network or BiGRU allows the model to effectively integrate local and global time-series features, thereby improving prediction accuracy. In terms of evaluation metrics, the proposed model outperforms the BiTCN-BiGRU-MA model, reducing MAE by 3.2%, decreasing RMSE by 5.1%, and increasing by 1.3%. These results demonstrate that the proposed method improves accuracy compared to the unimproved model, particularly by increasing while effectively reducing the errors.
To further verify the effectiveness of the proposed hybrid photovoltaic power generation prediction model, the six aforementioned models are applied to forecast the data of the region containing the power station in the test set, covering the period from 6:00 to 19:00 on both clear and cloudy days. The outcomes of the experiment are shown in
Figure 8 and
Figure 9. Specifically,
Figure 8 presents the forecasting outcomes for clear days, while
Figure 9 illustrates the prediction outcomes for overcast conditions. In the single-step prediction, the model introduced in this study has a greater advantage in fitting the true values compared to other models and has a better power prediction trend when there are large fluctuations in power generation.
In summary, the TB-BTCGA model proposed in this paper, which integrates the improved BiTCN, BiGRU, multi-head attention mechanism, and AR model, demonstrates superior accuracy in single-step prediction on the Station 0 dataset compared to other models. However, the model’s generalization ability and stability in real-world applications, particularly under conditions of poor data quality or significant data fluctuations, still require further validation. Therefore, the model’s predictive accuracy was further evaluated using the Station 1 and Station 2 datasets.
Figure 10 presents the 3-day PV power prediction results for Station 1 under cold winter conditions in the northern region. Due to the region’s climate characteristics, the PV power generation data remain relatively stable. Consequently, the predicted values in the figure exhibit smooth variations with relatively small errors, highlighting the model’s strong adaptability to stable environmental conditions. The curves in the figure illustrate the differences between actual PV power generation and the predicted values, showcasing the model’s ability to forecast such stable seasonal fluctuations. Despite the limited sunlight during winter, the model effectively predicts the power generation trend and accurately captures short-term variations. This demonstrates that the proposed model is well suited for stable environments, exhibiting both strong adaptability and high prediction accuracy for power generation patterns under typical climatic conditions.
Figure 11 presents the 3-day PV power prediction results for Station 2 under summer rainy conditions. Compared to Station 1, the data from Station 2 are more affected by weather fluctuations, particularly during the rainy season, which increases the volatility of photovoltaic power generation. The instability in light intensity and duration during rainy weather leads to significant short-term fluctuations in power generation, making forecasting more challenging. The prediction curves in the figure illustrate the model’s ability to handle such unstable weather conditions. Despite the increased uncertainty, the model still effectively predicts the short-term power generation trend. Although the prediction error is slightly higher compared to Station 1, the model still delivers relatively stable results. In particular, during frequent rainy conditions, the model successfully captures the variations in features and adjusts for errors, enabling it to track power generation changes accurately and maintain high prediction precision even amid short-term fluctuations.
In addition to single-step prediction, experiments were also conducted on multi-step forecasting, where power generation for 3, 6, and 12 time steps (corresponding to 45 min, 1.5 h, and 3 h into the future) is predicted. The results of these experiments are shown in
Table 6. While the proposed model outperforms all other models in every multi-step prediction, its decreasing performance with increasing time steps warrants further investigation.
As expected, the prediction accuracy tends to degrade as the forecast horizon lengthens, a common challenge in time-series forecasting. The temporal bottleneck structure partially mitigates error propagation in multi-step predictions by emphasizing critical features, resulting in a controlled increase in MAE and RMSE. The main reason for this decline is the accumulation of prediction errors over multiple time steps, which is inherent to sequential forecasting tasks. In particular, the model’s ability to maintain prediction accuracy decreases as the forecast horizon extends, making it more challenging to predict short-term fluctuations accurately. However, despite this decline, the proposed model still maintains a clear edge over other models, indicating that the enhancements—such as the bidirectional TCN, BiGRU, and attention mechanisms—significantly improve the model’s performance in multi-step forecasting. These improvements allow the model to handle the increasing complexity of multi-step predictions better than other methods, although further work is needed to address the challenges associated with long-term forecasting.
4.6. Analysis of Computational Complexity
Table 7 presents the computational complexity of the proposed and benchmark frameworks. The space complexity can be described by floating-point operations (FLOPs). As shown, the computational cost increases progressively with the inclusion of advanced modules such as the BiGRU, multi-head attention, the DRSN, and the temporal bottleneck structure in the TB-BTCGA model. These additions enhance the model’s ability to capture complex temporal dependencies and refine feature extraction, leading to the highest FLOPs (6.88 G) and training time (65.96 s). While the increased complexity demands more computational resources, it significantly improves the model’s predictive accuracy and robustness, justifying the trade-off for better forecasting performance.
4.7. Discussion
The TB-BTCGA model significantly improves PV power prediction, particularly in short-term forecasting. By incorporating the BiTCN, BiGRU, multi-head attention mechanism, and AR components, the model can better capture temporal dependencies in PV power generation data, leading to more accurate forecasts.
However, challenges in long-term forecasting remain, as error accumulation across time steps continues to affect predictive accuracy. The temporal bottleneck structure mitigates this issue by emphasizing critical features across time steps, reducing redundancy and focusing on key patterns. Meanwhile, bidirectional networks address the limitation of unidirectional models by capturing both historical and future dependencies. Despite these advancements, further improvements are needed, such as exploring ensemble learning or Transformer-based long-term memory structures, to better handle multi-step forecasting challenges.
A major strength of the TB-BTCGA model lies in its adaptability to varying environmental conditions. The temporal bottleneck ensures efficient feature extraction, which benefits both stable climates, such as cold winters (Station 1), and volatile climates, like summer rainy conditions (Station 2). Additionally, the bidirectional networks allow the model to dynamically adjust to diverse climatic scenarios, improving robustness and accuracy.
Nevertheless, the model’s performance under extreme weather conditions, such as sudden storms or rapid weather changes, remains untested. Future research should focus on these scenarios to ensure the model’s reliability in real-world applications, where extreme conditions are increasingly common due to climate change.
The TB-BTCGA model demonstrates significant potential for real-world applications, particularly in smart grids and PV power management systems. By minimizing redundant computations, the temporal bottleneck structure enhances computational efficiency, enabling the real-time processing of data from multiple PV systems. This feature is critical for real-time energy management, as it allows grid operators to make informed decisions under fluctuating energy production conditions. The bidirectional networks further improve decision-making by capturing both historical trends and future dependencies, ensuring accurate and dynamic predictions even in rapidly changing grid environments.
For practical implementation, the model must seamlessly integrate with existing grid management systems, accommodating inputs such as grid load, energy storage, and demand fluctuations. For example, accurate short-term predictions can optimize energy storage operations by ensuring that storage systems charge during periods of surplus energy and discharge during peak demand. Furthermore, during extreme weather events, the model’s ability to process real-time data from Internet of Things (IoT) devices, such as weather sensors and grid monitors, enables it to provide timely and reliable forecasts. This adaptability is crucial for maintaining grid stability and supporting intelligent energy management.
In summary, while the TB-BTCGA model achieves strong short-term forecasting performance and demonstrates adaptability to diverse conditions, several areas require further exploration. These include the following:
(1) Improving multi-step forecasting: Future work should focus on reducing error accumulation through advanced methods like long-term memory networks or error correction mechanisms.
(2) Handling data uncertainty: Techniques such as data imputation and anomaly detection can improve the model’s robustness when faced with noisy or incomplete data.
(3) Cross-regional validation: Testing on diverse datasets will help evaluate the model’s generalizability to different regions and environmental conditions.
(4) Enhancing computational efficiency: Optimizing the model for large-scale deployment without compromising predictive accuracy will be critical for its real-time applications.
5. Conclusions
This paper proposes a novel TB-BTCGA model for PV power prediction that integrates an improved BiTCN, a multi-focus mechanism, and a BiGRU. To enhance the model’s feature extraction capacity and mitigate the interference of redundant features, the DRSN is incorporated to improve the residual module of the BiTCN. Additionally, a temporal bottleneck structure is introduced to optimize the original TCN residual block. Notably, the inclusion of multiple attention mechanisms facilitates more effective extraction of critical features. Furthermore, the BiGRU network captures long-term dependencies within the time series, while AR models contribute to refining the model’s ability to extract linear features.
Experiments conducted on various datasets demonstrate that the proposed TB-BTCGA model outperforms several state-of-the-art models in terms of PV power prediction accuracy. Specifically, compared to the single TCN model, the proposed model significantly reduces the MAE and RMSE by 12.1% and 16.1%, respectively, while the improves by 6.2%. These results highlight the model’s robust performance across different environmental conditions and its ability to generalize effectively, suggesting its potential for practical applications in diverse scenarios.
However, the model’s performance in forecasting during extreme weather events, such as rapid temperature changes or cloud cover, still needs improvement. These delays in prediction accuracy could impact the usability of forecast results in real-time applications. Future work should focus on addressing these issues, possibly through enhanced weather condition modeling or real-time data integration. While this study demonstrates the model’s effectiveness in PV power prediction under typical environmental conditions, its generalizability across a wider range of climates and geographical areas requires further investigation. Case studies in other regions with varying weather patterns should be conducted to better understand the model’s scalability and robustness. Additionally, future research should explore the model’s performance under different operational scenarios, such as varying levels of solar panel efficiency or grid management requirements. By addressing these areas, the TB-BTCGA model could provide an even more effective solution for photovoltaic power management in smart grids.