1. Introduction
In a modern power system, utilities and transmission systems operators face large-scale uncertainty due to the high penetration and intermittent nature of renewable energy systems, such as solar and wind energy resources, and their integration into the electricity grid. The available generation capacity from conventional energy resources, such as coal-fired, oil-fired, gas-fired, nuclear power plants, and renewable resources, should be more significant than the peak load and the required reserve margin [
1]. An optimization model is needed for economic dispatch (ED) and optimal power flow (OPF) problems to reduce fuel costs and obtain the most economical solution for dispatching generators. The objective function of the ED problem is to minimize total generation costs. To obtain a better ED solution, consider an accurate power network. This network model should represent a set of power flow equations mathematically. Applying load flow equations as a whole network model while considering ED problem constraints will enable an OPF model to be found [
1,
2].
Recently, utility companies and systems operators have collected more valuable electricity data from distributed renewable energy (RE) resources, generators, and loads. Historical power system operation data give insights into the electricity grid’s current conditions and can be used to make modern power systems more stable, efficient, reliable, and secure.
Generally, the OPF problem can be classified into the direct current optimal power flow (DCOPF) and alternating current optimal power flow (ACOPF) subtypes. Indeed, the security constraint optimal power flow (SCOPF) problem is an expansion of the standard ACOPF/DCOPF problem, where binary variables are included to incorporate the switching of transmission lines, which enables more efficient and secure solutions [
3]. Due to the ACOPF problem being computationally inefficient, using the Jacobian gradient for Newton–Raphson or Hessian matrices for the interior point method requires significant time to solve the problem. The system’s bus lines’ flows, voltages, and angles, which can represent grid state variables, are unknown in the OPF problem. Researchers have applied convex relaxation techniques to linearize and simplify the non-linear ACOPF problem to the DCOPF problem [
4,
5,
6,
7]. However, the solution to the DCOPF problem made a few assumptions to converge faster to the solution; for example, it was assumed that, for each bus, the voltage magnitude is one per unit, and the phase angle difference between buses is trivial. Therefore, the solution to the DCOPF problem did not capture all the current state conditions of the grid. On the other hand, the ACOPF is a more accurate representation of power system operations, although it is demanding and complex. Recently, numerous data-driven OPF solution techniques have been applied to solve this optimization problem more efficiently and effectively. Some techniques have been used to search machine learning (ML) and deep learning (DL) in OPF problems, namely the direct mapping of OPF variables, predicting active or chance constraints, using physics-informed neural network (PINN) models, and predicting warm start points.
The direct mapping technique aims to match the system’s state variables to its control variables without repeatedly solving optimization problems [
8,
9,
10,
11]. Reference [
10] utilized a neural network (NN) to map system loads to optimal outputs without actually solving the OPF to enable efficient solutions to be found by avoiding complex approximations, distributed techniques, or computationally intensive platforms, thus ensuring the ability to find feasible solutions through a rapid iterative process. The authors of [
12] explored a two-step approach to training a deep neural network (DNN) for security-constrained DCOPF, initially focusing on load and generation mapping, followed by phase angle reconstruction, effectively streamlining the process and reducing the DNN’s size and training requirements. The authors of [
11] utilized a DNN approach based on a preventive framework, calibrating generation and transmission line limits in DNN training to anticipate approximation errors and ensure the feasibility of the predicted solutions. Generally, in the direct mapping method, the input parameters of the OPF problem are generated from simulations or collected from historical data. Then, these input parameters are introduced to the OPF problem, and the solution is captured at each sample point. Lastly, the model is trained on data using input and output pairs, learning the mapping from input parameters to the optimal OPF solution. Then, a new batch of data feeds into the previously trained model. This approach also allows for sequential learning, where the model can be updated as new batches of data become available, enhancing its accuracy and adaptability over time. This model approximates OPF solutions in real-time operation conditions; no further optimization is necessary for this model. The direct mapping method provides more rapid solutions than the traditional iterative solution-based OPF approach, particularly for large-scale power systems.
Predicting active and chance sets, rather than determining scenario-based heavy computation deterministic conditions for the ACOPF problem, utilizes approximate bounds on the joint chance constraints [
13,
14,
15,
16,
17,
18,
19]. These studies have explored how to handle uncertainty for chance-constrained OPF problems. Reference [
18] forecasted errors as random variables in an SCOPF problem and used chance constraints to limit technical violations, thereby requiring reformulation. Another study [
19] proposed a generalized framework for chance-constrained OPF problems under Gaussian uncertainties, emphasizing the effectiveness of affine feedback policies across various distribution types. The authors of reference [
15] proposed joint chance constraints for the OPF problem, addressing multiple constraints simultaneously with a specified probability and creating a less conservative set of single chance constraints for more reliable solutions. In reference [
13], the authors presented a streaming process design to learn active sets from training samples, including input parameters and optimal solutions across diverse problem types and structures, unrestricted by the probability distribution of the inputs. When given the system’s current or future conditions, predicting active and chance sets relies on determining which constraints are binding in the OPF problem. Predicting active constraints helps to evaluate active remaining constraints throughout the OPF process, significantly supporting the reduction in the solution time. The OPF model solver depends on previously predicted active constraints. On the other hand, chance constraints determine and account for uncertainties while also satisfying particular probability conditions. Classification models are suitable for predicting the probability of active constraints for the system’s state. The process of predicting active and chance constraints is similar to the direct mapping technique, except that it includes the active constraints. In this model, the accuracy of the predicted active and chance constraints is crucial because the OPF solvers rely on predicted constraints to speed up the process.
Another data-driven OPF approach is the PINN, which combines the NN model and incorporates physical constraints for the system to satisfy those constraints. PINN ensures that predicted power flow solutions are consistent with network topology and power system operation limits by introducing additional loss functions in the NN [
20,
21,
22,
23]. This survey [
23] provided a comprehensive overview of PINN in the power systems domain, summarizing various PINN paradigms such as PI loss functions, PI initialization, PI architectural design, and hybrid physics–deep learning models. The authors in this study [
24] introduced a framework for PINN in power systems and optimized training parameters using physical laws. Their approach simplified the network’s structures and improved the accuracy of PINN, specifically in rapidly determining dynamic states and uncertain parameters such as inertia and damping for single-machine infinite bus systems. This study [
20] presented an approach using the PINN scheme. It utilized stacked extreme learning machine (SELM) frameworks. Their approach simplified the OPF model by decomposing it into three stages to reduce complexity and bias. The method included a classification strategy for better feature extraction and is adaptable to various systems due to SELM minimal hyperparameter tuning. The approach in [
21] integrated the AC power flow equations in NN during the training phase, thus reducing worst-case constraint violations and maintaining prediction optimality. The PINN is particularly important when the RE outputs are variable and difficult to predict. Therefore, PINN can ensure that predicted power flow solutions are feasible and within operational conditions. PINN accuracy tends to be higher than the previously introduced data-driven OPF models. However, implementing PINN requires developing the set of physical constraints into NN, a challenging task. While there have been successful implementations of PINNs in certain network topologies [
20], generalizing this approach across a wide range of topologies remains a significant challenge. This complexity arises during the training phase, where researchers might encounter issues due to limited, accurate datasets representing system states and conditions.
Predicting warm start points for ACOPF, instead of solving the OPF from scratch, the NN predicts the optimal decision for continuous and binary variables. Power flow solvers utilize those predicted variables later [
15,
25,
26,
27]. The author [
25] demonstrated an NN model to emulate ACOPF solvers without enforcing specific training set restrictions. The model was trained using data from previous ACOPF runs. The model operated iteratively, making small steps toward the optimum instead of directly predicting the ACOPF solution. This work [
28] introduced an ML framework called Smart-PGSim that accelerated power system simulations using multitask-learning neural networks to predict initial values. Their work enhanced problem-solving efficiency and maintained solution accuracy. In this study, ref. [
29] proposed a method for rapidly learning warm start points using a multiple-target binary decision tree with post-pruning to speed up solving processes. Due to the nonlinear and non-convex problem, the ACOPF problem can result in falling into local optimal points throughout the process. A good starting point (warm start) dramatically influences the quality of the result, reducing convergence speed and avoiding local minima optima points. Therefore, the warm start method enhances and reduces computational complexity and efficiently gives near-real optimal solutions for the OPF problem.
While solving data-driven OPF models, each related methodology offers a unique strength. The best approach depends on specific application and system challenges. At the same time, the methods mentioned earlier are generally primary for solving the data-driven OPF approach. Several other methodologies, which include machine learning (ML) and DL, are notably applied to execute OPF problems. It is essential to recognize that advanced DL algorithms, known for their higher accuracy and lower error rates, enhance the capability to tackle complex problems in various fields. This survey [
30] explored ML and DL methodologies in energy systems. There are several methods applied for ML and DL, such as DNN [
12], convolution neural network (CNN) [
27,
31,
32], reinforcement learning (RF) [
33], Gaussian process (GP) [
34], graph neural network (GNN) [
31,
35,
36], Lagrangian-based approaches [
37], compact learning and principal component analysis (PCA) [
38], meta-learning [
39], and the learning-aided OPF approach [
40]. A hybrid or combined model reunites physics-based and statistical methods or two or more individual methods [
41,
42,
43]. The combined model utilizes the individual models’ strengths and produces more accurate results for the data-driven problem. The combined approach leverages an effective relationship of various features and structures of the data-driven models and optimization framework, which is not commonly utilized by individual methods [
37]. This integration increases the depth and scope of information extraction and efficient exploration while adding robustness against the potential risks of relying on a single methodology. To this end, the significant contributions of this study are as follows:
A two-stage combined convolution and deep feed-forward neural network (CNN-DFFNN) DL technique is proposed for the data-driven ACOPF problem. This methodology enhances prediction accuracy compared to individual CNN, deep feed-forward neural network (DFFNN), hybrid CNN-DFFNN, and transfer learning (TL)-based methods. The approach leverages the spatial feature extraction capabilities of CNN for analyzing grid topologies and subsequently utilizes the regression capabilities of DFFNN.
The proposed multiple-input, multiple-output (MIMO) methodology aims to utilize the traditional OPF solver mechanism. The grid parameters take inputs for loads and produce all the generator’s set points and voltages/angles of each bus in a single attempt. The applied approach is designed for a comprehensive computation in one execution, streamlining the process of optimization outputs by providing a complete set of control and state variables.
Integration of high-penetration-level wind power (WP) that is non-parametric in nature into the IEEE 39 bus system was utilized from the physical WP model. This approach was chosen to ensure that the variability and intermittency inherent in wind energy were accurately captured, reflecting a realistic simulation environment.
The organization of the rest of this paper is as follows. The OPF formulation is presented in
Section 2. Data and procedures are covered in
Section 3. The DL methodology for the ACOPF problem is demonstrated in
Section 4. Results and discussion are presented in
Section 5. The final section,
Section 6, provides information about the conclusion.
5. Results and Discussion
This section summarizes the findings of six studied deep learning methods for the ACOPF problem. The initial two models in our analysis are traditional deep learning methods DFFNN and CNN. These models serve as a foundation of performance baseline for hybrid models. Following the models, we introduce two hybrid models that combine the CNN and DFFNN architectures, with each configuration as depicted in
Figure 5 and
Figure 6.
The first hybrid model, labeled as CNN-DFFNN (a), represents the proposed combined CNN-DFFNN architecture for generators (a), as depicted in
Figure 5. The second hybrid model, labeled as CNN-DFFNN (b), represents the hybrid CNN-DFFNN architecture for generators (b), shown in
Figure 6. These hybrid models demonstrate variations in performance evaluation due to their distinct architectural approaches.
The model structures for the TL approaches, TL (a) and TL (b), are built upon the architectures previously detailed in
Figure 5 and
Figure 6 of the paper. The critical distinction in the TL models lies in their foundation on the pre-trained CNN models. This approach demonstrates the effectiveness of using pre-trained models without extensive fine-tuning.
The study concentrated on MIMO DL techniques to predict the generator’s active and reactive power and the voltage and phase angle for each bus result. The proposed model utilizes all 39 buses’ load active and reactive powers, voltage, and phase angles as an input and predicts all generators’ active and reactive power as an output. This section is divided into three case studies for each DL model accuracy and efficient, and results are presented in
Table 5,
Table 6,
Table 7,
Table 8,
Table 9 and
Table 10.
5.1. MIMO Forecasting for Bus Voltages and Angles
The first case study focuses on utilizing the active and reactive power loads as input to predict the voltages and angles for each bus.
X represents the input load feature vector in the given dataset starting from bus 1 to bus
n. In this network model,
n is equal to 39. The letter
L represents load, and
n represents the bus numbers.
Y demonstrates each bus’s associated voltages and angles. The primary objective is to accurately match the input feature vectors to the respective output vectors. This mapping ensures that voltage and angles are satisfied while adhering to the system constraints. The goal is to minimize the loss function, between the predicted (, ) and the actual values (, ).
To investigate the efficacy of combined DL approaches, two primary performance metrics, root mean squared error (RMSE) and mean absolute error (MAE), are considered for the study. These performance evaluation measures are defined by Equations (
14) and (
15).
and indicate the observed and forecasted generators’ active and reactive power results, respectively. The symbol n represents individual simulation samples and M denotes the total number of simulation datasets.
Predictions were made for all bus voltages and phase angles in a single attempt.
Figure 7,
Figure 8 and
Figure 9 represent the DFFNN, the CNN, and the proposed model visualization results in terms of voltage magnitude and phase angle results, respectively.
The worst-performing model was the DFFNN, as shown in
Table 5, and it was not able to capture the voltage magnitudes because the predicted voltage magnitudes reached as high as 1.4 p.u and dropped as low as 0.77 p.u. Significant predicted voltage deviations could lead to model inefficiency for the data-driven OPF problem.
On the other hand, the proposed method, as depicted in
Figure 9, demonstrated its capability to predict more accurate results regarding voltage magnitude within the range of 1.11 and 0.85 p.u and phase angles. Although the proposed model was not the best-performing model, the CNN model was the best-performing model.
Table 5 shows the results of RMSE and MAE errors while considering each bus voltage and angles as outputs when taking each load active and reactive power to serve as inputs.
Table 5.
Training and validation dataset results.
Table 5.
Training and validation dataset results.
DL Model | Train RMSE | Train MAE | Val. RMSE | Val. MAE |
---|
DFFNN | 0.230 | 0.135 | 0.614 | 0.356 |
CNN | 0.121 | 0.079 | 0.191 | 0.118 |
CNN-DFFNN (a) | 0.135 | 0.081 | 0.233 | 0.135 |
CNN-DFFNN (b) | 0.256 | 0.162 | 0.313 | 0.197 |
TL (a) | 0.319 | 0.172 | 0.389 | 0.207 |
TL (b) | 0.406 | 0.225 | 0.458 | 0.255 |
The results in
Table 5 demonstrate that the CNN was the best-performing model, although results were indistinguishable in terms of the lowest RMSE and MAE values on both training and validation datasets when comparing the proposed model. On the other hand, DFFNN was the least effective model.
When comparing the hybrid and the proposed models, the combined model outperformed the traditional one. Similarly, when considering the TL model with different configurations, the TL model with CNN-DFFNN and flatten configuration performed better than the CNN, flatten, and DFFNN models.
Table 6 provides a comparative analysis of various models, focusing on their size, training duration, and speed of prediction to assess their overall performance efficiency and the associated computational expenses.
Table 6.
DL model efficiency performance.
Table 6.
DL model efficiency performance.
DL Model | Trainable Param. | Non-Trainable Param. | Training Time | Prediction Time |
---|
DFFNN | 216,862 | 0 | 1 min 21 s | 135 ms |
CNN | 441,150 | 0 | 1 min 40 s | 171 ms |
CNN-DFFNN (a) | 624,942 | 0 | 2 min 37 s | 365 ms |
CNN-DFFNN (b) | 1,117,966 | 0 | 1 min 54 s | 213 ms |
TL (a) | 273,726 | 351,216 | 2 min 17 s | 337 ms |
TL (b) | 766,750 | 351,216 | 1 min 37 s | 230 ms |
The DFFNN model had the shortest training and prediction times, indicating high efficiency. The CNN model required more training time than the DFFNN and had a slightly longer prediction time. The CNN-DFFNN (a) took the longest to train and had the longest prediction time, suggesting a trade-off for its complexity.
The CNN-DFFNN (b) and TL (b) models were more efficient in training and prediction times than their counterparts. However, they had a more significant number of parameters. The comparative additional results of the model can be found in the
Appendix A, as referenced in
Appendix A.1.
5.1.1. Section Key Findings
The DFFNN model was the least effective, especially in capturing voltage magnitudes.
The proposed method demonstrated better accuracy in predicting voltage magnitudes and phase angles.
The CNN model appeared to be the best performer.
The proposed model demonstrated superior performance over the traditional hybrid model.
The DFFNN model had the highest efficiency with the shortest training and prediction times.
The CNN-DFFNN (a) model had the longest training duration and prediction time.
5.2. MIMO Forecasting for Generators’ Active and Reactive Powers from Loads
In the second case study, the focus is on utilizing the active and reactive power loads as input parameters and predicting the generators’ active and reactive power outputs.
In this second case, the input feature vector remains consistent with that of case one. However, the distinction lies in the output feature vector for the generator’s optimal dispatch setting. The primary objective in this scenario is for the model to predict all generators’ output values accurately.
Figure 10,
Figure 11 and
Figure 12 demonstrate predicted and actual results for one of the generators, the poorest, the intermediate, and the proposed model results in the system.
The predicted MW and MVAR outputs of generators have a more significant margin than the actual generator outputs for the DFFNN model, and the CNN model visualization results are indistinguishable from the proposed model.
Table 7 denotes the performance metric errors for the training and validation datasets.
The results indicate that the proposed combination of CNN-DFFNN outperformed the validation dataset RMSE and MAE by 17.89 and 13.13, respectively, compared to other DL models. The results in
Table 7 represent that the CNN model remained the second best model, and the training model performance was superior to the other models. However, on the validation datasets, the proposed model outperformed the rest. The DFFNN model is still the poorest model for this case study.
Table 7.
Training and validation dataset results.
Table 7.
Training and validation dataset results.
DL Model | Train RMSE | Train MAE | Val. RMSE | Val. MAE |
---|
DFFNN | 18.122 | 12.629 | 43.417 | 31.027 |
CNN | 11.310 | 8.325 | 18.074 | 13.371 |
CNN-DFFNN (a) | 13.864 | 9.837 | 17.898 | 13.131 |
CNN-DFFNN (b) | 16.878 | 11.355 | 21.012 | 15.728 |
TL (a) | 20.937 | 14.747 | 23.315 | 16.516 |
TL (b) | 27.911 | 19.978 | 32.048 | 24.401 |
The hybrid model, integrating both CNN and DFFNN models, performed close to the proposed model and standalone CNN model. The TL model with CNN-DFFNN and flatten configuration continued to outperform the CNN, flatten, and DFFNN models.
Table 8 presents a comparison of different methods, detailing their model size, training time, and prediction speed to evaluate performance efficiency and computational cost.
Table 8.
DL model efficiency performance.
Table 8.
DL model efficiency performance.
DL Model | Trainable Param. | Non-Trainable Param. | Training Time | Prediction Time |
---|
DFFNN | 215,876 | 0 | 1 min 22 s | 175 ms |
CNN | 374,276 | 0 | 1 min 38 s | 367 ms |
CNN-DFFNN (a) | 558,068 | 0 | 2 min 38 s | 336 ms |
CNN-DFFNN (b) | 1,116,980 | 0 | 1 min 57 s | 219 ms |
TL (a) | 206,852 | 351,216 | 2 min 15 s | 353 ms |
TL (b) | 765,764 | 351,216 | 1 min 31 s | 235 ms |
The DFFNN model stands out for its training speed and swift prediction capability. On the other hand, the slowest model was the CNN in this case study. When comparing combined and hybrid models, CNN-DFFNN (a) and (b), the hybrid model offered a more efficient model than CNN-DFFNN (a). For the TL models, the TL (b) hybrid model configurations were more outstanding than the combined model in terms of model efficiency. The comparative additional results of the model can be found in the
Appendix A, as referenced in
Appendix A.2.
5.2.1. Section Key Findings
The proposed CNN-DFFNN (a) combination outperformed other DL models.
The CNN models were identified as the second-best performers.
The hybrid showed performance close to both the proposed and the standalone CNN.
The DFFNN still outperformed the rest in training speed and prediction capabilities.
The CNN model was the slowest in this case study.
The hybrid configuration was outstanding and more efficient than CNN-DFFNN (a).
5.3. MIMO Forecasting for Generators’ Active and Reactive Powers from Loads and Buses
In the final case study, the active and reactive powers of each bus load, along with the voltages and angles for each bus, serve as the input to predict the generators’ active and reactive power outputs.
X represents the input sample space vector containing both the load’s active and reactive powers and bus voltages and angles.
Y denotes the predicting generator’s optimal dispatch settings for active and reactive powers.
Figure 13,
Figure 14 and
Figure 15 illustrate the worst-performing model, the CNN, and the proposed model showcases both active and reactive power prediction and actual results of one of the generators in the system, respectively.
Table 9 demonstrates that the CNN-DFFNN outperformed others on both the training and validation datasets. The rest of the deep learning models remained consistent with the observed performance in case studies
Section 5.1 and
Section 5.2. It can be seen that both case study
Section 5.2 and case study
Section 5.3 aim to predict generator outputs with additional inputs, taking into account voltage magnitude and phase angle features for case
Section 5.3 when compared to case
Section 5.2. The target outputs remained the same, and all of the generator’s active and reactive power predictions were based on the proposed method. It can also observed that case
Section 5.3’s generator’s output accuracy increased and enabled more accurate predictions when including additional input features.
Table 10 provides a comparative analysis of various models’ overall performance efficiency and the associated computational cost.
Table 9.
Training and validation dataset results.
Table 9.
Training and validation dataset results.
DL Model | Train RMSE | Train MAE | Val. RMSE | Val. MAE |
---|
DFFNN | 15.107 | 10.089 | 39.290 | 24.648 |
CNN | 5.831 | 3.791 | 8.952 | 6.223 |
CNN-DFFNN (a) | 4.821 | 3.164 | 8.449 | 5.833 |
CNN-DFFNN (b) | 14.273 | 10.378 | 15.213 | 11.124 |
TL (a) | 14.781 | 10.331 | 16.481 | 11.395 |
TL (b) | 20.876 | 14.451 | 23.311 | 16.682 |
Table 10.
DL model efficiency performance.
Table 10.
DL model efficiency performance.
DL Model | Trainable Param. | Non-Trainable Param. | Training Time | Prediction Time |
---|
DFFNN | 255,812 | 0 | 1 min 21 s | 136 ms |
CNN | 399,236 | 0 | 1 min 48 s | 394 ms |
CNN-DFFNN (a) | 583,028 | 0 | 2 min 48 s | 350 ms |
CNN-DFFNN (b) | 1,755,956 | 0 | 2 min 1 s | 221 ms |
TL (a) | 231,812 | 351,216 | 2 min 17 s | 371 ms |
TL (b) | 1,404,740 | 351,216 | 1 min 40 s | 224 ms |
In case study
Section 5.3, model efficiency and computation costs remained consistent compared to the results from case study
Section 5.2. The comparative analysis of RMSE and MAE performance metrics for active and reactive power outputs is illustrated in
Figure 16,
Figure 17 and
Figure 18.
The proposed model achieves significantly lower errors, with RMSE and MAE for active power peaking at 11 MW and 8 MW, respectively. In contrast, the MAE remained under 5 MVAR for most generators for reactive power. On the other hand, the DFFNN model exhibited higher errors, with RMSE increasing to 140 MW for active power and 125 MVAR for reactive power and the corresponding MAE reaching 110 MW and 102 MVAR. The CNN model peaked an RMSE of around 15 MW and 12 MVAR for reactive powers. These results indicate that the proposed model enhanced predictive accuracy and robustness, significantly outperforming the DFFNN model and slightly outperforming the CNN model.
The error distribution for the comparison between the worst model, the second-best, and the best model is shown in
Figure 19,
Figure 20 and
Figure 21 for all ten generators’ active and reactive power outputs within the system. The DFFNN model in
Figure 19 operates within the error range of
, displaying a broader variability in predictive accuracy.
Figure 20 illustrates the error variation of the CNN model across all generators. The results of the CNN model are similar to the proposed model, yet the proposed model demonstrates superior performance. On the other hand, the proposed model in
Figure 21 significantly improves the prediction accuracy and operates in an error range of
. Despite the difference in error magnitudes, the median errors for the models are closely aligned with zero, indicating no significant systematic bias in either model. The median errors are highlighted in red, and circles demonstrate outliers in the boxplot figures.
The scatter plots in
Figure 22,
Figure 23 and
Figure 24 illustrate the predicted and actual values for active and reactive power from the DFFNN model, the CNN model, and the proposed model. In
Figure 22, the data points marked in orange, representing active power, are widely distributed from the line of perfect prediction along the blue line. The figure also shows that the model predictions for active power are less closely clustered around the actual values, which implies that the first model had lower accuracy. For reactive power in the figure, the points are closer to the line. However, there is still a noticeable spread from the blue line circles.
In the second and third plots for
Figure 23 and
Figure 24, the proposed model and the CNN model predictions for both active and reactive power are more closely aligned with the actual values. In addition, the range of predicted values for P and Q does not extend as far as in the first plot, demonstrating that the proposed model and the CNN are more consistent and reliable for prediction. Therefore, the third plot illustrates a superior model accuracy improvement between predicted and actual values, implying that the proposed model has a better fit and predictive performance.
Section Key Findings
The proposed model still outperformed on validation datasets.
When additional input features were included, the model accuracy increased for generator outputs, leading to more precise predictions.
Model efficiency and computation costs remained consistent when evaluating the case study results in
Section 5.2.1.
The proposed model enhanced predictive accuracy and robustness, significantly outperforming the DFFNN model and slightly outperforming the CNN model.
6. Conclusions
This study addresses the ACOPF problem using several data-driven OPF prediction approaches. The load data generation process relies on Monte Carlo simulations of load variations in all buses. Additionally, the study utilizes a Weibull distribution for wind speed, integrating the physical wind turbine model into six buses of the IEEE 39 test system to simulate corresponding wind power plants. The problem formulation utilizes MIMO learning workflow. We analyzed three different case studies to demonstrate the discussion and results.
The first case study relies on load active and reactive power to predict each bus’s voltages and phase angles. The second case study utilizes load active and reactive power to predict the generator’s active and reactive power. The last case study focused on load active and reactive power and each bus’s voltages and phase angles to be considered as input and predict the generator’s active and reactive power outputs.
Based on the results of this study, the studied combination of the CNN-DFFNN model outperformed the other individual CNN and DFFNN models, as well as hybrid and TL-based methods. The results showed that the CNN model outperformed the predicted bus voltage and phase angle case, and the remaining case studies demonstrated that the performance of the combination CNN-DFFNN model was more outstanding than other models on the validation datasets.
Additionally, the DFFNN model was the most efficient in terms of computational resources. However, the DFFNN model was the poorest model in the study regarding model accuracy. Despite the proposed model being the least efficient, it delivered more accurate and robust results, showcasing its effectiveness in prediction tasks. The CNN model found a middle ground, performing very closely to the proposed model regarding accuracy and robustness. At the same time, its computational efficiency was moderate, positioned between the DFFNN model and the proposed model. The results highlighted a trade-off between efficiency and predictive performance across the different models.
It is worth mentioning that while the study was conducted on and tackled the need for more accurate forecasts in DL-supported ACOPF, issues related to ensuring the feasibility of the resulting predictions, finding global optimality approaches, and varying topology for the end-to-end prediction remain for future study. Future studies will expand this research to larger electric network models and explore probabilistic and time-series DL approaches. We intend to expand our methodology by incorporating Bayesian ML and advanced data preprocessing approaches.
In addition, we plan to evaluate the capabilities of more sophisticated neural network architectures, such as Residual Networks (ResNet) and Visual Geometry Group Networks (VGG), for their potential to further enhance the accuracy and robustness of our models. In addition, further stages of the study will investigate and incorporate spatial and temporal DL methods while considering atmospheric weather conditions in specific locations in the Connecticut and New England regions.