1. Introduction
Gas turbines offer advantages such as lower emissions by using mainly natural gas [
1], rapid start-up and acceleration [
2], high efficiency with higher combustion temperature [
3], and fuel flexibility with varied fuel options while maintaining performance [
4]. These benefits have led to widespread industrial applications. For instance, gas turbines have become one of the main power sources in various applications, including thrust for aircraft jet engines, as described by Pang et al. [
5], mechanism power for driving compressors or ship propulsion, and electricity generation through generators, as introduced by Tahan et al. [
6]. With increasing renewable energy integration, the fast start-stop and wide load characteristics of gas turbines make them essential for peak shaving in power systems [
7]. To ensure operational efficiency under diverse conditions, accurate predictive modeling of gas turbine power output is crucial [
8].
Significant attention has been devoted to the research of gas turbine modeling methods, which can be categorized into two main approaches: mechanism models and data-driven models. Mechanism models describe the material and energy conversion processes of gas turbines, establishing equations based on principles such as energy conservation, material conservation, and thermodynamic properties. An important type of mechanism model is the component-level model, which relies heavily on accurate component characteristic maps that are difficult to obtain [
9]. To solve this problem, Li et al. [
10] proposed a design-point performance adaptation approach using the Newton–Raphson method. Yang et al. [
11] proposed a new generation method to solve the problem of lacking data, in which the initial map was obtained according to a set of steady operating data, and the coefficients were tuned through sets of transient data. Pang et al. [
12] proposed a segmentation-based joint steady-state and transient performance adaptation technique, which takes both the idle point and the design point to scale the performance maps. Yan et al. [
13] developed an improved analytical approach for components and employed sensitivity analysis to determine the weight coefficients of the tuning factors, which aimed to enhance the performance adaptation and diagnostics of gas turbines. Plis et al. [
14] developed an adaptive model for a PG 9171E gas turbine unit to realize the performance calculation in a shorter time. Manasis et al. [
15] tested several Kalman Filtering techniques to obtain improved temperature forecasts, which were then used to obtain output power predictions. An open-cycle gas turbine was used to demonstrate the applicability of the proposed method. Zhang et al. [
16] performed CFD simulations of the aerodynamic and thermal processes of gas turbine blades. To optimize blade designs for enhanced gas turbine performance, they designed and implemented a nested optimization workflow incorporating an Artificial Neural Network (ANN). The mechanism model reveals the internal operating mechanism of the gas turbine and possesses strong interpretability. However, the mechanism model inevitably introduces simplifying assumptions to varying degrees, leading to deviations from actual operating conditions, or else it would suffer a significant computational burden.
With the continuous development of industrial digitization, the operational data stored in the Distributed Control System (DCS) of power plants has rapidly increased. Data-driven models can directly learn and identify operational patterns from existing operational data, which helps to understand the underlying laws of the electricity production system and improve the reliability and economy of the equipment. With the successful application of machine learning in various fields, there is an increasing number of studies focusing on data-driven modeling of gas turbines. These studies showed the potential application of machine learning algorithms in gas turbine modeling, and the data-driven gas turbine model proved to have high modeling accuracy, fast computation speed, and great application potential, as shown by the following references. Fast et al. [
17] utilized an ANN algorithm to identify anti-icing or normal operating modes of gas turbines based on input local environmental conditions (pressure, temperature, and relative humidity). They then predicted various operational and performance parameters of the gas turbines. This model can be used for both online monitoring and offline estimation of expected performance under different local environmental conditions. Subsequently, Fast et al. [
18] also applied the ANN model to the computer system of a power plant to achieve real-time monitoring and economic evaluation of the unit, which shows the great application potential of data-driven model of gas turbine. Maciej et al. [
19] developed an ANN-based prediction model for a fourth generation LM2500. The model is trained by the real-time data extracted from industrial installations and shows less than 1% mean absolute percentage error. Mathavan et al. [
20] focused on the prediction of power produced by a 747 MW Combined Cycle Power Plant. They demonstrated that predictive models are accurate and that such data science techniques can be used as a substitute for extensive thermodynamic calculations when using a Back Propagation Neural Network. Piotr et al. [
21] made use of an Artificial Neural Network model to predict heat demand in a district heating network. Elkhawad et al. [
22] applied an ANN model to establish a regression model for a combined cycle unit based on four input variables (ambient temperature, ambient pressure, exhaust pressure, and relative humidity). They also discussed the impact of training dataset size, the number of input variables, and training functions on the behavior of the regression neural. The statistical study on the errors shows the reliability of the model. Liu et al. [
23] used both ANN and high dimensional model representation (HDMR) to predict the operating characteristics of compressors and turbines. Four models were established separately to capture the part-load and full-load performance of gas turbines. The models for the compressors and turbines were then embedded into a gas turbine simulation program, and the prediction results achieved a high accuracy with average and maximum errors of less than 2.0% and 4.3%, respectively. Among these, ANN models have lower complexity and higher accuracy. Subsequently, the ANN model for predicting full-load performance is used to construct gas turbine performance modification curves, which shows high accuracy and can offer an excellent basis for continuous health monitoring and fault diagnosis. Afzal et al. [
24] made a comparative analysis and employed Ridge, linear regression (LR), and support vector regression (SVR) to model the combined cycle power plant. Subsequently, various evaluation metrics were utilized for model comparison, including mean absolute error (MAE), R-squared (R
2, also called coefficient of determination), median absolute error, mean absolute percentage error (MAPE), and mean Poisson deviance. Among the algorithms, SVR was deemed the most suitable, achieving an R
2 of 0.98, while all others were 0.9 to 0.92. Pachauri et al. [
25] used a generalized additive model (GAM) to predict the electrical power of a combined cycle unit. Furthermore, predictive models based on LR, gaussian process regression (GPR), multilayer perceptron neural network (MLP), SVR, decision tree (DT), and bootstrap-aggregated tree (BBT) were also designed for comparison. The results confirmed the effectiveness of GAM. Shao et al. [
26] employed computational fluid dynamics (CFD) results of a gas turbine combustion chamber to train a fast prediction ANN model, which significantly reduces CFD calculation time while maintaining high prediction accuracy. Sabzehali et al. [
27] employed a deep fully connected neural network to establish a mapping relationship between the state parameters of the PW100 engine and its thrust, fuel consumption rate, and exergy efficiency, which achieve high accuracy and can be used to optimize the energy and exergy performances of a F135 PW100 engine. It can be observed that the data-driven models are widely applicable in gas turbine modeling and can achieve high prediction accuracy.
During the operation of gas turbines, due to factors such as load fluctuations and ambient temperature changes, the operating conditions and equipment characteristics will also undergo corresponding changes. Data-driven models based on a single basic model often struggle to maintain high accuracy across all operating conditions. Because the data quantity is not evenly distributed under different operating conditions, some conditions characteristics with few data points cannot be captured by basic models. The basic model faces challenges when seeking to accurately capture performance characteristics under all operating conditions. To enhance the predictive accuracy of the data-driven models under different operating conditions, some studies have considered using clustering algorithms to establish gas turbine models. Benyounes et al. [
28] represented the control parameters of a gas turbine using the fuzzy clustering method based on Gustafson–Kessel algorithms, demonstrating the potential application of clustering algorithms in modeling gas turbines. Hou et al. [
29] proposed a fuzzy modeling strategy that combines an entropy-based clustering algorithm with the subspace identification (SID) strategy to address the non-linearity of gas turbines under various conditions, resulting in a highly accurate model.
Considering the rapid and accurate prediction abilities of data models, this paper establishes a gas turbine power prediction model based on machine learning algorithms. Recognizing the non-linearity of gas turbines and the limitations of a single model, a combined modeling approach is proposed. First, a basic model for gas turbine power prediction is conducted, providing an overall description of the data. Second, considering the performance variations of gas turbines under different operating conditions, a clustering algorithm is employed to categorize the data into distinct working modes. For each operation condition, a modification model is constructed, forming a set of such models to enhance local prediction accuracy. The main contributions of this paper are as follows:
- (1)
The integration of two-stage SVR and clustering introduces a composite framework that adapts dynamically to diverse operating conditions. First, a basic model is established to capture overall data characteristics. Then the operational data are categorized into different conditions through clustering. Finally, another SVR model is developed for each condition for model modification.
- (2)
The finding that 70% of annual operational data are sufficient for optimal model performance provides valuable guidance for industrial implementation. This threshold represents a balance between data collection requirements and model accuracy, enabling faster deployment while maintaining prediction reliability.
- (3)
This study establishes a combined model for a power prediction of an E-class gas turbine. To validate the performance of the combined model, the predictive performance of the basic model and the combined model are compared on the test set across multiple aspects, including Mean Absolute Percentage Error (MAPE), Maximum Absolute Percentage Error (MaxAPE), R-squared (R2), and error distribution. Specifically, the MAPE of the combined model is 0.2346%, compared to 0.3491% for the basic model, representing an improvement of 32.66%. The R2 of the combined model is 0.9982, higher than the 0.9966 of the basic model, demonstrating a better fit to the data. The error distribution of the combined model is also more concentrated. These results demonstrate the effectiveness of the combined model in improving prediction accuracy.
This paper is organized as follows:
Section 2 initially presents an E-class gas turbine object under study, describing the input and output for subsequent modeling, and introduces the proposed method.
Section 3 establishes a basic model using the SVR approach according to the operational data. Afterward, the data are categorized into different operating conditions through the k-means clustering algorithm, and modification models tailored to each condition are developed. Then, the performance of the combined model on the dataset is analyzed and compared with the basic model. Ultimately, several analyses, including cross validation and impact of training size, are conducted.
Section 4 provides a summary of this paper.
3. Results and Discussion
3.1. Data Preprocessing
Operational data from an E-class gas turbine power plant, spanning June 2022 to May 2023, were collected at a one-minute sampling frequency. The sample data of power are shown in
Figure 3.
As can be seen from
Figure 3, the gas turbine starts and stops frequently. Since the steady-state process of gas turbines plays a crucial role in operational monitoring and performance evaluation, this study focuses on power prediction modeling for the steady-state process of gas turbines. Thus, data from dynamic processes like start-up, shutdown, and load changes should be excluded. The filtering method for steady-state data is described below in step (3) “Steady-state discrimination”. After completing the data preprocessing process, the model training can be carried out. In this study, the preprocessing of data primarily includes sensor measurement data processing, unit conversion, steady-state discrimination, noise reduction, and outlier removal. The data processing is primarily based on Python, utilizing the numpy and pandas libraries to compute the mean, standard deviation, and variance of the data.
- (1)
Sensor measurement data processing: In this process, median values are calculated from multiple sensors, and relative pressure measurements are converted to absolute pressure values according to the control logic of TCS.
- (2)
Unit conversion: This process focuses on unifying units for the physical variables, such as standardizing pressure to Pascal (Pa) and temperature to degree Kelvin (K).
- (3)
Steady-state discrimination: Steady-state processes are essential in assessing turbine performance. The modeling in this study is targeted at steady-state processes. Thus, data from dynamic processes like start-up, shutdown, and load changes should be excluded. Assuming that the task is to determine whether point Pt at time t is in a steady state, a data window of 10 min after time t would be taken, and the mean value and standard deviation of measurement variables (e.g., IGV angles, compressor inlet temperature, power) during this period would be calculated. If the standard deviation exceeds 2% of the mean , point Pt is deemed a non-steady state. Furthermore, if the measurement variables’ value of point Pt deviate from the corresponding mean by more than twice the standard deviation (falling outside the 95% confidence interval), Pt is also considered as a non-steady point.
- (4)
Noise reduction: In this process, the mean value of data points within a 10-min window following a steady-state data point Pt is taken as the filtered value of point Pt.
- (5)
Outliers removal: Outliers are identified and removed based on the distribution of power values, excluding those that significantly differ from the main dataset.
After data preprocessing, the dataset covers a wide range of steady-state operating conditions with varying parameters, such as , , IGV angles ∈ [56.98°, 85.05°], .
As can be seen from
Figure 3, the preprocessed data have excluded the start-stop process data and reduced the noise fluctuations in the power signal, resulting in a relatively smoother trend. After data preprocessing, a total of 15,848 valid data points were obtained, with 80% designated as the training set and 20% as the test set. These processed data form the foundation for subsequent model development and validation.
After data processing, Pearson correlation analysis is conducted according to Equation (1), as depicted in
Figure 4. The value of Pearson correlation coefficient
P ranges from −1 to 1, with a larger absolute value indicating a stronger correlation between two variables. The corrcoef function in the Python library numpy is used here to calculate the correlation coefficient, with the calculation formula shown in Equation (1). As shown in
Figure 4, parameters such as
T1,
p1,
Td,
p4,
Tf, IGV angles, and
mf exhibit high correlations with power
Pw, confirming their significant influence on prediction accuracy. In contrast,
pf and speed
n exhibit low correlations (
P is less than 0.3 [
31]) with power
Pw. This may be attributed to the fact that
Pf is controlled at a constant value during operation to regulate fuel flow, while fuel pressure has a minor impact on the energy content of the fuel, thus resulting in a weak correlation. Additionally, speed n remains essentially constant at 3000 after the gas turbine is connected to the grid, which also minimally affects power. Consequently, these two input variables will be excluded from subsequent modeling.
3.2. Construction of Combined Model
In this section, a basic model is first established using a machine learning algorithm, with the choice of algorithm being dependent on the specific application. In this study, the SVR algorithm is employed to build both the basic model and the modification models, followed by a discussion on the performance of the prediction model. The SVR algorithm has been widely applied in gas turbine modeling and has achieved good prediction results. The SVR model is established using the SVR function from Python’s sklearn library, with its computational principles referenced in Equation (4), with the radial basis function serving as the kernel function. The basic model established by SVR achieves a MAPE of 0.3491%, a MaxAPE of 2.393%, and an R2 of 0.9966 on the test set. To further enhance the modeling accuracy, a clustering algorithm is subsequently used to categorize the dataset, and modification models are then built for each operational condition dataset.
To establish a set of modification models for different operating conditions, the k-means clustering algorithm is utilized to classify the training set. This paper implements clustering using the Kmeans function from sklearn, with the parameter ‘k-means++’ for generating initial centroids. The algorithm principles can be referenced in the description of the k-means clustering algorithm in
Section 2.2.3.
Figure 5 illustrates the results obtained by clustering the data into 5 and 10 clusters. Due to the dimensional constraints in graphical representation, it is not feasible to display the relationships between all input variables and labels. Thus, the focus here is on the correlation between clustering labels, inlet temperature, and fuel flowrate. Consequently, there may be some data overlap in different clusters. Overall, the clustering algorithm effectively segments the operational data into distinct load and temperature zones. It is observed that fuel consumption tends to decrease with higher temperatures, which may be attributed to the reduction in air density caused by temperature increases, subsequently weakening the unit’s power generation capacity and thus necessitating less fuel. The subsequent section will develop modification models for each cluster, which, together with the basic model, will form the final combined model.
Based on the clustering results from the previous section, training of modification models for each operational condition dataset is conducted, which targets the residuals of the basic model. Similarly, the SVR algorithm is employed to establish the modification models, with parameters identical to those of the previous basic model.
3.3. Performance of Combined Model
In this section, the performance of the proposed combined model is evaluated through multiple aspects, including basic metrics comparison, error distribution analysis, and cross validation. Taking ten modification models as an example, following the training of these ten modification models, an accuracy comparison between the basic and combined models is conducted on the test set, as detailed in
Table 1.
Table 1 presents a comparison of accuracy between the basic model and the combined model across different clustered operational conditions and the overall dataset. The first ten rows compare the accuracy of the basic and combined models for each cluster, showing improvements in MAPE and R
2 for all clusters. In terms of MaxAPE, most categories exhibited a decrease in maximum error, while a few categories showed an increase. The last column summarizes changes in metrics across the entire dataset, revealing a 32.66% reduction in MAPE and an increase in R
2. Due to the inherent randomness in maximum errors, the combined model experiences an increase in MaxAPE. However, the error distribution, as shown in the violin diagram of the absolute percentage error predicted by both models in
Figure 6, demonstrates that the combined model improves the prediction accuracy of the basic model. This figure illustrates that the combined model reduces errors overall and confines them more to the lower error regions. Specifically, the combined model controls the error at 75% of prediction points to 0.2946%, while the basic model’s corresponding value is 0.4647%, representing a 36.56% reduction. This demonstrates the enhanced accuracy of the combined model compared to the basic model. The improvement of these metrics indicates an advancement in the model’s ability to predict gas turbine power output, thereby enhancing its reliability and practicality in real-world applications. Accurate prediction data are essential in various scenarios, including power dispatch, equipment performance monitoring, and aiding operators in making more precise decisions.
Figure 7 compares the predicted values of the combined model and basic model with the actual values. For data clarity, the points are sorted according to the actual output values.
From
Figure 7a, it is observed that both the basic model and the combined model follow the actual value changes. The red line representing the basic model’s predicted values essentially envelops the blue line of the combined model’s predicted values, indicating that the combined model achieves closer approximations to the actual values at the majority of data points.
Figure 7b illustrates the absolute errors between predicted and actual power values for both the basic and combined models. It is evident that the combined model exhibits lower absolute errors than the basic model in most instances. Statistical analysis of these absolute errors reveals that the basic model has a mean absolute error of 0.38 MW and a median of 0.27 MW, whereas the combined model achieves a mean absolute error of 0.26 MW and a median of 0.17 MW. These findings demonstrate that the combined model exhibits a 32.70% improvement in mean absolute error and a 35.49% improvement in median absolute error compared to the basic model.
Figure 8 illustrates the absolute percentage error distribution of the basic and combined models on the test set, and
Table 2 presents the statistical distribution of the absolute percentage error.
Figure 8 and
Table 2 reveal that the combined model outperforms the basic model in overall accuracy distribution. Specifically, the proportion of data points with errors ranging from 0–0.25% increased from 50.32% in the basic model to 68.08% in the combined model. Additionally, there was a reduction in the number of high-error data points, with those exceeding 1% error decreasing from 156 to 73, and their percentage dropping from 4.921% to 2.303%. The test set comprises a total of 3170 data points. The number of data points with prediction errors less than 0.25% increased from 1595 in the basic model to 2158 in the combined model. Conversely, the number of data points with errors exceeding 0.25% decreased from 1575 in the basic model to 1012 in the combined model. While observing the data points within the 0.25–0.5% error range in
Table 2, it can be seen that the absolute count decreased from 856 in the basic model to 691 in the combined model, which seems modest at first glance. However, considering the relative proportion of these data points within the overall error range exceeding 0.25%, it is found that the basic model accounted for 54.35%, while the combined model improved to 68.28%. This indicates that the combined model not only reduced the number of high-error points but also shifted more of the data points originally falling within the 0.25–0.5% error range to a lower error interval. It can be observed that the combined modeling approach improves the error distribution, increasing the proportion of data points with small prediction errors, thus enhancing the model’s stability and enabling it to provide more reliable power predictions for real-world applications. This indicates the effectiveness of the combined model in enhancing prediction accuracy.
Next, we employ cross-validation techniques to assess the stability and predictive capability of the combined model. The original dataset consisting of 15,848 data points is partitioned into five mutually exclusive subsets. For each round, the union of four subsets serves as the training set, while the remaining subset acts as the test set, generating five sets of training and test data for five rounds of training and testing. The results are shown in
Table 3.
It can be observed that the combined model consistently demonstrates predictive advantages across various metrics in each of the five testing sets. Referencing the results from the five rounds, the combined model achieves an average MAPE of 0.2336%, a MaxAPE of 2.839%, and an R2 of 0.9983, all of which outperform the basic model. This indicates that the combined model enhances the predictive accuracy of the basic model, better capturing the complex relationships and trends of the data. The results of multiple cross-validation rounds suggest that the combined model exhibits high stability in its predictions across different datasets.
3.4. Comparison with Other Modeling Methods
To further evaluate the effectiveness of the proposed combined modeling approach, this section includes a comparison with other advanced modeling methods. Two comparison algorithms chosen are gradient boosted decision trees (GBDT) and random forests (RF), which are two renowned ensemble modeling techniques. RF enhances model accuracy and stability by constructing and aggregating predictions from multiple decision trees. In contrast, GBDT predicts outcomes by combining multiple decision trees, where each new tree is sequentially trained to correct the prediction errors of the previous tree. Implementations of RF and GBDT were carried out using the GradientBoostingRegressor and RandomForestRegressor functions from the sklearn library, respectively. For RF, the number of estimators was 700 and the maximum depth of the trees was 20. For GBDT, the number of estimators was 1200. A comparison of the prediction results between the combined model and these two models is presented in
Table 4.
It can be observed that the combined model still maintains a certain predictive advantage over these two ensemble modeling methods. While the RF model achieves a relatively low MAPE, the presence of overfitting in some trees may have led to a larger MaxAPE. GBDT demonstrates good control over maximum error. However, as the improvement in prediction errors by GBDT is applied to the entire dataset, it does not outperform the combined model. The combined model approach considers the concept of bias fitting from the GBDT algorithm while also considering the varying operating conditions of gas turbines. This allows the model to make more fine-grained adjustments for each specific operating condition, resulting in more accurate model outcomes in this study.
3.5. Performance of the Combined Model to Changes in Training Data Size
In this section, the impact of training size on the performance of the combined model is conducted. The training set is sorted in chronological order, and the first 10% (i.e., the first 1268 points), 20% (2526 points), 30% (3803 points), 40% (5071 points), 50% (6339 points), 60% (7607 points), 70% (8875 points), 80% (10,142 points), 90% (11,410 points), and 100% (12,678 points) are selected as training sets to establish power prediction models. The clustering number of these combined model is 10. The error distribution of the prediction models is illustrated in
Figure 9, and the corresponding model prediction errors are presented in
Table 5.
As observed from
Table 5 and
Figure 9, with the increase in the number of samples, the median prediction error and MAPE of the model gradually decrease, indicating an overall improvement in prediction accuracy. Simultaneously, as the number of training set samples increases, the maximum error also exhibits a downward trend. The slight fluctuations observed within the 10–30% range may be attributed to the inherent randomness of the maximum values. The top 75% errors also show a decreasing trend, demonstrating that the model’s prediction errors are gradually being controlled as the number of sampling points increases. However, there are also fluctuations within the 10–30% range, which could be because this range mostly covers operating conditions within the same season, resulting in less significant improvements in prediction accuracy for other seasons. Inaccuracies in predictions for other seasons often lead to the occurrence of high error points. When the sampling quantity reaches 70%, the model’s prediction accuracy is significantly enhanced, with an average error of 0.6677% and a maximum error controlled within 5%, specifically 4.035%. Additionally, 75% of the prediction points have errors less than 0.6306%. This may be because, at 70% samples, the training set basically covers typical operating conditions for three seasons. Therefore, to ensure controllable prediction errors in the model, it is advisable to collect data that comprises at least 70% of the annual data volume as the training set.
3.6. Sensor Accuracy Analysis
Given that field data are acquired through measurement instruments, instrument accuracy inherently influences model accuracy. This section analyzes the model’s error distribution in the presence of measurement noise. Based on gas turbine instrumentation literature [
34], the measurement accuracy specifications for each variable are presented in
Table 6.
Based on
Table 6, random errors within their respective maximum allowable ranges were added to the measurement data of the seven input variables required for modeling in the test set. This generated a new set of input variable data containing random noise compared to the original test set input variables. These data were then input into the combined model to generate power prediction values. The error distribution between these predicted power values and actual measured values is shown in
Figure 10.
Figure 10 shows the error distribution of power predictions at different power points. Taking the 95 MW power point as an example, the prediction deviation of the combined model falls within ±0.40 MW with 95% probability. For other power points, the prediction deviations are: ±0.53 MW at 100 MW, ±0.46 MW at 105 MW, ±0.53 MW at 110 MW, ±1.04 MW at 115 MW, ±1.54 MW at 120 MW, ±0.53 MW at 125 MW, and ±0.28 MW at 130 MW.
The relatively larger deviations at 115 MW and 120 MW may be attributed to these power levels primarily occurring during spring and autumn seasons and occasional winter conditions, resulting in fewer data points. Additionally, the greater fluctuations in environmental conditions like temperature and humidity during spring and autumn increase prediction difficulty.
Overall, even with sensor noise introduced, the combined model maintains high prediction accuracy and demonstrates stable performance within the measurement equipment’s accuracy limits.
3.7. Validation on Different Gas Turbine
In this section, we selected one year of operational data from a 9 FB heavy-duty gas turbine at a power plant. The data were sampled at 1-h intervals, and after removing start-up and shutdown data, 6504 sampling points were obtained. These points were randomly split into 80% training set and 20% test set. Based on the training set, we established both basic and combined models to predict power output for the test set. The prediction results are shown in
Figure 11.
Figure 11a shows the comparison between predicted power from both basic and combined models against actual power values. The combined model demonstrates better alignment with actual power values at most data points.
Figure 11b illustrates the distribution of absolute percentage errors, showing that the combined model reduces overall prediction errors and shifts more prediction points into lower error regions. Using common model evaluation metrics, the Mean Absolute Percentage Error (MAPE) for the basic model is 0.7341% while the combined model achieves 0.5079%, representing a 30.81% improvement. In terms of mean absolute error, the basic model shows 1.75 MW while the combined model achieves 1.23 MW, indicating a 29.98% improvement in accuracy. For the median absolute error, the basic model shows 1.51 MW while the combined model achieves 0.91 MW, representing a 39.39% improvement over the basic model. These results demonstrate that compared to using a single basic model, the combined model achieves higher prediction accuracy on this new turbine model, indicating the generalizability of the combined modeling approach.