Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group

Ma, Dengchang; Xie, Rongyi; Pan, Guobing; Zuo, Zongxu; Chu, Lidong; Ouyang, Jing

doi:10.3390/en16155649

Open AccessArticle

Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group

by

Dengchang Ma

^1,2

,

Rongyi Xie

³,

Guobing Pan

^1,*,

Zongxu Zuo

¹,

Lidong Chu

¹ and

Jing Ouyang

¹

Key Laboratory of E & M, Ministry of Education & Zhejiang Province, Zhejiang University of Technology, Hangzhou 310012, China

²

The College of Electrical Engineering, Zhejiang University of Water Resources and Electric Power, Hangzhou 310020, China

³

Guangxi Xijiang Group Investment Corp., Nanning 530000, China

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(15), 5649; https://doi.org/10.3390/en16155649

Submission received: 1 July 2023 / Revised: 19 July 2023 / Accepted: 24 July 2023 / Published: 27 July 2023

(This article belongs to the Section A2: Solar Energy and Photovoltaic Systems)

Download

Browse Figures

Versions Notes

Abstract

:

With the increasing proportion of distributed photovoltaic (DPV) installations in county-level power grids, to improve the centralized operation and maintenance of the stations and to meet the needs of power grid dispatching, the output of the county-level regional DPV stations group needs to be predicted. In this paper, the weather prediction information is used to predict the output based on the model input average strategy. To eliminate the effect of the selected non-optimal training sample collection period on the prediction accuracy, an ensemble prediction method based on the minimum redundancy maximum relevance criterion and TabNet model is carried out. To reduce the influence of weather prediction errors on the power output prediction, a modified model based on error prediction is proposed. The ensemble prediction model is used to predict the day-ahead output, and a combination prediction model based on the proposed ensemble prediction model and the proposed modified model is established to predict the hour-ahead output. The experimental results verify the effectiveness of the proposed models. Compared with the corresponding reference models, the proposed ensemble prediction method reduces the normalized mean absolute errors (nMAEs) and the normalized root mean square errors (nRMSEs) of the day-ahead output prediction results by 2.86% and 5.51%, respectively. The combination prediction model reduces the nMAE and nRMSE of the hour-ahead output prediction results by 3.05% and 3.05%, respectively. Therefore, the prediction accuracy can be improved by the proposed models.

Keywords:

distributed photovoltaic; regional power output prediction; minimum redundancy maximum relevance criterion; TabNet; ensemble prediction

1. Introduction

As one of the most potential new energy utilization technologies, photovoltaic (PV) power generation has attracted the attention of most countries in the world [1]. PV systems are available in two forms: Centralized and distributed. Compared with the centralized, distributed PV (DPV) systems are constructed close to loads, and the system output can be absorbed locally, which helps to overcome the common defects of mismatch between actual distribution and application demand of PV resources. Therefore, in recent years, the Chinese DPV industry has been vigorously developed along with the whole county advance.

However, with the increasing proportion of DPV installations in county-level regions, the power grid operation is greatly affected. Considering the system construction and operation costs, the traditional DPV system operation and maintenance are extensive, resulting in low system benefits. The centralized operation and maintenance management of DPV systems in a county can improve the overall efficiency of the systems. To meet the needs of power grid dispatching and the centralized operation and maintenance of DPV stations, the output of the regional DPV stations group needs to be predicted. Therefore, we take DPV systems in a county as a whole to study the regional DPV output prediction.

Regional PV prediction models can be divided into different types. Liu et al. [2] reviewed the research on regional PV prediction methods based on multiple time scales. Kim and Kim [3] divided the models into two categories: Type 1 is about the public utility-scale systems [4,5,6,7,8,9,10,11,12,13], Type 2 is about the system behind the meter [14,15,16,17]. Pierro et al. [9] proposed an interesting classification method based on prediction strategies:

(1) Bottom-up strategy. In this strategy, firstly, the output of each PV system in the region needs to be predicted, and then the regional output can be achieved by accumulating these predicted values.

(2) Upscaling strategy. The strategy can be further divided into models output average strategy and model input average strategy. Model output average strategy is based on the selection and output prediction of a subset of regional PV systems, which is taken as the representation of the whole systems. Then the predicted subset power output is rescaled based on the subset capacity and total capacity to predict the regional power output. In the model output average strategy, the PV output in the predicted area is taken as the output of a virtual PV system. Then, the regional PV power output is directly predicted.

For the model output average strategy, Shaker et al. [18] proposed a data-driven method to estimate the power output of invisible PV systems based on the measured values of a little of representative systems. The representative sites were selected using the proposed data dimension reduction model based on K-mean clustering and principal component analysis, and then regional PV power generation was obtained according to the mapping function. In addition to power generation estimation, Shaker et al. [17] also proposed a probabilistic prediction model based on a Fuzzy Arithmetic Wavelet Neural Network (FAWNN) to predict the power generation of a large number of small PV systems. Bright et al. [19] evaluated the satellite-only and upscaling-only PV output estimate methods, and the authors concluded that the method through combining the two methods is more beneficial. Saint-Drenan et al. [8] analyzed the performance of the upscaling strategy by using measured power data of a set of 366 PV systems. They found that the error decreases with an increasing number of reference systems and a decreasing number of un-metered systems, and the average distance between a reference and the unknown system has a great influence on the performance of a set of reference systems.

For the model input average strategy, Fonseca et al. [5] proposed a method based on principal component analysis, support vector regression, and weather prediction data. One-day ahead regional PV power outputs of the four main regions of Japan in 2009 were predicted with hourly power output data of 453 PV systems. Aillaud et al. [20] proposed a model through a combination of a convolutional neural network (CNN) with a long short-term memory architecture. The day-ahead regional PV power outs of Germany were predicted, and the main result of this study shows that the proposed model is more accurate than the Random Forest model. Moschella et al. [21] attempted to directly predict wind and solar power generation in each Italian region based on the model input average strategy. Based on the same strategy, Pierro [22] et al. conducted a more detailed study on the solar power generation prediction of six regions in Italy by comparing six different prediction models. Yu et al. [23] presented a probabilistic prediction method based on CNN and non-linear quantile regression (QR). The model was used to predict the regional PV power output of PV systems in the Weifang region of China, and the prediction result shows that the improved CNN can effectively process high-dimensional and complex input data and the non-linear QR model can provide quantile prediction information of regional PV power output.

The upscaling prediction strategy is improved in some interesting studies, such as the research of Pierro et al. [9], Wolff et al. [24], Saint-Drenan et al. [12], and Fu [10]. They first clustered the PV systems in the region and then used the upscaling prediction strategy on the clustered subsets, respectively, to predict the PV power output of the whole region.

There are also some studies in which different strategies were compared. Fonseca et al. [6] conducted a comparative study on the four models, and each prediction method assumed a different scenario regarding the data available to make the prediction. In view of the complete availability of regional PV power data, the strategy of direct prediction and then accumulation is adopted. A prediction model based on stratified sampling is proposed for the partial availability of regional PV power data. In light of the availability of regional aggregate PV power, the model input average strategy is adopted. In the case that the power data cannot be obtained, the strategy of indirect prediction and then accumulation is adopted. By comparison, in the region with a variety of weather conditions, the prediction methods based on single systems’ predictions and the one based on stratified sampling provided the best results. Zamo et al. [7] predicted the regional PV power generation in two counties based on the bottom-up strategy and the model input average strategy. By using a reference system to directly predict the regional aggregate PV power, the results can get an RMSE of about 6%, whatever the county and the RMSE can be reduced to about 5.8% by using the bottom-up strategy. Pierro et al. [9] firstly clustered PV systems in a region and then compared two strategies: (1) Calculate the average prediction results of each cluster to obtain the regional PV power (the models output average strategy), (2) the input variables of each cluster center are used to directly predict the regional PV power (the model input average strategy). The results show that the accuracy of the latter is a little better.

Saint-Drenan et al. [25] proposed a new strategy that can be used as an alternative to the upscaling strategy for the scenario where no or few power measurements are available. The strategy uses an average PV model to calculate the power output of the most frequent module orientation angles. The calculated power values are finally weighted according to their probability of occurrence to estimate the real power output. The basic condition of this strategy is that the physical model information of regional PV systems is available. However, in practice, it is usually difficult to meet this condition.

For regional PV output prediction, the bottom-up strategy needs to predict the output of all systems in the predicted area. It is necessary to establish a prediction model for each system and perform a lot of data processing and calculation. When some PV systems in the region lack historical data and cannot apply the data-driven model, it is necessary to adopt the prediction method based on the physical model. However, in reality, it is difficult to obtain the physical models of all PV systems in a region. Therefore, the bottom-up strategy is actually difficult to apply in practice [10]. To reduce the amount of calculation through simplified methods, the research focus of regional PV output prediction mainly focuses on the upscaling strategy [9].

Through the study of the previous research, we found that there are two problems with the county-level regional DPV output prediction. The first is about the available data resources for prediction. Most of the DPV systems in county-level regions are small rooftop PV systems. Considering the construction cost, there is generally no single output prediction device in these systems, which leads to the lack of predicted output data from single systems. For the same reason, there is also no meteorological data acquisition device in these systems, which leads to the lack of locally measured meteorological data for output prediction. Although the easily available weather prediction data can be used to predict the regional power output, the inherent weather prediction errors will affect the output prediction accuracy. The lack of available data resources and the weather prediction errors make it difficult to directly use the previously proposed models to predict the county-level regional DPV output. The second is about the prediction method. Most of the previously proposed deep neural network (DNN) architectures are successfully applied to images, text, and audio but are not well suited for tabular data [26]. Therefore, there are few studies on the regional PV output prediction method based on DNNs. As is the case with other data-driven models, when a new DNN for tabular data, such as TabNet, is applied to predict the county-level regional DPV output, the optimal training sample collection period (TSCP) is dynamically changing, and it is difficult to select this hyper-parameter, so generally a fixed value is used, or all historical samples are taken as the training samples, which will reduce the accuracy of the predicted results.

In this paper, the weather prediction information is used to predict the county-level regional DPV output based on the model input average strategy. To eliminate the effect of the selected non-optimal TSCP on the prediction accuracy, an ensemble prediction method based on the minimum redundancy maximum relevance (mRMR) criterion and the TabNet model is carried out. To reduce the influence of weather prediction errors on the power output prediction, a modified model based on error prediction is proposed. The proposed ensemble prediction method is used to predict the day-ahead output, and a combination prediction model based on the proposed ensemble prediction method and the modified model is established to predict the hour-ahead output. Finally, the performance of the proposed models is verified by error analysis.

2. Theoretical Background

2.1. TabNet Model

TabNet model [26], a novel DNN for tabular data, is introduced in this paper to predict the county-level regional DPV output. As shown in Figure 1, TabNet’s encoding is based on sequential multi-step processing with

N_{s t e p s}

decision steps and can be summarized as follows:

(1)

D

-dimensional features

f \in R^{B \times D}

are normalized by applying batch normalization (BN) and then passed to each decision step, where

B

is the batch size.

(2) Process the normalized features using a feature transformer and then split into two parts,

[d [0], a [0]]

, where

d [0] \in R^{B \times N_{d}}

and

a [0] \in R^{B \times N_{a}}

.

(3) Perform decision Step 1. Take

a [0]

as the input of the attentive transformer to obtain the mask

M [1] \in R^{B \times D}

. Employ

M [1]

to select the salient features and then process the selected features by using the feature transformer. Split the processed features for the decision step output and information for the subsequent decision step,

[d [1], a [1]]

, where

d [1] \in R^{B \times N_{d}}

and

a [1] \in R^{B \times N_{a}}

.

(4) Similarly, perform the other decision steps to obtain the split features

[d [i], a [i]]

, where

d [i] \in R^{B \times N_{d}}

and

a [i] \in R^{B \times N_{a}}

.

(5) Construct the overall decision embedding as

d_{out} = \sum_{i = 1}^{N_{s t e p s}} ReLU (d [i])

and apply a linear mapping

W_{final} d_{out}

to get the output mapping.

(6) Obtain the aggregate feature importance mask

M_{agg - b, j}

by the following formula:

M_{agg - b, j} = \sum_{i = 1}^{N_{s t e p s}} η_{b} [i] M_{b, j} [i] / \sum_{j = 1}^{D} \sum_{i = 1}^{N_{s t e p s}} η_{b} [i] M_{b, j} [i],

(1)

where

M_{b, j} [i]

is the element in the

b^{t h}

row and the

j^{t h}

column of the mask in the

i^{t h}

decision step.

η_{b} [i]

, which denotes the aggregate decision contribution at the

i^{t h}

decision step for the

b^{t h}

sample, is calculated as follows:

η_{b} [i] = \sum_{c = 1}^{N_{d}} R e L U (d_{b, c} [i]),

(2)

where

d_{b, c} [i]

is the the element in the

b^{t h}

row and the

c^{t h}

column of the splited features for the decision step

i

.

According to the structure of the attentive transformer shown in Figure 2, the feature selection mask

M [i] = s p a r s e m a x (P [i - 1] \cdot h_{i} (a [i - 1]))

, where sparsemax is the sparsemax normalization,

a [i - 1]

is the processed features from the preceding decision step, and

h_{i}

is a trainable function with a FC (fully-connected) layer followed by BN.

P [i - 1]

, the prior scale term denoting how much a particular feature has been used previously, is defined as follows:

P [i - 1] = \prod_{j = 1}^{i - 1} (γ - M [j]),

(3)

where

γ

is a relaxation parameter.

P [0]

is initialized as all ones,

1^{B \times D}

.

For parameter-efficient and robust learning with high capacity, a feature transformer should comprise layers that are shared across all decision steps, as well as decision step-dependent layers. Figure 3 shows the implementation as a concatenation of two shared layers and two decision step-dependent layers. Each FC layer is followed by BN and gated linear unit (GLU) nonlinearity [27], eventually connected to a normalized residual connection with normalization. Normalization with

\sqrt{0.5}

helps to stabilize learning by ensuring that the variance throughout the network does not change dramatically [28]. For faster training, the ghost BN [29] form is used in the feature transformer.

2.2. Mutual Information

In information theory, mutual information (MI) is used to represent the degree of dependence between two systems or the correlation between two variables [10]. MI can be represented based on system entropy, which expresses the complexity or uncertainty of a system. Assuming that the probability of system output

Y = y

is

P (y)

, the system entropy

H (Y)

is defined as follows:

H (Y) = - \sum_{y} P_{Y} (y) \log P_{Y} (y),

(4)

When the system input

X = x

is known, the conditional entropy

H (Y | X)

is defined in Equation (5):

H (Y | X) = - \sum_{x} P_{x} (x) (\sum_{y} P_{Y | X} (y | x) \log P_{Y | X} (y | x)),

(5)

where

P_{Y | X} (y | x)

is the conditional probability of

Y

when the system input

X = x

.

The system joint entropy is defined as follows:

H (Y, X) = - \sum_{y, x} P_{Y X} (y, x) \log P_{Y X} (y, x),

(6)

Since the known

X = x

can reduce the uncertainty of the system, the conditional entropy

H (Y | X)

is smaller than the system entropy

H (Y)

. MI is used to quantify the degree of system uncertainty decrease, represented by

I (X, Y)

in Equation (7):

\begin{matrix} I (X, Y) = H (X) + H (Y) - H (Y, X) \\ = H (X) - H (X| Y) \\ = H (Y) - H (Y| X) \\ = I (Y| X), \end{matrix}

(7)

Based on Equations (4), (5), and (7),

I (X, Y)

is calculated as follows:

I (X, Y) = \sum_{x} \sum_{y} P_{X Y} (x, y) \log \frac{P_{X Y} (x, y)}{P_{X} (x) P_{Y} (y)},

(8)

where

P_{X Y} (x, y)

represents the joint probability distribution when

X = x

and

Y = y

. When

X

and

Y

are continuous variables,

I (X, Y)

is calculated by Equation (9):

I (X, Y) = \iint p_{X Y} (x, y) \log \frac{p_{X Y} (x, y)}{p_{X} (x) p_{Y} (y)} d x d y

(9)

2.3. mRMR Criterion

The mRMR criterion [30] is an eigenvalue selection method, and its core idea is to establish the optimal feature subset based on the maximum relevance condition and the minimum redundancy condition [31]. Let the eigenvalue set consisting of

m

eigenvalues be

F_{m}

, from which

n (n \leq m)

eigenvalues are selected to form a subset

S_{n}

, where

F_{m} = {v_{i}, i = 1, 2, \dots, m}

and

S_{n} = {v_{j}, j = 1, 2, \dots, n}

, and

S_{n} \subseteq F_{m}

. According to the maximum relevance condition, the average value of the MI between the characteristic variables in the subset and the target variable is the largest, and the constraints are as follows:

\max D (S_{n}, c), D = \sum_{i = 1}^{n} I (v_{i}, c),

(10)

where

I (v_{i}, c)

is the MI between the

i

-th eigenvalue and the target variable

c

.

Eigenvalues of the subset

S_{n}

obtained by Equation (10) may be highly correlated, which may introduce unnecessary redundant. Therefore, in addition to meeting the maximum relevance condition, the mean value of MI between eigenvalues in the subset

S_{n}

should be minimized. The minimum redundancy condition is as follows:

\min R (S_{n}), R = \frac{1}{C_{n}^{2}} \sum_{i = 1}^{n - 1} \sum_{j = i + 1}^{n} I (v_{i}, v_{j}),

(11)

where

I (v_{i}, v_{j})

is the MI between the eigenvalues of the subset

S_{n}

. Based on Equations (10) and (11), the mRMR criterion is expressed by Equation (12):

\max Φ (D, R), Φ (D, R) = D - R

(12)

3. Proposed Method

3.1. Data Experiment Scheme

As shown in Figure 4, the steps of the data experiment are:

Step 1: Collect the measured output data (sampling period: 15 min). Collect the weather prediction data (sampling period: 1 h), and then obtain the weather prediction data of the same period with the measured output data by linear interpolation. Preprocess and combine the output data and weather data to establish the experimental sample set.

Step 2: Take a test sample.

Step 3: For the test sample extracted in the previous step,

n

fixed TSCPs are randomly generated. Based on different TSCPs,

n

training sample sets are established by extracting qualified training samples from the experimental sample set.

Step 4: Based on the

n

training sample sets established in the previous step, TabNet model is trained to generate

n

prediction models.

Step 5: Taking the

n

prediction models generated in the previous step as base predictors, an ensemble prediction model based on mRMR is established.

Step 6: Based on the test sample taken in Step 2 and the ensemble prediction model established in the previous step, the day-ahead and hour-ahead outputs are predicted, respectively.

Step 7: Based on the hour-ahead output predicted in the previous step and the proposed modified model, the final hour-ahead output is obtained.

Repeat Step 2 to Step 7

m

(test sample size) times to obtain the day-ahead and hour-ahead output prediction series, respectively. Step 8 and Step 9 are prediction error analyses.

The normalized mean absolute error (nMAE) calculated by Equation (13) and the normalized root mean square error (nRMSE) calculated by Equation (14) are imported to present the prediction errors in this paper.

n M A E = \frac{1}{n} \sum_{i = 1}^{n} |p_{i} - p_{i}^{'}|,

(13)

n R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(p_{i} - p_{i}^{'})}^{2}},

(14)

where

n

is the test sample size,

p_{i}

is the normalized predicted regional DPV output, and

p_{i}^{'}

is the normalized measured output.

3.2. Proposed Ensemble Prediction Model

In predicting the regional DPV output based on the model input average strategy, the training sample set affects the prediction performance of the trained TabNet model. The training sample set depends on the TSCP. For the output prediction of a specific period, there is a relatively optimal training sample set, which corresponds to a specific TSCP and the historical samples in the TSCP. With the systems running, the newly generated samples will update the historical sample set in the previous optimal TSCP, which will cause the training sample set in this TSCP is no longer optimal. Therefore, the TSCP corresponding to the optimal training sample set is dynamic. However, it is difficult to select the optimal value of this hyper-parameter. The TSCP selected by traditional methods is often not optimal, which affects the prediction accuracy. To solve this problem, we proposed an ensemble prediction model based on mRMR criterion and TabNet model, and the specific steps of the model are as follows:

Step 1: Randomly generate

n

fixed TSCPs and establish

n

training sample sets for the day before the predicted day. Then, based on the established

n

training sample sets and TabNet model, the regional DPV output series

\{p_{1}, p_{2}, \dots, p_{n}\}

is predicted.

Step 2: Calculate each

I (p_{i}, p_{j})

, the MI between the

n

regional DPV output prediction series in the previous step and calculate each

I (p_{i}, c)

, the MI between the

n

predicted output series and the measured output series

c

.

Step 3: Let

F = \{p_{1}, p_{2}, \dots, p_{n}\}

, select the

p_{i}

with the largest

I (p_{i}, c)

, let

S = {p_{i}}

, and then update

F

as follows:

F = F - {p_{i}}

;

Step 4: Select

p_{k}

which meet the conditions expressed in Equation (12) from

F

, then update

S

and

F

as follows:

S = S + {p_{k}}

,

F = F - {p_{k}}

;

Step 5: Repeat Step 4 for a total of

m - 1

times to select the

m

output prediction series according to the mRMR criterion from the

n

output prediction series in Step 1 and constitute a set

S_{m} = \{p_{1}, p_{2}, \dots, p_{m}\}

. The

m

fixed TSCPs corresponding to the

m

output prediction series in the set

S_{m}

are extracted to constitute a set

T_{m} = \{t_{1}, t_{2}, \dots, t_{m}\}

. Then calculate the MI between the predicted output series and the measured output series to constitute a set

I (P_{m}, c) = \{I (p_{1}, c), I (p_{2}, c), \dots, I (p_{m}, c)\}

.

Step 6: Calculate the weights by Equation (15) and construct a weight vector

w = {[ω_{1}, ω_{2}, {\dots, ω}_{m}]}^{T}

.

ω_{i} = \frac{I (p_{i}, c)}{\sum_{j = 1}^{m} I (p_{j}, c)},

(15)

Step 7: Predict the regional DPV output series of the predicted day based on the fixed TSCPs in

T_{m}

and TabNet model. Then, construct an output prediction matrix

P = [p_{1}^{'}, p_{2}^{'}, \dots, p_{m}^{'}]

.

Step 8: Calculate the county-level regional DPV output prediction series

p_{ω}

of the predicted day by Equation (16):

p_{ω} = P \times w

(16)

3.3. Proposed Modified Model

The weather prediction information is taken as the input to predict the power output of the regional DPV stations group. Therefore, weather prediction accuracy has a great influence on the prediction accuracy of power output. However, weather prediction errors are similar in adjacent time periods. In this paper, a modified model based on error prediction is established by mining this similarity. Based on the known prediction errors, the unknown prediction errors are predicted, thus the influence of weather prediction errors on the output prediction accuracy is reduced.

To describe the proposed modified model, the concepts of potential test sample (PTS), non-potential test sample (NPTS), and the closest similar sample (CSS) are defined. A test sample is defined as PTS if there are some historical samples with the same weather type on the same day, otherwise it is defined as NPTS. There is some potential that the power output prediction error in a PTS period can be reduced by the proposed modified model. The closest historical sample on the same day and with the same weather type of PTS is defined as the CSS of the PTS. As shown in Figure 5, the steps of the proposed modified model are:

Step 1: Take a test sample and determine whether it is a PTS. If so, proceed to the next step, if not, return to take the next test sample.

Step 2: Extract the historical PTSs with the same weather type of the PTS taken in the previous step and the CSSs corresponding to the historical PTSs.

Step 3: Calculate the prediction errors in the periods of PTSs extracted in Step 2 by Equation (17), and transform the errors by Equation (18):

e = p_{p} - p_{a},

(17)

e_{T} = \frac{e + 1}{2},

(18)

where

e

is the prediction error,

p_{p}

is the normalized predicted output,

p_{a}

is the normalized measured output, and

e_{T}

is the transformed error.

Step 4: Calculate the differences of extraterrestrial solar radiation in the periods of PTSs extracted in Step 2 by Equation (19), and transform the differences by Equation (20):

G_{d} = G_{P} - G_{C},

(19)

G_{T} = \frac{G_{d} + 1}{2},

(20)

where

G_{d}

is the difference of extraterrestrial solar radiation in one of the periods of PTSs extracted in Step 2,

G_{P}

is the normalized extraterrestrial solar radiation in the period of the PTS,

G_{C}

is the normalized extraterrestrial solar radiation in the period of the CSS corresponding to the PTS, and

G_{T}

is the transformed difference of extraterrestrial solar radiation.

Step 5: Establish a training sample set in which a training sample

S_{train} = [G_{T}, e_{C}, e_{T}]

, where

e_{C}

is the transformed error in the period of the CSS corresponding to the training sample.

Step 6: Train a TabNet model based on the training sample set established in the previous step, and then predict the error

e_{p}

.

Step 7: Transform the predicted error according to Equation (21) and modify the predicted power output according to Equation (22).

e_{m} = 2 e_{p} - 1,

(21)

p_{m} = p_{p} - e_{m},

(22)

where

e_{m}

is the transformed predicted error,

p_{m}

is the modified regional DPV output, and

p_{p}

is the output of the proposed ensemble prediction model.

3.4. Experimental Data and DATA Preprocessing

The raw data used in the experiment include measured regional DPV output data and weather prediction data from 7 January 2021 to 30 September 2021. The measured output data (sampling period: 15 min) is collected from 27 DPV systems in Xiaoshan District, Hangzhou City, Zhejiang Province, China. Weather prediction data (sampling period: 1 h) was obtained from Xinzhi weather prediction platform. The weather prediction data (sampling period: 15 min) is obtained based on the original weather prediction data and linear interpolation. The experimental sample set is established by combining multisource data sets according to the time attribute of samples. The attributes of the established experimental sample set include: Extraterrestrial solar radiation, weather type, air temperature, air index, wind speed, and measured regional DPV output.

After establishing the experimental sample set, data preprocessing is carried out. In order to improve the convergence speed and accuracy of DNN models, Max-Min normalization is usually carried out on the input and output features as shown in Equation (23):

x_{nor} = \frac{x - x_{\min}}{x_{\max} - x_{\min}},

(23)

where

x_{nor}

is a normalized feature value,

x

is the original feature value,

x_{\max}

is the maximum feature value, and

x_{\min}

is the minimum feature value.

4. Results and Discussion

4.1. Output Prediction

According to the proposed ensemble prediction model, 16 fixed TSCPs (1, 3, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 180 days) were randomly generated in this paper to establish training sample sets. In particular, the experimental data on 27 September 2021 was selected to verify the proposed models. For output prediction, based on the mRMR criterion, the first 14 optimal output prediction series were selected from the 16 series on the day before the predicted day to calculate weights. The weight calculation results are listed in Table 1.

The regional DPV output series on the predicted day were predicted based on the fixed TSCPs and TabNet model. Then based on these predicted output series and calculated weights, the day-ahead output prediction series was calculated according to Equation (16). As shown in Figure 6, the predicted values of the day-ahead outputs on this day are generally lower than the corresponding measured values, which may be caused by the similarity of the weather prediction errors of the day.

On the basis of the day-ahead prediction results, the hour-ahead output prediction results were obtained through the proposed modified model. As shown in Figure 7, compared with the day-ahead output prediction results, the predicted hour-ahead outputs are closer to the corresponding measured outputs.

4.2. Performance Analysis

In order to further verify the validity of the proposed models, test samples from 1 May 2021 to 30 September 2021 were used. Figure 8 presents the measured regional DPV outputs versus the predicted outputs based on the proposed and reference models. Power values are normalized from 0 to 1 to show a better comparison. The blue points represent the predicted power values and the corresponding measured power values at the same time. The red solid line

y = x

provides a reference when the predicted value is equal to the measured value, so the dispersion degree of the blue points around the red line reflects the error between the measured power and the predicted power. The more intensively the blue points aggregate to the red line, the less prediction error the corresponding model shows.

The proposed ensemble prediction strategy is used to predict the day-ahead output, and the proposed combination prediction strategy is used to predict the hour-ahead output. As shown in Figure 8a,c,e, the prediction errors of the models with the combination prediction strategy are less than that of the hour-ahead output persistence prediction model, and the prediction error of the TabNet-based combination prediction model is smaller than that of the SVM based combination prediction model. Figure 8b,d,f show that the prediction errors of the models with the ensemble prediction strategy are less than that of the day-ahead output persistence prediction model, and the prediction error of the TabNet-based ensemble prediction model is smaller than that of the SVM based ensemble prediction model.

As shown in Figure 8, the distribution of the measured power versus prediction power points reveals an intuitive comparison between different prediction models but is not reliable in quantitative evaluation. Therefore, the prediction errors based on different models are listed in Table 2 and Table 3 and described in Figure 9 and Figure 10. In Figure 9, the numbers 1, 3, …, 180 represent the fixed TSCPs, E represents the proposed ensemble prediction strategy, and C represents the proposed combination prediction strategy. In the subsequent figures of this section, this abbreviation way is also applied.

It can be seen from Figure 9 that, based on different data-driven models, the proposed ensemble prediction strategy can eliminate the influence of the non-optimal sampling period of training samples on the prediction accuracy, and the proposed combination prediction strategy can further improve the hour-ahead prediction accuracy.

Figure 10 presents a comparative analysis of the prediction errors between the proposed models and the reference models. For day-head output prediction, the nMAEs predicted by the TabNet-based ensemble prediction model, SVM-based ensemble prediction model, and day-ahead persistence prediction model are 8.40%, 8.85%, and 11.26% respectively. Meanwhile, the nRMSEs are 11.11%, 11.35, and 16.62%, respectively. For hour-head output prediction, the nMAEs predicted by the TabNet-based combination prediction model, SVM based combination prediction model, and hour-ahead persistence prediction model are 6.90%, 7.71%, and 9.95%, respectively. Meanwhile, the nRMSEs are 9.49%, 9.97%, and 12.54%, respectively. The experiment results verified that the proposed methods are more accurate than the corresponding reference models.

In order to analyze the stabilities of the proposed models, the daily errors are analyzed. Figure 11 and Figure 12 present daily prediction error distributions of the TabNet-based models and the SVM-based models, respectively. It can be seen that the distributions of daily nMAE and nRMSE using the ensemble prediction strategy are lower than that using the fixed TSCPs, and the distributions using the combination prediction strategy are lower than that using the ensemble prediction strategy.

Figure 13 presents a comparative analysis about the distributions of daily day-ahead prdiction errors between the proposed model and the reference models. The mean values of daily nMAE predicted by the TabNet-based ensemble prediction model, SVM-based ensemble prediction model, and day-ahead persistence prediction model are 8.22%, 8.77%, and 11.12%, respectively, and the median values are 7.95%, 8.39%, and 9.90%, respectively. The mean values of daily nRMSE predicted by the three models are 10.19%, 10.65%, and 14.39%, respectively, and the median values are 9.58%, 10.03%, and 12.92%, respectively. Both the boxplots and the statistical indicators verified that the proposed TabNet-based ensemble prediction model is more stable than the reference day-ahead prediction models.

Figure 14 presents a comparative analysis about distributions of daily hour-ahead prediction errors between the proposed model and the reference models. The mean values of daily nMAE predicted by the TabNet based combination prediction model, SVM based combination prediction model, and hour-ahead persistence prediction model are 6.77%, 7.65%, and 9.88% respectively, and the median values are 6.46%, 7.39%, and 10.77% respectively. The mean values of daily nRMSE predicted by the three models are 8.83%, 9.49%, and 11.91%, respectively, and the median values are 8.38%, 9.12%, and 12.78% respectively. Both the boxplots and the statistical indicators verified that the proposed TabNet-based combination prediction model is more stable than the reference hour-ahead prediction models.

In order to analyze the variations in the performance of the proposed models with time, the monthly errors are analyzed. Figure 15 and Figure 16, respectively, present the monthly nMAE and nRMSE of TabNet-based models. The closer the color of the histogram in the figure is to yellow, the greater the error value is, and the closer it is to blue, the smaller the error value is. It can be seen that, compared with the models with fixed TSCPs, the monthly errors by using the ensemble prediction strategy are lower. By using the proposed combination prediction strategy, the monthly errors are further reduced. Figure 17 and Figure 18, respectively, present the monthly nMAE and nRMSE of SVM-based models. The experimental results are similar to those of the TabNet-based models.

Figure 19 presents a comparative analysis of the day-ahead prediction models. In September, the nMAEs of the proposed day-ahead prediction model (TabNet-based ensemble prediction model) and the two reference models (SVM-based ensemble prediction model and day-ahead persistence prediction model) are close to each other, and the nRMSE of the proposed day-ahead prediction model is close to that of the SVM-based ensemble prediction model but significantly higher than that of the day-ahead persistence prediction model. In other months, considering the two error indicators comprehensively, the performance of the proposed day-ahead prediction model is better than that of the reference models.

Figure 20 presents a comparative analysis of the hour-ahead prediction models. Considering the two error indicators comprehensively, the proposed hour-ahead prediction model (TabNet-based combination prediction model) is better than the reference models.

The experimental results verify that the proposed day-ahead and hour-ahead prediction models are more accurate and stable than the corresponding reference models and show robust performance with monthly variations.

5. Conclusions

This paper presents a new prediction method for the output of the county-level regional DPV stations group, which aims to improve the centralized operation and maintenance of the stations and to meet the needs of power grid dispatching. The weather prediction information is used to predict the output based on the model input average strategy. Considering the effect of the selected non-optimal TSCP on the prediction accuracy, an ensemble prediction method based on the mRMR criterion and TabNet model is carried out to predict the day-ahead output. Firstly, multiple fixed TSCPs are randomly generated, and the output prediction series of the day before the predicted period are predicted based on the fixed TSCPs and the TabNet model. The weight vector of the output prediction series of the previous day is calculated according to the mRMR algorithm. Then, based on the fixed TSCPs and the TabNet model, the output vector of the predicted period is predicted. Finally, the output prediction value of the predicted period is obtained by the weighted average method. The nMAEs and nRMSEs of the prediction results based on the fixed TSCPs are in the interval (8.85%, 16.09%) and the interval (12.06%, 23.74%), respectively. The nMAE and nRMSE of the prediction results based on the proposed ensemble prediction model are 8.4% and 11.11%, respectively. Therefore, the effect of the selected non-optimal TSCP on the prediction accuracy can be eliminated by the proposed ensemble prediction model.

Taking into account the influence of weather prediction errors on the power output prediction, a modified model based on error prediction is proposed. Firstly, the functional relationship between the prediction errors of the same weather type on the same day is learned. Then, based on the functional relationship, the prediction error of the predicted period is predicted. Finally, the output prediction result of the proposed ensemble prediction model is modified by the predicted error. The nMAE and nRMSE of the hour-ahead output prediction results obtained by this combination prediction model are 6.9% and 9.49%, respectively, which is less than that of the proposed ensemble prediction model. Thus, the influence of weather prediction errors on the power output prediction is reduced by the proposed modified model.

According to the overall error analysis, compared with the reference day-ahead prediction model, the proposed ensemble prediction model reduces the nMAE by 2.86% and the nRMSE by 5.51%, respectively, and compared with the reference hour-ahead prediction model, the proposed combination prediction model reduces the nMAE by 3.05% and the nRMSE by 3.05%, respectively. Based on the daily error analysis, compared with the reference day-ahead prediction model, the proposed ensemble prediction model reduces the mean value of daily nMAE by 2.9% and daily nRMSE by 4.2%, respectively, and compared with the reference hour-ahead prediction model, the proposed combination prediction model reduces the mean value of daily nMAE by 3.11% and daily nRMSE by 3.08%, respectively. In accordance with the monthly error analysis, compared with the reference day-ahead prediction model, the proposed ensemble prediction model reduces the mean value of monthly nMAE by 2.91% and monthly nRMSE by 5.3%, respectively, and compared with the reference hour-ahead prediction model, the proposed combination prediction model reduces the mean value of monthly nMAE by 3.02% and monthly nRMSE by 3.01%, respectively. Therefore, the proposed day-ahead and hour-ahead prediction models are more accurate and stable than the corresponding reference models and show robust performance with monthly variations.

Author Contributions

Each author contributed extensively to this research. G.P. and J.O. provided the research ideas and guided the revision of the draft; D.M. and R.X. designed the experiment; D.M., Z.Z. and L.C. performed the experiments; D.M., Z.Z. and L.C. analyzed the data; and D.M. wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research is financially supported by Key R&D Program of Zhejiang Province, China (2022C01244); and The National Key Research and Development Program of China (2017YFA0700301).

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Acknowledgments

We wish to thank Yuhang Zhao for providing the experimental data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fichter, T.; Soria, R.; Szklo, A.; Schaeffer, R.; Lucena, A.F. Assessing the potential role of concentrated solar power (CSP) for the northeast power system of Brazil using a detailed power system model. Energy 2017, 121, 695–715. [Google Scholar] [CrossRef]
Liu, C.; Li, M.; Yu, Y.; Wu, Z.; Gong, H.; Cheng, F. A Review of Multitemporal and Multispatial Scales Photovoltaic Forecasting Methods. IEEE Access 2022, 10, 35073–35093. [Google Scholar] [CrossRef]
Kim, T.; Kim, J. A Regional Day-Ahead Rooftop Photovoltaic Generation Forecasting Model Considering Unauthorized Photovoltaic Installation. Energies 2021, 14, 4256. [Google Scholar] [CrossRef]
Lorenz, E.; Heinemann, D.; Kurz, C. Local and regional photovoltaic power prediction for large scale grid integration: Assessment of a new algorithm for snow detection. Prog. Photovolt. 2012, 20, 760–769. [Google Scholar] [CrossRef]
Fonseca, J.; Oozeki, T.; Ohtake, H.; Shimose, K.; Takashima, T.; Ogimoto, K. Regional forecasts and smoothing effect of photovoltaic power generation in Japan: An approach with principal component analysis. Renew. Energy 2014, 68, 403–413. [Google Scholar]
Fonseca, J.G.D.; Oozeki, T.; Ohtake, H.; Takashima, T.; Ogimoto, K. Regional forecasts of photovoltaic power generation according to different data availability scenarios: A study of four methods. Prog. Photovolt. 2015, 23, 1203–1218. [Google Scholar] [CrossRef]
Zamo, M.; Mestre, O.; Arbogast, P.; Pannekoucke, O. A benchmark of statistical regression methods for short-term forecasting of photovoltaic electricity production, part I: Deterministic forecast of hourly production. Sol. Energy 2014, 105, 792–803. [Google Scholar] [CrossRef]
Saint-Drenan, Y.M.; Good, G.H.; Braun, M.; Freisinger, T. Analysis of the uncertainty in the estimates of regional PV power generation evaluated with the upscaling method. Sol. Energy 2016, 135, 536–550. [Google Scholar] [CrossRef] [Green Version]
Pierro, M.; De Felice, M.; Maggioni, E.; Moser, D.; Perotto, A.; Spada, F.; Cornaro, C. Data-driven upscaling methods for regional photovoltaic power estimation and forecast using satellite and numerical weather prediction data. Sol. Energy 2017, 158, 1026–1038. [Google Scholar] [CrossRef]
Fu, L.; Yang, Y.L.; Yao, X.L.; Jiao, X.F.; Zhu, T.T. A Regional Photovoltaic Output Prediction Method Based on Hierarchical Clustering and the mRMR Criterion. Energies 2019, 12, 3817. [Google Scholar] [CrossRef] [Green Version]
Koster, D.; Minette, F.; Braun, C.; O’Nagy, O. Short-term and regionalized photovoltaic power forecasting, enhanced by reference systems, on the example of Luxembourg. Renew. Energy 2019, 132, 455–470. [Google Scholar] [CrossRef]
Saint-Drenan, Y.M.; Vogt, S.; Killinger, S.; Bright, J.M.; Fritz, R.; Potthast, R. Bayesian parameterisation of a regional photovoltaic model—Application to forecasting. Sol. Energy 2019, 188, 760–774. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Li, Y.; Lu, S.; Hamann, H.F.; Hodge, B.-M.; Lehman, B. A Solar Time Based Analog Ensemble Method for Regional Solar Power Forecasting. IEEE Trans. Sustain. Energy 2019, 10, 268–279. [Google Scholar] [CrossRef]
Shaker, H.; Zareipour, H.; Wood, D. Estimating Power Generation of Invisible Solar Sites Using Publicly Available Data. IEEE Trans. Smart Grid 2016, 7, 2456–2465. [Google Scholar] [CrossRef]
Wang, Y.; Zhang, N.; Chen, Q.; Kirschen, D.S.; Li, P.; Xia, Q. Data-Driven Probabilistic Net Load Forecasting With High Penetration of Behind-the-Meter PV. IEEE Trans. Power Syst. 2018, 33, 3255–3264. [Google Scholar] [CrossRef]
Kapourchali, M.H.; Sepehry, M.; Aravinthan, V. Multivariate Spatio-temporal Solar Generation Forecasting: A Unified Approach to Deal with Communication Failure and Invisible Sites. IEEE Syst. J. 2019, 13, 1804–1812. [Google Scholar] [CrossRef]
Shaker, H.; Manfre, D.; Zareipour, H. Forecasting the aggregated output of a large fleet of small behind-the-meter solar photovoltaic sites. Renew. Energy 2020, 147, 1861–1869. [Google Scholar] [CrossRef]
Shaker, H.; Zareipour, H.; Wood, D. A Data-Driven Approach for Estimating the Power Generation of Invisible Solar Sites. IEEE Trans. Smart Grid 2016, 7, 2466–2476. [Google Scholar] [CrossRef]
Bright, J.M.; Killinger, S.; Lingfors, D.; Engerer, N.A. Improved satellite-derived PV power nowcasting using real-time power data from reference PV systems. Sol. Energy 2018, 168, 118–139. [Google Scholar] [CrossRef]
Aillaud, P.; Lequeux, J.; Mathe, J.; Huet, L.; Lallemand, C.; Liandrat, O.; Sebastien, N.; Kurzrock, F.; Schmutz, N. Day-ahead forecasting of regional photovoltaic production using deep learning. In Proceedings of the 47th IEEE Photovoltaic Specialists Conference, PVSC 2020, Calgary, OR, Canada, 15 June–21 August 2020; Institute of Electrical and Electronics Engineers Inc.: Calgary, OR, Canada, 2020; pp. 2688–2691. [Google Scholar]
Moschella, M.; Tucci, M.; Crisostomi, E.; Betti, A. A Machine Learning Model for Long-Term Power Generation Forecasting at Bidding Zone Level. In Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe), Bucharest, Romania, 29 September–2 October 2019; pp. 1–5. [Google Scholar]
Pierro, M.; Gentili, D.; Liolli, F.R.; Cornaro, C.; Moser, D.; Betti, A.; Moschella, M.; Collino, E.; Ronzio, D.; van der Meer, D. Progress in regional PV power forecasting: A sensitivity analysis on the Italian case study. Renew. Energy 2022, 189, 983–996. [Google Scholar] [CrossRef]
Yu, Y.; Wang, M.; Yan, F.; Yang, M.; Yang, J. Improved convolutional neural network-based quantile regression for regional photovoltaic generation probabilistic forecast. IET Renew. Power Gener. 2020, 14, 2712–2719. [Google Scholar] [CrossRef]
Wolff, B.; Kuhnert, J.; Lorenz, E.; Kramer, O.; Heinemann, D. Comparing support vector regression for PV power forecasting to a physical modeling approach using measurement, numerical weather prediction, and cloud motion data. Sol. Energy 2016, 135, 197–208. [Google Scholar] [CrossRef]
Saint-Drenan, Y.M.; Good, G.H.; Braun, M. A probabilistic approach to the estimation of regional photovoltaic power production. Sol. Energy 2017, 147, 257–276. [Google Scholar] [CrossRef] [Green Version]
Arik, S.O.; Pfister, T.; Assoc Advancement Artificial, I. TabNet: Attentive Interpretable Tabular Learning. In Proceedings of the 35th AAAI Conference on Artificial Intelligence, Electr Network, Online, 2–9 February 2021; pp. 6679–6687. [Google Scholar]
Dauphin, Y.N.; Fan, A.; Auli, M.; Grangier, D. Language modeling with gated convolutional networks. In Proceedings of the International Conference on Machine Learning, Sydney, NSW, Australia, 6–11 August 2017; pp. 933–941. [Google Scholar]
Gehring, J.; Auli, M.; Grangier, D.; Yarats, D.; Dauphin, Y.N. Convolutional sequence to sequence learning. In Proceedings of the 34th International Conference on Machine Learning, Sydney, NSW, Australia, 6–11 August 2017; Volume 70, pp. 1243–1252. [Google Scholar]
Hoffer, E.; Hubara, I.; Soudry, D. Train longer, generalize better: Closing the generalization gap in large batch training of neural networks. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 1729–1739. [Google Scholar]
Peng, H.C.; Long, F.H.; Ding, C. Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1226–1238. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Gu, X. Feature selection for transient stability assessment based on improved maximal relevance and minimal redundancy criterion. Zhongguo Dianji Gongcheng Xuebao/Proc. Chin. Soc. Electr. Eng. 2013, 33, 179–186. [Google Scholar]

Figure 1. A TabNet model example.

Figure 2. An attentive transformer block example.

Figure 3. A feature transformer block example.

Figure 4. The data experiment scheme.

Figure 5. The proposed modified model.

Figure 6. Prediction results based on the proposed ensemble prediction model.

Figure 7. Prediction results based on the proposed combination prediction model.

Figure 8. The predicted outputs versus the measured outputs: (a) using the TabNet-based combination prediction model; (b) using the TabNet-based ensemble prediction model; (c) using the SVM-based combination prediction model; (d) using the SVM-based ensemble prediction model; (e) using the hour-ahead output persistence prediction model; (f) using the day-ahead output persistence prediction model.

Figure 9. Prediction errors based on different models. (a) nMAEs predicted by the TabNet models with the fixed TSCPs, TabNet-based ensemble prediction model, and TabNet-based combination prediction model; (b) nRMSEs predicted by the TabNet models with the fixed TSCPs, TabNet based ensemble prediction model, and TabNet-based combination prediction model; (c) nMAEs predicted by the SVM models with the fixed TSCPs, SVM based ensemble prediction model, and SVM based combination prediction model; (d) nRMSEs predicted by the SVM models with the fixed TSCPs, SVM based ensemble prediction model, and SVM based combination prediction model.

Figure 10. Histogram comparison of the prediction errors: (a) Of day-ahead output prediction results; (b) of hour-ahead output prediction results.

Figure 11. Boxplots of daily prediction errors of TabNet-based models. (a) nMAE boxplot; (b) nRMSE boxplot.

Figure 12. Boxplots of daily prediction errors of SVM-based models. (a) nMAE boxplot; (b) nRMSE boxplot.

Figure 13. Boxplots of daily day-ahead prediction errors. (a) nMAE boxplot; (b) nRMSE boxplot.

Figure 14. Boxplots of daily hour-ahead prediction errors. (a) nMAE boxplot; (b) nRMSE boxplot.

Figure 15. The monthly nMAE histogram of the TabNet-based models.

Figure 16. The monthly nRMSE histogram of the TabNet-based models.

Figure 17. The monthly nMAE histogram of the SVM-based models.

Figure 18. The monthly nRMSE histogram of the SVM-based models.

Figure 19. Histogram of monthly errors of the day-ahead prediction models. (a) nMAE histogram; (b) nRMSE histogram.

Figure 20. Histogram of monthly errors of the hour-ahead prediction models. (a) nMAE histogram; (b) nRMSE histogram.

Table 1. Calculation results of MI and weight.

Extraction Order of Output Prediction Series	Fixed TSCP Corresponding to Output Prediction Series	MI	Weight
1	12	4.28	7.69%
2	7	3.87	6.96%
3	16	4.25	7.64%
4	4	3.61	6.50%
5	10	4.13	7.42%
6	14	3.63	6.53%
7	11	4.10	7.37%
8	8	4.19	7.53%
9	5	4.03	7.24%
10	3	3.51	6.31%
11	15	4.12	7.40%
12	13	4.14	7.45%
13	6	3.74	6.72%
14	9	4.01	7.21%

Table 2. Prediction errors for each TabNet-based model.

Model	nMAE	nRMSE
TSCP = 1	16.09%	23.74%
TSCP = 3	14.01%	20.86%
TSCP = 5	12.09%	17.62%
TSCP = 10	11.74%	17.60%
TSCP = 20	10.16%	14.20%
TSCP = 30	9.53%	13.36%
TSCP = 40	9.49%	13.12%
TSCP = 50	9.07%	12.33%
TSCP = 60	9.11%	12.43%
TSCP = 70	8.85%	12.06%
TSCP = 80	9.78%	13.75%
TSCP = 90	9.60%	12.97%
TSCP = 100	8.98%	12.11%
TSCP = 110	9.40%	13.35%
TSCP = 120	9.79%	13.86%
TSCP = 180	9.52%	12.84%
Ensemble prediction model	8.40%	11.11%
Combination prediction model	6.90%	9.49%

Table 3. Prediction errors for each SVM-based model.

Model	nMAE	nRMSE
TSCP = 1	27.79%	34.57%
TSCP = 3	12.16%	16.31%
TSCP = 5	11.49%	15.48%
TSCP = 10	9.60%	12.46%
TSCP = 20	9.68%	12.41%
TSCP = 30	9.30%	11.91%
TSCP = 40	9.24%	11.94%
TSCP = 50	9.12%	11.75%
TSCP = 60	8.99%	11.59%
TSCP = 70	9.02%	11.64%
TSCP = 80	8.97%	11.59%
TSCP = 90	8.97%	11.57%
TSCP = 100	9.00%	11.62%
TSCP = 110	8.98%	11.62%
TSCP = 120	9.01%	11.66%
TSCP = 180	9.01%	11.80%
Ensemble prediction model	8.85%	11.35%
Combination prediction model	7.71%	9.97%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ma, D.; Xie, R.; Pan, G.; Zuo, Z.; Chu, L.; Ouyang, J. Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group. Energies 2023, 16, 5649. https://doi.org/10.3390/en16155649

AMA Style

Ma D, Xie R, Pan G, Zuo Z, Chu L, Ouyang J. Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group. Energies. 2023; 16(15):5649. https://doi.org/10.3390/en16155649

Chicago/Turabian Style

Ma, Dengchang, Rongyi Xie, Guobing Pan, Zongxu Zuo, Lidong Chu, and Jing Ouyang. 2023. "Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group" Energies 16, no. 15: 5649. https://doi.org/10.3390/en16155649

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Photovoltaic Power Output Prediction Based on TabNet for Regional Distributed Photovoltaic Stations Group

Abstract

1. Introduction

2. Theoretical Background

2.1. TabNet Model

2.2. Mutual Information

2.3. mRMR Criterion

3. Proposed Method

3.1. Data Experiment Scheme

3.2. Proposed Ensemble Prediction Model

3.3. Proposed Modified Model

3.4. Experimental Data and DATA Preprocessing

4. Results and Discussion

4.1. Output Prediction

4.2. Performance Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI