A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets

Nahed, Zemouri; Hatem, Mezaache; Aissa, Chouder

doi:10.3390/ASEC2023-15228

Open AccessProceeding Paper

A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets^†

by

Zemouri Nahed

^1,*,

Mezaache Hatem

² and

Chouder Aissa

²

¹

Department of Electronics, University of Mohamed Boudiaf M’sila, M’Sila 28000, Algeria

²

Electrical Engineering Laboratory (LGE), University of Mohamed Boudiaf M’sila, M’Sila 28000, Algeria

^*

Author to whom correspondence should be addressed.

^†

Presented at the 4th International Electronic Conference on Applied Sciences, 27 October–10 November 2023; Available online: https://asec2023.sciforum.net/.

Eng. Proc. 2023, 56(1), 1; https://doi.org/10.3390/ASEC2023-15228

Published: 26 October 2023

(This article belongs to the Proceedings of The 4th International Electronic Conference on Applied Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

Photovoltaic (PV)-system-generated solar energy has inconsistent and variable properties, which makes controlling electric power distribution and preserving grid stability extremely difficult. A photovoltaic (PV) system’s performance is profoundly affected by the amount of sunlight that reaches the solar cell, the season of the year, the ambient temperature, and the humidity of the air. Every renewable energy technology, sadly, has its problems. As a result, the system is unable to function at its highest or best level. To combat the unstable and intermittent performance of solar power output, it is essential to achieve a precise PV system output power. This work introduces a new approach to enhancing accuracy and extending the time range of very short-term solar energy forecasting (15 min step ahead) by using multivariate time series inputs in different seasons. First, Linear Discriminant Analysis (LDA) is used to select the relevant factors from the mixed meteorological input data. Secondly, two very short-term deep learning prediction models, CNN and LSTM, are used to predict PV power for a shuffled and reduced database of weather inputs. Finally, the predicted outputs from the two models are combined using a classification strategy. The proposed method is applied to one year of real data collected from a solar power plant located in southern Algeria to demonstrate that this technique can improve the forecasting accuracy compared to other techniques, as determined through the use of statistical analysis involving normalized root mean square error (NRMSE), mean absolute error (MAE), mean bias error (MBE), and determination coefficient. (R²).

Keywords:

very short-term; solar power forecasting; deep learning; aggregation; weather prediction

1. Introduction

Nowadays, solar energy has become a crucial element in electricity production systems worldwide. For the proper functioning and cost-effective integration of solar power, accurate photovoltaic power forecasting is essential [1]. Several studies have been conducted, and different methods have been suggested in the literature. There are three categories of solar energy forecasting methods: physical methods, statistical methods, and machine learning methods [2,3]. In [4], N. Zemouri et al. proposed a novel approach using multi-model statistical ensembles to predict short-term GHI using the classification strategy. In [2], Kejun Wang et al. proposed a hybrid deep learning model LSTM-CNN applied to photovoltaic power prediction. In [5], Elham M. Al-Ali and al presented a hybrid CNN-LSTM-Transformer Model to forecast solar energy production. The rest of this paper is structured as follows: the next section provides a brief description of the proposed approach. To evaluate the proposed approach, we choose the statistical criteria, which are presented in Section 3; the practical experimental results and comparisons for the evaluation of the proposed technique are described in Section 3. Finally, the concluding remarks of this study can be found in Section 4.

2. Methodology

Figure 1 illustrates the operational flow of the proposed approach for executing very short-term forecasting of PV output power using different stage techniques. The prediction system is trained using real weather input data (solar radiation, temperature, relative humidity, pressure and wind speed) to forecast solar PV generation. After the actual methodological data have been subjected to dimensionality reduction to acquire an ideal collection of past variables, this collection serves as an input component for every predictive model trained using deep learning methods. The fusion of these two models enables the ultimate prediction of photovoltaic solar energy output. All these steps are detailed in the following subsection.

2.1. Meteorological Weather Input Parameters

In the present work, different meteorological weather input parameters are used to forecast solar power, which are as follows: temperature (T), relative humidity (RH), wind speed (WS) and solar radiation (SR) because these are uncomplicated to measure. The process of choosing input variables is a significant aspect of time series prediction. Selecting the appropriate input variables is an important step for modeling time series effectively. When there is a set of independent variables, it is necessary to perform variable selection to exclude variables that have minimal impact on the forecast and ensure important variables are not overlooked. The selection of variables is influenced by factors such as data availability, quality, and correlation with the target variable [6]. In order to determine the connection between the meteorological variables collected and the output of a solar power plant, a statistical analysis was conducted. Pearson correlation was utilized as a metric to assess the linear correlation between the two variables, which will be explained in the following subsection.

2.2. Correlation between Weather Inputs Data and Power

The process of choosing input variables is a significant aspect of time series prediction. Selecting the appropriate input variables is critical for modeling time series effectively. When there is a set of independent variables, it is necessary to perform variable selection to exclude variables that have minimal impact on the forecast and ensure important variables are not overlooked. The selection of variables is influenced by factors such as data availability, quality, and correlation with the target variable [7]. In order to determine the connection between the meteorological variables collected and the output of a solar power plant, a statistical analysis was conducted. Pearson correlation is utilized as a metric to assess the linear correlation between the two variables. It calculates the strength and direction of the relationship between the variables, ranging from −1 to +1. A value of +1 indicates a perfect positive linear relationship, 0 indicates no linear relationship, and −1 indicates a perfect negative linear relationship. Equation (1) represents the mathematical expression of the Pearson correlation coefficient.

R = \frac{n \sum X Y - \sum X \sum Y}{\sqrt{(n \sum X^{2} - {(\sum X)}^{2}) . (n \sum Y^{2} - {(\sum Y)}^{2})}}

(1)

Table 1 shows the assessed effectiveness of the correlations between SP (solar power) and the different meteorological components, emphasizing the best results in bold. The impact of these relationships was evaluated using the correlation coefficient (R), which shows the degree to which the two variables are related.

2.3. Deep Learning Application Method to Forecast Power

The following section contains a brief introduction to the standard LSTM and 1D-CNN models that are used to forecast the power output.

2.3.1. LSTM-RNN Predictor Model

LSTM is a new kind of RNN, and it is substantially distinct compared to standard ANNs, which are often used to represent sequential data, such as time series or natural language. LSTMs, however, are able to recall long-term dependencies because they were specifically created to address the vanishing gradient problem [8]. LSTM comprises four basic interconnected layers (three gates and cell state): cell state is used to store information; the forget gate controls how much information is forgotten by the cell memory; the input gate is used to determine which inputs are utilized to update the memory of the cell; finally, the output gate defines which components of the cell memory need to be updated to change the LSTM cell’s hidden state. Figure 2a illustrates the LSTM’s fundamental unit architecture. The output cell

c_{t}

and upward output

h_{t}

are determined using the following equations:

{\tilde{c}}_{t} = \tanh (W_{c} x_{t} + U_{c} h_{t - t} + b_{c})

(2)

c_{t} = f_{t} ⨂ c_{t - 1} + i_{t} ⨂ {\tilde{c}}_{t}

(3)

h_{t} = o_{t} ⨂ \tanh (c_{t})

(4)

where

σ t a n h

represents sigmoid and hyperbolic tangent activation functions, respectively, and

⨂

denotes the operation of matrix multiplication. In [8], a general introduction to the LSTM model is presented. The second deep learning prediction method used in our work is CNN, as mentioned in the following subsection.

2.3.2. CNN Predictor for 1D Signal

In 1989, deep ANNs, types of convolutional neural networks (CNNs), were first used to recognize zip codes [9]. At present, CNN methods have been created specifically to handle one-dimensional data, also called 1D-CNN. It has beeen very recently proposed for use in a number of applications and performed well, particularly for 1D signals, as in our case. The components that comprise a 1D-CNN model are as follows: convolution layers, pooling layers, and fully connected layers or dense layers, as shown in Figure 2b; the tasks of each layer are presented in [10,11,12], and the details of the mathematical equations are outlined in [13].

Figure 2. (a) LSTM unit architecture [14]. (b) 1D-CNNarchitecture.

2.4. Supervised Dimensionality Reduction Based on the LDA Technique

Linear Discriminate Analysis (LDA) is one famous supervised learning technique used in statistics and another field to find a linear combination of features that characterizes or separates two or more classes of data [15]. It can be achieved through three principal steps: (1) calculate the separability between different classes called between-class variance (Sb), (2) calculate the distance between the mean and sample of each class, which is called the within-class variance (SW), and (3) construct the lower dimensional space, which maximizes between-class variance (Sb) and minimizes the within-class variance (SW), as mentioned in Figure 3.

2.5. Classification Aggregation Strategy

A combination strategy combines various techniques in order to enhance the performance of global forecasting. It presents a conscious decision to increase forecasting accuracy by using a practical approach; in the present work, two prediction models were combined using a classification strategy that is based on the hypothesis that several high-performance predictors may be associated with various regions of the input time series variable space [4] the best predictor is the one that minimizes error throughout the whole variable space wee used the RMSE error to suggested our proposed strategy.

3. Results and Discussion

The data that were used in this study are real data collected within 15 min during the period from 1 January 2019, to 31 December 2019, from a photovoltaic solar power plant with a capacity of 20 megawatts located in a city situated in East Algeria in North Africa. The meteorological data included are taken as the proposed model inputs. The collected dataset was divided into three subsets; this division was realized with the aim of obtaining acceptable and accurate forecasting data. The first subset includes a train set, containing 70%, and the second is regarded as a test set, containing 30% of the total data. All of these data are used to forecast solar power and to evaluate our approach proposed in this work; different criteria were used as follows: mean square error (NRMSE), mean absolute percentage error (MAPE), mean bias error (MBE), and coefficient of determination (R²) their equation is mentioned in [4]. The characters of the two methods of prediction used in this work are mentioned in Table 2.

The prediction accuracy of the proposed model is compared in Table 3, which shows that the used strategy based on classification aggregating performs significantly better than the other individual mode with LDA and without LDA in terms of R², with 0.9337, which indicates that the classification-based strategy outperforms the other ones by a large margin. Performance samples from the prediction of solar power with LSTM and CNN after using LDA reduction are compared with their combinations, as shown in Figure 4.

4. Conclusions

An aggregating deep learning technique with a dimensionality reduction framework called the Linear Discriminate Analysis (LDA) method is used to forecast solar power. Its combination based on a classification strategy is proposed in this paper. The suggested approach demonstrates experimental results in terms of forecasting tasks involving solar power, indicating that the combination strategy yields the best results compared with individual models. The good performance of the proposed models demonstrated here should make them more attractive for a large variety of forecasting problems.

Future Research

Under feasible forecasting conditions, the proposed approach has achieved good forecasting accuracy, and it can be further improved. As research work that can be carried out in the future, we can aim to propose other dimensionality reduction methods, such as PCA or MRMR, to add a block of decomposition to the signal of the weather time series using, for example, EMD, CEEMD or VMD, with the objective of improving the precision of the forecasts of the model of the proposed forecast.

Author Contributions

Conceptualization, Z.N., M.H. and C.A.; methodology, Z.N.; software, Z.N.; validation, Z.N., M.H. and C.A.; formal analysis, Z.N.; investigation, Z.N.; resources, Z.N.; data curation, Z.N.; writing—original draft preparation, Z.N.; writing—review and editing, Z.N.; visualization, Z.N.; supervision, Z.N.; project administration, Z.N.; funding acquisition, M.H. All authors have read and agreed to the published version of the manuscript.

Funding

Electrical Engineering Laboratory (LGE), University of Mohamed Boudiaf M’sila, M’Sila, Algeria.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

b_c, c_t	Bias term for generating candidate; output cell
c_t−1, $\tilde{c_{t}}$	Cell state from previous time step; candidate cell state
f_t	Forget gate
h_t, h_t−1	Upward output; previous hidden state
i_t	Input value of weather time series
LDA	Linear Discriminate Analysis
o_t	Output gate
R, R²	Pearson correlation coefficient; coefficient of determination
RH, SR, SP	Relative humidity; solar radiation; solar power
WS, T	Wind speed; temperature
W_c	Weight matrix for generating candidate
x_t	Input value of weather time series

References

Abdel-Nasser, M.; Mahmoud, K. Accurate photovoltaic power forecasting models using deep LSTM-RNN. Neural Comput. Appl. 2019, 31, 2727–2740. [Google Scholar] [CrossRef]
Wang, K.; Qi, X.; Liu, H. Photovoltaic power forecasting based LSTM-Convolutional Network. Energy 2019, 189, 116225. [Google Scholar] [CrossRef]
Antonanzas, J.; Osorio, N.; Escobar, R.; Urraca, R.; Martinez-de-Pison, F.J.; Antonanzas-Torres, F. Review of photovoltaic power forecasting. Sol. Energy 2016, 136, 78–111. [Google Scholar] [CrossRef]
Zemouri, N.; Bouzgou, H.; Gueymard, C.A. Multimodel ensemble approach for hourly global solar irradiation forecasting. Eur. Phys. J. Plus 2019, 134, 594. [Google Scholar] [CrossRef]
Al-Ali, E.M.; Hajji, Y.; Said, Y.; Hleili, M.; Alanzi, A.M.; Laatar, A.H.; Atri, M. Solar Energy Production Forecasting Based on a Hybrid CNN-LSTM-Transformer Model. Mathematics 2023, 11, 676. [Google Scholar] [CrossRef]
Al-Shamisi, M.H.; Assi, A.H.; Hejase, H.A.N. Artificial neural networks for predicting global solar radiation in Al Ain City—UAE. Int. J. Green Energy 2013, 10, 443–456. [Google Scholar] [CrossRef]
Al Shamisi, M.H.; Assi, A.H.; Hejase, H.A. Using MATLAB to Develop Artificial Neural Network Models for Predicting Global Solar Radiation in Al Ain City—UAE. Eng. Educ. Res. Using MATLAB 2011. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J.; et al. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef]
Ghimire, S.; Nguyen-Huy, T.; Deo, R.C.; Casillas-Pérez, D.; Salcedo-Sanz, S. Efficient daily solar radiation prediction with deep learning 4-phase convolutional neural network, dual stage stacked regression and support vector machine CNN-REGST hybrid model. Sustain. Mater. Technol. 2022, 32, e00429. [Google Scholar] [CrossRef]
Wang, F.; Yu, Y.; Zhang, Z.; Li, J.; Zhen, Z.; Li, K. Wavelet decomposition and convolutional LSTM networks based improved deep learning model for solar irradiance forecasting. Appl. Sci. 2018, 8, 1286. [Google Scholar] [CrossRef]
Jalali, S.M.J.; Ahmadian, S.; Kavousi-Fard, A.; Khosravi, A.; Nahavandi, S. Automated Deep CNN-LSTM Architecture Design for Solar Irradiance Forecasting. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 54–65. [Google Scholar] [CrossRef]
Lemke, C.; Gabrys, B. Meta-learning for time series forecasting and forecast combination. Neurocomputing 2010, 73, 2006–2016. [Google Scholar] [CrossRef]
Bouktif, S.; Fiaz, A.; Ouni, A.; Serhani, M.A. Single and multi-sequence deep learning models for short and medium term electric load forecasting. Energies 2019, 12, 149. [Google Scholar] [CrossRef]
ArchanaH, T.; Sachin, D. Dimensionality Reduction and Classification through PCA and LDA. Int. J. Comput. Appl. 2015, 122, 4–8. [Google Scholar] [CrossRef]

Figure 1. The proposed model to enable solar power forecasting.

Figure 3. Linear Discriminate Analysis principle.

Figure 4. 15-min ahead solar power forecasting for different deep learning strategy methods compared with their combinations.

Table 1. Performance comparison of solar power vs. meteorological data in terms of correlation coefficient (R) Bolded values are the best in the comparison.

Relationship	SP(kw)	T(C°)	SR(w/m²)	RH(%)	WS(m/s)	Pres(HPa)
SP(kw)	1.0000	0.9329	0.8520	−0.5015	0.1032	0.1711
T(C°)	0.9329	1.0000	0.8430	−0.6964	0.1220	0.0562
SR(w/m²)	0.852	0.8430	1.0000	−0.4595	0.0162	0.0702
RH(%)	−0.5015	−0.6964	−0.4595	1.0000	−0.2519	0.3060
WS(m/s)	0.1032	0.1220	0.01620	−0.2519	1.0000	−0.3088
Pres(HPa)	0.1711	0.0562	0.07020	0.3060	−0.3088	1.0000

Table 2. Setting parameters of the prediction models.

Model	Parameters	Value
LSTM	Optimizer, mini batch size, number of hidden units, learning rate	ADAM, 80, 107, 0.0005
CNN	Convolutional layer, max-pooling layer, dropout layer	3, 5, 0.005
CNN	initial learning rate, mini batch size	0.002, 2

Table 3. Prediction accuracy of different proposed methods. The best results are highlighted in bold.

Methods	CNN	CNN-LDA	LSTM	LSTM-LDA	Class
NRMSE	0.1697	0.1480	0.0019	0.0016	0.0014
MAPE	43.6935	35.6423	17.7424	16.055	16.012
MBE	1.5077	1.0137	0.0580	0.0225	0.0183
R²	0.8701	0.8828	0.9234	0.9319	0.9337

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nahed, Z.; Hatem, M.; Aissa, C. A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets. Eng. Proc. 2023, 56, 1. https://doi.org/10.3390/ASEC2023-15228

AMA Style

Nahed Z, Hatem M, Aissa C. A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets. Engineering Proceedings. 2023; 56(1):1. https://doi.org/10.3390/ASEC2023-15228

Chicago/Turabian Style

Nahed, Zemouri, Mezaache Hatem, and Chouder Aissa. 2023. "A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets" Engineering Proceedings 56, no. 1: 1. https://doi.org/10.3390/ASEC2023-15228

Article Menu

A Very Short-Term Photovoltaic Power Forecasting Model Using Linear Discriminant Analysis Method and Deep Learning Based on Multivariate Weather Datasets^†

Abstract

1. Introduction