Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory

Zhu, Tingting; Guo, Yiren; Li, Zhenye; Wang, Cong

doi:10.3390/en14248498

Open AccessArticle

Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory

¹

College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China

²

Key Laboratory of Measurement and Control of Complex Systems of Engineering (Southeast University), Ministry of Education, Nanjing 210096, China

^*

Author to whom correspondence should be addressed.

Energies 2021, 14(24), 8498; https://doi.org/10.3390/en14248498

Submission received: 29 October 2021 / Revised: 8 December 2021 / Accepted: 13 December 2021 / Published: 16 December 2021

Download

Browse Figures

Versions Notes

Abstract

:

Photovoltaic power generation is highly valued and has developed rapidly throughout the world. However, the fluctuation of solar irradiance affects the stability of the photovoltaic power system and endangers the safety of the power grid. Therefore, ultra-short-term solar irradiance predictions are widely used to provide decision support for power dispatching systems. Although a great deal of research has been done, there is still room for improvement regarding the prediction accuracy of solar irradiance including global horizontal irradiance, direct normal irradiance and diffuse irradiance. This study took the direct normal irradiance (DNI) as prediction target and proposed a Siamese convolutional neural network-long short-term memory (SCNN-LSTM) model to predict the inter-hour DNI by combining the time-dependent spatial features of total sky images and historical meteorological observations. First, the features of total sky images were automatically extracted using a Siamese CNN to describe the cloud information. Next, the image features and meteorological observations were fused and then predicted the DNI in 10-min ahead using an LSTM. To verify the validity of the proposed SCNN-LSTM model, several experiments were carried out using two-year historical observation data provided by the National Renewable Energy Laboratory (NREL). The results show that the proposed method achieved nRMSE of 23.47% and forecast skill of 24.51% for the whole year of 2014, and it also did better than some published methods especially under clear sky and rainy days.

Keywords:

solar radiation; inter-hour forecast; Siamese network; convolution neural network; long short-term memory

Graphical Abstract

1. Introduction

In recent years, under the pressure of global warming, deterioration of the human ecological environment, shortages of non-renewable energy resources, and environmental pollution, solar radiation energy has become highly valued worldwide as an inexhaustible clean energy source, and consequently, solar photovoltaic power has developed rapidly [1,2]. However, photovoltaic power generation is volatile and intermittent, and large-scale grid connections negatively impact the stability and security of the power grid, and even cause serious economic losses [3,4]. In order to increase the proportion of photovoltaic power generation in the power system, the key is to implement timely and effective power dispatching, where accurate photovoltaic power generation forecast is an important basis for the power dispatching process. However, the fluctuation of photovoltaic power generation is mainly caused by changes in solar irradiance. Therefore, it is important to accurately predict solar irradiance, the results of which can provide important decision support for power dispatching systems and can effectively reduce the operational costs of the power system [5,6].

There are many methods used for short-term and ultra-short-term solar irradiance predictions. Traditional prediction methods mainly use statistical methods to establish the relationship between the historical value and solar irradiances, such as time series and regression analysis methods [7,8]. However, traditional statistical methods cannot accurately describe the complex nonlinear relationship between various meteorological variables and the solar irradiance, which limits the improvement of the prediction accuracy.

With the rise of machine learning technology, many scholars have applied machine learning methods into solar irradiance prediction and have achieved good results [9]. For example, support vector machine (SVM) [10], extreme learning machine (ELM) [11], and artificial neural network (ANN) methods [12] have all been shown to produce better results than linear regression prediction methods when predicting solar irradiance. What’s more, machine learning methods, especially ANN, combining with Numerical Weather Prediction are also achieved great improvement on the hour-term or medium-term forecast [13,14,15]. Furthermore, compared to traditional machine learning, deep learning, such as recurrent neural network (RNN), has shown the potential to further improve the prediction of solar irradiance [16].

At the same time, with the development of hardware technologies, such as charge-coupled devices (CCDs), and the continuous improvement of digital image processing technology [17], many total cloud-measuring remote sensing instruments have been successfully developed, such as the total sky imager (TSI), which can accurately monitor and collect cloud images over photovoltaic power stations in real time [18]. The images have sufficient information that is more beneficial to the prediction of solar irradiance than historical observation values, such as cloud cover. However, the existing solar irradiance prediction methods based on the TSI have some disadvantages that cannot be ignored. For example, the artificial image feature extraction relies heavily on the experience of researchers and it is often difficult to obtain satisfactory prediction results [19]. Based on this, Feng et al. [20] designed a SolarNet model that can automatically extract the features of a total sky image, but this model only uses one total sky image as the model input, which ignores the cloud motion information and greatly limits the accuracy of the prediction. Zhao et al. [21] designed a three-dimensional convolutional neural network (3D-CNN) model to realize the fusion of multiple images and historical values, and then input the fusion features into a multilayer perceptron (MLP). However, as a traditional neural network structure, an MLP cannot capture the long-term memory of an input time series because the nodes between the hidden layers are not connected. Therefore, an MLP often performs poorly when predicting a time series. The long short-term memory (LSTM) has a complex memory unit, which can remember the previous information and can apply it to the calculation of the current output, that is, the nodes between hidden layers become connected [22]. Therefore, compared to an MLP, LSTM displays better performance when predicting a time series. In particular, long short-term memory (LSTM) networks have been used to predict solar irradiance due to their strong time series-learning ability [23,24].

Based on the shortcomings of the above model, this study developed a Siamese convolutional neural network-long short-term memory (SCNN–LSTM) model. A Siamese CNN can automatically extract the spatial dimension features of multiple continuous total sky images and can retain the temporal dimension features. Then, historical meteorological features and image features are fused using a concatenate layer, and the fused features are input into the LSTM for the prediction of solar irradiance within hours.

Since the direct normal irradiance (DNI) was vital to the concentrated solar thermal power plant and the global horizontal irradiance was important to photovoltaic solar power plant [25], the DNI was taken as research target in this study to evaluate the performance of the proposed model. The two years’ data were corrected from the National Renewable Energy Laboratory (NREL) [26], and several experiments were carried out to verify the effectiveness of the proposed method.

The main contributions of this study include: (1) A Siamese CNN was developed to automatically extract the features of continuous total sky images, where the Siamese structure reduced the model training time by sharing part parameters of the model; (2) SCNN-LSTM was used to effectively fuse the time-series features of images and meteorological data and to improve the DNI prediction accuracy.

The remainder of this paper is organized as follows: Section 1 introduces the three correlation networks, based on which the proposed model was constructed. Section 2 describes the collection and processing of the experimental materials. Section 3 presents a SCNN–LSTM forecasting model of DNI. Section 4 discusses the experimental results and analyzes the performance of the SCNN–LSTM model based on several comparative experiments. Finally, Section 5 summarizes the conclusions.

2. Data Collection and Preprocessing

2.1. Data Collection

All measured data in the daytime were downloaded from an open database, namely, the NREL’s Solar Radiation Research Laboratory (SRRL) [26]. The SRRL station is located at 39.74 °N and 105.18 °W, 1829 m above sea level in Golden City, Colorado, USA, where there are abundant solar resources. The measured meteorological variables of the SRRL, including the DNI, solar zenith angle, relative humidity, and air mass, are obtained with a 1 min sampling frequency, and the details of these variables are listed in Table 1. There are few negative values of DNI which are very close to zero and are corrected as 0 [27].

The total sky images used were RGB images obtained using the TSI (TSI-880), and the image resolution was 352 × 288 pixels. The total sky images of different weather conditions are shown in Figure 1, where the shadow bands in the image move with the Sun to protect the CCD sensor from direct sunlight. The total sky images are obtained with a 10 min sampling frequency. Therefore, the 10 min averages of the meteorological data were used as the samples in this study. Additionally, the total sky images were removed when the solar zenith angle were greater than 80 degrees in order to avoid hazy sky and obstacle presence [28].

2.2. Data Preprocessing

2.2.1. DNI Clear-Sky Index

In order to eliminate the prediction error caused by the change in solar position, the DNI was converted into a DNI clear sky index by a basic clear sky model [29], which considers the impacts of the atmospheric, seasonal and geographical factors on DNI and calculates the clear-sky DNI with only local time, date and location information. The DNI clear sky index (k) was represented as:

k = \frac{I}{I_{c l r}},

(1)

where I is the measured value of DNI at a certain moment when it reaches the Earth’s surface, and

I_{c l r}

is the DNI of theoretical clear sky at the same time when it reaches the Earth’s surface.

I_{c l r}

is represented as:

I_{c l r} = I_{s c} \times ε \times τ_{b},

(2)

where

I_{s c}

is the solar radiation constant 1360.8 W/m²;

ε

is the eccentricity correction factor, which is the correction factor for the deviation caused by the change in distance between the Sun and the Earth;

I_{s c} \times ε

means the incident solar radiation intensity reaching the top of the Earth’s atmosphere at that moment and

τ_{b}

is the atmospheric transparency coefficient of the direct irradiance.

ε

and

τ_{b}

are as:

ε_{0} = 1 + 0.033 \times \cos (\frac{2 π \times D O Y}{365}),

(3)

τ_{b} = {0.7}^{P L^{0.678}},

(4)

where DOY is the day of the year, with counting starting at 1 on New Year’s Day; PL is the ratio of the length of the path of the Sun’s rays through the Earth to the length of the path perpendicular to the zenith angle of the Sun. Its expression is as follows:

P L = \sqrt{{(r + c)}^{2} \cos^{2} (z) + (2 r + 1 + c) (1 - c)} - (r + c) \cos (z),

(5)

where r is the ratio between the Earth’s radius of 6371 km and the effective thickness of the atmosphere of 9 km; c is the ratio between the altitude of the observation site of 1.8288 km and the effective thickness of the atmosphere of 9 km; z is the solar zenith angle which is calculated in radians.

2.2.2. Image Preprocessing

The main task of sky image processing is to extract the region of interest (ROI) out of the RGB image to remove unnecessary pixels. The mask is a binary matrix corresponding to the location of the original foundation cloud map. The pixel value in the mask is either 0 or 1, where 1 corresponds to the location of the sky image in the original foundation cloud map, namely, the ROI, while 0 corresponds to the location of the non-sky image in the original foundation cloud map. The size of the mask image was 352 × 288 and the mask area on it was a circle with a radius of 128 pixels that had the center of the sky as the origin. The original foundation cloud map was multiplied by the mask, and then the extra pixels were cut out to obtain the processed image, as shown in Figure 2c. Finally, a minor gradients algorithm [30] was used to repair the shadow-band pixels as shown in Figure 2d, which was used as the input of the model with a resolution of 256 × 256 pixels.

Finally, the meteorological and image data were collected from 1 January 2013 to 31 December 2014, where a total of 41,139 groups of valid samples were obtained after processing. The data for January and July in the year of 2013 were chosen as the verification set, and the remaining data in the year of 2013 was set as the training set. Meanwhile, the data of the whole year of 2014 were set as the testing set. The details of data segmentation were listed in Table 2.

3. SCNN-LSTM Prediction Model

In this section, a SCNN–LSTM model was designed to predict the 10-min ahead DNI, and the structure of SCNN-LSTM model was shown as Figure 3. The cloud features were firstly extracted from a group of consecutive total sky images in order to make up the missing information of a single image blocked by the shadow-band [30,31]; and then the cloud features and meteorological variables were normalized and fused as inputs of LSTM to predict the clear-sky of DNI in the next 10 min.

3.1. Input Dimension

Bayesian information criterion (BIC) was used to determine the input dimension of the forecasting model using DNI clear-sky index; that is, the DNI at moment t was related to the DNI at the previous n moments [32]. The BIC was used to determine the order of the DNI clear sky index sequence after the first-order difference, and the obtained BIC thermal diagram is shown in Figure 4. The BIC information reached the minimum value when the autoregression coefficient was 1 and the moving average coefficient was 2. The DNI clear sky index sequence went through a difference such that the order of the model was determined to be 3, which means that the information at time t − 2Δt, t − Δt, and t predicted the DNI clear sky index at time t + Δt, where Δt is 10 min.

3.2. Siamese Convolutional Neural Network

The convolutional neural network (CNN) is one of the representative algorithms of deep learning. It has the ability of representation learning and is able to extract high-order features from inputs [33]. The main structure of a traditional CNN includes convolutional layers, pooling layers, and fully connected layers. A Siamese network [34] is a class of neural networks that consist of two or more identical subnetworks, and the subnetworks have the same network structure and configuration, including the network parameters and weights. During the training phase, the parameter updates are mirrored across multiple subnetworks.

The proposed Siamese convolution neural network took the advantages of CNN and Siamese network, and it was used to extract the high-order features from ground-based sky images at different times. The branch of the SCNN structure was improved based on the AlexNet network [35]. Because the images of the three input moments (i.e., t − 2Δt, t − Δt, and t) need to be processed, the SCNN has three improved AlexNet subnetworks, and the structure of the SCNN was shown as Figure 5. The network structure, parameters, and weights of the three subnetworks are the same, and the three inputs determine how the weights are updated.

3.3. Long Short-Term Memory

Long short-term memory (LSTM), a variant of a recurrent neural network (RNN), usually performs better than an RNN when predicting outcomes [36]. In LSTM, every neuron is a memory cell and there are three gates in each cell, namely, the forgetting gate

f_{t}

, the input gate

i_{t}

, and the output gate

o_{t}

:

{\begin{matrix} f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}) \\ i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) \\ {\tilde{C}}_{t} = \tan h (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) \\ C_{t} = f_{t} \times C_{t - 1} + i_{t} \times {\tilde{C}}_{t} \\ o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}) \\ h_{t} = o_{t} \times \tan h (C_{t}) \end{matrix},

(6)

where

h_{t - 1}

represents the output at the previous moment;

x_{t}

represents the input at the current moment, and it is the fused features of cloud features and meteorological variables;

σ

represents the sigmoid function,

W

represents the weight, and

b

represents the bias. The process of operation of the whole unit structure decides which information should be discarded in the memory cells of the last moment by multiplying the forgetting gate

f_{t}

by the previous cell state

C_{t - 1}

. Then, the new information is obtained by multiplying the input gate

i_{t}

with the alternative content

{\tilde{C}}_{t}

that needs to be updated. According to the above system of equations, the cell state

C_{t}

at the current moment can be obtained by discarding and updating the information. Finally, the

C_{t}

status value is pushed from −1 to 1 through the tanh layer, and the output

h_{t}

at the current moment is obtained by multiplying the tanh layer by the output gate

o_{t}

, which is the DNI clear sky index for the next 10 min; therefore, the DNI at that moment is obtained by multiplying by the predicted clear-sky index by the DNI of the clear sky at the same moment.

3.4. Loss Function

In this study, the greatest difference from a traditional Siamese network is that the SCNN section of the SCNN–LSTM model was designed to find the key features in the total sky images at different times in order to provide the image timing features for the LSTM prediction instead of to compare the similarity between these images. Therefore, there was no need to calculate the Euclidian distance between sample pairs to judge their similarity.

Secondly, the SCNN–LSTM model does not need to use the contrastive loss function to represent the degree of matching between paired samples. Instead, it uses the predicting error (PE) to evaluate the difference between the predicted value and the observed value to train the SCNN–LSTM parameter, as follows:

P E = \frac{1}{2 N} \sum_{i = 1}^{N} {(y'_{i} - y_{i})}^{2},

(7)

where

y'_{i}

is the predicted value of a forecast model,

y_{i}

is the target, and N is the number of training samples.

Just as in a traditional Siamese network, the multiple subnetwork branches of the SCNN in this model also have the same network structure, parameters, and weights. In the implementation of the SCNN structure, only one network structure needs to be built, and then the network structure is mirrored to multiple subnetworks with different inputs. All the subnetworks jointly determine how the weights are updated in the network.

During the process of the model training, the weights of the whole model were adjusted simultaneously. First, the PE was used to evaluate the prediction error of the model; then, the error was propagated back to the fully connected layers, the LSTM layer, and the SCNN in turn. Notably, the weight-sharing between the three branches of the SCNN was realized via mirroring, and the weights of the three branches were uniformly determined using the input of the three branches.

4. Results and Discussion

The experimental platform comprised a server containing an Intel Core I9-9900K CPU and a Nvidia GeForce RTX 2080Ti. The SCNN–LSTM model was implemented using the Keras library and Tensorflow.

4.1. Evaluation Index

The correlation coefficients (r), normalized mean bias error (nMBE), normalized mean absolute error (nMAE), and normalized root mean squared error (nRMSE) were used to evaluate the performance of different forecast models, and they are calculated as follows:

r = \frac{\sum_{i = 1}^{N} (y'_{i} - \bar{y'}) (y_{i} - \bar{y})}{\sqrt{\sum_{i = 1}^{N} {(y'_{i} - \bar{y'})}^{2} \sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}}},

(8)

n M B E = \frac{1}{N} \sum_{i = 1}^{N} (y'_{i} - y_{i}) / \bar{y} \times 100 %,

(9)

n M A E = \frac{1}{N} \sum_{i = 1}^{N} | y'_{i} - y_{i} | / \bar{y} \times 100 %,

(10)

n R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y'_{i} - y_{i})}^{2}} / \bar{y} \times 100 %,

(11)

where

y'_{i}

is the predicted value of a forecast model,

\bar{y'}

is the mean of all of the predicted values,

y_{i}

is the target,

\bar{y}

is the mean of all of the targets, and N is the number of testing samples.

Meanwhile, the persistent model is usually used as a benchmark for evaluating the performance of different forecast models and is defined as:

I (t + 1) = I (t) .

(12)

The evaluating index based on the persistent model, which is called the forecast skill (Fs), is defined as:

F s = (n R M S E_{p e r} - n R M S E_{f}) / n R M S E_{p e r} \times 100 %,

(13)

where

n R M S E_{p e r}

and

n R M S E_{f}

are the nRMSEs of the persistent model and the forecast model, respectively.

4.2. Performance of the SCNN-LSTM

The image feature extraction section of the SCNN-LSTM structure was improved using AlexNet. Different numbers of fully connected layers affect the feature extraction outcome; therefore, a group of experiments were conducted to compare three different SCNN-LSTM models with different numbers of fully connected layers in order to obtain an appropriate model structure. The results are listed in Table 3, where Model-1 is a fully connected layer containing 10 neurons following the flatten layer (layer 11), as shown in Figure 5. There were two fully connected layers after the flatten layer in Model-2, which contained 512 and 10 neurons, respectively. Model-3 had three fully connected layers, which contained 512, 256, and 10 neurons, respectively. The results show that the performance of Model-2 was the best among the three models, where its Fs was up to 24.51%.

The LSTM layer of the SCNN-LSTM model was used to process the fusion features for the DNI prediction. Different LSTM layers and the number of neurons affect the prediction performance of the model; therefore, a set of experiments were conducted to select the optimal number of LSTM layers and neurons. The results are listed in Table 4, in which Model-A was the LSTM layer after the feature fusion part (layer 14) as shown in Figure 5, where the number of neurons was 30. The number of neurons in the LSTM layer of Model-B was 50, and Model-C had two LSTM layers containing 50 and 30 neurons, respectively. The results show that the SCNN-LSTM model that adopted an LSTM layer with 50 neurons gave the best prediction effect, had the biggest r and Fs, and had the smallest nMBE, nMAE, and nRMSE.

A group of experiments were carried out by missing an input variable to evaluate its importance for the proposed method, and the results are shown as Figure 6. It is clear that the total sky image is the most important variable for the proposed model, which means that the cloud is the most important variable for the solar radiation. The nRMSE of the proposed method without the solar zenith angle (Z) as inputs is greater than those of predicting models without relative humidity or air mess, which means that the position between the Sun and the observation station is also important for predicting accuracy.

4.3. Performance of Different Forecast Models for the Inter-Hour DNI Forecast

Another group of experiments were carried out to evaluate the performance of the proposed model in comparison to another published method, and the results are listed in Table 5 (Table S1 listed the results of GHI prediction in Supplementary Materials). Among them, the MLP with one hidden layer and LSTM with one layer only used the above meteorological data and cloud cover (instead of total sky images) as the model input, SolarNet only used the total sky images as the model input, and the 3D-CNN and SCNN-LSTM models used the historical observation values and the total sky images as the model input.

It can be seen from Table 5 that the prediction performance was better when using the total sky images as the model input instead of the historical meteorological values. Moreover, the combination of total sky images and historical meteorological values as the model input was able to reach an even better performance than when using them independently. Meanwhile, it can be observed that the nRMSE, nMBE, and nMAE of the SCNN-LSTM model’s predictions were 23.72%, 0.35%, and 13.27%, respectively, which were all lower than the other compared models. Additionally, the correlation coefficient was 0.9592, which was higher than the other compared models, indicating that the SCNN-LSTM model had superior performance regarding DNI prediction 10 min in advance.

In order to make a detailed comparison between the SCNN-LSTM model and the other models, test sets were used to construct error and error cumulative frequency graphs of the observed and predicted values of the DNI. Figure 7 shows that the SCNN-LSTM model improved the accuracy of prediction, mainly by increasing the frequency with a small prediction error and reducing the frequency with a large prediction error.

Figure 8 shows a histogram of the prediction skills of the SCNN-LSTM model and the other comparison models. SolarNet’s prediction skills were significantly higher than those of the MLP and LSTM models, mainly because the total sky images contained more diverse information than the historical observation values, and clouds have a more critical impact on the solar irradiance. The prediction skills of the 3D-CNN and SCNN-LSTM models were superior to those of SolarNet, mainly because a total sky image can better reflect the attenuation degree of radiation from the sky to the ground, and the historical observation value can express the trend of the DNI well. Therefore, the meteorological value and image fusion could be used as the model input to achieve the optimal prediction accuracy. In addition, the prediction skills of the SCNN-LSTM model were better than the 3D-CNN model, mainly because the image feature extraction branch of the SCNN-LSTM model uses an improved AlexNet network. Compared to the 3D-CNN model, the AlexNet network has a greater number of layers and a better image feature extraction effect. At the same time, DNI prediction is a time series prediction. The LSTM in the SCNN-LSTM model has a powerful time series-learning ability; compared to the MLP in the 3D-CNN model, LSTM is more conducive to time series prediction.

4.4. Performance of Different Forecast Models under Different Weather Conditions

To further analyze the performance of the proposed model, historical data from ten days for every weather condition were selected, and the predicting results were listed in Table 6. The MLP and LSTM without images have similar prediction performance under different weather conditions, and both do worse than the last three deep learning methods with images. The proposed SCNN-LSTM method does best among these methods especially under clear sky and rainy days, and its forecast skills are all greater than 20% under any weather condition.

Historical data from four days with different weather conditions were selected, where Figure 9 shows the performance of the last three models in Table 6, which were all constructed based on a deep CNN with total sky images. For clear sky and partly cloudy days, the fitting effect of the SolarNet, 3D-CNN, and SCNN–LSTM models with the observed value was similar, but for cloudy and rainy days, the SCNN–LSTM model showed a better fitting performance with the observed value. This indicates that compared to the known public model mentioned in this paper, the SCNN–LSTM model can effectively reduce the DNI prediction error for rainy days, which is consistent with the results in Table 6.

5. Conclusions

In this study, a SCNN-LSTM model was proposed for predicting the DNI 10-min ahead. First, a SCNN was built using three component networks by improving the AlexNet network, which independently extracts features from three total sky images and produces the image characteristics at three moments, and the meteorological integration of historical observations. The fusion feature was implemented after the LSTM, and the two fully connected layers output the DNI clear sky index prediction. To obtain the solar irradiance, the previous result was multiplied by the DNI clear sky predictive value. Using the NREL open dataset of the whole year of 2014 as the testing set, the experimental results show that the nRMSE of the SCNN-LSTM model was 23.47% and the forecast skill was 24.51%. Compared to other models used in this study, the prediction accuracy was improved.

This experiment also provided some inspiration for our future work. For example, DNI data could be classified based on weather conditions or cloud classifications, and then DNI prediction models could be constructed for different weather conditions in order to further improve the prediction accuracy, especially under partly cloudy or cloudy days. In addition, we cloud try to adjust the number of samples under different weather conditions to balance the prediction accuracy of different weather conditions, so as to improve the overall prediction performance.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/en14248498/s1, Table S1: The performance of different models for predicting the global horizontal irradiance (GHI) 10-min in advance using the testing set.

Author Contributions

All authors designed this work; T.Z., Y.G. and Z.L. contributed equally to this work. Conceptualization and methodology, T.Z. and C.W.; software and validation, Z.L. and Y.G.; writing, T.Z.; visualization, C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Science Program of China, grant number 62006120, and the Key Laboratory of Measurement and Control of Complex Systems of Engineering (Southeast University), Ministry of Education, grant number MCCSE2020A02.

Institutional Review Board Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yang, Y.; Campana, P.E.; Stridh, B.; Yan, J. Potential analysis of roof-mounted solar photovoltaics in Sweden. Appl. Energy 2020, 279, 115786. [Google Scholar] [CrossRef]
Qiao, Y.H.; Han, S.; Xu, Y.P.; Liu, Y.Q.; Ma, T.D.; Cai, Q. Analysis Method for Complementarity between Wind and Photovoltaic Power Output Based on Weather Classification. Autom. Elect. Power Syst. 2021, 45, 82–88. [Google Scholar]
Moreira, M.; Balestrassi, P.; Paiva, A.; Ribeiro, P.; Bonatto, B. Design of experiments using artificial neural network ensemble for photovoltaic generation forecasting. Renew. Sustain. Energy Rev. 2020, 135, 110450. [Google Scholar] [CrossRef]
Murty, V.V.V.S.N.; Kumar, A. Optimal Energy Management and Techno-economic Analysis in Microgrid with Hybrid Renewable Energy Sources. J. Mod. Power Syst. Clean Energy 2020, 8, 929–940. [Google Scholar] [CrossRef]
Rodríguez, F.; Fleetwood, A.; Galarza, A.; Fontán, L. Predicting solar energy generation through artificial neural networks using weather forecasts for microgrid control. Renew. Energy 2018, 126, 855–864. [Google Scholar] [CrossRef]
Ge, L.; Xian, Y.; Yan, J.; Wang, B.; Wang, Z. A Hybrid Model for Short-term PV Output Forecasting Based on PCA-GWO-GRNN. J. Mod. Power Syst. Clean Energy 2020, 8, 1268–1275. [Google Scholar] [CrossRef]
Ji, W.; Chee, K.C. Prediction of hourly solar radiation using a novel hybrid model of ARMA and TDNN. Sol. Energy 2011, 85, 808–817. [Google Scholar] [CrossRef]
Sun, H.; Yan, D.; Zhao, N.; Zhou, J. Empirical investigation on modeling solar radiation series with ARMA–GARCH models. Energy Convers. Manag. 2015, 92, 385–395. [Google Scholar] [CrossRef]
Alfadda, A.; Rahman, S.; Pipattanasomporn, M. Solar irradiance forecast using aerosols measurements: A data driven approach. Sol. Energy 2018, 170, 924–939. [Google Scholar] [CrossRef]
Lin, J.; Li, H. A Short-Term PV Power Forecasting Method Using a Hybrid Kmeans-GRA-SVR Model under Ideal Weather Condition. J. Comput. Commun. 2020, 8, 102–119. [Google Scholar] [CrossRef]
Wu, X.; Lai, C.S.; Bai, C.; Lai, L.L.; Zhang, Q.; Liu, B. Optimal Kernel ELM and Variational Mode Decomposition for Probabilistic PV Power Prediction. Energies 2020, 13, 3592. [Google Scholar]
Zhu, T.; Zhou, H.; Wei, H.; Zhao, X.; Zhang, K.; Zhang, J. Inter-hour direct normal irradiance forecast with multiple data types and time-series. J. Mod. Power Syst. Clean Energy 2019, 7, 1319–1327. [Google Scholar] [CrossRef] [Green Version]
Fonseca, J.G.D.S.; Uno, F.; Ohtake, H.; Oozeki, T.; Ogimoto, K. Enhancements in Day-Ahead Forecasts of Solar Irradiation with Machine Learning: A Novel Analysis with the Japanese Mesoscale Model. J. Appl. Meteorol. Clim. 2020, 59, 1011–1028. [Google Scholar] [CrossRef]
Ghimire, S.; Deo, R.C.; Downs, N.J.; Raj, N. Global solar radiation prediction by ANN integrated with European Centre for medium range weather forecast fields in solar rich cities of Queensland Australia. J. Clean. Prod. 2019, 216, 288–310. [Google Scholar] [CrossRef]
Pereira, S.; Canhoto, P.; Salgado, R.; Costa, M.J. Development of an ANN based corrective algorithm of the operational ECMWF global horizontal irradiation forecasts. Sol. Energy 2019, 185, 387–405. [Google Scholar] [CrossRef]
Awan, S.M.; Khan, Z.A.; Aslam, M. Solar Generation Forecasting by Recurrent Neural Networks Optimized by Levenberg-Marquardt Algorithm. In Proceedings of the IECON 2018—44th Annual Conference of the IEEE Industrial Electronics Society, Washington, DC, USA, 20–23 October 2018. [Google Scholar]
Yeom, J.-M.; Park, S.; Chae, T.; Kim, J.-Y.; Lee, C.S. Spatial Assessment of Solar Radiation by Machine Learning and Deep Neural Network Models Using Data Provided by the COMS MI Geostationary Satellite: A Case Study in South Korea. Sensors 2019, 19, 2082. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Moncada, A.; Richardson, W., Jr.; Vega-Avila, R. Deep learning to forecast solar irradiance using a six-month UTSA SkyImager dataset. Energies 2018, 11, 1988. [Google Scholar] [CrossRef] [Green Version]
Chu, Y.; Pedro, H.; Li, M.; Coimbra, C.F. Real-time forecasting of solar irradiance ramps with smart image processing. Sol. Energy 2015, 114, 91–104. [Google Scholar] [CrossRef]
Feng, C.; Zhang, J. SolarNet: A sky image-based deep convolutional neural network for intra-hour solar forecasting. Sol. Energy 2020, 204, 71–78. [Google Scholar] [CrossRef]
Zhao, X.; Wei, H.; Wang, H.; Zhu, T.; Zhang, K. 3D-CNN-based feature extraction of total cloud images for direct normal irradiance prediction. Sol. Energy 2019, 181, 510–518. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Ge, Y.; Nan, Y.; Bai, L. A Hybrid Prediction Model for Solar Radiation Based on Long Short-Term Memory, Empirical Mode Decomposition, and Solar Profiles for Energy Harvesting Wireless Sensor Networks. Energies 2019, 12, 4762. [Google Scholar]
Huynh, A.N.-L.; Deo, R.C.; An-Vo, D.-A.; Ali, M.; Raj, N.; Abdulla, S. Near Real-Time Global Solar Radiation Forecasting at Multiple Time-Step Horizons Using the Long Short-Term Memory Network. Energies 2020, 13, 3517. [Google Scholar]
Law, E.W.; Prasad, A.A.; Kay, M.; Taylor, R.A. Direct normal irradiance forecasting and its application to concentrated solar thermal output forecasting—A review. Sol. Energy 2014, 108, 287–307. [Google Scholar] [CrossRef]
Andreas, A.; Stoffel, T. NREL Solar Radiation Research Laboratory (SRRL): Baseline Measurement System (BMS); Golden, Colorado (Data). NREL Report No. DA-5500-56488. 1981. Available online: http://dx.doi.org/10.5439/1052221 (accessed on 15 July 2021).
Feng, C.; Yang, D.; Hodge, B.-M.; Zhang, J. OpenSolar: Promoting the openness and accessibility of diverse public solar datasets. Sol. Energy 2019, 188, 1369–1379. [Google Scholar] [CrossRef]
Eduardo, W.F.; Ramos, M.R.; Santos, C.E.; Pereira, E.B. Comparison of methodologies for cloud cover estimation in Brazil—A case study. Energy Sust. Dev. 2018, 43, 15–22. [Google Scholar]
Zhu, T.; Wei, H.; Zhao, X.; Zhang, C.; Zhang, K. Clear-sky model for wavelet forecast of direct normal irradiance. Renew. Energy 2017, 104, 1–8. [Google Scholar] [CrossRef]
Zhu, X.; Zhou, H.; Zhu, T.T.; Jin, S.; Wei, H. Pre-processing of Ground-based Cloud Images in Photovoltaic System. Autom. Electr. Power Syst. 2018, 42, 140–145. [Google Scholar]
Pfister, G.; Mckenzie, R.L.; Liley, J.B.; Thomas, A.; Forgan, B.W.; Long, C.N. Cloud Coverage Based on All-Sky Imaging and Its Impact on Surface Solar Irradiance. J. Appl. Meteorol. 2003, 42, 1421–1434. [Google Scholar] [CrossRef]
Rodríguez-Benítez, F.J.; López-Cuesta, M.; Arbizu-Barrena, C.; Fernández-León, M.M.; Pamos-Ureña, M.; Tovar-Pescador, J.; Santos-Alamillos, F.J.; Pozo-Vázquez, D. Assessment of new solar radiation nowcasting methods based on sky-camera and satellite imagery. Appl. Energy 2021, 292, 116838. [Google Scholar] [CrossRef]
Wealliem, D.L. A Critique of the Bayesian Information Criterion for Model Selection. Sociol. Methods Res. 1999, 27, 359–397. [Google Scholar] [CrossRef]
Ni, C.; Wang, D.; Vinson, R.; Holmes, M.; Tao, Y. Automatic inspection machine for maize kernels based on deep convolutional neural networks. Biosyst. Eng. 2019, 178, 131–144. [Google Scholar] [CrossRef]
Chopra, S.; Hadsell, R.; Lecun, Y. Learning a similarity metric discriminatively, with application to face verification. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR′05), San Diego, CA, USA, 20–25 June 2005. [Google Scholar]
Xiaoyu, L.; Zhang, L.; Wang, Z.; Dong, P. Remaining useful life prediction for lithiumion batteries based on a hybrid model combining the long short-term memory and Elman neural networks. J. Energy Storage 2019, 21, 510–518. [Google Scholar]

Figure 1. The total sky images under four different weather conditions.

Figure 2. The preprocessing of a total sky image to extract the regions of interest.

Figure 3. The structure of the Siamese convolutional neural network–long short-term memory (CNN–LSTM) model for predicting the direct normal irradiance (DNI) 10 min in advance, where C means convolution layer, D means dense layer, and F means fully connected layer.

Figure 4. Bayesian information criterion (BIC) thermal diagram for the 10-min DNI clear sky indexes time series.

Figure 5. The structure of the siamese convolutional neural network with three subnetworks.

Figure 6. The nRMSE of the proposed method without the horizontal axis variable, where Z means solar zenith angle, RH means relative humidity, AM means air mess and image means total sky image.

Figure 7. Distribution of the prediction errors of the forecast models in Table 4.

Figure 8. Forecast skills of the five last models in Table 4 compared to the persistent model.

Figure 9. The predictions of the last three forecast models in Table 6 for four different weather conditions.

Table 1. The details of meteorological variables in the open database.

Variables	Names in Database	Units	Instruments
DNI	Direct CH1	Wm⁻²	Kipp and Zonen pyrheliometer
Solar zenith angle	Zenith angle	Degrees	-
Relative humidity	Relative humidity (Tower)	-	Vaisala probe
Air mass	Airmass	%	-

Note: solar zenith angle and air mass were obtained after data collection using the solar position algorithm.

Table 2. The experimental data for DNI forecast.

Dateset	Time	Number of Data Groups
Training set	February, March, April, May, June, August, Spetember, October, November and December in 2013	16,843
Validation set	January 2013, July 2013	3678
Testing set	2014	20,618

Table 3. The performance of the SCNN-LSTM model with different AlexNet improvement structures.

Models	Number of Neurons	r	nMBE (%)	nMAE (%)	nRMSE (%)	Fs (%)
1	10	0.9560	−0.22	13.94	24.84	20.84
2	512, 10	0.9596	0.14	13.75	23.47	24.51
3	512, 256, 10	0.9590	0.07	13.37	23.95	22.97

Note: r, correlation coefficient; nMBE, normalized mean bias error; nMAE, normalized mean absolute error; nRMSE, normalized root mean squared error; Fs, forecast skill.

Table 4. The performance of the SCNN-LSTM model with different LSTM layers and numbers of neurons.

Models	Number of Neurons	r	nMBE (%)	nMAE (%)	nRMSE (%)	Fs (%)
A	30	0.9585	−0.52	13.41	23.89	23.16
B	50	0.9596	0.14	13.75	23.47	24.51
C	50, 30	0.9544	−2.44	14.65	25.13	19.17

Table 5. The performance of different models for predicting the DNI 10-min in advance using the testing set.

Models	r	nMBE (%)	nMAE (%)	nRMSE (%)
Persistent model	0.9311	0.50	15.54	31.09
MLP ¹	0.9348	2.73	16.89	29.92
LSTM ²	0.9351	0.65	16.78	29.69
SolarNet [20]	0.9505	−1.07	17.28	26.18
3D-CNN [21]	0.9564	1.13	13.92	24.49
SCNN-LSTM	0.9596	0.14	13.75	23.47

¹ Multilayer perceptron (MLP) was a neural network (NN) model with one hidden layer. ² LSTM was a model with one layer.

Table 6. The performance of different models for predicting the DNI 10-min in advance under different weather conditions.

Weather Conditions	Models	r	nMBE (%)	nMAE (%)	nRMSE (%)	Fs (%)
Clear sky	Persistent model	0.9684	0.24	2.33	6.32	0
	MLP	0.9731	0.37	2.30	5.83	7.82
	LSTM	0.9719	−1.16	2.60	6.05	4.28
	SolarNet [20]	0.9650	−0.92	3.74	6.69	−5.86
	3D-CNN [21]	0.9789	−1.10	3.16	5.27	16.66
	SCNN-LSTM	0.9838	0.16	2.50	4.53	28.25
Partly cloud	Persistent model	0.8895	0.98	17.06	29.21	0
	MLP	0.8949	1.92	17.41	27.96	4.28
	LSTM	0.8940	−0.31	17.48	27.98	4.22
	SolarNet	0.9080	−6.71	21.40	27.66	5.29
	3D-CNN	0.9249	−0.48	15.39	23.69	18.88
	SCNN-LSTM	0.9323	1.33	15.19	22.62	22.56
Cloudy	Persistent model	0.8888	0.90	30.90	51.35	0
	MLP	0.8918	7.34	33.44	49.88	2.87
	LSTM	0.8938	5.66	33.07	49.12	4.36
	SolarNet	0.9226	−4.79	30.66	42.28	17.66
	3D-CNN	0.9356	3.78	25.44	38.59	24.85
	SCNN-LSTM	0.9274	4.65	27.09	41.05	20.07
Rainy	Persistent model	0.8912	0.52	32.33	65.62	0
	MLP	0.8957	7.20	36.81	63.27	3.58
	LSTM	0.8949	3.87	36.42	63.08	3.88
	SolarNet	0.9320	0.13	37.40	52.92	19.35
	3D-CNN	0.9291	2.91	28.43	52.44	20.09
	SCNN-LSTM	0.9405	−0.79	26.33	47.84	27.10

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, T.; Guo, Y.; Li, Z.; Wang, C. Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory. Energies 2021, 14, 8498. https://doi.org/10.3390/en14248498

AMA Style

Zhu T, Guo Y, Li Z, Wang C. Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory. Energies. 2021; 14(24):8498. https://doi.org/10.3390/en14248498

Chicago/Turabian Style

Zhu, Tingting, Yiren Guo, Zhenye Li, and Cong Wang. 2021. "Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory" Energies 14, no. 24: 8498. https://doi.org/10.3390/en14248498

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Solar Radiation Prediction Based on Convolution Neural Network and Long Short-Term Memory

Abstract

1. Introduction

2. Data Collection and Preprocessing

2.1. Data Collection

2.2. Data Preprocessing

2.2.1. DNI Clear-Sky Index

2.2.2. Image Preprocessing

3. SCNN-LSTM Prediction Model

3.1. Input Dimension

3.2. Siamese Convolutional Neural Network

3.3. Long Short-Term Memory

3.4. Loss Function

4. Results and Discussion

4.1. Evaluation Index

4.2. Performance of the SCNN-LSTM

4.3. Performance of Different Forecast Models for the Inter-Hour DNI Forecast

4.4. Performance of Different Forecast Models under Different Weather Conditions

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI