Next Article in Journal
A Single-Phase Bidirectional AC/DC Converter for V2G Applications
Previous Article in Journal
Numerical Analysis of Flatback Trailing Edge Airfoil to Reduce Noise in Power Generation Cycle
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions

1
Department of Computer System and Technology, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia
2
School of Engineering, Deakin University, Melbourne 3216, Australia
3
Power Electronics and Renewable Energy Research Laboratory (PEARL), Department of Electrical Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur 50603, Malaysia
4
School of Software and Electrical Engineering, Swinburne University of Technology, Melbourne, Victoria 3122, Australia
*
Author to whom correspondence should be addressed.
Energies 2017, 10(7), 876; https://doi.org/10.3390/en10070876
Submission received: 24 May 2017 / Revised: 27 June 2017 / Accepted: 27 June 2017 / Published: 29 June 2017

Abstract

:
Inaccurate forecasting of photovoltaic (PV) power generation is a great concern in the planning and operation of stable and reliable electric grid systems as well as in promoting large-scale PV deployment. The paper proposes a generalized PV power forecasting model based on support vector regression, historical PV power output, and corresponding meteorological data. Weather conditions are broadly classified into two categories, namely, normal condition (clear sky) and abnormal condition (rainy or cloudy day). A generalized day-ahead forecasting model is developed to forecast PV power generation at any weather condition in a particular region. The proposed model is applied and experimentally validated by three different types of PV stations in the same location at different weather conditions. Furthermore, a conventional artificial neural network (ANN)-based forecasting model is utilized, using the same experimental data-sets of the proposed model. The analytical results showed that the proposed model achieved better forecasting accuracy with less computational complexity when compared with other models, including the conventional ANN model. The proposed model is also effective and practical in forecasting existing grid-connected PV power generation.

1. Introduction

Energy, especially electrical energy, plays a key role in a country’s development; it also improves the living standard of people. Therefore, the demand for electrical energy has been increasing day by day due to industrialization and modernization. However, the conventional generation of electrical energy has caused global warming and significant climate changes. Subsequently, the modern world has undertaken different renewable energy initiatives to mitigate global warming and meet the growing electricity demand. Nowadays, many countries have produced a significant portion of their energy demands from renewable energy resources, particularly solar generation power plants [1]. Among the potential renewable energies, photovoltaics (PV) have undergone enormous growth over the last few years. The total installed capacity of PV systems has reached around 227 GW worldwide, an increase of more than 28% in 2015 (International Energy Agency (IEA) report, 2016); such growth is expected to continue at similar or higher rates in the future. Decreasing prices (i.e., the lowest at below $1.5/WP for fixed tilt systems) and improved PV technology can also boost PV system installations [2,3].
However, the generation of PV power fully depends on random and ungovernable solar irradiance and other metrological factors, such as atmospheric temperature, module temperature, wind speed, wind direction, and humidity. The power output of a PV system dynamically changes with time due to the variability of environmental factors. Unpredictable PV power output adversely affects system stability and reliability, the scheduling of system operations, and related economic benefits [4,5]. Meanwhile, accurate forecasting of PV power generation can reduce the impact of PV power uncertainty on the grid, improve system reliability, maintain power quality, and increase the penetration level of PV systems [6]. Therefore, accurate forecasting of PV power generation has become an important task for researchers at present.
In a significant number of previous research, solar irradiance on different time scales was forecasted using various approaches, including numerical weather prediction methods, image-based methods, and statistical methods [7,8,9,10]. The forecasted solar irradiance and other associated data are used as inputs for PV power generation by commercial PV simulation software, such as TRNSYSM, PVFORM, and HOMER. In [11], first the fuzzy theorem was used to forecast the solar irradiance levels and then the recurrent neural network (RNN) method was used to forecast the 24 hour ahead output power of the PV system. However, most previous research on this problem employed direct methods to forecast PV power generation based on historical time-series data, such as historical PV power output and corresponding meteorological data. The research by Kudo et al. [12] demonstrated that the direct method of forecasting next-day PV power generation is better when compared with indirect methods.
In direct PV power forecasting, persistence modeling [13] is generally conducted to justify and select other models, and to decide on benchmarks. Persistence modeling is mainly used for one-hour ahead forecasting; hence, as the time range of forecasting increases, the accuracy of persistence modeling decreases [14]. Both autoregressive and moving-average modeling and their generalizations (i.e., autoregressive-moving-average models) [15] are widely used in statistical and time-series data analyses. These are based on classical time-series analysis such as following the Box–Jenkins method [16]. Yang et al. [17] proposed an autoregressive method, with an exogenous input (ARX)-based spatio-temporal (ST) model, in order to improve the accuracy of the developed PV output power forecasting technique. However, these time-series models have limitations because they require stationary data-sets [18]. A stationary time series is one whose statistical properties, such as mean, variance, and autocorrelation, are all constant over time. However, the PV power output and related meteorological data are non-stationary. By contrast, autoregressive-integrated-moving-average (ARIMA) models integrate non-stationarity elements from time-series data [19], but these models are computationally intensive because of the inclusion of a summation/integration function. The most general time series analysis model is called NARX (Nonlinear Autoregressive with Exogenous Input). In [20], NARX was chosen as a dynamic artificial neural network (ANN) to forecast the PV power generation, and the result showed that the NARX is more efficient because of its capability to learn and generalize formulas, compared with other ANN models. However, the NARX models have limitations in learning long time dependences because of the “vanishing gradient”. Furthermore, similar to any dynamical system, the models were affected by instability and lack a procedure for optimizing embedded memory. Moreover, time series models need lots of data when the model is structured, and the parameters are difficult to update when new data are uploaded.
ANN modeling is excellent for complicated and non-linear data analysis, and it does not require any prior assumption. Neural networks are widely used in PV power generation forecasting, and forecasted results are better when compared with those from regression analysis methods [21]. NARX and feed-forward neural networks with tapped delay lines have been used to forecast PV energy production, and they result in errors (MAPE) of less than 5% [22]. However, the performance of this forecasting model depends on meteorological factors and the correlation between explanatory variables and dependent variables. If the correlation factor between variables is less significant, then the forecasting result will be inaccurate. Multilayer perceptron (MLP) [23] is an example of the feed-forward NN model that has been widely deployed in PV power generation forecasting. In this model, the input of each neuron is transformed into output through the sigmoid function. The performance of this model is better than the Box–Jenkins method due to its non-linear approximation. The radial basis function neural network (RBFNN) [24] is a type of ANN model that makes use of linear combinations of radial basis functions. For the radial basis function, Gaussian functions are often used to transform inputs into outputs. If tuned accurately, the RBFNN has better performance than MLP. However, the tuning procedures of the RBFNN sometimes cause problems, such that good results are not obtained due to poor parameter adjustments. Among the ANN-based forecasting methods, back propagation NN (BPNN) has been widely used because of its excellent nonlinear mapping function, which is especially suitable for solving complex regression problems [25]. BPNN has the advantages of a complex nonlinear systems simulation ability, a strong learning ability, good approximation performance, and a large fault data tolerance. However, inherent defects are found, such as a slow convergence rate as well as a susceptibility to easily fall into local minimum values; thus BPNN is unable to obtain the global optimal solution [26].
Accuracy is a main consideration when developing models for PV power generation forecasting. For short-term forecasting, errors should be less than 20% [6]. However, for changing conditions (e.g., morning and evening, or rainy or cloudy weather), forecasting accuracy decreases to a point where the relative mean square errors (RMSE) are sometimes higher than 50%. To build better forecasting models, some studies classified forecasted days into different categories on the basis of weather conditions. Kang et al. [27] developed an algorithm by utilizing k-means clustering; Chen et al. [28] presented a RBFNN model; Shi et al. [6] proposed a model based on weather classification and support vector machines (SVM); Yang et al. [29] presented a weather-based hybrid method; and Liu et al. [26] proposed a back-propagation NN model to forecast PV power generation. All of these works classified days into different categories like sunny, cloudy, foggy, and rainy, and then built separate forecasting models for each classification. This approach implies that sub-models should be chosen on the basis of the weather condition of a forecasted day, apart from applying meteorological data to the model. However, although the accuracy is satisfactory for forecasting, sub-modeling is limited by complexity and computational costs.
In this paper, a generalized PV power forecasting model is proposed on the basis of support vector regression (SVR), historical data of PV power output, and meteorological data. In particular, SVR is supervised as a learning method and utilized for model development. In the study, PV power output characteristics and influential factors are analyzed to achieve forecasting accuracy. Subsequently, forecasted days are classified according to Malaysian weather conditions and historical PV power output data. The two types of weather in Malaysia are normal days (clear sky) and abnormal days (cloudy or rainy sky). Consequently, a generalized SVR-based model is introduced to forecast PV power generation accurately for any Malaysian weather condition. The proposed model is applied to and validated by three PV power stations situated in an institutional building of the University of Malaya in Kuala Lumpur. A generalized hourly resolution and day-ahead forecasting model is established to compare the forecasting accuracies of different weather conditions. This proposed single model, which is applicable for different weather conditions, is very simple to use and can reduce the complexities and computational costs.
The paper is organized as follows: Section 2 presents a brief description of the methods applied to forecast the PV power generation including real PV plant data collection and analysis; Section 3 indicates the performance metrics to evaluate the forecasting models; and Section 4 discusses the results of the proposed model including comparison and validations. Finally, Section 5 summarizes and concludes the study.

2. Methodology

2.1. Data Collection and Analysis

Three PV power generation systems were selected to collect PV power output and related meteorological data. These systems are located in Kuala Lumpur (latitude = 03°09′ N; longitude = 101°41′ E). The PV systems were installed on the rooftop of an institutional building of the University of Malaya. Table 1 presents the details of these PV systems. The three PV plants are of monocrystalline, polycrystalline, and thin-film types with installed capacities of “1875”, “2000”, and “2700” Wp, respectively. PV power outputs were collected individually from each plant; however, meteorological data (i.e., solar irradiance, atmospheric temperature, module temperature, and wind pressure) were collected as a single dataset for all plants because of their similar geographical locations.
Table 2 shows the collected data for each unit parameter identified for the study. The PV power output and related meteorological data were collected by an automatic data acquisition system in 5 min durations from 1 January 2016 to 31 December 2016.
Figure 1a illustrates the mean daily PV power output generated by plant-1 for January 2016, and Figure 1b shows the daily energy generated by each PV plant for May 2016. Average power generation differed across days due to variations in the weather conditions.
For plant-1 (Figure 1a), the three days of 1, 15, and 16 January generated comparatively low power due to cloudy or rainy weather conditions (abnormal day). Meanwhile, due to clear skies (normal day), average power generation was comparatively higher for the days of 2, 7, 10, 11, 20, 24, and 25 January. In Figure 1b, the daily average PV energy production shows a rather stable fluctuation in relation to the different weather conditions in Malaysia, which suggests the potential of the country to generate PV energy.
Photovoltaic power generation is closely related to meteorological parameters, such as solar irradiance, ambient/atmospheric temperature, module temperature, and wind speed, among others. Figure 2a shows the patterns of solar irradiance and PV power output of different PV plants on a particular day. During clear-sky days (normal day), PV power output very strongly matched the solar irradiance curve. Similarly, for cloudy or rainy days (abnormal day), a pattern harmony is observed between PV power output and solar irradiance as shown in Figure 2a. A similar pattern for PV power output and solar irradiance was observed for the different weather conditions because the generation of PV power fully depends on solar irradiance. If the irradiance increases, then PV power will be increased, and vice versa for any weather conditions. In Malaysian weather conditions, a linear relationship occurs between them. Therefore, a high correlation coefficient between PV power and solar irradiance has been observed. Figure 2b shows a strong positive correlation between solar irradiance and PV power output. Therefore, solar irradiance is an important input vector when developing an appropriate PV power forecasting model, as evidenced by the high correlation coefficient of R2 = 0.9888.
The analytical results also indicate that the other meteorological variables, such as ambient temperature, module temperature, and wind pressure, were also correlated with PV power generation. A comparatively weak correlation was established between PV power output and atmospheric temperature, while an extremely weak correlation was observed between PV power output and wind speed. However, wind speed has been considered an input of the proposed model to build an appropriate model. By contrast, a strong correlation between PV power output and module temperature was observed. Nevertheless, it has been ignored in the selection as an input vector of the proposed model because the module temperature is highly dependent on other variables, such as atmospheric temperature, wind speed, wind direction, humidity, and the amount of PV power generation.

2.2. Data Preparation

Some meteorological variables showed extremely weak correlations with the PV power generation. In this study, the influence on PV power generation by all of the aforementioned independent meteorological variables was considered.
Sample data on actual PV power generated and related meteorological variables were collected at 5 min intervals. The obtained PV power output data and meteorological data were averaged as hourly datasets. However, during data collection, PV power output samples may be lost due to recording errors or other special events. If abnormal data are collected, the training of SVR becomes unstable. Thus, before averaging, missing data should be replaced by same-hours data from the latest similar day.

2.3. Pre-Processing of Data

The nonlinear SVR model can map nonlinear inputs into higher-dimensional space to make them linear. However, a wider data range results in imprecisions both in terms of fitting and regression. If data are pre-processed into smaller ranges before they are inputted into the model, then regression precision can be increased. A well-known solution to the aforementioned limitation is a normalization process, wherein data are restricted to the range of 0 and 1. Normalization minimizes regression error, improves precision, and maintains correlation in the data-set. The formula for normalization is [30]:
D Normal = D actual D min D max D min
where D Normal is the normalized input data; D actual is the original input data (PV power output and meteorological data); and D min and D max are the minimum and maximum values of the utilized input data, respectively.

2.4. Support Vector Regression

SVM [31] is a supervised machine-learning method that follows the structural risk minimization principle. SVMs have greater generalization ability compared with other approaches, and they are widely used in resolving classification and regression problems. SVMs are excellent for time-series analysis due to its global minima. When applied to time-series prediction, SVM modeling is referred to as support vector regression (SVR). Forecasting PV power generation is a typical time-series analysis problem; in this case, SVR is an appropriate method.
The SVR algorithm is a nonlinear regression algorithm. Inputs from time-series data samples are mapped into high-dimensional feature space for nonlinear mapping. Subsequently, linear regression is conducted (Figure 3).
A set of training data { ( x 1 , y 1 ) ,   ( x 2 , y 2 ) , , ( x l ,   y l ) } is considered, where x i R n is the input vector (meteorological data), and y i R 1 is the corresponding output value (PV power output). The estimation function f ( x ) is shown in Function (2):
y = f ( x ) = w × ψ ( x ) + b
where w R n is a weight vector, and b R is the bias term, which can be estimated by minimizing the regularized risk function.
Using the ε -insensitive loss function in SVR, the regression problem can be changed into the following optimization problem:
min :   1 2 ǀ ǀ w ǀ ǀ 2 + C i = 1 l ( ξ i + ξ i * ) Subject   to :   y i f ( x ) ε + ξ i * ;   f ( x ) y i ε + ξ i ;   and   ξ i * , ξ i 0
where ε is the radius of the tube (margin of tolerance), which refers to the data inside the tube that should be ignored during regression. The feature vector, which lies on the boundary of the tube, is known as the support vector.
In line with the Lagrange multiplier method, the Lagrange function is acquired as follows:
L ( w , b , ξ , ξ * , α , α * , γ , γ * ) = 1 2 ǀ ǀ w ǀ ǀ 2 + C i = 1 l ( ξ + ξ * ) i = 1 l α i ǀ ξ i + ε y i + f ( x i ) ǀ i = 1 l α i * ǀ ξ i * + ε + y i f ( x i ) ǀ i = 1 l ( ξ i γ i + ξ i * γ i * )
where ξ   and   ξ * are the slack variables representing the distance from actual values to the corresponding boundary values of the ε-tube. Hence, α , α * , γ , γ * 0 .
Next, the saddle point of L is calculated. The following equations can thus be obtained:
w L = 0       w = i = 1 l ( α i α i * ) ψ ( x i ) b L = 0         i = 1 l ( α i α i * ) = 0 ξ i L = 0       C α i γ i = 0
By substituting the original Function (4) with Equation (5), the following model can be obtained:
m a x :   w ¯ ( α , α * ) w , b , ξ , ξ * = 1 2 i , j = 1 l ( α i α i * ) ( α j α j * ) < ψ ( x i ) , ψ ( x j ) > i = 1 l ( α i + α i * ) ε + i = 1 l ( α i α i * ) y i S u b j e c t   t o :       i = 1 l ( α i α i * ) = 0 ;               0 α i , α i * C
where C determines the penalties of the estimation errors.
Subsequently, the kernel function k ( x i , x j ) is introduced with a mercer condition to replace the original function < ψ ( x i ) , ψ ( x j ) > , and the following model can thus be obtained:
m a x : w ¯ ( α , α * ) w , b , ξ , ξ * = 1 2 i , j = 1 l ( α i α i * ) ( α j α j * ) k ( x i , x j ) i = 1 l ( α i + α i * ) ε + i = 1 l ( α i α i * ) y i S u b j e c t   t o :       i = 1 l ( α i α i * ) = 0 ;               0 α i , α i * C
The kernel function is one of the key factors of SVR. The performance of SVR is largely dependent on the selection of the kernel function and its parameters. Four traditional kernel functions are commonly used in SVR:
  • Liner kernel function: k ( x i , x j ) = x i T x j ;
  • Polynomial: k ( x i , x j ) = ( γ x i T x j + r ) d ,   γ > 0 ;
  • Gaussian RBF (radial bias function): k ( x i   , x j ) = e x p ( ǀ ǀ x i x j ǀ ǀ 2 2 σ 2 ) = exp ( γ ǀ ǀ x i x j ǀ ǀ 2 ) ,   γ > 0 ;
  • Sigmoid: k ( x i , x j ) = tanh ( γ x i T x j + r ) .
The radial bias function (RBF) is often used as the kernel in SVR because it requires only one parameter and it has a wide scope of application. RBFs also have the ability to universally approximate any distribution in feature space. Therefore, the RBF was used as kernel in this study. For the RBF, σ 2 was set as the bandwidth of the kernel function.
To optimally solve the problems, the best α i   and   α i * should be obtained. The best-fit regression function can be expressed as follows:
f ( x ) = i = 1 l ( α i α i * ) × k ( x i , x j ) + b
where α i and α i * are the Lagrange multipliers; and k ( x i ,   x j ) is the kernel function. The nonlinear separable cases can be easily transformed into linear cases by mapping the original variable into a new high-dimension feature space using k ( x i ,   x j ) .

2.5. Proposed SVR-Based Model to Forecast PV Power Generation

To establish the generalized SVR-based model for PV power generation forecasting, the LIBSVM package [32] proposed by Chang and Lin (2001) was adopted in present study. For the proposed generalized day-ahead hourly resolution model using the SVR approach, the average hourly data samples were considered for training and testing purposes. The diagram flow of the proposed model is shown in Figure 4.
Historical data of meteorological variables (i.e., solar irradiance, atmospheric temperature, and wind speed) were considered as the input data-set of the model. Meanwhile, historical data of PV power output were considered as the output data-set for training the proposed model. All of the original historical PV power data and meteorological data were normalized within the range of [0, 1] to minimize regression error. Subsequently, the datasets for training (i.e., more than 70% of the total data) and testing were separated. To develop the model, the initialization values of the three dominating parameters (C, ε , and γ) in SVR should be also included. The training and the testing of the model were conducted first-time using the stipulated historical data-set. A non-optimization model implies that the dominating parameter values should be changed, and model training and testing should be conducted again; this procedure should be continued until the model is optimized. In the present study, the parameters were chosen on the basis of experience-based trial and error. An optimized model implies that PV power generation can be forecasted for a particular day. To forecast PV power generation using the proposed model, the hourly averages of normalized meteorological data for a given forecasted day (i.e., data collected from the Malaysian meteorological department or numerical predicted data) should be applied. Given that the input data of this model were normalized, the output data should be anti-normalized to extract the original values of PV power; this process consolidates the performance analysis of the proposed model.
A standard three-layer back-propagation ANN with the number of epochs set to 1000 was utilized as the alternative model for a comparative evaluation of the performance of the proposed model. To evaluate for better performance of the ANN model, the number of hidden neurons of the ANN model was changed to be between 5 and 20. To establish the benchmark of the PV power forecasting model, a persistence model was used to validate the proposed model.

3. Evaluation of Forecasting Accuracy

Several types of performance metrics were used to evaluate the accuracy of the forecasting model. In this study, normalized root-mean-square error (nRMSE) (i.e., the most commonly used metric), mean absolute error (MAE), and mean bias error (MBE) were adopted for the proposed model:
nRMSE % = ( 1 N   i = 1 N ( W Predicted W actual ) 2 ) × 100 / W actual   ( max )
MAE = 1 N i = 1 N | W Predicted W actual |
MBE = 1 N i = 1 N ( W Predicted W actual )
where W predicted and W actual are the forecasted value and actual measured value of the PV power output at each time point, respectively. In addition, W actual ( max ) is the maximum value of the actual measured PV power output, and N is the number of test data samples. The datasets for training and testing were normalized; accordingly, the forecasted PV power output data of the model have to be anti-normalized before calculating the nRMSE, MAE, and MBE.

4. Result and Discussion

In this research, a generalized forecasting model based on SVR was developed. The model was applied for hourly resolution day-ahead PV power generation forecasting. The actual PV power generation data of three different PV plants and their related meteorological data (i.e., solar irradiance, atmospheric temperature, and wind speed) were utilized in the study. In Malaysia, solar irradiance is available from 08:00 to 19:00 at almost all seasons of the year; accordingly, daily PV power outputs are received during these times only. Experimental data covering three months (January, May, and September 2016) were used to verify the proposed model. However, Malaysian weather conditions have not changed drastically over the year. Nevertheless, these three months have been selected in this analysis because all of the weather conditions throughout the year were considered. Sample data were selected randomly for training and test purposes due to different weather conditions. The forecasted PV power of the proposed model should be anti-normalized to extract the actual value of the PV power forecast. Consequently, the forecasted values of the proposed model were compared with those for the standard back-propagation NN model and the persistence model.
Figure 5a shows the measured PV power outputs (actual) and forecasted PV power outputs of the different models (i.e., proposed model, ANN, and persistence model) of PV plant-1 in normal weather conditions. In this case, 21 and 22 May were set as the forecast days. As shown in Figure 1b and based on the analysis of the PV power output patterns, these days (21 and 22 May) are clear-sky days (normal day). Hence, the hourly PV power output average and related metrological data of 14 days (7–20 May) were used to train the model. As shown in Figure 5a, the forecasted curve of the proposed model almost matched the actual measured curve of PV power generation.
Figure 5b represents the actual measured PV power outputs and forecasted PV power outputs of the proposed model, ANN model, and persistence model of plant-1 in abnormal weather conditions. In this figure, 26 and 27 May were set as forecast day. As shown in Figure 1b and based on the analysis of PV power output patterns, these days are cloudy or rainy days (abnormal day). Hence, the hourly PV power output average and related meteorological data of 14 days (12–25 May) were used to train the model. As shown in Figure 5b, the forecasted curve of the proposed model almost matched the actual measured curve of PV power generation. However, a significant deviation for the persistence model curve was observed, which might have been affected by abnormal weather conditions.
The same meteorological data-set and related PV power output data of PV plant-2 were used to train and test the models in different weather conditions. Similarly to plant-1, normal forecast days (21 and 22 May) and abnormal forecast days (26 and 27 May) were selected. Figure 6a,b shows the actual measured PV power output and forecasted PV power output of the different models (i.e., the proposed model, the ANN model, and the persistence model) for normal and abnormal weather conditions, respectively. As shown by Figure 6a,b, the forecasted result of the proposed model almost matched the actual measured PV power output of plant-2.
Similarly to the previous approach, the same meteorological data-set and related PV power output data of plant-3 were used to train and test the proposed model, the ANN model, and the persistence model in different weather conditions. Normal and abnormal forecast days were selected, similarly to plant-1 and plant-2, in consideration of same-month data. Figure 7a,b shows the actual measured PV power output and forecasted PV power output of the different models for normal and abnormal weather conditions, respectively. The figures show that the proposed PV power curve almost matched the actual measured PV power curve of plant-3.
The performances of the proposed model, the ANN model, and the persistence model were evaluated using the experimental data of May 2016. In this case, nRMSE, MAE, and MBE were adopted for the forecasting performance evaluation. Calculation results are shown in Table 3.
Similar work has been done by using other practical datasets collected from the months of January and September of 2016 for further validation of the proposed model. In this case, the analyses have been completed by considering the normal and abnormal weather conditions separately. From the analysis of the PV power output pattern of January, it is clear that 24 and 25 January are normal days, and 15 and 16 January are abnormal days. On the other hand, it is also clear from the analysis of the PV power output pattern of September that 29 and 30 September are normal days and 21 and 22 September are abnormal days. From this study, it has been found that the actual measured output and forecasting output of the proposed model are almost matched in all PV plants for the data-sets of both months. The detail of the error calculation result is shown in Table 4 for the month of January and Table 5 for the month of September.
The performances of the proposed model, the ANN model, and the persistence model were calculated by averaging the performance values for the three-month period (Table 6). This table shows the average nRMSE, MAE, and MBE result for the different models. The overall average result for the different models and those for other ANN-based forecasting models [33] are also presented in Table 6.
Based on the figures, tables, and discussions presented in this paper, the proposed generalized SVR-based model performed very well for PV power generation forecasting in different weather conditions. The results also showed that the model could forecast the PV power generation accurately in various seasonalities of Malaysian weather conditions. The errors obtained for all of the plants using the proposed model are very similar for normal and abnormal weather conditions. In the proposed model, the errors computed in normal weather conditions are always slightly lower compared with those for the abnormal weather conditions. The reason for this situation is that the model can be fitted well in normal weather conditions because of less variation in sample data. Hence, the forecasted results are nearly the same in various months because of the absence of drastic changes in the weather conditions of Malaysia. The average forecasting results of the proposed model were 3.08% in nRMSE, 34.57 W in MAE, and 11.34 W in MBE, which were better compared with the ANN model and the persistence model. The proposed model also outperformed other ANN-based forecasting models [33].
Figure 8 shows the actual measured PV power output and the forecasted PV power output of the proposed model. In this case, 14 and 15 January were set as forecast days. As shown in Figure 1a and based on the PV power output pattern of plant-1, these two days have cloudy or rainy weather conditions (abnormal day), which suggest comparatively low electrical energy. The shape of the second part of this figure is quite different from the other figures because some hours during these days have clear skies while the rest are cloudy or rainy. However, the highest peak of the first part of the figure is comparatively low due to the existence of some clouds that day. Nonetheless, the forecasting PV power of the proposed model almost matched the actual measured PV power in any situation. Therefore, the proposed model can forecast PV power generation accurately in any weather condition.
Figure 9a shows the deviation of the actual measured PV power output and the PV power output of the proposed forecasting model of plant-2 in different weather conditions. The deviation of the measured and forecasted PV power generation at each particular point was not higher than 10%, a percentage in the allowable range. The figure also shows that deviations in different weather conditions do not change drastically. The deviations for all plants were then evaluated on the basis of all datasets. In all cases, the results were almost similar to the deviation.
For the proposed model, Figure 9b shows the correlation of the actual measured PV power output and the forecasted PV power of plant-2 in different weather conditions. The correlation coefficient of R 2 = 0.988 in normal weather conditions and R 2 = 0.9746 in abnormal weather conditions explains the intensity of the correlation; that is, the correlation between the measured power and the forecasted power is very good. The positive sign of this value expresses the proportional relationship between the two powers. Based on this analysis, the proposed model performs very well.

5. Conclusions

Accurate forecasting of PV power generation has become a key point in the reliable operation of electrical grids due to the related rapid connection of PV systems to the grid. In this study, a generalized SVR-based forecasting model was proposed to accurately forecast PV power generation in different weather conditions. The proposed model was applied to three different PV power stations, and then its performance was analyzed. The generalized model could forecast PV power generation in both normal days (clear-sky) and abnormal days (cloudy or rainy days) with nearly the same accuracy. The proposed model could also accurately forecast PV power generation in various seasonalities and different abnormal weather conditions. Deviations in the actual measured and forecasted PV powers at each particular point were in an acceptable range (i.e., not greater than 10%), and no drastic fluctuations at any point were observed. The correlation between the actual measured PV power and the forecasted PV power for the proposed model was also very good. Subsequently, this simple model could significantly reduce computational complexity yet increase model accuracy. The model was experimentally validated by deploying it to three different types of PV plants in the same location. The average forecasting errors of the model were 3.08% in nRMSE, 34.57 W in MAE, and 11.34 W in MBE. Finally, the proposed model outperformed the ANN model, the persistence model, and other conventional models.

Acknowledgment

This work was supported by the Fundamental Research Grant Scheme (FRGS), Project No: FP061-2016.

Author Contributions

Utpal Kumar Das, Kok Soon Tey and Mehdi Seyedmahmoudian conceived the theoretical approaches and contributed in designing and developing the proposed forecasting method. Mohd Yamani Idna Idris, Saad Mekhilef, Ben Horan and Alex Stojcevski contributed in analyzing the simulation results and provided constructive inputs in order to develop the proposed method and implement the simulation model. In terms of writing the paper all authors contributed jointly to preparing this manuscript and all have read and approved the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bizzarri, F.; Bongiorno, M.; Brambilla, A.; Gruosso, G.; Gajani, G.S. Model of photovoltaic power plants for performance analysis and production forecast. IEEE Trans. Sustain. Energy 2013, 4, 278–285. [Google Scholar] [CrossRef]
  2. Byrne, J.; Kurdgelashvili, L. The role of policy in PV industry growth: Past, present and future. In Handbook of Photovoltaic Science and Engineering, 2nd ed.; John Wiley & Sons Ltd.: New York, NY, USA, 2011. [Google Scholar]
  3. Roselund, C. The Latest Report by National Renewable Energy Laboratory. Available online: http://www.nrel.gov/ (accessed on 28 June 2017).
  4. Strzalka, A.; Alam, N.; Duminil, E.; Coors, V.; Eicker, U. Large scale integration of photovoltaics in cities. Appl. Energy 2012, 93, 413–421. [Google Scholar] [CrossRef]
  5. Woyte, A.; Van Thong, V.; Belmans, R.; Nijs, J. Voltage fluctuations on distribution level introduced by photovoltaic systems. IEEE Trans. Energy Conv. 2006, 21, 202–209. [Google Scholar] [CrossRef]
  6. Shi, J.; Lee, W.J.; Liu, Y.Q.; Yang, Y.P.; Wang, P. Forecasting power output of photovoltaic systems based on weather classification and support vector machines. IEEE Trans. Ind. Appl. 2012, 48, 1064–1069. [Google Scholar] [CrossRef]
  7. Wang, F.; Mi, Z.Q.; Su, S.; Zhao, H.S. Short-term solar irradiance forecasting model based on artificial neural network using statistical feature parameters. Energies 2012, 5, 1355–1370. [Google Scholar] [CrossRef]
  8. Jang, H.S.; Bae, K.Y.; Park, H.-S.; Sung, D.K. Solar power prediction based on satellite images and support vector machine. IEEE Trans. Sustain. Energy 2016, 7, 1255–1263. [Google Scholar] [CrossRef]
  9. Mellit, A.; Pavan, A.M. A 24-h forecast of solar irradiance using artificial neural network: Application for performance prediction of a grid-connected PV plant at Trieste, Italy. Sol. Energy 2010, 84, 807–821. [Google Scholar] [CrossRef]
  10. Hocaoglu, F.O.; Gerek, O.N.; Kurban, M. Hourly solar radiation forecasting using optimal coefficient 2-d linear filters and feed-forward neural networks. Sol. Energy 2008, 82, 714–726. [Google Scholar] [CrossRef]
  11. Yona, A.; Senjyu, T.; Funabashi, T.; Kim, C.-H. Determination method of insolation prediction with fuzzy and applying neural network for long-term ahead PV power output correction. IEEE Trans. Sustain. Energy 2013, 4, 527–533. [Google Scholar] [CrossRef]
  12. Kudo, M.; Takeuchi, A.; Nozaki, Y.; Endo, H.; Sumita, J. Forecasting electric power generation in a photovoltaic power system for an energy network. Electr. Eng. Jpn. 2009, 167, 16–23. [Google Scholar] [CrossRef]
  13. Yang, X.; Ren, J.; Yue, H. Photovoltaic power forecasting with a rough set combination method. In Proceedings of the 2016 UKACC 11th International Conference on Control (CONTROL), Belfast, UK, 31 August–2 September 2016; pp. 1–6. [Google Scholar]
  14. Perez, R.; Kivalov, S.; Schlemmer, J.; Hemker, K.; Renne, D.; Hoff, T.E. Validation of short and medium term operational solar radiation forecasts in the US. Sol. Energy 2010, 84, 2161–2172. [Google Scholar] [CrossRef]
  15. Hassanzadeh, M.; Etezadi-Amoli, M.; Fadali, M.S. Practical approach for sub-hourly and hourly prediction of PV power output. In Proceedings of the North American Power Symposium 2010, Arlington, TX, USA, 26–28 September 2010; pp. 1–5. [Google Scholar]
  16. Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2015. [Google Scholar]
  17. Yang, C.; Thatte, A.A.; Xie, L. Multitime-scale data-driven spatio-temporal forecast of photovoltaic generation. IEEE Trans. Sustain. Energy 2015, 6, 104–112. [Google Scholar] [CrossRef]
  18. Diagne, M.; David, M.; Lauret, P.; Boland, J.; Schmutz, N. Review of solar irradiance forecasting methods and a proposition for small-scale insular grids. Renew. Sustain. Energy Rev. 2013, 27, 65–76. [Google Scholar] [CrossRef]
  19. Wan Ahmad, W.K.A.; Ahmad, S.; Ishak, A.; Hashim, I.; Ismail, E.S.; Nazar, R. Arima model and exponential smoothing method: A comparison. AIP Conf. Proc. 2013, 1522, 1312–1321. [Google Scholar]
  20. Sansa, I.; Missaoui, S.; Boussada, Z.; Bellaaj, N.M.; Ahmed, E.M.; Orabi, M. PV power forecasting using different artificial neural networks strategies. In Proceedings of the 2014 International Conference on Green Energy, Sfax, Tunisia, 25–27 March 2014; pp. 54–59. [Google Scholar]
  21. Oudjana, S.H.; Hellal, A.; Mahamed, I.H. Short term photovoltaic power generation forecasting using neural network. In Proceedings of the 2012 11th International Conference on Environment and Electrical Engineering, Venice, Italy, 18–25 May 2012; pp. 706–711. [Google Scholar]
  22. Cococcioni, M.; Andrea, E.D.; Lazzerini, B. 24-hour-ahead forecasting of energy production in solar PV systems. In Proceedings of the 2011 11th International Conference on Intelligent Systems Design and Applications, Cordoba, Spain, 22–24 November 2011; pp. 1276–1281. [Google Scholar]
  23. Ting-Chung, Y.; Hsiao-Tse, C. The forecast of the electrical energy generated by photovoltaic systems using neural network method. In Proceedings of the 2011 International Conference on Electric Information and Control Engineering, Wuhan, China, 15–17 April 2011; pp. 2758–2761. [Google Scholar]
  24. Atsushi, Y.; Tomonobu, S.; Saber, A.Y.; Toshihisa, F.; Hideomi, S.; Kim, C.H. Application of neural network to 24-hour-ahead generating power forecasting for PV system. In Proceedings of the 2008 IEEE Power and Energy Society General Meeting—Conversion and Delivery of Electrical Energy in the 21st Century, Chicago, IL, USA, 20–24 July 2008; pp. 1–6. [Google Scholar]
  25. Smolensky, P.; Mozer, M.C.; Rumelhart, D.E. Mathematical Perspectives on Neural Networks; Psychology Press: Hove, UK, 2013. [Google Scholar]
  26. Liu, J.; Fang, W.; Zhang, X.; Yang, C. An improved photovoltaic power forecasting model with the assistance of aerosol index data. IEEE Trans. Sustain. Energy 2015, 6, 434–442. [Google Scholar] [CrossRef]
  27. Kang, M.C.; Sohn, J.M.; Park, J.Y.; Lee, S.K.; Yoon, Y.T. Development of algorithm for day ahead PV generation forecasting using data mining method. In Proceedings of the 2011 IEEE 54th international midwest symposium on circuits and systems, Seoul, Korea, 7–10 August 2011; pp. 1–4. [Google Scholar]
  28. Chen, C.S.; Duan, S.X.; Cai, T.; Liu, B.Y. Online 24-h solar power forecasting based on weather type classification using artificial neural network. Sol. Energy 2011, 85, 2856–2870. [Google Scholar] [CrossRef]
  29. Yang, H.-T.; Huang, C.-M.; Huang, Y.-C.; Pai, Y.-S. A weather-based hybrid method for 1-day ahead hourly forecasting of PV power output. IEEE Trans. Sustain. Energy 2014, 5, 917–926. [Google Scholar] [CrossRef]
  30. Changsong, C.; Shanxu, D.; Jinjun, Y. Design of photovoltaic array power forecasting model based on neutral network. Trans. China Electrotech. Soc. 2009, 24, 153–158. [Google Scholar]
  31. Muller, K.R.; Smola, A.J.; Rätsch, G.; Schölkopf, B.; Kohlmorgen, J.; Vapnik, V. Predicting Time Series with Support Vector Machines. Available online: http://www.svms.org/regularization/MSRS97a.pdf (accessed on 28 June 2017).
  32. Chang, C.C.; Lin, C.J. Libsvm: A Library for Support Vector Machines. Available online: http://www.csie.ntu.edu.tw/~cjlin/papers/libsvm.pdf (accessed on 28 June 2017).
  33. Yan, X.Y.; Francois, B.; Abbes, D. Operating power reserve quantification through PV generation uncertainty analysis of a microgrid. In Proceedings of the 2015 IEEE Eindhoven PowerTech, Eindhoven, The Netherlands, 29 June–2 July 2015; pp. 1–6. [Google Scholar]
Figure 1. (a) Daily average power generated by PV plant-1; (b) Daily generated energy of PV plants.
Figure 1. (a) Daily average power generated by PV plant-1; (b) Daily generated energy of PV plants.
Energies 10 00876 g001
Figure 2. (a) Pattern of solar irradiance and PV power output for abnormal days; (b) Correlation between solar irradiance and PV power output.
Figure 2. (a) Pattern of solar irradiance and PV power output for abnormal days; (b) Correlation between solar irradiance and PV power output.
Energies 10 00876 g002
Figure 3. Changing nonlinear regression into linear regression.
Figure 3. Changing nonlinear regression into linear regression.
Energies 10 00876 g003
Figure 4. Flowchart of the proposed support vector regression (SVR)-based forecasting model.
Figure 4. Flowchart of the proposed support vector regression (SVR)-based forecasting model.
Energies 10 00876 g004
Figure 5. PV power output of plant-1 in (a) normal weather conditions; (b) abnormal weather conditions. “ANN” means artificial neural network.
Figure 5. PV power output of plant-1 in (a) normal weather conditions; (b) abnormal weather conditions. “ANN” means artificial neural network.
Energies 10 00876 g005
Figure 6. PV power output of plant-2 in (a) normal weather conditions; (b) abnormal weather conditions.
Figure 6. PV power output of plant-2 in (a) normal weather conditions; (b) abnormal weather conditions.
Energies 10 00876 g006
Figure 7. PV power output of plant-3 in (a) normal weather conditions; (b) abnormal weather conditions.
Figure 7. PV power output of plant-3 in (a) normal weather conditions; (b) abnormal weather conditions.
Energies 10 00876 g007
Figure 8. PV power output of plant-1 in abnormal weather conditions (January 2016).
Figure 8. PV power output of plant-1 in abnormal weather conditions (January 2016).
Energies 10 00876 g008
Figure 9. (a) Point-wise deviation of PV plant-2 in different weather conditions; (b) Correlation between measured and forecasted PV power.
Figure 9. (a) Point-wise deviation of PV plant-2 in different weather conditions; (b) Correlation between measured and forecasted PV power.
Energies 10 00876 g009
Table 1. Details of photovoltaic (PV) power plants utilized in the study.
Table 1. Details of photovoltaic (PV) power plants utilized in the study.
Data SourcesNature of PlantInstalled CapacityMeasurement Items
Plant-1Monocrystalline1875 Wp (75 W × 25 pcs)(i) PV power generation;
(ii) Solar irradiance (W/m2);
(iii) Atmospheric temperature (°C);
(iv) PV module temperature (°C);
(v) Wind speed (m/s)
Plant-2Polycrystalline2000 Wp (125 W × 16 pcs)
Plant-3Thin-Film2700 Wp (135 W × 20 pcs)
Table 2. Set of collected input/output variables of the PV system.
Table 2. Set of collected input/output variables of the PV system.
Input/Output ParametersUnitResolutionVariables
Input ParametersSolar IrradianceW/m25 minIrradiance
Ambient Temperature°C5 minTemp_amb
Module Temperature°C5 minTemp_module
Wind Speedm/s5 minWind_speed
Output ParameterPV Output PowerWatt5 minPower_pv
Table 3. Summary of the forecasting performance for the data of May 2016.
Table 3. Summary of the forecasting performance for the data of May 2016.
MetricsModelsPlant-1Plant-2Plant-3MeanAverage
nRMSE (%)ProposedN = 2.77N = 2.56N = 2.58N = 2.643.06
A = 3.66A = 3.53A = 3.21A = 3.47
ANNN = 3.53N = 3.32N = 3.45N = 3.433.83
A = 4.58A = 4.47A = 3.65A = 4.23
PersistenceN = 15.97N = 15.64N = 14.81N = 15.4716.22
A = 17.25A = 16.97A = 16.68A = 16.97
MAE (W)ProposedN = 17.12N = 24.87N = 46.99N = 29.6632.36
A = 23.81A = 34.68A = 46.65A = 35.05
ANNN = 25.51N = 40.74N = 66.62N = 44.2944.26
A = 27.94A = 46.48A = 58.27A = 44.23
PersistenceN = 72.40N = 120.43N = 170.47N = 121.10115.36
A = 64.04A = 106.50A = 158.30A = 109.61
MBE (W)ProposedN = 6.91N = 23.38N = 6.56N = 12.2816.11
A = 13.86A = 16.59A = 29.35A = 19.93
ANNN = 14.89N = 23.79N = 34.77N = 24.4814.42
A = 3.95A = 4.68A = 4.41A = 4.35
PersistenceN = 23.93N = 44.13N = 57.38N = 41.8160.39
A = 45.56A = 73.30A = 118.05A = 78.97
Hence, “N” means normal day and “A” means abnormal day. “nRMSE” means normalized root-mean-square error. “MAE” means mean absolute error. “MBE” means mean bias error.
Table 4. Summary of the forecasting performance for the data of January 2016.
Table 4. Summary of the forecasting performance for the data of January 2016.
MetricsModelsPlant-1Plant-2Plant-3MeanAverage
nRMSE (%)ProposedN = 2.78N = 2.27N = 2.96N = 2.673.12
A = 3.96A = 3.46A = 3.30A = 3.57
ANNN = 4.16N = 3.24N = 3.17N = 3.523.86
A = 4.27A = 4.11A = 4.20A = 4.19
PersistenceN = 12.99N = 13.34N = 13.61N = 13.3113.03
A = 13.32A = 12.49A = 12.41A = 12.74
MAE (W)ProposedN = 19.71N = 23.41N = 36.93N = 26.6833.63
A = 24.57A = 36.49A = 60.66A = 40.57
ANNN = 28.90N = 39.42N = 67.48N = 45.2750.69
A = 32.36A = 50.37A = 85.57A = 56.10
PersistenceN = 68.48N = 108.33N = 170.47N = 115.76109.17
A = 60.89A = 96.21A = 150.60A = 102.57
MBE (W)ProposedN = 10.62N = 5.65N = 11.98N = 9.428.08
A = 10.43A = 2.32A = 7.45A = 6.73
ANNN = 9.62N = 6.58N = 14.62N = 10.2712.52
A = 20.79A = 8.93A = 14.58A = 14.77
PersistenceN = 28.99N = 49.81N = 76.00N = 51.6045.88
A = 25.47A = 37.49A = 57.52A = 40.16
Table 5. Summary of the forecasting performance for the data of September 2016.
Table 5. Summary of the forecasting performance for the data of September 2016.
MetricsModelsPlant-1Plant-2Plant-3MeanAverage
nRMSE (%)ProposedN = 2.85N = 2.67N = 2.72N = 2.753.06
A = 3.71A = 3.36A = 3.04A = 3.37
ANNN = 3.31N = 3.03N = 3.11N = 3.153.90
A = 5.02A = 4.73A = 4.16A = 4.64
PersistenceN = 10.80N = 11.47N = 11.42N = 11.2310.45
A = 8.95A = 10.16A = 9.88A = 9.66
MAE (W)ProposedN = 22.02N = 34.64N = 53.82N = 36.8337.72
A = 23.24A = 40.70A = 51.89A = 38.61
ANNN = 26.39N = 41.45N = 65.14N = 44.3351.54
A = 35.68A = 60.63A = 79.95A = 58.75
PersistenceN = 44.27N = 75.78N = 113.62N = 77.8978.25
A = 40.94A = 77.90A = 117.00A = 78.61
MBE (W)ProposedN = 2.40N = 11.92N = 21.83N = 12.059.82
A = 8.43A = 9.61A = 4.71A = 7.58
ANNN = 11.81N = 23.67N = 27.10N = 20.8613.81
A = 3.85A = 3.47A = 12.93A = 6.75
PersistenceN = 25.31N = 48.19N = 75.27N = 49.5934.37
A = 4.87A = 20.42A = 32.13A = 19.14
Table 6. Summary of overall forecasting performance.
Table 6. Summary of overall forecasting performance.
MetricsMethodsAverage (January)Average (May)Average (September)Average (Proposed Model)Average (ANN Model)Average of Persistence ModelOther ANN Models [33]
nRMSE (%)Proposed3.123.063.063.083.8613.235.95
ANN3.863.833.90
Persistence13.0316.2210.45
MAE (W)Proposed33.6332.3637.7234.5748.83100.93-
ANN50.6944.2651.54
Persistence109.17115.3678.25
MBE (W)Proposed8.0816.119.8211.3413.5846.88-
ANN12.5214.4213.81
Persistence45.8860.3934.37

Share and Cite

MDPI and ACS Style

Das, U.K.; Tey, K.S.; Seyedmahmoudian, M.; Idna Idris, M.Y.; Mekhilef, S.; Horan, B.; Stojcevski, A. SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions. Energies 2017, 10, 876. https://doi.org/10.3390/en10070876

AMA Style

Das UK, Tey KS, Seyedmahmoudian M, Idna Idris MY, Mekhilef S, Horan B, Stojcevski A. SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions. Energies. 2017; 10(7):876. https://doi.org/10.3390/en10070876

Chicago/Turabian Style

Das, Utpal Kumar, Kok Soon Tey, Mehdi Seyedmahmoudian, Mohd Yamani Idna Idris, Saad Mekhilef, Ben Horan, and Alex Stojcevski. 2017. "SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions" Energies 10, no. 7: 876. https://doi.org/10.3390/en10070876

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop