For any EMS to take optimal decisions on when to charge/discharge the ESS and feed in/draw power to/from the grid, it is of utmost importance that such an EMS be equipped with a forecasting module. The objective of such a module is to estimate the amount of power that can be generated from the corresponding installed solar panels. The generated forecasts can be later used for day-ahead planning purposes.
In this section, we first present our proposed methodology to identify the relevant parameters impacting on the power generation forecasts of solar panels. Furthermore, based on the derived analysis, we then evaluate the accuracy of those forecasts by considering three different types of networks, namely FFNN, RBFN, and NEAT.
4.2. Correlation Analysis
In this section, we present the results of the analysis carried out on the consolidated dataset by applying the methodology of
Section 4.1. We started (Step 2) with Pearson’s correlation (see
Section 2.2.3) and then, based on the results of this correlation, analysis was performed using a neural network (Step 3).
Table 2 illustrates the corresponding Pearson’s correlation of the different considered attributes of the consolidated dataset given in percent. Note that green and red cells denote positive and negative correlations, respectively. When inspecting the table, one must remember that correlation does not imply causation (e.g., indicates that one event is the result of the occurrence of the other event). For instance, the relatively high correlation between the clear-sky GHI (e.g., CSG) on the one hand and the temperature (e.g., TP) on the other hand is a good example for that. The clear-sky GHI is dependent on the sun’s current activity in terms of insolation and the angle between the site of observation and the sun.
Still in comparison with aforementioned clarification, the correlation of the GHI with the cloud cover (e.g., CC) might show causation. The high correlation between clear-sky GHI and cloud cover is argued to be random, but the difference to the correlation between GHI and cloud cover can be interpreted as causal, where the rationale is explained next. GHI is the product of the clear-sky GHI and the clear-sky index, which indicates how little the insolation reaching the atmosphere is absorbed on the way to ground level. This means GHI and cloud cover, which is the percentage of the sky covered, must be indirect proportional effects and therefore have a lower Pearson’s correlation than the random one between clear-sky GHI and cloud cover.
There is also a causal explanation for the high correlation between temperature and GHI, and clear-sky GHI, respectively. Fluctuation in the irradiation reaching the atmosphere (e.g., respectively the ground plane) is the main reason for deviation in temperature. Meteorologic phenomena like wind and cloud cover are a direct consequence of regional temperature differences which weaken the correlation but not the causation. Therefore, the irradiation is a direct source of increase in temperature.
Temperature, on one hand, and wind speed (WS), respectively, wind gust on the other hand, are negatively correlated since wind is not a consequence of local temperature but rather the temperature difference between two arbitrary regions where neither of those regions is the site of observation. In fact, wind in the region of the considered site is often caused by temperature differences between the Atlantic Ocean and the Eurasian land mass. This also explains the correlation of temperature and cloud cover being close to zero as well. Cloud cover is not caused by local temperature, but depending on whether the wind moves during cloudy or clear weather conditions towards the considered site. Those attributes have a causal relation between each other, but it is a far more complex system than a linear correlation.
Humidity (HM) could be seen as combination of effects like cloud cover and precipitation intensity (PI) and its probability (PP). Therefore, the low correlation between cloud cover and humidity is unexpected at first sight, but can be explained by weather conditions of dry cloudy days as well as humid mornings and evenings with clear sky. The negative correlation between humidity and insolation (DHI and GHI), with the absolute values of 0.46 and 0.63, might indicate a meaningful relation.
At a certain level, a higher temperature causes higher panel efficiency and, therefore, higher power generation (MP) for the same amount of insolation. Nevertheless, the high correlation of temperature and power generation is mostly caused by the correlation of temperature and insolation, and rather unlikely to be the consequence of the minor increase in panel efficiency. We know that the amount of generated power is directly dependent on the irradiate. Hence, as expected, correlation is highest between the different types of irradiation (i.e. CSG, CSD, GHI, DHI) and power generation.
Visibility (VS) and cloud cover both indicate sky clearness and therefore influence the amount of irradiation reaching the ground plane. Their effect on power generation is already implicitly taken into account when using the GHI instead of the clear-sky GHI. The differences in correlation between those attributes (i.e., VS and CC) with respect to GHI and clear-sky GHI, respectively, confirm this. The negative correlation of those attributes among each other illustrates that irradiation is the most important influencing factor.
To conclude, from the Pearson’s correlation analysis presented in this section, it can be clearly noted that both GHI and temperature can be used as useful parameters in estimating the power generation of the solar panels. This fact is illustrated in
Figure 2,
Figure 3 and
Figure 4 that capture power generation, global horizontal irradiance, as well as ambient temperature, respectively, during the period between 1 May 2017 and 30 April 2018 such that the X-axis of those figures denotes the hour of the day scaled between 00:00 and 23:00 on an hourly basis. Furthermore,
Figure 5 demonstrates the average yearly power generation and irradiation. It can be seen from those figures that peak power generation happens at 13:00, which coincides with the corresponding peak values of irradiation and temperature. It is important to note that, although the gathered data sources are from the southern region of Bavaria (Germany), the derived conclusions of this section are generic enough to be applicable to any region having similar characteristics as the one considered in this paper.
Next, we present the different neural-network types that were used in order to train the corresponding networks by having both GHI and temperature as input parameters.