Solar Energy Prediction Model Based on Artificial Neural Networks and Open Data
Abstract
:1. Introduction
- It can be built from Open Data. Therefore, it only requires data that can be freely found available on the web, and anyone with technical knowledge can build their own tailored version.
- It focuses on energy production, rather than solar radiation. Consequently, we are adding an abstraction layer that can answer higher-level questions. These questions range from analyzing the suitability of a specific place for a solar installation to its return on investment (ROI) from an economic point of view.
- It provides solid accuracy results. The approach provides competitive results on the MSE (Mean Squared Error) and MAE (Mean Absolute Error) metrics that have been found in the literature.
2. Related Work
3. Input Data for Solar Energy Prediction
- Direct radiation: Direct radiation is the component that is neither reflected nor scattered, reaching the directly the surface.
- Diffuse radiation: Diffuse radiation is the component of the solar energy that is scattered by the atmosphere and reaches the surface.
- Reflected radiation: Reflected radiation is the part of the radiation reflected by the surface or other elements and reaches the surface.
- Solar azimuth angle: It is the angle from due north in a clockwise direction of the sun. It influences solar energy generation. Usually, this information would need to be input into the model. However, our approach provides an abstraction layer that deals properly with this parameter.
- Air temperature: Air temperature has been demonstrated to have a strong influence on solar models [27] and a high correlation with solar photovoltaic generation. Solar panels are tested at room temperature (25 °C) and, therefore, the information provided by the manufacturer most often corresponds to a rather unusual situation of a solar panel operating under strong sunlight but with low temperature [28].
- Wind speed: Wind speed is closely related with air temperature. In the same way as air temperature affects photovoltaic panels, wind speed can influence temperature variance and consequently on performance of energy conversion [29].
- Performance of the photovoltaic (PV) panel: The electrical energy conversion ratio of solar panels greatly influences the amount of energy generated, depending on the radiation received. Currently, there is a great variety of photovoltaic panels, with a great variety of different compositions, offering very varied performances.
- Size of the generating installation: The larger the generating installation, the greater the collection of solar energy, and therefore, the greater the amount of electrical energy produced. This is one of the foremost factors in order to get a properly electrical energy production.
4. Solar Energy Prediction Model Based on Neural Networks and Open Data
4.1. Data Selection
- Information about the amount of energy generated by that installation
- Information about solar factors
- Information about geological and atmospheric factors
- Information about electrical factors that conform that installation.
- PVOutput [31]: A free service for sharing and comparing Photovoltaic output data. It provides large volumes of raw data related to solar photovoltaic production. From this data source, we can obtain information about the amount of energy generated by an installation in a range of time (item 1), information about the solar azimuth angle (item 3), and information about the electrical factors that conform that installation (item 4)
- Photovoltaic Geographical Information System (PVGIS) [32]: PVGIS is a system developed by the European Commission Joint Research Centre, at the JRC site in Ispra, Italy, since 2001. The focus of PVGIS is research in solar resource assessment, photovoltaic (PV) performance studies, and the dissemination of knowledge and data about solar radiation and PV performance. The information available on the platform is monthly, daily and even hourly solar radiation. Therefore, it fulfills the requirement of solar factors in a specific range of time (item 2) and the information about air temperature and winds speed for that range of time (item 3).
- Location, with Universal Transverse Mercator (UTM) coordinates
- Brand and model of the solar panels installed
- Full data (or almost full data) over the selected period (between 2014 and 2017, including data of the energy generated each day)
- Orientation and tilt of the solar panels installed.
- Size of installation (kW)
- Energy generated (kWh/day)
- Ratio of solar energy conversion (%). This value can be obtained from manufacturer of the solar panel (taking in account model and brand)
- Beam (direct) irradiance on the inclined plane (on photovoltaic panel) (W/m2)
- Diffuse irradiance on the inclined plane (on photovoltaic panel) (W/m2)
- Reflected irradiance on the inclined plane (on photovoltaic panel) (W/m2)
- Sun height (degrees)
- Temperature of the air (degree Celsius)
- Wind speed (m/s)
4.2. Artificial Neural Network
- Solar forecasting can be done using global energy (GR) as input parameter or it can be separated into its components: directed radiation (DR), diffused radiation (FR), and reflected radiation (RR), as we can see in Figure 3. Depending how this information is provided, the input will have one structure or other.
- Information related about solar parameters provided by the satellites is supplied every 5 min. That information can be connected with a dense layer (where all the information related with solar parameters are interconnected with each other), or we can use a stratified layer to isolate solar radiation data. With this last approach, the solar parameters from one range of time are not interfering in other ranges and it forms an isolated information core (Figure 4).
- Number of hidden layers (1 or 2). It has been shown that the number of hidden layers of the ANN can influence the precision of the result, therefore different hypotheses were tested [33]. An additional layer is necessary if broken-down radiation is used, whereas if global radiation is the input element, then only one hidden layer is necessary (Figure 5).
- Formula for calculating the number of neurons in the hidden layers. It has been shown that the number of neurons in the hidden layers can influence the result, so the number of neurons in the hidden layers was varied, following examples from the literature [34]. The proposed formulas are as follows:
- MAE (Mean Absolute Error): MAE measures the average magnitude of the errors in a set of predictions, without considering their direction. It is the average over the test sample of the absolute differences between prediction values (A) and real values (B) where all individual differences have equal weight. The mathematical representation is:
- MSE (Mean Squared Error): MSE measures the average magnitude of the squared errors in a set of predictions. As the error is squared, it will be always positive, and the direction of the error is irrelevant. The mathematical representation is:
5. Results, Discussion, and Limitations
5.1. ANN Topologies Results
- In approach 1, we can see oscillations in the training and validation datasets. In addition, it must be pointed that optimal iterations are centered around 500 epochs. From that point on, the validation dataset loses its minimum error value (0.2) and starts to grow.
- Approach 2 is very similar to the previous one. Global radiation, number of hidden layers, and dense connections are maintained (it changes the formula used for the number of neurons on the hidden layer). As we can see in the figures in the Appendix A, there are greater oscillations in MSE and MAE values than in the previous case, with similar results. As a result, this approach would be worse than previous one.
- In approach 3, we can see oscillations on the initial phases of the training/test phases, until it smooths from epoch 300 onwards. However, the results are not interesting, due to higher values of MSE and MAE (0.067 and 0.024, respectively).
- Approach 4 is the first pair of graphs with the dense layer connection and decomposed radiation. As it can be easily seen, there are too many oscillations, due to the fully connected approach with the decomposed radiation that conforms a huge multidimensional space of possibilities. Thus, the distribution function can be very difficult to learn properly. In addition, the results are not interesting due to higher values of MSE and MAE (0.10 and 0.27, respectively).
- Similar to the previous case, there are too many oscillations in approach 5. As a result, we can conclude that the formula used for calculating the number of neurons on hidden layer is less important than other parameters. As in the previous case, error values are relatively higher compared to best performing topologies.
- The third approach with dense layer and decomposed radiation is reflected in approach 6. It follows the trend of the other two: too many oscillations and elevated values of MSE (0.11) and MAE (0.27) for the validation dataset.
- Approach 7 is a hybrid of topologies with one hidden layer and topologies with two hidden layers. In this approach, we are using decomposed radiation, but only a simple add layer is used to obtain the distribution function. As it can be observed in the provided figures, the results are the worst across all the topologies, and they must be represented with a different scale.
- In approach 8, we find the first pair of figures with the stratified layer connection and global radiation approach. We can observe in the graphs that the optimal value of test is acquired with a notably higher number of epochs than in previous cases (10,000 vs 500). However, the oscillations have disappeared, and the results are quite more promising, with lower values of MSE and MAE (0.046 and 0.18, respectively).
- The figures that we can see in approach 9 follow the trends of previous one. With the modification of the number of neurons in the hidden layers, we try to tune the algorithm, in order to provide better results. However, the results are not clearly improved.
- Same as in the previous approach, with the modification of number of neurons in the hidden layers, we try to tune the algorithm, in order to provide better results. Nevertheless, with the formula used, the results are clearly worse in approach 10 than in the previous approach.
- To start closing up the analysis of the Appendix A, we present the three approaches based on stratified layer and decomposed radiation, starting with approach 11. As we can see in the graphs, these approaches obtain similar values to the best approach until now (approach 9), with similar values in MSE and MAE (0.04646 and 0.1757, respectively).
- In approach 12, we are tuning the parameters of number of hidden neurons, looking for slight improvements on MAE and MSE. With the second formula, the results are not clearly improved compared to previous cases.
- Finally, in approach 13, we are tuning the parameter of the number of hidden neurons, looking at slight improvements on MAE and MSE. These results slightly improve the MSE and MAE value obtained in approach 8, obtaining 0.0161 (vs. 0.1757) and 0.04025 (vs. 0.04646), respectively. Thus, these results make this ANN configuration the best one out of the studied topologies.
5.2. Results Compared to the State-Of-The-Art
5.3. Discussion and Limitations
6. Conclusions
- It is trained with Open Data. Consequently, everyone with proper technical knowledge can access the data sources and get the data for building their own ANN model.
- It is focused on solar power forecasting, rather than solar radiation prediction. As a result of that, we can answer high-level questions (focused on return on investment (ROI) of the installation), thanks to the abstraction layer that has been included between solar radiation and its conversion to solar power.
- It provides solid accuracy results. The presented method provides competitive results on MAE and MSE metrics compared to other approaches that we have found in the literature.
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Appendix A
References
- Shafeek, H. Maintenance Practices in Cement Industry. Asian Trans. Eng. 2012, 1, 10–20. [Google Scholar]
- Kiesecker, J.; Baruch-Mordo, S.; Heiner, M.; Negandhi, D.; Oakleaf, J.R.; Kennedy, C.M.; Chauhan, P. Renewable Energy and Land Use in India: A Vision to Facilitate Sustainable Development. Sustainability 2019, 12, 281. [Google Scholar] [CrossRef] [Green Version]
- King, P.; Sansom, C.; Comley, P. Photogrammetry for Concentrating Solar Collector Form Measurement, Validated Using a Coordinate Measuring Machine. Sustainability 2019, 12, 196. [Google Scholar] [CrossRef] [Green Version]
- National Renewable Energy Laboratory. Best Research-Cell Efficiency Chart: Photovoltaic Research. Available online: https://www.nrel.gov/pv/cell-efficiency.html (accessed on 21 February 2020).
- Noticias CIEMAT: El CIEMAT Presenta en Alpedrete el Mapa Solar del Municipio. Available online: http://www.ciemat.es/cargarAplicacionNoticias.do;jsessionid=E2015D198EA7C11FCD180ACB22A03F3C?idArea=-1&identificador=1500 (accessed on 24 August 2020).
- Nia, M.; Chegaar, M.; Benatallah, M.F.; Aillerie, M. Contribution to the quantification of solar radiation in Algeria. Energy Procedia 2013, 36, 730–737. [Google Scholar] [CrossRef]
- Gautier, C.; Diak, G.; Massé, S. A Simple Physical Model to Estimate Incident Solar Radiation at the Surface from GOES Satellite Data. J. Appl. Meteorol. 1980, 19, 1005–1012. [Google Scholar] [CrossRef] [Green Version]
- Chineke, T.C. Equations for estimating global solar radiation in data sparse regions. Renew. Energy 2008, 33, 827–831. [Google Scholar] [CrossRef]
- Khatib, T.; Mohamed, A.; Mahmoud, M.M.; Sopian, K. Modeling of Daily Solar Energy on a Horizontal Surface for Five Main Sites in Malaysia. Int. J. Green Energy 2011, 8, 795–819. [Google Scholar] [CrossRef]
- Li, H.; Ma, W.; Wang, X.; Lian, Y. Estimating monthly average daily diffuse solar radiation with multiple predictors: A case study. Renew. Energy 2011, 36, 1944–1948. [Google Scholar] [CrossRef]
- Şen, Z. Simple nonlinear solar irradiation estimation model. Renew. Energy 2007, 32, 342–350. [Google Scholar] [CrossRef]
- Akinoǧlu, B.; Ecevit, A. Construction of a quadratic model using modified Ångstrom coefficients to estimate global solar radiation. Sol. Energy 1990, 45, 85–92. [Google Scholar] [CrossRef]
- Şen, Z. Angström equation parameter estimation by unrestricted method. Sol. Energy 2001, 71, 95–107. [Google Scholar] [CrossRef]
- Rizwan, M.; Jamil, M.; Kirmani, S.; Kothari, D. Fuzzy logic based modeling and estimation of global solar energy using meteorological parameters. Energy 2014, 70, 685–691. [Google Scholar] [CrossRef]
- Chugh, A.; Chaudhary, P.; Rizwan, M. Fuzzy logic approach for short term solar energy forecasting. In Proceedings of the 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3) INDICON, Piscataway, NJ, USA, 17–20 December 2015. [Google Scholar]
- Monís, J.I.; López-Luque, R.; Reca, J.; Martínez, J. Multistage Bounded Evolutionary Algorithm to Optimize the Design of Sustainable Photovoltaic (PV) Pumping Irrigation Systems with Storage. Sustainability 2020, 12, 1026. [Google Scholar] [CrossRef] [Green Version]
- Che, S.; Boyer, M.; Meng, J.; Tarjan, D.; Sheaffer, J.W.; Skadron, K. A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 2008, 68, 1370–1380. [Google Scholar] [CrossRef]
- Jin, L.; Kuang, X.; Huang, H.; Qin, Z.; Wang, Y. Study on the Overfitting of the Artificial Neural Network Forecasting Model. J. Meteorol. Res. 2005, 19, 216–225. [Google Scholar]
- Karatepe, E.; Boztepe, M.; Colak, M. Neural network based solar cell model. Energy Convers. Manag. 2006, 47, 1159–1178. [Google Scholar] [CrossRef]
- Fadare, D. Modelling of solar energy potential in Nigeria using an artificial neural network model. Appl. Energy 2009, 86, 1410–1422. [Google Scholar] [CrossRef]
- Mellit, A.; Benghanem, M.; Bendekhis, M. Artificial neural network model for prediction solar radiation data: Application for sizing stand-alone photovoltaic power system. In Proceedings of the 2005 IEEE Power Engineering Society General Meeting, San Francisco, CA, USA, 12–16 June 2005; Volume 1, pp. 40–44. [Google Scholar]
- Amrouche, B.; Le Pivert, X. Artificial neural network based daily local forecasting for global solar radiation. Appl. Energy 2014, 130, 333–341. [Google Scholar] [CrossRef]
- Quiles, E.; Roldán-Blay, C.; Escrivá-Escrivá, G.; Porta, C.R. Accurate Sizing of Residential Stand-Alone Photovoltaic Systems Considering System Reliability. Sustainability 2020, 12, 1274. [Google Scholar] [CrossRef] [Green Version]
- Abuella, M.; Chowdhury, B. Solar power forecasting using artificial neural networks. In Proceedings of the 2015 North American Power Symposium, Charlotte, NC, USA, 4–6 October 2015. [Google Scholar]
- Cococcioni, M.; D’Andrea, E.; Lazzerini, B. 24-h-ahead forecasting of energy production in solar PV systems. In Proceedings of the 11th International Conference on Intelligent Systems Design and Applications, Cordoba, Spain, 22–24 November 2011; pp. 1276–1281. [Google Scholar]
- Gala, Y.; Fernández, A.; Dorronsoro, J.; García, M.; Rodríguez, C. Machine Learning Prediction of Global Photovoltaic Energy in Spain. Renew. Energy Power Qual. J. 2014, 605–610. [Google Scholar] [CrossRef]
- Ibrahim, S.; Daut, I.; Irwan, Y.; Irwanto, M.; Gomesh, N.; Farhana, Z. Linear Regression Model in Estimating Solar Radiation in Perlis. Energy Procedia 2012, 18, 1402–1412. [Google Scholar] [CrossRef] [Green Version]
- Dupré, O.; Vaillon, R.; Green, M.A. Thermal Behavior of Photovoltaic Devices; Springer Science and Business Media: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Jacques, S.; Caldeira, A.; Ren, Z.; Schellmanns, A.; Batut, N. Impact of the cell temperature on the energy efficiency of a single glass PV module: Thermal modeling in steady-state and validation by experimental data. Renew. Energy Power Qual. J. 2013, 1, 291–294. [Google Scholar] [CrossRef]
- Uhlir, P.F.; Schröder, P. Open Data for Global Science. Data Sci. J. 2007, 6, OD36–OD53. [Google Scholar] [CrossRef] [Green Version]
- PVOutput. Available online: https://pvoutput.org/ (accessed on 10 March 2020).
- EU Science Hub. Photovoltaic Geographical Information System (PVGIS). Available online: https://ec.europa.eu/jrc/en/pvgis (accessed on 24 August 2020).
- Behera, M.K.; Nayak, N. A comparative study on short-term PV power forecasting using decomposition based optimized extreme learning machine algorithm. Eng. Sci. Technol. Int. J. 2020, 23, 156–167. [Google Scholar] [CrossRef]
- Shaft, I.; Ahmad, J.; Shah, S.I.; Kashif, F.M. Impact of varying neurons and hidden layers in neural network architecture for a time frequency application. In Proceedings of the 10th IEEE International Multitopic Conference (INMIC), Islamabad, Pakistan, 23–24 December 2006; pp. 188–193. [Google Scholar]
- Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
- Kim, S.; Kim, H. A new metric of absolute percentage error for intermittent demand forecasts. Int. J. Forecast. 2016, 32, 669–679. [Google Scholar] [CrossRef]
- Wan, C.; Zhao, J.; Song, Y.; Xu, Z.; Lin, J.; Hu, Z. Photovoltaic and solar power forecasting for smart grid energy management. CSEE J. Power Energy Syst. 2015, 1, 38–46. [Google Scholar] [CrossRef]
- Zhang, W.; Zhang, H.; Liu, J.; Li, K.; Yang, D.; Tian, H. Weather prediction with multiclass support vector machines in the fault detection of photovoltaic system. IEEE/CAA J. Autom. Sin. 2017, 4, 520–525. [Google Scholar] [CrossRef]
Approach | Description |
---|---|
1 | Dense layer. Formula (1). Hidden layers = 1. Global radiation |
2 | Dense layer. Formula (2). Hidden layers = 1. Global radiation |
3 | Dense layer. Formula (3). Hidden layers = 1. Global radiation |
4 | Dense layer. Formula (1). Hidden layers = 1. Decomposed radiation |
5 | Dense layer. Formula (2). Hidden layers = 1. Decomposed radiation |
6 | Dense layer. Formula (3). Hidden layers = 1. Decomposed radiation |
7 | Stratified layer. Add layer Hidden layers = 1. Decomposed radiation |
8 | Stratified layer. Formula (1) Hidden layers = 2. Global radiation |
9 | Stratified layer. Formula (2) Hidden layers = 2. Global radiation |
10 | Stratified layer. Formula (3) Hidden layers = 2. Global radiation |
11 | Stratified layer. Formula (1) Hidden layers = 2. Decomposed radiation |
12 | Stratified layer. Formula (2) Hidden layers = 2. Decomposed radiation |
13 | Stratified layer. Formula (3) Hidden layers = 2. Decomposed radiation |
Approach | MSE | Type of Prediction | Prediction Granularity | MAE | |
---|---|---|---|---|---|
Solar power forecasting using artificial neural networks [24] | 0.069–0.055 | Energy | Daily, Monthly | - | |
24-h-ahead forecasting of energy production in solar PV systems [25] | - | Energy | Daily | Spring: | 0.122 |
Summer: | 0.211 | ||||
April–Sept: | 0.266 | ||||
Contribution of quantification of solar radiation in Algeria [6] | 0.0157–0.0418 | Radiation | Monthly | - | |
Solar energy prediction model based on artificial neural networks and open data | 0.040 | Energy | 5 Min, Daily, Monthly | 0.161 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Barrera, J.M.; Reina, A.; Maté, A.; Trujillo, J.C. Solar Energy Prediction Model Based on Artificial Neural Networks and Open Data. Sustainability 2020, 12, 6915. https://doi.org/10.3390/su12176915
Barrera JM, Reina A, Maté A, Trujillo JC. Solar Energy Prediction Model Based on Artificial Neural Networks and Open Data. Sustainability. 2020; 12(17):6915. https://doi.org/10.3390/su12176915
Chicago/Turabian StyleBarrera, Jose Manuel, Alejandro Reina, Alejandro Maté, and Juan Carlos Trujillo. 2020. "Solar Energy Prediction Model Based on Artificial Neural Networks and Open Data" Sustainability 12, no. 17: 6915. https://doi.org/10.3390/su12176915
APA StyleBarrera, J. M., Reina, A., Maté, A., & Trujillo, J. C. (2020). Solar Energy Prediction Model Based on Artificial Neural Networks and Open Data. Sustainability, 12(17), 6915. https://doi.org/10.3390/su12176915