Next Article in Journal
Logging Identification Methods for Oil-Bearing Formations in the Chang 6 Tight Sandstone Reservoir in the Qingcheng Area, Ordos Basin
Previous Article in Journal
Exergy Analysis in Highly Hydrogen-Enriched Methane Fueled Spark-Ignition Engine at Diverse Equivalence Ratios via Two-Zone Quasi-Dimensional Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Enhanced Forecasting Method of Daily Solar Irradiance in Southwestern France: A Hybrid Nonlinear Autoregressive with Exogenous Inputs with Long Short-Term Memory Approach

by
Oubah Isman Okieh
1,2,*,
Serhat Seker
1,
Seckin Gokce
3 and
Martin Dennenmoser
3
1
Electrical Engineering, Istanbul Technical University, 34469 Istanbul, Turkey
2
Energy and Environment Research Center, University of Djibouti, Djibouti 1904, Djibouti
3
BayWa r.e. Solar Projects GmbH, 81925 München, Germany
*
Author to whom correspondence should be addressed.
Energies 2024, 17(16), 3965; https://doi.org/10.3390/en17163965 (registering DOI)
Submission received: 18 June 2024 / Revised: 12 July 2024 / Accepted: 18 July 2024 / Published: 10 August 2024
(This article belongs to the Section A2: Solar Energy and Photovoltaic Systems)

Abstract

:
The increasing global reliance on renewable energy sources, particularly solar energy, underscores the critical importance of accurate solar irradiance forecasting. As solar capacity continues to grow, precise predictions of solar irradiance become essential for optimizing the performance and reliability of photovoltaic (PV) systems. This study introduces a novel hybrid forecasting model that integrates Nonlinear Autoregressive with Exogenous Inputs (NARX) with Long Short-Term Memory (LSTM) networks. The purpose is to enhance the precision of predicting daily solar irradiance in fluctuating meteorological scenarios, particularly in southwestern France. The hybrid model employs the NARX model’s capacity to handle complex non-linear relationships and the LSTM’s aptitude to manage long-term dependencies in time-series data. The performance metrics of the hybrid NARX-LSTM model were thoroughly assessed, revealing a mean absolute error (MAE) of 9.58 W/m2, a root mean square error (RMSE) of 16.30 W/m2, and a Coefficient of Determination (R2) of 0.997. Consequently, the proposed hybrid model outperforms the benchmark model in all metrics, showing a significant improvement in prediction accuracy and better alignment with the observed data. These results highlight the model’s effectiveness in enhancing forecasting accuracy under unpredictable conditions, improving solar energy integration into power systems, and ensuring more reliable energy predictions.

1. Introduction

The escalating demand for energy sources in many sectors, such as transportation, residential, and industry, along with the adverse environmental effects of fossil fuels, has led to the emergence of renewable energy sources as a viable alternative, such as wind, geothermal, biomass, solar, and hydropower energy [1,2,3,4]. In the year 2022, the worldwide renewable energy capacity amounted to 3381 GW. Among these renewable sources, solar energy emerged as the second most extensively implemented, with a capacity of 1053 GW. It is positioned second after hydropower, which leads with a total capacity of 1392 GW [5,6]. Solar energy refers to the process of harnessing sunlight to generate electricity. Solar energy is widely recognized as a highly significant form of renewable energy on a global scale, owing to its abundant availability and environmentally advantageous characteristics [7,8,9,10]. The global installed solar energy capacity has had a significant growth, rising from 140,514 MW to 1053 GW over the course of the past decade (2013–2022) [11]. The predicted capacity for solar photovoltaic (PV) power generation is expected to reach 7000 terawatt-hours (TWh) by the year 2050. Multiple variables, including solar irradiance, wind speed, ambient and module temperatures, cloud cover, and humidity, influence solar power production. The generation of solar power is directly tied to solar irradiance, as higher levels of sunlight exposure result in increased energy output from solar panels. Therefore, precise predictions of solar irradiance are essential for assessing the solar energy potential in regions with fluctuating environmental conditions, such as France [12]. France is positioned as the fifth-ranked nation in Europe and the thirteenth-ranked nation in the world in terms of its significant solar energy capacity, trailing behind Germany, Italy, Spain, and the Netherlands. Over the past decade, the nation has witnessed a threefold increase in its cumulative installed solar energy capacity, rising from 5277 MW to 17,419 MW [11]. Figure 1 provides a graphical representation of the growth in solar energy capacity from 2013 to 2022, comparing the growth of France’s solar infrastructure with the global trend. This visual comparison highlights France’s role in the global solar market, with a consistent growth in installed capacity throughout the decade. This growth reflects France’s dedication to expanding its renewable energy resources. Simultaneously, the global capacity has shown a significant rise, indicating a robust worldwide shift towards solar energy.
The research on solar irradiance forecasting has gained significant attention, driven by the growing demand for renewable and environmentally friendly energy sources. Hence, the accurate prediction of solar irradiance is crucial for enhancing the effectiveness and reliability of solar power systems, and numerous techniques have been developed to meet this requirement [13,14,15,16]. Statistical techniques [17] like autoregressive integrated moving average (ARIMA) use past data to forecast future levels of irradiance, providing accurate predictions for short-term forecasts. Numerical weather prediction (NWP) models [18], which employ the principles of atmospheric physics to make accurate predictions, are well-suited for forecasting weather patterns over a time frame ranging from medium- to long-term, particularly in regions with complex weather patterns. Additionally, support vector machines (SVMs) [19], artificial neural networks (ANNs) [20], and random forests (RFs) [21] are machine learning approaches that have become popular because of their capacity to handle nonlinear relationships and adjust to various geographical situations. They have proven to be useful in both short-term and intra-day forecasting [22,23,24,25]. Furthermore, recent studies have developed hybrid models that combine different forecasting techniques to take advantage of their strengths and minimize their weaknesses, resulting in improved accuracy and reliability of forecasts [26].
Nevertheless, models frequently have difficulties in dealing with the fundamental instability of weather conditions. Severe weather phenomena, such as cloud cover and storms, have the potential to rapidly decrease the amount of sun radiation, making precise and consistent prediction difficult [27].
This study presents an advanced hybrid Nonlinear Autoregressive Network with Exogenous Inputs Long Short-Term Memory (NARX-LSTM) model to address the increasing demand for accurate daily solar irradiance forecasting, particularly in unstable weather conditions in southwestern France. The utilization of a NARX-LSTM hybrid model for predicting solar irradiance in areas with unpredictable environmental variables is significant progress in this study. By integrating the autoregressive capabilities of the NARX model with the long-term memory functions of the LSTM, this approach effectively addresses the complexity of environmental variability. The robustness of the NARX-LSTM model not only contributes to more precise energy planning and management but also ensures better operational efficiency of solar energy systems in regions where traditional models fail to perform adequately. This combination has the potential to establish a new benchmark model for accurate forecasting in locations with unpredictable environmental conditions.
The present paper is structured as follows: Section 2 provides an overview of the existing literature on the topic of solar energy forecasting. Section 3 provides a thorough explanation of the materials and the methodologies used in the present study. The findings of the simulation are analyzed and presented in Section 4. Section 5 presents the concluding remarks.

2. Related Works

Several studies have been undertaken to predict solar radiation through various methodologies. In other words, some research opted to employ a single methodological approach, while others combined many methodologies in a hybrid strategy to improve the accuracy and reliability of the network’s predictions.
In their study, Rangel et al. [28] proposed several NARX models for three sites. The focus of the study was to enhance the performance of NARX models for forecasting solar radiation an hour in advance by appropriately configuring input vectors. The study employed Engle–Granger causality tests to investigate collinearity among meteorological variables in databases from Chihuahua, Temixco, and Zacatecas, three Mexican cities characterized by distinct climates. The authors discovered that using input vectors containing only solar radiation and temperature data was sufficient for accurately predicting solar radiation. Including pressure, wind speed, and relative humidity data did not significantly improve the accuracy of the forecasts. The comparison of forecasting accuracy between the improved NARX models and the smart persistence model revealed notable enhancements in accuracy with the NARX models across all three cities.
Additionally, Hissou et al. [29] introduced a novel method for predicting solar radiation using a multivariate time series (MVTS) model that employed recursive feature elimination (RFE) with various models such as decision tree (DT), logistic regression (LogR), gradient boosting models (GBMs), Pearson correlation (Pr), and RF to improve the accuracy of forecasting by identifying the most important features. The LogR model particularly stands out, showing excellent performance with a low root mean square error (RMSE) (0.003) and a high R2 score (0.002), while other models also demonstrate promising results with consistently high accuracy. The paper demonstrates how different machine learning approaches contribute to refining solar radiation forecasts by handling multiple variables and their complex relationships. This comprehensive approach not only enhances the precision and effectiveness of solar radiation forecasts but also provides a valuable understanding of the most influential meteorological variables that impact solar radiation levels, such as temperature and humidity.
Furthermore, Natava et al. [30] proposed a novel approach to forecast daily global solar radiation in Dumaguete using a NARX-LSTM hybrid network. The study discovered that the optimized Gaussian Process Regression (GPR) model and NARX were successful in accurately predicting outcomes using limited datasets. Despite their inherent inaccuracies, incorporating these models into an LSTM network enhanced their predictive precision. The LSTM model alone was highly effective with minimal losses, suggesting its robustness in predicting global solar radiation with just six variables. The hybrid NARX-LSTM model improved the results by achieving an RMSE of about 0.10, potentially decreasing the inconsistency and unpredictability in the output of PV cells.
Another study by Ref. [25] introduced a novel hybrid model for predicting solar radiation, combining the strengths of autoregressive and moving average (ARMA) and NARX models. Initially, the ARMA model is employed to predict solar radiation for specific winter and summer days, as well as to estimate daily and weekly averages. The hybrid approach benefits from ARMA’s initial predictive accuracy and NARX’s capabilities, aiming to deliver highly accurate and efficient solar radiation forecasts. The choice of models for the hybrid system is strategically determined by their simplicity, adaptability, and performance. Each model undergoes individual testing to verify its usefulness before being integrated. Therefore, the study confirms that while the ARMA model is effective in predicting weekly solar radiation averages when variations are under 10%, its performance declines with larger fluctuations. In such cases, the NARX model is employed, utilizing temperature and historical solar radiation data as inputs to handle higher variability (over 10%). The effectiveness of the resulting hybrid model is evaluated using metrics such as mean square error (MSE), mean absolute error (MAE), and RMSE. These metrics demonstrate improved performance in comparison to several prior research, highlighting the proposed model’s capacity to estimate weekly averages of solar radiation with more accuracy.
Moreover, Ref. [31] developed a hybrid algorithm combining k-means clustering and support vector regression (SVR) to predict daily global solar radiation. The model’s effectiveness was assessed using six years of meteorological data from Ibadan (2010–2015) to predict solar radiation for 2016 and 2017. The model was compared to three well-established models, namely ANN, Angstrom-Prescott, and ARMA, using a range of statistical tests, including RMSE, relative root mean square error (RRMSE), mean absolute percentage error (MAPE), and R2. The results demonstrated the k-means-SVR model’s excellent accuracy and reliability, surpassing the performance of other current models. This indicates that the k-means-SVR model has great potential as a tool for energy planning.
Mutavhatsindi et al. [7] employed machine learning models to predict hourly solar irradiation. The performance of LSTM, SVR, and feed-forward neural networks (FFNNs) was evaluated and compared to the principal component regression model (PCR). The process of selecting variables was performed using the least absolute shrinkage and selection operator (Lasso). Based on the data, the FFNN model outperformed the other two machine learning models in terms of forecast accuracy, as measured by MAE and RMSE. The study found that when combining forecasts from machine learning models using both the convex combination approach and quantile regression averaging (QRA), QRA proved to be the superior method. It exceeded other models in all statistical tests and forecast accuracy measures employed.
The study conducted by Huang et al. [32] at Ganzhou station used historical meteorological data spanning from 1980 to 2016; twelve machine learning model networks (extreme gradient boosting (XGBoost), multiple linear regression (MLR), radial basis function neural network (RBFNN), RF, DT, back-propagation algorithm-based neural network (BPNN), extreme learning machine (ELM), GPR, gradient boosting regression tree (GBRT), adaptive boosting (AdaBoost), k-nearest neighbor (KNN), and SVR) were assessed for their effectiveness in predicting solar radiation both daily and monthly. The most frequently utilized models were GBRT, XGBoost, GPR, and RF, compared to the other models. These models were assessed using metrics such as R2, RMSE, MAE, and BIAS. They found that GBRT showed the highest level of accuracy when it came to daily forecasts, although XGBoost outperformed other models for monthly predictions. Additionally, they conducted experiments with a stacking model that integrated these models. This stacking model demonstrated enhanced accuracy in daily radiation predictions. However, its effectiveness in monthly predictions was diminished, most likely due to limited data. This comprehensive investigation by Huang et al. demonstrated the potential of advanced machine learning methods, including hybrid approaches like stacking models, to significantly enhance the predictive accuracy of solar radiation, which is critical for strategic energy sector management. In their results, they determined that the XGBoost model excels at predicting solar radiation values, recommending the use of either the XGBoost or a stacking fusion model when handling large datasets.
The researchers of Ref. [21] focused on enhancing solar radiation prediction using the RF algorithm, comparing it to models like AdaBoost, linear regression (LR), and recurrent neural networks (RNNs). The study found that optimizing RF’s hyperparameters led to a significant accuracy improvement of 95.98%, surpassing other approaches. The favorable and consistent climate in Queretaro contributed to the high accuracy of the predictive models. The study also investigated the correlations among solar radiation, temperature, and humidity, confirming that humidity has a major influence on this relationship.
Chodakowska et al. [33] examined the use of ARIMA models to predict solar radiation using data from Jordan and Poland. The study evaluated the efficiency of ARIMA models under diverse meteorological conditions and for different time intervals, including monthly and hourly periods. The models had robust prediction skills, particularly under stable weather circumstances, and were observed to possess a high level of accuracy in Jordan owing to the persistent presence of sunlight. Nevertheless, the effectiveness of these methods showed inconsistencies in Warsaw, indicating limitations in less stable circumstances. This study highlights the significance of adjusting ARIMA models to geographic and climatic circumstances to enhance the accuracy of solar radiation predictions. This is essential for effective management and planning of energy systems.
In their study, Woldegiyorgis et al. [34] conducted a comprehensive analysis on forecasting daily and monthly averaged horizontal global solar radiation (HGSR) in Fiche town, Ethiopia, by employing various ANN models. They used ANN models like cascade-forward back-propagation (CFBP), feed-forward back-propagation (FFBP), Elman back-propagation (EBP), and layer recurrence (LayR) to estimate HGSR, analyzing their performance through metrics such as MAPE, MSE, and RMSE. The findings suggest that ANN models are capable of accurately forecasting solar radiation levels, although the level of accuracy may vary among different models.
Overall, these studies collectively contribute to the understanding of solar radiation forecasting by employing diverse methodological frameworks that adapt to the specifics of the datasets and environmental conditions they study. This review highlights the potential of combining traditional forecasting models with advanced machine learning techniques to enhance predictive accuracy and reliability. Table 1 provides a concise summary of the various forecasting models discussed in the literature review. It outlines the models employed, input parameters, geographical location of the studies, and significant conclusions.

3. Materials and Methods

3.1. Site Assessment and Dataset

The solar farm is situated in southwestern France with a total capacity of 41.2 MWp of ground-mounted installation. The project comprises four PV plants; however, our study focuses exclusively on one specific part, namely Project A, with a capacity of 11.1 MW, as illustrated in Figure 2. The project has a prolonged wet season that lasts for 8.5 months, starting in September and ending in June. During this period, there is a minimum 25% chance of rainfall occurring every day. Simultaneously, this period is marked by substantial cloud cover. Precise prediction of solar irradiance is crucial for maximizing the incorporation of solar energy into the power grid and reducing uncertainty related to solar power generation.
The study employs a dataset consisting of solar power generation data collected from 1 January 2020 to 25 October 2023. The dataset has 33,456 observations that were recorded at hourly intervals. The data comprises the following variables:
  • Date_Time: The exact date and time of each observation, formatted in “DD/MM/YYYY HH:MM”;
  • T01-t Solar Irradiance [W/m2]: Solar irradiance of Transformer 1;
  • T01-t Module Temperature [°C]: Temperature of the solar module of Transformer 1;
  • T03-Ta Ambient Temperature [°C]: Ambient temperature of Transformer 3;
  • T03-Wind Speed [m/s]: Wind speed of Transformer 3.

3.2. Data Preprocessing

The preprocessing phase is crucial for ensuring that the dataset is properly prepared for further analysis and modeling. Preprocessing involves making efforts to improve the quality and integrity of the data, as well as configuring the dataset to better align with the requirements of the forecasting models. Here, this section will present a comprehensive description of each step included in the preprocessing procedure.
Data Cleaning:
  • Handling Missing Values: Due to the continuous nature of solar irradiance and related environmental data, missing values can significantly impact the performance of predictive models. We employed an interpolation method to replace the missing data points.
Feature Engineering:
  • Lagged Features: Historical data of the variables can provide context for current values, helping to capture trends and inertia in the data. We created lagged features for up to 24 h prior for the variables.
  • Integration of Exogenous Variables: In the NARX framework, the exogenous variables (ambient temperature, surface temperature, and wind speed) are treated as external inputs that provide additional context to the model. These are integrated into the input feature set alongside the historical values of the target variable (solar irradiance).
Feature Scaling:
  • Normalization: This was employed to standardize the values of distinct features, which had a broad range, in order to make them similar. This process improved the stability and performance of the machine learning models. Min-max scaling was used to rescale all features so that they fall within the range of 0 to 1.

3.3. Proposed Methodology

This section provides a comprehensive explanation of the method that will be employed to carry out the investigation. Forecasting solar irradiance using a Nonlinear Autoregressive Network with Exogenous Inputs (NARX) combined with a Long Short-Term Memory (LSTM) neural network, which is a type of recurrent neural network (RNN), involves several steps. This approach utilizes the memory capacity of LSTM networks to effectively manage complicated time-series data, where previous values (autoregression) and additional variables (exogenous inputs) impact the target predictions. The following steps give a comprehensive outline of the methods employed in this study, where Figure 3 illustrates the overall flowchart of the study.
  • Feature Selection: For the development of a robust hybrid NARX-LSTM model aimed at forecasting solar irradiance, it is critical to carefully select and prepare input features that capture the underlying dynamics affecting solar power generation. In this context, features such as wind speed and ambient and surface temperatures are selected using the Pearson correlation coefficient due to their direct influence on solar irradiance levels.
  • Data Segmentation: After the feature selection, the preprocessed dataset was divided into three subsets: 80% for training, 10% for validation, and 10% for testing. The training set is used to enable the model to acquire knowledge from a significant historical context. The validation set is employed for model parameter fine-tuning and overfitting prevention. Finally, the testing set assesses the model’s performance on unseen data. Table 2 presents the division of data across different periods and the number of samples in each dataset.
    Table 2. Training, validation, and test sets.
    Table 2. Training, validation, and test sets.
    DatasetsTime PeriodsNumber of Samples
    Training set1 January 2020–20 January 202326765
    Validation set20 January 2023–06 August 20233346
    Testing set6 August 2023–25 October 20233345
  • Model Architecture: The architecture of the hybrid NARX-LSTM model is shown in Figure 4.
    Figure 4. Hybrid NARX-LSTM proposed model structure.
    Figure 4. Hybrid NARX-LSTM proposed model structure.
    Energies 17 03965 g004
  • Hyperparameter Optimization: Hyperparameter optimization for LSTM networks is an essential technique focused on adjusting the parameters that control the model’s learning process and structure. Our study utilizes the Optuna package to do hyperparameter optimization when building an LSTM-based neural network with Keras. The crucial hyperparameters that were optimized were the number of LSTM layers, the number of units per layer, the dropout rate, and the learning rate. Optuna’s efficient search algorithms are used to change these parameters. The algorithms recommend values for the number of units and dropout rates within specified ranges, and they also utilize a logarithmic scale for the learning rate with an ADAM optimizer to efficiently explore a wider range of values. The model architecture is dynamically constructed based on these parameters, incorporating dropouts after each LSTM layer to prevent overfitting. During model training, early stopping is used to end the training process if the validation loss does not improve. This helps increase training efficiency and prevents overfitting. This approach is particularly effective for managing the complexities of time-series predictions where the tuning of network architecture and learning parameters can greatly influence model accuracy and generalization. The optimized hyperparameters of this study are displayed in Table 3.
    Table 3. Best hyperparameters for LSTM model.
    Table 3. Best hyperparameters for LSTM model.
    HyperparameterValueSearch Interval
    LSTM layers21–5
    Hidden units19212–200
    Learning rate0.00080.0001–0.1
    Number of epochs350100–1500
    Dropout rate0.35590.2–0.6
  • Model Evaluation: The evaluation of the hybrid NARX-LSTM model for forecasting solar irradiance is conducted using MAE, RMSE, and R2 performance metrics to ensure a comprehensive assessment of its predictive accuracy and reliability.
The data preprocessing, analysis, and model creation tasks were performed using Python, utilizing libraries such as Pandas for data manipulation, Sklearn for machine learning methods, Keras for deep learning models, and Matplotlib for data visualization.

3.4. Statistical Performance Metrics

In order to assess the precision and predictive capability of the solar irradiance forecasting of the proposed model, three primary performance metrics were employed: Coefficient of Determination (R2), root mean square error (RMSE), and mean absolute error (MAE). These metrics offer a thorough evaluation of the accuracy of predictions and the model’s ability to explain the data, enabling a strong assessment of the model’s performance [35,36].
The Coefficient of Determination, denoted as R2, measures the proportion of variance in the dependent variable that is predictable from the independent variables. It provides an indication of how well the model can predict unseen data by indicating the quality of fit. The R2 value ranges from 0 to 1, where 1 indicates perfect prediction. It is calculated using the formula:
R 2 = 1 ( y i y ^ ) 2 ( y i y ¯ ) 2
where y i is the actual value, y ^ is the predicted value, and y ¯ is the mean of the actual values.
RMSE is a frequently used measure of the differences between values predicted by a model and the values observed from the environment that is being modeled. It is particularly useful when large errors are particularly undesirable. The RMSE of an estimator with respect to an estimated variable is defined as the square root of the mean squared error:
R M S E = 1 N i = 1 N ( y i y ^ ) 2
Here:
  • N represents the number of observations.
MAE measures the average magnitude of the errors in a set of predictions without considering their direction. It is the average over the test sample of the absolute differences between prediction and actual observation, where all individual differences have equal weight:
M A E = 1 N i = 1 N y i y ^

4. Results and Discussion

4.1. Correlation Coefficient

After the preprocessing stage, the Pearson correlation coefficient is generated to visually represent the connections between the most important variables. The Pearson correlation coefficient is a statistical metric that precisely defines the magnitude and direction of a linear relationship between two variables. It ranges from −1 to +1, where +1 indicates a perfect positive linear correlation, −1 indicates a perfect negative linear correlation, and 0 indicates no linear correlation. A positive number indicates a direct relationship, where an increase in one variable will be followed by an increase in the other variable. Conversely, a negative value indicates an inverse relationship, where an increase in one variable is preceded by a reduction in the other variable [37].
Figure 5 presents a correlation heatmap that measures the relationships between solar irradiance and several environmental factors, including module temperature, ambient temperature, and wind speed. These relationships are crucial for understanding the dynamics of solar energy generation and are central to enhancing forecasting accuracy.
The following deductions can be derived from this heatmap analysis:
  • Solar Irradiance and Module Temperature: The correlation coefficient between solar irradiance and module temperature is high, indicating a strong positive relationship. As solar irradiance increases, module temperature typically rises, affecting the efficiency of PV cells. This relationship underscores the need for thermal management in solar PV systems, especially in regions with high solar irradiance.
  • Solar Irradiance and Ambient Temperature: There is also a significant positive correlation between solar irradiance and ambient temperature. This relationship is expected, as higher solar irradiance leads to increased temperatures. Ambient temperature can influence the performance of solar PV installations by affecting the physical properties of the materials involved in energy conversion. This phenomenon can be attributed to the overall rise in temperature that occurs during bright weather.
  • Solar Irradiance and Wind Speed: The correlation between solar irradiance and wind speed is moderately positive. While less intuitive, this relationship suggests that in certain geographical regions, higher solar irradiance days might also experience higher wind speeds, possibly due to local climatic conditions. Wind speed can play a role in cooling solar panels, potentially improving their performance on very sunny days.
This detailed analysis helps to highlight the complex relationship of these environmental variables with solar irradiance. By understanding these correlations, we can better predict the performance of solar energy systems under various environmental conditions. Furthermore, the insights gained from this analysis are crucial for designing more robust solar energy systems that are more robust and can effectively employ particular climatic circumstances.

4.2. Nonlinear Autoregressive with Exogenous Inputs (NARX) Prediction Model

The Nonlinear Autoregressive with Exogenous Inputs (NARX) model is a recurrent dynamic neural network specifically designed for the accurate prediction of time-series data. This model depicts the relationships between the previous values of a target variable and the previous values of external or exogenous inputs. NARX models are widely used in diverse domains, including signal processing, financial forecasting, weather prediction, and energy-related projections. The usefulness of the NARX model in forecasting and system identification comes from its inherent structure and functionality, enabling it to accurately capture complex nonlinear relationships present in time-series data [38,39]. The NARX methodology operates by utilizing a systematic analysis, as given below:
  • Nonlinear Autoregressive (NAR): This aspect of the model signifies that the future values of a series are predicted based on past values of the same series but with a nonlinear relationship. In traditional linear models, future values are a linear combination of past values; in NARX, these relationships are modeled using nonlinear functions, typically represented by a neural network.
  • Exogenous Inputs (X): These are external variables that influence the target variable but are not influenced by it within the model framework. For example, in the case of solar irradiance forecasting, exogenous inputs might include temperature, humidity, or cloud cover.
The NARX model is characterized by the following mathematical equation:
y t = f [ y t 1 ,   y t 2 ,   ,   y t d y ;   u t 1 ,   u t 2 ,   ,   u t d u ]
where y t is the target variable at time t, u t represents the exogenous inputs, d y and d u are the delays for the target and the exogenous inputs, respectively, and f represents a nonlinear function.
Moreover, the nonlinear function f is commonly represented by a neural network, although other nonlinear estimators such as polynomial models, radial basis functions, or SVM may be employed, depending on the system’s complexity and the available data. In the context of this study, neural networks are used to implement the nonlinear function f in NARX model. Hence, the nonlinear function f would be given as:
f = W o W i y t 1 ,     , y t n y ,   u t 1 ,   ,   u t n u + b i + b o
Here:
  • is the nonlinear activation function (tanh for this study).
  • W i and W o are the weight matrices for the input and output layers, respectively.
  • b i and b o are bias vectors for the input and output layers, respectively.
In this study, NARX is used as a benchmark model to compare performance metrics with the proposed model. After the training and the hyperparameter optimization, the performance metrics results of the benchmark model are as follows: an R2 of 0.983, an RMSE of 43.02, and an MAE of 37.18.
Additionally, Figure 6 illustrates the predicted solar irradiance for 7 October 2023, using the NARX model, which was specifically selected to evaluate its performance in predicting solar irradiance on rainy days.

4.3. Hybrid NARX-LSTM Prediction Model

In this study, the hybrid NARX-LSTM prediction model is proposed, which is a sophisticated hybrid neural network that combines the NARX model and LSTM networks to enhance time-series forecasting accuracy. The NARX model initially captures the nonlinear relationships between the target variable and its past values, along with exogenous inputs, effectively handling the dependencies influenced by historical data and external conditions. Afterwards, the LSTM component, known for its ability to effectively understand long-term dependencies, handles the outputs from the NARX model (see Appendix A). The utilization of a two-stage technique enables the NARX-LSTM model to exploit the initial predictions and enhance them by incorporating more complex temporal patterns that may be neglected by the NARX model alone. An integrated model is especially advantageous in complicated forecasting scenarios, such as solar irradiance prediction, where both short-term variations and long-term trends significantly affect the accuracy of predictions.
Figure 7, Figure 8 and Figure 9 illustrate the hourly, daily average, and monthly average of solar irradiance, respectively. Figure 7 presents the hourly observations of solar irradiance taken from 1 January 2020 to 25 October 2023. The vertical peaks correspond to the magnitude of solar irradiation, quantified in watts per square meter. Irradiance peaks are observed during daylight hours. The general pattern exhibits seasonal fluctuations, with greater levels of irradiance observed during the summer months. Furthermore, Figure 8 shows the daily mean solar irradiation throughout the same time period. Each vertical bar corresponds to the mean irradiance value for a specific day. The graph exhibits a distinct seasonal pattern, matching the hourly data, with elevated values in the summer and diminished values in the winter. The daily variation is also evident, affected by meteorological conditions and atmospheric transparency. Finally, Figure 9 displays the mean monthly solar irradiance. The y-axis quantifies solar irradiance in units of watts per square meter, while the x-axis corresponds to the months. The graph demonstrates a clear seasonal pattern, with irradiance reaching its highest point during the summer months and its lowest point during the winter, reflecting the previous figures. This pattern reflects the impact of the Earth’s axial tilt on sun exposure.
Additionally, the regression plot of predicted and actual solar irradiance has been constructed, as shown in Figure 10. This scatter plot depicts the correlation between forecasted and observed sun irradiance values. Each red dot corresponds to a pair of real and forecasted irradiance values, with the real values plotted on the x-axis and the forecasted values on the y-axis. The blue line indicates the optimal scenario in which the anticipated values precisely correspond to the actual values. The proximity of the red dots to the blue line demonstrates the precision of the forecasts. In general, the plot displays a robust linear correlation, indicating that the predictions are reasonably precise, although there are occasional variations, particularly at higher levels of irradiation.
Furthermore, the hybrid NARX-LSTM solar irradiance prediction model shown in Figure 11 demonstrated exceptional performance on 7 October 2023, specifically chosen to assess its effectiveness during wet days. Wet days are known to present difficulties in predicting solar irradiance due to the fluctuating cloud cover and precipitation, as mentioned in Section 3.1. The graph shows an excellent alignment between the predicted and actual solar irradiance values throughout the daylight hours, demonstrating the model’s high effectiveness under unfavorable weather conditions. This ability to closely track the actual irradiance during wet conditions underscores the robustness of the model. The precision with which the predicted curve follows the actual data during peak solar hours indicates that the model has successfully captured the complex dynamics influenced by intermittent cloud cover and varying sunlight availability. Such precision in adverse weather conditions is crucial for optimizing the operation and energy output of solar power systems, as it ensures reliable forecasting essential for grid management and energy planning in regions experiencing frequent wet weather. As a result, the hybrid model’s performance metrics include an MAE of 9.58 W/m2, an RMSE of 16.30 W/m2, and an R2 of 0.997.
Table 4 represents the comparison between the results obtained from this study and the prior studies on solar irradiance prediction models. The present study has compared its performance metrics with those of previous studies. The study in ref. [40] effectively shows that machine learning techniques can improve the accuracy of solar radiation predictions compared to standard physics-based models. The ELM method demonstrated superior performance in multiple scenarios, achieving an RMSE of 60.60 W/m2 and an R2 value of 0.96. Similarly, ref. [41] examines the effectiveness of machine learning models in predicting solar radiation in different climatic regions of Morocco. By employing models such as LSTM, SVM regression, and Multilayer Artificial Neural Networks (MLANNs), it emphasizes the LSTM model’s exceptional performance in terms of forecasting accuracy, as evidenced by its RMSE of 41.05 W/m2, MAE of 21.99 W/m2, and R2 of 0.98. Moreover, a novel multivariate hybrid WPD-CNN-LSTM-MLP has been proposed by ref. [42] that is designed for forecasting solar irradiance. This model combines wavelet packet decomposition (WPD), CNN, LSTM networks, and multi-layer perceptron (MLP) to improve prediction accuracy. It achieves this by analyzing multiple variables, such as hourly solar irradiance, temperature, relative humidity, and wind speed. The findings demonstrated a significant enhancement in forecasting accuracy using the WPD-CNN-LSTM-MLP model. This model outperformed traditional models in predicting hourly irradiance, exhibiting lower error rates and higher precision. The performance metrics, namely RMSE and MAE, were measured at 32.1006 W/m2 and 14.3523 W/m2, respectively. Additionally, the R2 value was determined to be 0.994. Focusing on the needs of distributed photovoltaic generation systems, Ref. [43] employs LSTM to model hourly day-ahead solar irradiance using readily available weather forecasting data. The LSTM model exhibited exceptional performance, outperforming conventional methods such as persistence algorithms, linear regression, and simple neural networks, achieving a good RMSE of 76.245 W/m2. Finally, ref. [44] developed a novel forecasting model that uses commonly available weather data instead of historical solar irradiance data to predict solar power output for microgrids. Utilizing a deep LSTM-RNN, the model was compared against a traditional FFNN across six experiments involving weather stations located in Germany, USA, Switzerland, and South Korea. The findings demonstrated that the LSTM-RNN model had superior performance compared to the FFNN, attaining an RMSE of 60.31 W/m2 and an MAE of 36.90 W/m2.
The proposed hybrid NARX-LSTM model demonstrated superior performance compared to previous models, as seen by its lower RMSE of 16.30 W/m2, MAE of 9.58 W/m2, and high R2 of 0.997. This proposed model shows a notable enhancement compared to other sophisticated models such as WPD–CNN–LSTM-MLP, which previously held the best reported performance among the prior studies, with an RMSE and MAE of 32.1006 W/m2 and 14.3523 W/m2, respectively, and an R2 of 0.994. The enhanced accuracy of the hybrid NARX-LSTM model not only showcases the effectiveness of combining NARX and LSTM architectures but also highlights its promise for practical solar irradiance prediction in uncertain environmental situations. Despite its complexity, the obvious improvements in prediction accuracy justify the model’s employment, suggesting that its integration could enhance operational efficiency in renewable energy management.

5. Conclusions and Future Perspectives

The research findings validate the effectiveness of the hybrid NARX-LSTM model in significantly enhancing the precision of daily solar irradiance forecasts for southwestern France. The hybrid NARX-LSTM model demonstrated superior performance, evidenced by an MAE of 9.58 W/m2, an RMSE of 16.30 W/m2, and an R2 of 0.997. These metrics demonstrate a significant enhancement compared to conventional forecasting (benchmark model) and validate the hybrid model’s capacity to successfully manage the complexity of solar irradiance data. The effective implementation of this model offers useful insights for optimizing and planning the operations of solar power plants, leading to potential cost reductions and enhanced energy reliability. This study enhances the field of renewable energy forecasting by showcasing the potential of using advanced neural network designs to improve the accuracy of predictions in areas with unpredictable climate variables. Future studies could investigate the possibility of applying this model to additional geographical areas or alternative tasks related to predicting renewable energy, thereby confirming its usefulness and flexibility in various environmental and operational contexts.

Author Contributions

Conceptualization, O.I.O.; methodology, O.I.O.; software, O.I.O. and S.G.; validation, S.S., S.G. and M.D.; formal analysis, O.I.O.; investigation, O.I.O.; resources, S.G. and M.D.; data curation, O.I.O.; writing—original draft preparation, O.I.O.; writing—review and editing, S.S., S.G. and M.D.; visualization, O.I.O.; supervision, S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are available on request.

Acknowledgments

We would like to express our gratitude to BayWa r.e. company for generously providing the real data that were utilized in this study. In addition, we would like to express our deepest appreciation to all the employees involved in the Project in southwestern France.

Conflicts of Interest

Author Martin Dennenmoser was employed by the company BayWa r.e. Solar Projects GmbH. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Nomenclature

AdaBoostAdaptive boosting
ANNArtificial neural network
ARIMAAutoregressive integrated moving average
ARMAAutoregressive and moving average
BPNNBack-propagation algorithm-based neural network
DTDecision tree
ELMExtreme learning machine
FFNNFeed-forward neural network
GBMGradient boosting model
GBRTGradient boosting regression tree
GPRGaussian Process Regression
k-NNk-nearest neighbor
LassoLeast absolute shrinkage and selection operator
LRLinear regression
LogRLogistic regression
LSTMLong Short-Term Memory
MAEMean absolute error
MAPEMean absolute percentage error
MLRMultiple linear regression
MSEMean square error
MVTSMultivariate time series
NARXNonlinear Autoregressive Network with Exogenous Inputs
NWPNumerical weather prediction
PCRPrincipal component regression
PrPearson correlation
PVPhotovoltaic
QRAQuantile regression averaging
RBFNNRadial basis function neural network
RFRandom forest
RFERecursive feature elimination
RMSERoot mean square error
RNNRecurrent neural network
RRMSERelative root mean square error
SVMSupport vector machine
SVRSupport vector regression
XGBoostExtreme gradient boosting

Appendix A

An LSTM model is a type of RNN that is specifically developed to overcome the constraints of conventional RNNs in effectively dealing with long-term dependencies. Traditional RNNs face difficulties in capturing long-term temporal dependencies in sequential data. This is mostly caused by issues such as diminishing gradients, where the gradients decrease as they are back-propagated through each time step of the network during training. Consequently, it becomes difficult to establish relationships between occurrences that are separated by significant temporal intervals.
LSTM models address this issue through a unique architecture that incorporates memory cells and many gates that control the information flow. LSTMs possess the capability of preserving knowledge for longer durations, making them highly suitable for applications that include complex sequential data, such as speech recognition, language translation, and time-series forecasting. Below is an analysis of the fundamental characteristics and elements of Long Short-Term Memory (LSTM) models:
  • Memory cell: The central component of the LSTM unit, responsible for retaining its state over a period of time. The cell state functions as a mechanism that transfers pertinent information down the sequence chain without modifying it.
  • Gates: LSTMs have three types of gates, namely the forget gate, input gate, and output gate, which are responsible for regulating the memory cell. The forget gate decides which information is to be eliminated from the cell state, while the input gate determines the specific information that is incorporated into the cell state. It also includes creating a vector of new candidate values that could be added to the state. Finally, the output gate defines what the next hidden state should be.
The operations within an LSTM cell at time step t can be described by the following equations:
f t = σ ( W f . h t 1 , x t + b f )
i t = σ ( W i + h t 1 , x t + b i )
o t = σ ( W o + h t 1 , x t + b o )
C t ~ = t a n h ( W c + h t 1 , x t + b c )
C t = f t C t 1 + i t C t ~
where x t is the input vector at time step t, h t 1 is the output vector from the previous time step, W f , W i , W o , and W c are the weight coefficient matrices of each corresponding gate, b f , b i , b o , and b c are the bias vector parameters for different gates, C t 1 is the cell state vector from the previous time step, C t ~ is the candidate for the cell state at time step t, and C t is the newly modified cell at time step t.
These equations enable the LSTM to retain a long-term memory and to selectively forget and modify information. Figure A1 displays the structure of the LSTM model.
Figure A1. LSTM structure model.
Figure A1. LSTM structure model.
Energies 17 03965 g0a1

References

  1. Sorrell, S. Reducing energy demand: A review of issues, challenges and approaches. Renew. Sustain. Energy Rev. 2015, 47, 74–82. [Google Scholar] [CrossRef]
  2. Chakraborty, D.; Mondal, J.; Barua, H.B.; Bhattacharjee, A. Computational solar energy—Ensemble learning methods for prediction of solar power generation based on meteorological parameters in Eastern India. Renew. Energy Focus 2023, 44, 277–294. [Google Scholar] [CrossRef]
  3. Jia, L.; Cheng, P.; Yu, Y.; Chen, S.-H.; Wang, C.-X.; He, L.; Nie, H.-T.; Wang, J.-C.; Zhang, J.-C.; Fan, B.-G.; et al. Regeneration mechanism of a novel high-performance biochar mercury adsorbent directionally modified by multimetal multilayer loading. J. Environ. Manag. 2023, 326, 116790. [Google Scholar] [CrossRef]
  4. Rana, V.; Tyagi, A.; Kumar, K.; Grover, H. History and Application of Solar PV System. In Artificial Intelligence for Solar Photovoltaic Systems: Approaches, Methodologies, and Technologies; CRC Press: Boca Raton, FL, USA, 2022; pp. 1–22. [Google Scholar] [CrossRef]
  5. Pourasl, H.H.; Barenji, R.V.; Khojastehnezhad, V.M. Solar energy status in the world: A comprehensive review. Energy Rep. 2023, 10, 3474–3493. [Google Scholar] [CrossRef]
  6. Okieh, O.I.; Mohamed, Y.N.; Seker, S. Application of 30 MWp grid-connected solar photovoltaic power plant for Djibouti Vision 2035. Int. J. Renew. Energy Res. 2023, 13, 136–143. [Google Scholar] [CrossRef]
  7. Mutavhatsindi, T.; Sigauke, C.; Mbuvha, R. Forecasting Hourly Global Horizontal Solar Irradiance in South Africa Using Machine Learning Models. IEEE Access 2020, 8, 198872–198885. [Google Scholar] [CrossRef]
  8. Dhaked, D.K.; Dadhich, S.; Birla, D. Power output forecasting of solar photovoltaic plant using LSTM. Green Energy Intell. Transp. 2023, 2, 100113. [Google Scholar] [CrossRef]
  9. Okieh, O.I.; Seker, S.; Akinci, T.C.; Idriss, A.I. Optimization of Neuro-controller Application for Maximum Power Point Tracking Photovoltaic Systems Through Shannon’s Information Criteria. Electr. Power Components Syst. 2024, 1–12. [Google Scholar] [CrossRef]
  10. Franco, M.A.; Groesser, S.N. A Systematic Literature Review of the Solar Photovoltaic Value Chain for a Circular Economy. Sustainability 2021, 13, 9615. [Google Scholar] [CrossRef]
  11. International Renewable Energy Agency. Renewable Energy Statistics 2023 Statistiques D’énergie Renouvelable 2023 Estadísticas de Energía Renovable 2023 about IRENA. 2023. Available online: www.irena.org (accessed on 27 October 2023).
  12. Sharadga, H.; Hajimirza, S.; Balog, R.S. Time series forecasting of solar power generation for large-scale photovoltaic plants. Renew. Energy 2020, 150, 797–807. [Google Scholar] [CrossRef]
  13. Brahma, B.; Wadhvani, R. Solar Irradiance Forecasting Based on Deep Learning Methodologies and Multi-Site Data. Symmetry 2020, 12, 1830. [Google Scholar] [CrossRef]
  14. Qazi, A.; Fayaz, H.; Wadi, A.; Raj, R.G.; Rahim, N.; Khan, W.A. The artificial neural network for solar radiation prediction and designing solar systems: A systematic literature review. J. Clean. Prod. 2015, 104, 1–12. [Google Scholar] [CrossRef]
  15. Ssekulima, E.B.; Anwar, M.B.; Al Hinai, A.; El Moursi, M.S. Wind speed and solar irradiance forecasting techniques for enhanced renewable energy integration with the grid: A review. IET Renew. Power Gener. 2016, 10, 885–989. [Google Scholar] [CrossRef]
  16. Ziane, A.; Necaibia, A.; Sahouane, N.; Dabou, R.; Mostefaoui, M.; Bouraiou, A.; Khelifi, S.; Rouabhia, A.; Blal, M. Photovoltaic output power performance assessment and forecasting: Impact of meteorological variables. Sol. Energy 2021, 220, 745–757. [Google Scholar] [CrossRef]
  17. Zhang, Y.; Cui, N.; Feng, Y.; Gong, D.; Hu, X. Comparison of BP, PSO-BP and statistical models for predicting daily global solar radiation in arid Northwest China. Comput. Electron. Agric. 2019, 164, 104905. [Google Scholar] [CrossRef]
  18. Zhang, G.; Yang, D.; Galanis, G.; Androulakis, E. Solar forecasting with hourly updated numerical weather prediction. Renew. Sustain. Energy Rev. 2022, 154, 111768. [Google Scholar] [CrossRef]
  19. Belaid, S.; Mellit, A. Prediction of daily and mean monthly global solar radiation using support vector machine in an arid climate. Energy Convers. Manag. 2016, 118, 105–118. [Google Scholar] [CrossRef]
  20. Yadav, A.K.; Chandel, S. Solar radiation prediction using Artificial Neural Network techniques: A review. Renew. Sustain. Energy Rev. 2014, 33, 772–781. [Google Scholar] [CrossRef]
  21. Villegas-Mier, C.G.; Rodriguez-Resendiz, J.; Álvarez-Alvarado, J.M.; Jiménez-Hernández, H.; Odry, Á. Optimized Random Forest for Solar Radiation Prediction Using Sunshine Hours. Micromachines 2022, 13, 1406. [Google Scholar] [CrossRef]
  22. Almaghrabi, S.; Rana, M.; Hamilton, M.; Rahaman, M.S. Solar power time series forecasting utilising wavelet coefficients. Neurocomputing 2022, 508, 182–207. [Google Scholar] [CrossRef]
  23. Iheanetu, K.J. Solar Photovoltaic Power Forecasting: A Review. Sustainability 2022, 14, 17005. [Google Scholar] [CrossRef]
  24. Akhter, M.N.; Mekhilef, S.; Mokhlis, H.; Shah, N.M. Review on forecasting of photovoltaic power generation based on machine learning and metaheuristic techniques. IET Renew. Power Gener. 2019, 13, 1009–1023. [Google Scholar] [CrossRef]
  25. Sansa, I.; Boussaada, Z.; Bellaaj, N.M. Solar Radiation Prediction Using a Novel Hybrid Model of ARMA and NARX. Energies 2021, 14, 6920. [Google Scholar] [CrossRef]
  26. Narayanan, S.; Kumar, R.; Ramadass, S.; Ramasamy, J. Hybrid Forecasting Model Integrating RNN-LSTM for Renewable Energy Production. Electr. Power Components Syst. 2024, 52, 1–19. [Google Scholar] [CrossRef]
  27. Ghimire, S.; Deo, R.C.; Wang, H.; Al-Musaylh, M.S.; Casillas-Pérez, D.; Salcedo-Sanz, S. Stacked LSTM Sequence-to-Sequence Autoencoder with Feature Selection for Daily Solar Radiation Prediction: A Review and New Modeling Results. Energies 2022, 15, 1061. [Google Scholar] [CrossRef]
  28. Rangel, E.; Cadenas, E.; Campos-Amezcua, R.; Tena, J.L. Enhanced Prediction of Solar Radiation Using NARX Models with Corrected Input Vectors. Energies 2020, 13, 2576. [Google Scholar] [CrossRef]
  29. Hissou, H.; Benkirane, S.; Guezzaz, A.; Azrour, M.; Beni-Hssane, A. A Novel Machine Learning Approach for Solar Radiation Estimation. Sustainability 2023, 15, 10609. [Google Scholar] [CrossRef]
  30. Natava, C.J.C.; Palmares, X.L.P.S.; Doma, B.J.T. Forecasting of daily global solar radiation in Dumaguete, Philippines using NARX-LSTM Hybrid Network. In ACM International Conference Proceeding Series; Association for Computing Machinery: New York, NY, USA, 2023; pp. 66–72. [Google Scholar] [CrossRef]
  31. Ayodele, T.; Ogunjuyigbe, A.; Amedu, A.; Munda, J. Prediction of global solar irradiation using hybridized k-means and support vector regression algorithms. Renew. Energy Focus 2019, 29, 78–93. [Google Scholar] [CrossRef]
  32. Huang, L.; Kang, J.; Wan, M.; Fang, L.; Zhang, C.; Zeng, Z. Solar Radiation Prediction Using Different Machine Learning Algorithms and Implications for Extreme Climate Events. Front. Earth Sci. 2021, 9, 596860. [Google Scholar] [CrossRef]
  33. Chodakowska, E.; Nazarko, J.; Nazarko, Ł.; Rabayah, H.S.; Abendeh, R.M.; Alawneh, R. ARIMA Models in Solar Radiation Forecasting in Different Geographic Locations. Energies 2023, 16, 5029. [Google Scholar] [CrossRef]
  34. Woldegiyorgis, T.A.; Benti, N.E.; Chaka, M.D.; Semie, A.G.; Jemberie, A.A. Estimating solar radiation using artificial neural networks: A case study of Fiche, Oromia, Ethiopia. Cogent Eng. 2023, 10, 2220489. [Google Scholar] [CrossRef]
  35. Huang, J.; Rikus, L.J.; Qin, Y.; Katzfey, J. Assessing model performance of daily solar irradiance forecasts over Australia. Sol. Energy 2018, 176, 615–626. [Google Scholar] [CrossRef]
  36. Ağbulut, Ü.; Gürel, A.E.; Biçen, Y. Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison. Renew. Sustain. Energy Rev. 2021, 135, 110114. [Google Scholar] [CrossRef]
  37. Wang, X.; Xu, N.; Meng, X.; Chang, H. Prediction of Gas Concentration Based on LSTM-LightGBM Variable Weight Combination Model. Energies 2022, 15, 827. [Google Scholar] [CrossRef]
  38. Narvaez, G.; Giraldo, L.F.; Bressan, M.; Pantoja, A. Machine learning for site-adaptation and solar radiation forecasting. Renew. Energy 2021, 167, 333–342. [Google Scholar] [CrossRef]
  39. Ezzeldin, R.; Hatata, A. Application of NARX neural network model for discharge prediction through lateral orifices. Alex. Eng. J. 2018, 57, 2991–2998. [Google Scholar] [CrossRef]
  40. Cornejo-Bueno, L.; Casanova-Mateo, C.; Sanz-Justo, J.; Salcedo-Sanz, S. Machine learning regressors for solar radiation estimation from satellite data. Sol. Energy 2019, 183, 768–775. [Google Scholar] [CrossRef]
  41. Mendyl, A.; Demir, V.; Omar, N.; Orhan, O.; Weidinger, T. Enhancing Solar Radiation Forecasting in Diverse Moroccan Climate Zones: A Comparative Study of Machine Learning Models with Sugeno Integral Aggregation. Atmosphere 2024, 15, 103. [Google Scholar] [CrossRef]
  42. Huang, X.; Li, Q.; Tai, Y.; Chen, Z.; Zhang, J.; Shi, J.; Gao, B.; Liu, W. Hybrid deep neural model for hourly solar irradiance forecasting. Renew. Energy 2021, 171, 1041–1060. [Google Scholar] [CrossRef]
  43. Qing, X.; Niu, Y. Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM. Energy 2018, 148, 461–468. [Google Scholar] [CrossRef]
  44. Husein, M.; Chung, I.-Y. Day-Ahead Solar Irradiance Forecasting for Microgrids Using a Long Short-Term Memory Recurrent Neural Network: A Deep Learning Approach. Energies 2019, 12, 1856. [Google Scholar] [CrossRef]
Figure 1. Cumulative solar energy capacity installed in France and in the world during the last ten years [11].
Figure 1. Cumulative solar energy capacity installed in France and in the world during the last ten years [11].
Energies 17 03965 g001
Figure 2. Satellite image of the solar power plant with location marker for Project A.
Figure 2. Satellite image of the solar power plant with location marker for Project A.
Energies 17 03965 g002
Figure 3. Proposed methodology flowchart.
Figure 3. Proposed methodology flowchart.
Energies 17 03965 g003
Figure 5. Correlation heatmap analysis.
Figure 5. Correlation heatmap analysis.
Energies 17 03965 g005
Figure 6. Solar irradiance prediction of 7 October 2023 using NARX model.
Figure 6. Solar irradiance prediction of 7 October 2023 using NARX model.
Energies 17 03965 g006
Figure 7. Hourly solar irradiance.
Figure 7. Hourly solar irradiance.
Energies 17 03965 g007
Figure 8. Daily average solar irradiance.
Figure 8. Daily average solar irradiance.
Energies 17 03965 g008
Figure 9. Monthly average solar irradiance.
Figure 9. Monthly average solar irradiance.
Energies 17 03965 g009
Figure 10. Regression analysis of predicted vs. actual solar irradiance values using hybrid NARX-LSTM model.
Figure 10. Regression analysis of predicted vs. actual solar irradiance values using hybrid NARX-LSTM model.
Energies 17 03965 g010
Figure 11. Solar irradiance prediction of 7 October 2023 using hybrid NARX-LSTM model.
Figure 11. Solar irradiance prediction of 7 October 2023 using hybrid NARX-LSTM model.
Energies 17 03965 g011
Table 1. Previous studies on solar radiation forecasting.
Table 1. Previous studies on solar radiation forecasting.
Ref.YearInput ParametersForecasting ModelRegionForecasting Results
[31]2019Clearness index, evaporation, sunshine duration, maximum temperature, sunshine hours, difference between max and min temperature, average of max and min temperatureHybridized k-means and SVRIbadan, NigeriaFor 2016:
R2 = 0.9816
RMSE = 0.4275
RRMSE = 2.6515%
MAPE = 1.7928%
For 2017:
R2 = 0.9842
RMSE = 0.434
RRMSE = 2.7498%
MAPE = 1.795%
[28]2020Solar radiation, wind speed, relative humidity, pressure, temperaturePersistence model, NARXThree cities in Mexico%Forecast skill:
11.5% in Chihuahua
5.7% in Temixco
27.2% in Zacatecas
[7]2020wind speed, wind direction standard deviation, rainfall, relative humidity, barometric
pressure, temperature,
hour, month, a non-linear trend, a lagged hourly solar irradiance at lags 1 and 2.
LSTM, SVR, FFNN, PCR, Convex model, QRA model RMSE = 88.6, MAE = 52.034
[25]2021Historical solar radiation, temperatureARMA, NARX, Hybrid model of ARMA and NARXSpainMSE = 0.0340
MAE = 0.1270
RMSE = 0.1863
[32]2021sunshine duration,
mean wind speed, mean ground temperature, mean relative
humidity, minimum temperature, maximum temperature,
mean temperature, mean precipitation, mean pressure,
visibility
XGBoost, ELM, DT, RF, GBRT, BPNN, AdaBoost, KNN, SVR, GPR, MLR, RBFNNGanzhou, ChinaR2 = 0.944
RMSE = 1.131 MJ/m2
MAE = 0.870 MJ/m2
[21]2022Solar radiation, humidity, wind direction, wind speed, temperature, barometric pressure, sunshine hoursLR, RNN, SVR, RF, AdaBoost, Optimization RF, Optimization AdaBoostJuriquilla, Queretaro,
Mexico
MSE = 5011.19
RMSE = 70.78
MAE = 33.44
R2 = 0.95
MAPE = 1.596019 × 1013
Accuracy % = 95.98
[30]2023Rainfall, wind direction, wind speed, relative humidity, temperature, sunshine duration, predicted global solar radiationOptimized GPR, NARX, NARX-LSTMDumaguete,
Philippines
RMSE = 0.10
[33]2023Historical solar
irradiance
ARIMA Amman,
Jordan
Warsaw,
Poland
R2 = 99.9%
RMSE = 13.53
MSE = 183.18
MAPE = 2.79%
Std. Error = 14.14
[29]2023Maximum and minimum temperature, wind speed, relative humidity, precipitationRF, DT, LR, Pr, GBM-RMSE = 0.003
R2 = 0.002
[34]2023Maximum and minimum temperature, relative humidity, surface pressure, wind speed, latitude, longitude,
elevation
FFBP, LayR, CFBP, EBPFiche town,
Oromia, Ethiopia
MSE = 0.0079
RMSE = 0.0889
MAPE = 1.2075
Table 4. Comparison of the proposed model with prior studies.
Table 4. Comparison of the proposed model with prior studies.
ReferenceModelsOptimal ModelPerformance Metrics
RMSE (W/m2)MAE (W/m2)R2
[40]SVR, GPR,
MLP, ELM
ELM60.60-0.96
[41]LSTM,
SVM, MLANN
LSTM41.0521.990.98
[42]WPD–CNN–LSTM-MLP,
BPNN, SVM, RNN, LSTM, Clim-Pers
WPD–CNN–LSTM-MLP32.100614.35230.994
[43]Persistence, LSTM, LR, BPNNLSTM76.245--
[38]NN,
LR, AdaBoost, RF
RF66330.96
[44]LSTM-RNN, FFNN, NARXLSTM-RNN60.3136.90-
Present StudyNARX,
Hybrid
NARX-LSTM
Hybrid NARX-LSTM16.309.580.997
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Isman Okieh, O.; Seker, S.; Gokce, S.; Dennenmoser, M. An Enhanced Forecasting Method of Daily Solar Irradiance in Southwestern France: A Hybrid Nonlinear Autoregressive with Exogenous Inputs with Long Short-Term Memory Approach. Energies 2024, 17, 3965. https://doi.org/10.3390/en17163965

AMA Style

Isman Okieh O, Seker S, Gokce S, Dennenmoser M. An Enhanced Forecasting Method of Daily Solar Irradiance in Southwestern France: A Hybrid Nonlinear Autoregressive with Exogenous Inputs with Long Short-Term Memory Approach. Energies. 2024; 17(16):3965. https://doi.org/10.3390/en17163965

Chicago/Turabian Style

Isman Okieh, Oubah, Serhat Seker, Seckin Gokce, and Martin Dennenmoser. 2024. "An Enhanced Forecasting Method of Daily Solar Irradiance in Southwestern France: A Hybrid Nonlinear Autoregressive with Exogenous Inputs with Long Short-Term Memory Approach" Energies 17, no. 16: 3965. https://doi.org/10.3390/en17163965

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop