1. Introduction
Solar power plants are a promising solution to reduce greenhouse gas emissions into the atmosphere. Some studies have shown that the influence of cloudiness, ambient temperature, and humidity are closely related to the accurate prediction of solar irradiance variability, which directly affects power generation (e.g., Refs. [
1,
2,
3,
4]). Concentrated solar power (CSP) uses direct solar irradiation to heat fluid until it vaporizes, then introduces the vapor into an expander to generate mechanical work, which in turn drives an electrical generator; photovoltaic (PV) technologies harness global solar irradiance and convert it into electrical energy from semiconductor materials. In both cases, the prediction of solar irradiance is crucial for the operability of power grids, allowing for better planning and regulation of variable power generation [
5]. In the residential, commercial, agricultural, and industrial sectors, it is expected that solar-activated power generation systems (SGESs) will gradually become accessible and affordable technologies for any user; they will continue their large-scale expansion in various climatic regions, and will see sustained research and development in the coming years. Therefore, the prediction of solar power generation will become increasingly necessary in the short range [
6,
7].
However, one of the greatest vulnerabilities of SGESs lies in the uncertainty of the intermittency and natural variability the resource can generate, especially during mesoscale meteorological events and extreme events; in the United States alone, it has been reported that between 2000 and 2021, approximately 83% of interruptions in the electric grid were attributed to extreme meteorological events, which annually affect the safety and public health of citizens, in addition to causing damage to the electric infrastructure. Severe weather events such as high winds and storms (58%), winter weather events (22%), and tropical cyclones (15%) were among the top causes of power outages. Texas, Michigan, and California stood out as the states with the most weather-related power outages [
8].
For example, in 2010, a cold snap in France caused a spike in natural gas demand, with approximately every degree Celsius drop in temperature resulting in an increase of 100 GWh in daily natural gas consumption. In January 2017, Spain experienced a three-day period with a minimum temperature of 1.8 °C, which, combined with winter weather conditions, market restrictions, and the need to supply electricity to France due to the failure of its nuclear power plants, drove prices up to EUR 112.8 per MWh, the highest ever recorded in the Spanish region [
9].
The need for adequate weather forecasting to increase the reliability of electricity generation in power plants supplied by renewable resources has become relevant in North America. In February 2021 in Texas, USA, the slow movement of an intense cold front associated with an Arctic air mass caused several events that interrupted the electricity supply to almost 10 million people for several days. During the influence of the weather system, the generation of the main energy sources in Texas (natural gas, wind, coal, nuclear, and solar) decreased, which, in a cascading effect resulting from the extreme winter conditions, led to a significant deficit in the generation of natural gas, causing an over-demand of the available energy; as well as the death of 111 people due to the impact of the winter system [
10].
In the western United States, southern California, Arizona, and part of the desert area of northwestern Mexico between Baja California and Sonora, there is a region of extreme desert climate that is of particular interest for study due to its interaction with extreme weather events in winter; according to the Climate and Weather Disaster Report for California and Arizona from 1980 to 2024, severe storms, freezes, and floods during the winter season can cause significant stress on energy demand in the region [
11].
Over the southwestern United States and northwestern Mexico, winter frontal systems occur between September and May, with an average of 55 events per season, according to historical weather reports from the National Meteorological Service of Mexico (SMN) over the past 10 years. Considering the historical data on the occurrence of winter frontal systems based on the synoptic analysis of the Weather Prediction Center [
12] archives, approximately 40% of the total frontal systems each season enter the western and southwestern United States, affecting Southern California and Baja California.
Weather forecasting considers two important time scales to differentiate the variability that affects the prediction of the solar resource: (1) Medium-range forecasts, which are mainly influenced by long-term weather patterns in the atmosphere that may include periods longer than 120 h or correspond to climatic oscillations that regulate the meteorological conditions of a region for long periods as annual cycles or longer [
13]. The second scale of interest, which is the focus of this analysis, involves a smaller temporal and spatial coverage and is known as (2) short-range forecasting. This scale includes variations related to meteorological phenomena lasting from a few minutes to about 72 h and with spatial coverage ranging from 0.01 km to 100 km. Some meteorological phenomena, such as storms, tornadoes, and cold fronts, produce variations in solar irradiance that develop within the spatial and temporal horizons that can be predicted by short-range weather forecast [
14], as shown in
Figure 1.
Climate models are a fundamental tool in the investigation of long-range atmospheric variations; they are capable of representing general patterns of temperature, humidity, and precipitation over a large region; however, their low resolution makes it difficult to analyze spatial scales smaller than 100 km. Among the forecasting methods capable of capturing local variability are statistical models (SMs) and numerical weather prediction (NWP) computational models. Statistical models require a robust climate and meteorological database of the site, from which meaningful statistical relationships are established for the prediction of atmospheric variables [
16]. NWP can be based on initial data available from different global models, satellite data, and reanalysis information. They also allow for the configuration of high-resolution domains for a specific region and offer the possibility of selecting different physical parameterizations of atmospheric processes for small-scale experimental purposes. Among the studies that have focused their interest on the short-range prediction of solar irradiance at sites with potential for solar energy harvesting, we can mention [
17], whose authors conducted an evaluation of the Weather Research and Forecasting (WRF) model in the region of Andalusia, Spain, using three-day periods throughout 2007 and 2008 to collect clear, partially covered and cloudy days in different seasons of the year. The results of the model showed that there was a greater number of successes in the prediction of solar irradiance during spring and summer, while fall and winter showed greater inaccuracies. Cohen et al. [
18] conducted a review of planetary boundary layer (PBL) parameterization schemes, such as Yonsei University (YSU), Quasi-Normal Scale Elimination (QNSE), Mellor–Yamada–Janjic (MYJ), Asymmetric Convective Model 2 (ACM2), and Medium-Range Forecast Model (MRF), by simulating a cold season in the southeastern United States (winter–spring), where the simulation of humidity and temperature meteorological variables under severe convective weather conditions was analyzed. The results showed the advantages and disadvantages of local and non-local parameterizations during the influence of mesoscale systems, where the implementation of the YSU parameterization in the PBL was particularly notable. In a similar effort to evaluate parameterization schemes, in 2016, the authors of [
4] conducted a study over a large region of Greece, characterized by a Mediterranean climate. This study involved the simulation and analysis of one year of horizontal solar irradiance measurements, focusing on the variation in shortwave parameterization schemes in the WRF model for clear day cases and “all cases”, which included different cloudiness conditions. The statistical results showed that clear days were satisfactorily predicted under the Dudhia parameterizations, but in the rest of the cases, different parameterizations stood out according to the seasons of the year. In 2019, Ref. [
5] evaluated different WRF model parameterizations for global solar irradiance prediction in a Mediterranean climate in the southeastern region of Turkey. The study simulated different periods of the year to characterize the model’s behavior in different seasons. Among its results, the study highlighted the favorable performance of the shortwave FLG and RRTMG parameterizations, and it retained YSU as its only variation in the PBL parameterizations.
In 2020, the authors of [
19] performed a short-range forecast in Australia with the WRF-Solar model, with an emphasis on the simulation of solar irradiance and meteorological variables for 24–48 h periods, which they called “high solar intermittency days”; these were selected over a year of on-site measurements based on the determination of the clearness index, which partially related the interaction with the influencing cloudiness. Among their most outstanding results, the authors pointed out that the prediction errors in wind speed and direction were closely related to the cloudiness circulation. These results gave rise to works such as the one carried out by the authors of [
20], in which, under the continuation of the search for improvements for short-range solar prediction and the influence of other meteorological variables related to the interaction with cloudiness, they selected several periods of temperate climate for the northeastern region in Germany, which were considered relevant to their variability on that time scale. They simulated periods in which the region was influenced by cold, warm, and occluded frontal systems. The RRTMG and Dudhia schemes were used, and it was found that the most favorable prediction results were obtained with RRTMG for almost all cases, except for occluded fronts.
Based on the background information, which shows that no studies have been conducted to predict solar irradiance in desert climates during winter seasons, this paper presents an evaluation of the WRF model for the prediction of the global horizontal irradiance (GHI) by simulating five cold fronts over the desert region of northwestern Mexico, recorded between 2017 and 2020. The events were selected for having registered abrupt changes in GHI, ambient temperature (Tamb), and relative humidity (Hrel), in addition to having a similar duration and having generated clouds at different times of the day.
Each experiment was run under four different configurations of solar radiation settings of the WRF model. The results obtained were compared with surface measurements from automatic weather stations, solar pyranometers, satellite data, and a reference to a persistence forecast. The Dudhia and RRTMG parameterizations of the WRF allow for a more accurate estimation of the behavior of the GHI variable during the passage of the cold fronts, which can be used to make a better short-term prediction of the electricity generation of the solar power plants in the event of short-range meteorological events, as well as allowing for the planning and definition of operational strategies in solar power plants.
3. Analysis and Discussion of Results
This paper presents the forecast results of the GHI in a WRF model for cold frontal systems in a desert region. In order to perform the simulation of the selected events, four different configurations of physical parameterizations were tested, opt0, opt1, opt2, and opt3, which were proposed for the prediction of short-term irradiance variability due to the influence of other meteorological variables and cloud circulation by mesoscale events, as mentioned previously in
Table 2. Given the extent of the statistical comparison performed for the five frontal events and the similarity of the results obtained for events CF01, CF02, CF03, and CF04, only the prediction errors of the most favorable case (CF01) and the least favorable case (CF05) are discussed and compared. Furthermore, the analysis of these two cases is of great interest in the evaluation of the WRF model due to the contrasting characteristics under which the frontal systems were classified: the first (CF01), as a “wet” event, is mainly associated with a polar maritime air mass, and the second (CF05) is mainly associated with a tropical maritime air mass and the influence of synoptic conditions particular to the transition season from autumn to winter in the study region, as mentioned above in
Table 1.
3.1. Variability of Cold Fronts in a Case Study
In order to analyze the correlation and variability of the forecast with respect to the observed data, the forecasted data series of frontal systems CF01 and CF05 were plotted by variables in scatter plots. They were differentiated into diurnal (10:00 am to 5:30 pm) and nocturnal (6:00 p.m. to 9:30 a.m.) periods as shown in
Figure 4 and
Figure 5. In
Figure 4, corresponding to CF01, it is possible to observe that the GHI showed a behavior with a marked linear trend and the greatest dispersion of points graphed was observed from values above 500 W m
−2 for most of the options, i.e., a large number of the predictions commit a similar estimation error. Within the GHI, it can be highlighted that some data points corresponding to opt1 are located further apart from the data cloud than the rest, mainly in overestimates in the range of 400–700 W m
−2. With a similar behavior, in the range of 300–700 W m
−2, the points corresponding to opt3 showed overestimates. The scatter plot for the GHI shows coherent results for the predicted values in the night period, ranging from 0 to 30 W m
−2. Tamb groups most of its values within a linear trend and shows a greater scatter of points at values above 12.5 °C. The lower temperature values, in the range of 7.5 to 11 °C, show a good correlation with respect to the measured values, and the night periods predominate. Temperatures between 11 and 20 °C show that the points are dispersed to the left of the data cloud as an underestimation of the day and night data of opt2, and to the right, in the same range as a diurnal, an overestimation of opt0 and opt3. In the Hrel variable, it is observed that the lowest dispersion of points is located between the values in the range of 20 and 40%, mainly in diurnal periods; while the rest of the values show a gradual increase in dispersion as they approach values greater than 60%. In the Hrel values, the daytime and nighttime periods are homogeneously dispersed in the 40–60% range, showing an overestimation, and this behavior is highlighted in the axis of the predicted values upon observing the dispersion of points in the 60–80% range for opt2; in the same range, but in the axis of the measured values, most of the options in daytime and nighttime periods are an underestimation.
Figure 5, corresponding to event CF05, shows that the GHI is the variable with the greatest dispersion in the point cloud and that the estimates show a greater dispersion within the range of 400–700 W m
−2. It is possible to distinguish its positive linear trend; however, the errors that the prediction makes by overestimating or underestimating values equal to or greater than 600 W m
−2 in most of the options stand out. With regard to the overestimates, opt2 and opt3 stand out, while for the underestimates, it is opt0. The Tamb is generally grouped with a clear positive trend and little dispersion of points in the data. The highest values in the range of 30–35 °C in diurnal periods, which show possible overestimates at values above 30 °C, stand out for their greater dispersion. The Hrel also shows a dispersion of data that tends to a linear correlation. The data cloud is more dispersed in the area of the minima and maxima values of the series, although underestimates predominate. However, although the differentiation between daytime and nighttime data is not clear, since they are homogeneously distributed in the data cloud, a normal pattern of behavior can be observed by locating a majority of nighttime points in the region of maximum values (greater than 50%), and of daytime periods in the region of the graph with the minimum values (below 30%). Both the overestimates shown by the GHI and Tamb could be due to the variability generated by the meteorological event upon arrival at the study point, which generated a significant error in the model’s prediction due to the abrupt change in these variables by influencing the humid air mass, which is reflected in the diurnal underestimates. The overestimation of the Hrel conditions of CF01 was what marked the main difference with respect to CF05, in addition to favoring a greater variability of Tamb of the former with respect to the latter. The conditions of underestimation of the GHI for CF05 were not present in CF01 in a significant way.
3.2. Taylor Diagrams
The results obtained with the different proposed configurations were also compared graphically in terms of Pearson’s correlation coefficient values and standard deviation, normalized with a Taylor diagram as shown in
Figure 6 and
Figure 7. In
Figure 6, corresponding to event CF01, it is observed that the variable with the highest similarity to the observed data was the GHI in the data measured by NASA, with correlation values around 0.98, followed very closely by the opt2 configuration prediction in the same variable. The rest of the forecast options for the GHI variable were very close to each other with values around a Pearson’s correlation of between 0.92 and 0.94 and a standard deviation close to 1. The persistence forecast representing the variability with respect to the previous day, as expected due to the influence of the frontal event, was the least favorable prediction, with correlation values close to 0.82. The predictions of the Tamb variable showed Pearson’s correlation values between 0.85 and 0.89 in the proposed configuration options (opt0, opt1, opt2, and opt3), which were grouped with similar values and a standard deviation close to 1. The values measured by NASA in Tamb obtained one of the lowest correlation values with 0.82 and a standard deviation of approximately 1.50, which could be due to the differences in resolution in the mesh of the model. The persistence forecast showed correlation values of 0.88, similar to the prediction options, which showed that there was no significant ambient temperature variability during the frontal event. The Hrel predictions in CF01 presented the least favorable correlation values compared to the rest of the normalized variables, as well as a higher standard deviation with values close to 2. The correlation values in Hrel were less than 0.60 in all the proposed options; opt0, opt2, and opt3 showed very similar values between them, ranging between correlations of 0.40 and 0.55. Given the characteristics of CF01, favorable values were obtained in the prediction of solar irradiance, followed by the Tamb variable, with the least favorable prediction being Hrel.
From
Figure 7, corresponding to the Taylor diagram of event CF05, it is possible to observe that one of the best-predicted variables was Tamb, with correlation values that ranged from 0.85 to 0.97. The Hrel and GHI variables showed similarity, with most of their predictions ranging from 0.75 to 0.9 and standard deviations between 1.5 and 2; however, opt1 in both variables can be considered as the most favorable results for Hrel and GHI, followed closely by opt2 with similar values. In this same comparison, the data measured by NASA were favorable in the prediction of the GHI and Tamb, but not so for Hrel, where, despite showing a correlation higher than 0.9, a standard deviation close to 2 was obtained. This is possibly associated with local conditions of moisture transport at low levels of the atmosphere, which NASA measurements failed to represent. As mentioned above, it is important to note that in event CF05, the GHI predictions were less favorable than in CF01, i.e., the local humidity factor at the study site during the passage of the frontal system generated errors in the model predictions. The results of the GHI predictions in the two frontal events showed that the proposed prediction options were more favorable in the CF01 event, which is associated with a polar maritime air mass; in the prediction of event CF05, the standard deviation values were higher, possibly due to the influence of the tropical maritime air mass with the influence of ODALYS, a weak and short-lived tropical storm that remained over the western portion of the Eastern Pacific basin from 3 to 8 November with a peak on 4 November and low remnants from 5 to 8 November [
40].
3.3. Analysis of Statistical Values
Table 3 shows the statistical values of the CF01 forecast classified as a system with greater influence of a polar maritime air mass. Considering the characteristics of each statistic used to quantify the error in the simulations, it is necessary to analyze and discuss every result for each of the variables. In the statistical results, it is observed that opt2 predominates over the rest of the configurations for most of the simulated variables.
For the GHI variable, the determination coefficient R2 shows values between 0.69 and 0.94. The values measured by NASA obtained the most favorable estimate with an R2 of 0.94; however, there was no significant difference between the proposed prediction options, which ranged between 0.86 and 0.87. The persistence forecast showed the lowest coefficient value with an R2 of 0.69, suggesting an attribution to the daily variability generated by the event. A similar pattern of correlation was observed in the statistical results for the Tamb and Hrel variables, where the R2 of the simulation options showed values between 0.73 and 0.78, and 0.14 and 0.23, respectively. The R2 values for the Hrel variable were the least favorable of the proposed configuration options.
In the MAE statistic, the GHI simulation option corresponding to opt2 was considered the best result because its values were the lowest among the options with 35.66 W m−2 and with an error 18% lower than the rest; in the measured data from NASA, the lowest value was obtained with 22.35 W m−2, which represents a difference of only 35% with respect to the opt2 simulation. For the Tamb and Hrel variables, the MAE values in the opt2 configuration options were also the most favorable with a difference of approximately 40% in Tamb and 9% in Hrel with respect to the rest of the options.
For the GHI variable, all the proposed simulation configurations showed a negative MBIAS: opt2 with a value of −16.22 W m−2, while the rest had higher negative values up to −39 W m−2, in contrast to opt1. The RMSE statistic showed a similar pattern for opt2, where, with a value of 81.36 W m−2, it was the best evaluated configuration option. The MBIAS values were the most favorable across the board, i.e., the lowest values of this statistic for the GHI were from NASA were from NASA and the persistence forecasts with a value of −2.96 and 4.96 W m−2, respectively; however, with only a difference of −13.2 W m−2, opt2 was the best of the predictions. Of the proposed configuration options, opt2 showed the most favorable values for most of the statistics, except for R2, where the predictions were evaluated with values that were similar to each other with only a difference of one-tenth.
The variables Tamb and Hrel obtained a favorable estimation with low values in most of the statistics. Tamb showed R2 values between 0.67 and 0.78, with the highest value in opt1, similar to that of the persistence forecast and slightly different from opt2 with 0.73, which dominated in the rest of the statistics. Finally, it is important to note that the measured NASA data, for the R2, MAE, MBIAS, and RMSE statistics in Tamb and Hrel, made errors that, at times, in the statistical evaluation, resulted in scores that were a less favorable “prediction” than some of the proposed options.
The statistical results of the CF05 event, classified as wet and with the influence of a tropical cyclone prior to its displacement in the study area, are shown in
Table 4. The GHI variable showed the most favorable values of R
2 in the opt1 with 0.86, while the opt0 was the least favorable, with 0.77 followed by the persistence forecast with a value of 0.68. In MAE, a similar pattern to the previous event could be observed, since according to the values, the best evaluated option was opt1 with 47.14 W m
−2, followed only by opt2 with 47.55 W m
−2; the MBIAS values remained negative in all options, where opt2 showed a lower bias with −22.81 W m
−2, in contrast to the highest bias, corresponding to opt3, with −43.95. W m
−2. The RMSE reflected that the values of 104.35 W m
−2 and 108.33 W m
−2 for opt1 and opt2, respectively, were the most favorable and did not show a large difference between them. As can be seen in the CF05 statistics, it is not possible to appreciate a clear trend towards any of the proposed options; however, relevant values were identified in almost all the options except opt0.
The Tamb and Hrel variables showed a satisfactory estimation in most of the statistics for each of the configuration options. However, according to the small differences shown in the evaluation, it could be complicated to define an optimal option. The Tamb variable showed good results, especially in opt3, where R2, MAE, and RMSE were the most favorable among the configuration options. For the Hrel variable, it can be highlighted that there are unfavorable statistical values compared to the rest of the variables; this was generally the least accurate prediction in the CF05 event; however, opt2 stood out as the configuration with the best statistical results. Its highest R2 value was obtained in opt1 and opt2 with 0.79.
4. Conclusions
In this work, four proposed configurations of the WRF model’s physical parameterizations are generated and evaluated, focused on the prediction of the GHI for five frontal systems in the study region. The Tamb and Hrel are of great variability during this type of winter events, so they were considered in the discussion of the predicted GHI behavior. Based on the dispersion analysis of the point series, the evaluation of statistical errors between measured and calculated values, and their respective correlations, it can be concluded that the predictions of the GHI, Tamb, and Hrel variables within a range no larger than two standard deviations were able to simulate the behavior of atmospheric conditions during the passage of a winter frontal system. However, according to the evaluation of the statistical results obtained in all the case studies, the opt2 configuration corresponding to the Dudhia and RRTM solar radiation parameterizations was the most favorable for most of the events.
During the period of influence of the frontal systems analyzed in the study region, it was perceived that the WRF model made errors of underestimation and showed a lack of sensitivity to the rapid variability generated by possible clouds and humidity at low levels of the atmosphere, which was visible during the first 12 h, in which the cold front entered the study region. For the most part, it is clear that opt2 dominated with the most favorable statistical prediction values. Predictions of the Tamb variable obtained the lowest errors despite a general overestimation evidenced in the MBIAS values. The predictions of Hrel showed different values in the errors; however, they did not exceed the MAE of 13% nor the 10.6% overestimation in the less favorable cases. The fact that the Hrel variable responded with low correlation values and a relatively high error percentage could be closely related to the behavior and natural variability of the on-site measurements. Given the characteristics of the CF05 study event, this analysis suggests that the WRF model has difficulty in predicting the variability of the humidity of a cold front associated with a tropical maritime air mass and its interaction with Pacific upwelling moisture associated with the passage of tropical cyclones that transport moist air masses from latitudes to the south.
The results of the data measured by NASA stood out as a potential tool for the adjustment of the case studies analyzed, mainly due to the low errors shown in their measurements when recording the Hrel and GHI variables during the influence of frontal events associated with humid air masses.
The comparison between the persistence forecast and the predictions generated by the four WRF model parameterization configurations showed the capacity of the model to improve the prediction of atmospheric variables under the influence of cold fronts. Based on the observation of the Taylor diagrams and the analysis of the statistical errors, it can be concluded that the model configuration corresponding to the opt2 had the best performance in the prediction of the variability of the solar irradiance GHI, under the influence of a frontal system in the desert region of northwestern Mexico. It was also possible to observe the importance of associating humidity characteristics to frontal systems as a potential factor that hindered model predictions in this study area. This analysis provides a configuration of the WRF model that can be used as a meteorological prediction tool to directly associate the variability of solar irradiation in the estimation of short-term electric power generation in solar-activated plants, under the influence of winter systems in desert regions. It is proposed in future work that the opt2 configuration be tested under different planetary boundary layer parameterization options, where its performance will be evaluated in other similar climatic regions, and the sample of frontal events with humidity characteristics associated with tropical maritime air masses be increased.