1. Introduction
The global drive towards environmental protection and energy sustainability has led to the increasing deployment of hybrid energy systems (HESs), which leverage the complementarity of various energy sources to improve system performance and enable the integration of renewable energy sources [
1,
2,
3,
4]. Among these, photovoltaic (PV) systems have emerged as a leading technology, with rapidly decreasing costs and maturing auxiliary devices contributing to explosive growth in the global installed capacity [
5,
6,
7]. However, the inherent uncertainty of solar power poses significant challenges to the photovoltaic-based HESs, particularly in the optimal design phase, when the predictive information is hard to get, making it challenging to achieve an optimal configuration that balances load demand, minimizes cost, and meets other performance indices [
8,
9,
10]. It is necessary to identify a reasonable method for the HES optimization design to quantitatively analyze the uncertainty of solar radiation.
In outdoor application scenarios, the uncertainty of solar power generation is caused by factors such as temperature, efficiency, and variations in the radiance received by the panel. Radiation is the direct source of solar energy, making it the main factor contributing to the volatility of PV systems. Typically, researchers construct physical models of PV modules to evaluate the effects of radiation changes on the final energy output of HES [
11,
12,
13,
14]. Radiation data is usually based on typical days (weeks, months, or years) or on historical measured sequences. While one could track the total global solar spectrum on an inclined plane using satellite datasets [
15], this approach may provide valuable insights into solar radiation dynamics, but it demands significant computational resources and data processing efforts. In addition, even if analytical methods potentially offer deterministic results for specific scenarios, they will not fully address the variability inherent in real-world HES conditions [
16]. For example, optimizing HES includes complex interactions between multiple energy sources, diverse optimization objectives, and the need for comprehensive uncertainty-containing radiation information. Therefore, we opted for a stochastic modeling approach, which we integrated into our HES optimization framework. This helps provide complementary benefits in addressing uncertainties, thus improving decision-making effectiveness. To this end, some researchers use stochastic sampling to incorporate radiation uncertainty into the optimal design of HES [
17]. Li et al. used Monte Carlo simulation and sensitivity analysis to quantify the uncertainties associated with the optimal design of solar heating systems [
18]. Cho et al. used the scenario-based approach to represent solar radiation uncertainty and time variability in the optimal design of HES [
19]. The scenarios are randomly chosen from twelve years of Typical Meteorological Year data. Liu proposed an uncertainty-based multi-objective design method by generating uncertainty distributions for load prediction and RESs production based on Monte Carlo simulation [
20]. The impact of the degree of uncertainty and scenario settings was analyzed simultaneously.
In addition, another effective method for addressing radiation uncertainty is to generate simulate sequences using stochastic modeling of cloud cover [
21]. The radiation outside of the atmosphere at any given location on earth can be determined based on its relative position to the sun. However, the radiation received on land is weakened due to atmospheric transmittance and induced fluctuation caused by stochastic cloud motion, and this rate varies dynamically as air mass and cloud motion change over time [
22,
23,
24]. To describe various weather states, the clearness index is introduced and widely used in solar radiation stochastic modeling. Ngoko et al. proposed a probabilistic model for the synthetic generation of 1 min global solar radiation data based on the second-order Markov model [
25]. The model was validated by confirming the reasonable agreement between the synthetic and observed datasets. Bouabdallah et al. simulated solar radiation with hour-level stochastic cloud cover sequences, which are generated based on Markov transition matrices (MTM) and used for the sizing optimization of stand-alone PV systems [
26]. Although the studies mentioned above consider uncertain solar radiation sequences, for HES optimal design, the radiation must satisfy its rationality as random time series, which is expressed as the rational variation in daily and intraday radiation intensity. Few studies have considered both stochastic processes of the daily and intraday radiation transition, which directly determine the temporal change characteristics of the final radiation sequences.
In the present study, the daily clearness states (DCSs) are introduced, based on the intraday clearness index (CI) distribution, to describe the daily radiation intensity. The stochastic evolution characteristics in DCSs is then modeled using the Markov model. The obtained model is verified based on the Markov property and stationarity. Moreover, diffusion-based kernel density estimation (DKDE) is applied to fit the intraday CI distribution at different DCS. By integrating the two stochastic processes of the daily clearness states and the intraday clearness index, the stochastic model for solar radiation is established. A total number of 2000 radiation scenarios are generated for optimal sizing of an HES consisting of a PV, storage battery, and backup diesel generator. Half are used to solve the optimization problem, with the objective of minimizing the COE, to obtain the variation range of the optimal size of each component. The other half is used to perform a techno-economic analysis of the candidate size combinations to determine the final HES configuration.
This paper is organized as follows:
Section 2 focuses on the stochastic modeling of solar radiation. The mathematical modeling of an off-grid PV-based HES is described in
Section 3.
Section 4 presents the results of the case study used to demonstrate our method. The conclusions are drawn in
Section 5.
2. Stochastic Model Description
The total solar radiation
Gtot received on the ground can be divided into three parts consisting of direct
Gdir, diffused
Gdif, and reflected
Gr irradiation, as follows:
The Hay–Davies–Klucher–Reindl (HDKR) anisotropic model provides a method that considers the effect of clouds on terrestrial radiation through the clearness index and the diffuse fraction. The clearness index is defined as the ratio between the solar radiation received on the horizontal surface
Gh and the horizontal component of the extraterrestrial radiation
G0:
The horizontal component of the extraterrestrial radiation
G0 can be calculated as:
where
θz is the zenith angle,
N is the
Nth day of the year, and
Gsc is the solar constant, set to 1367 W/m
2 in this paper. Note that there are minor fluctuations in the solar constant, primarily due to variations in solar activity, such as changes in the number and activity level of sunspots over the solar cycle [
27]. While these variations may cause some fluctuations in the solar constant, they are relatively small and typically fall within acceptable ranges. Therefore, the solar constant is generally regarded as an approximately constant value.
A generic piecewise function of the diffuse fraction
f(
kt) is proposed by Bouabdallah et al., expressed as [
28]:
Then, the three components of the radiation can be expressed as:
where
Rb is the ratio of direct radiation between the tilted and horizontal plane,
Rdif is the tilt factor of the diffused component,
Rr is the tilt factor for the reflected component,
ρ is the albedo, and
Ai is the anisotropic index. The detailed calculation process of the HDKR model can be found in the relative literature [
29,
30,
31].
A dataset of one year of measured solar radiation is adopted in this paper, which was measured in Yulin, China, with a longitude of 109.16° E, and a latitude of 38.29° N. The measured radiation data covers a complete year from 1 January 2021 to 31 December 2021, with a time interval of 1 h, and was provided by the electric power department of Shaanxi Province, in which Yulin is located. The intention of the proposed stochastic model is to generate the stochastic CI sequence and then bring it into the HDKR model to obtain the corresponding Gtot. In this paper, the stochastic CI sequences are obtained by constructing daily and intraday stochastic models, which are based on Markov models and DKDE, respectively.
2.1. Daily Clearness State
A fundamental constraint in the optimal design of an energy system is the energy balance between supply and demand, as the daily radiation intensity transition will directly affect the size configuration. For instance, continuous daily low radiation increases the potential PV capacity for more power generation. Therefore, the stochastic radiation sequences must consider the daily radiation intensity transition for an optimal design. As described in Equation (2), the clearness index represents the extent to which extraterrestrial radiation is attenuated by the atmosphere and clouds at a given time.
Considering this, the daily clearness states (DCSs) are introduced according to the intraday CIs distribution to represents the daily radiation intensity. The median of intraday CIs is used to classify DCSs, given its ability to reflect the concentrated trend and avoid the effects of extreme outliers. A total of nine DCSs are defined in this paper, uniformly distributed between 0 and 1, with a step of 0.1, as shown in
Table 1. Note that intervals [0.8, 0.9] and [0.9, 1] both are classified as
s9, due to the existence of atmospheric transmittance, which will inevitably weaken the radiation received on the ground.
Figure 1 shows the statistical information for the intraday clearness index (CI) values at different DCS, along with the corresponding normalized radiation curves. Note that the DCSs reflect the overall daily state of radiation intensity, independent of its intraday fluctuations. Although the range of intraday CIs in some states, like states 5 and 6, are dispersed and span multiple state intervals, the difference in radiation intensity can still be distinguished from the
Gh area diagram. The dispersion of intraday CI values represents, to some extent, the fluctuating state of radiation, which is considered by the intraday CI stochastic model.
2.2. State Transition Matrix
The Markov model is capable of describing the transition probability of a finite state in a stochastic process through the Markov transition matrices (MTMs). In general, weather-related parameters, including the DCS proposed in this paper, are highly complex and nonlinear, and do not fully conform to the Markov properties. However, in certain simplified models or specific circumstances, the transition of weather phenomena can be approximated as Markov processes, which has been applied and validated in many studies [
24,
25,
32]. In this paper, the Markov model is employed for stochastic modeling of the DCSs transition, which provides transition probabilities between any two DCSs, as illustrated in
Figure 2. The frequency counting method is used to estimate the transition probabilities, based on observed frequencies of events in the historical dataset. Specifically, the estimated transition probability can be calculated by:
where
Aij is the number of observed transitions from state
i to state
j;
M is the number of the states.
To ensure the validity of the model, the Markov chain property and stationarity of the DCS sequence must be statistically tested. The Markov chain properties can be evaluated by assessing the dependence of successive events. Based on the null hypothesis that successive events are independent, the statistic
α is asymptotically distributed as
χ2, with (
N − 1)
2 degrees of freedom. This statistic is defined as:
where
N is the total number of states, and
nij and
pij are the frequency and probability of state
j transfer from state
i, respectively.
If the transition probabilities of a Markov process remain independent over time, the process is considered stationary, and the statistic
β has a
χ2 distribution with
N(
N − 1)(
T − 1) degrees of freedom.
where
T is the number of subintervals and is set to four here. Each subinterval represents about one season of the total 365 days.
As summarized in
Table 2, the value of
α is larger than the
χ2, with 64 df at the 5% level, which means we can reject the null hypothesis and conclude that the transition of DCS possesses the first-order Markov chain property. Meanwhile, the stationary property has been proven, as the value of
β is smaller than the
χ2 at the 5% level, with 216 df.
Figure 3 shows the construction method for the DCS sequences, where
v refers to random variables uniformly distributed in the interval [0, 1], and
p is the corresponding transition probability from the obtained Markov transition matrices. Note that the probability vector used on the first day is calculated by the stationary distribution of the proposed Markov model. For the rest of the sequence, the probability vector is the corresponding transition vector of the previous state.
2.3. Distribution Fitting of Intraday CIs
The CIs at each time of the day are treated as stochastic variables, satisfying a certain distribution that can be fitted based on the CI sets at different DCSs from the samples. The diffusion-based kernel density estimator (DKDE) has been shown to be superior in distribution fitting, as it addresses traditional KDE concerns regarding boundary bias and optimal bandwidth, while also improving computational speed [
33,
34,
35].
Given one-dimensional
n sample feature space
, the estimated probability density function (PDF) obtained by KDE is:
where
K(·) is the kernel function, and
w is the bandwidth.
The Gaussian kernel function is commonly used in KDE, which can be expressed as:
The Gaussian kernel density estimator (GKDE) is the only solution to the diffusion partial differential equation, as shown in Equation (12).
By defining the domain
R belong to [0, 1], the initial and boundary condition can be expressed as:
where
δ(·) is Dirac function.
Substituting the CI sets under each DCS, the estimated PDFs can be obtained by solving the diffusion partial differential equation in Equation (12). The obtained PDFs and corresponding CDFs by DKDE at different DCSs are shown in
Figure 4. Due to the uniform [0, 1] property of the CDF, the stochastic CI can be obtained from the inverse CDF by generating a random variable of uniform [0, 1], as shown in Equation (15).
where
v is the random variable, and
Fi is the CDF at
i-th DCS.
2.4. Stochastic Solar Radiation
The final solar radiation sequences are determined by generating stochastic DCS transition chains, with their corresponding intraday CIs.
Figure 5 shows an annual hourly stochastic solar radiation curve; as the stochastic model is based on a clear-sky model, the simulated radiation data changes are consistent with the quarterly trend.
4. Experimental Results
A total of 2000 sets of annual hourly stochastic solar radiation scenarios are generated by the proposed stochastic model. The data are equally divided into two blocks of 1000 sets, with half used for the optimal design of the proposed HES and the other half used for analysis of operational performance of the candidate sizes. The size of the PV panel, diesel generator, and battery are treated as decision variables in the optimal design process. The optimization models are solved using the Python 3.9 optimization package RSOME with Gurobi version 10.0.
The load profiles are randomly generated based on the typical load profile for a local village at the same location as the solar radiation data. Note that the study area consists of the simple hybrid energy systems of remote mountain villages. Their loads are mainly determined by essential production and living needs, and fluctuations due to changes in seasons, temperatures, etc., are typically small.
Figure 7 illustrates the typical load profile, and the load variation rate is set as 0.2.
Technical data employed in the computations are provided in
Table 3. These data cover the efficiency of each component, the costs, and other necessary parameters for the calculation process. Note that the estimates of some parameters may deviate from the actual scenarios, which, due to the costs, efficiency, reliability, and durability of the system devices, are intricately linked to their origin, manufacturer, and the current stage of development. Additionally, prices often include not only the cost of the equipment itself, but also the associated expenses, such as those related to installation and transportation, especially for the remote rural household users studied in this paper. The results obtained from the proposed optimization model only represent the optimal configuration under the set conditions. However, this framework is universal and replicable.
Battery aging paths are determined by nonlinear functions of the state of charge (SOC), depth of discharge (DOD), C-rate (defined as the charge or discharge current divided by the battery’s capacity to store an electrical charge, with the unit h − 1), temperature, etc. Commonly, battery aging models adopted in microgrid assessment and control references are mainly simplified to be linearly related with DOD and cycle amounts [
36]. In this paper, we have separately defined the lifetime and life cycles of the battery, representing the lifespan under standby conditions and its cycle lifespan, respectively.
The cost of auxiliary equipment depreciation, including the system controller, battery chargers, enclosures, switch gear, fuses, wiring, etc., was taken into account in the calculation. Note that the cost of the inverter, a key component of the HES, is not explicitly calculated in the optimization model, but is considered along with the cost of the auxiliary equipment. As a matter of fact, the selection of inverters is directly related to the capacity of the generating devices; once the capacity of generating devices is determined, compatible inverters can be determined. The cost of emissions was also considered; this is the responsibility of the user and is not always considered in some countries.
4.1. Optimal Sizing
The time horizon of one year is chosen for the optimal design of the HES, as it encompasses a full cycle of solar motion. The results of the optimization are summarized in
Figure 8, which shows a clear correspondence between the yearly total radiation and the annual COE. Specifically, the optimal sizing of the HES tends to involve a larger battery and a smaller diesel generator capacity for scenarios with higher yearly total radiation, resulting in a smaller annual COE. Conversely, scenarios with lower yearly total radiation require a larger diesel generator capacity to meet the load demand, resulting in a higher annual COE. The maximum and minimum yearly total radiation are 1874.6 kW/m
2 and 1607.4kW/m
2, resulting in a difference of 14.25%, which corresponds to a COE of USD 0.372/kWh and USD 0.388/kWh, respectively, which represents a difference of 4.3%. This result underscores the sensitivity of the optimal sizing of the HES to variations in yearly total radiation levels and highlights the importance of the radiation uncertainty in the design of HES. Meanwhile, the optimal sizes in different radiation scenarios are maintained within a certain range, with PV in [10.58, 13.86] m
2, battery in [7.3, 15.1] kWh, and diesel generator in [1.54, 2.22] kW.
Additionally, the obtained operation schedule for each component in the HES over the entire simulation period is the optimal scheduling strategy for minimizing the COE.
Figure 9 shows an example of the power dispatch schedule, with two time periods taken from winter and summer, respectively. The scenario of the example has the max yearly total radiation of 1874.6 kW/m
2, with the sizes of PV of 13.07 m
2, battery of 14.51 kWh, and diesel generator of 1.67 kW, resulting in an annual COE of USD 0.372/kWh.
From
Figure 9, as the demand is low during the daytime and high after sunset; excess power generated by the PV panel is stored in batteries and used along with the diesel generators for the night load. In the winter, the low intensity of solar radiation makes it necessary to combine with a diesel generator to meet the load supply during the daytime. Conversely, during the summer months, the increased generation of solar power allows for greater flexibility in the deployment of energy storage, resulting in a reduced reliance on diesel generators. As shown in
Figure 9b, sufficient PV power generation causes the battery to undergo a full charge/discharge cycle per day, within the feasible SOC. Additionally, it is worth noting that during the winter months, all the PV power is utilized for the load supply or battery charge, whereas there is PV power curtailment during the summer months.
4.2. Techno-Economic Analysis
The obtained 1000 sets of optimal sizes are only used for the respective radiation scenarios to acquire the minimum COE by optimal energy scheduling. However, the range of optimal sizes obtained for each component in the optimal design, with sufficient radiation scenarios, provides a reference for sizing configurations of systems under radiation uncertainty. In this section, five candidate sizes are selected from the optimal range, while the techno-economic analysis is performed based on the remaining 1000 sets of radiation scenarios. Respectively, sizes (1), (2), and (3) are the minimum, maximum, and mean sizes from the optimal size range of each component, and sizes (4) and (5) are the optimal sizes of the maximum/minimum yearly total radiation scenario.
Table 4 gives the statistical index for each size of the HES, including the specific size information, initial capital cost, annual COE, average energy supply, average PV power generation, etc. Overall, the main difference between the various size combinations comes from the battery capacity, which also creates a gap in their initial investment. The annual COE fluctuated within USD 0.0157 to 0.0194/kWh for each HES in the random scenario, with a difference of USD 0.0284/kWh between the maximum and minimum COE for all HES. The smallest COE occurs at the size (5) of USD 0.3821/kWh, and since it is taken from the smallest yearly total radiation scenario, a larger diesel generator capacity must be configured to meet the load demand. The largest COE occurs under size (2), with USD 0.4105/kWh; the size consists of the maximum sizes from the optimal size range of each component, leading to the highest initial capital cost of USD 6909.8. Only size (1) shows a loss of energy supply, with a yearly average of 12.03 kWh and a loss probability of 0.15%. The average proportion of PV generation is positively correlated to its capacity, with a maximum of 64.1% for size (2), while the average curtailment rate is correlated to battery capacity, with a minimum of 13.17% for size (4) (752.17kWh/year). Size (4) is the most notable among all candidate size combinations, with average COE of USD 0.3931/kWh, which is only USD 0.0041/kWh higher than the minimum, yet it exhibits the second-highest PV proportion and the smallest curtailment rate.
Specifically, the COE composition for each size of HES is shown in
Figure 10. The fuel and emission costs of diesel generators comprise the largest share for each size combination, followed by the depreciation cost of auxiliary equipment, which is directly related to the initial capital cost of the system. The depreciation cost of the PV and diesel generators is not sensitive to the HES size, while the depreciation cost of the battery shows a significant increase with the increase in battery capacity. Meanwhile, higher battery depreciation costs mean that the battery meets the flexible take-up of PV power through more frequent charging and discharging, which reduces the fuel costs of the diesel generator. The main advantage of distributed HES is the flexibility to meet the load demand, as well to express the environmental friendliness of the renewable energy it contains. The COE composition of both size (2) and size (4) exhibit high battery depreciation costs and low fuel costs, echoing the high PV proportion of both in
Table 4. Nevertheless, size (4) is better than size (2) in terms of the cost of the initial investment, the COE, and the PV curtailment rate, and seems to be the best choice in terms of both economic and environmental considerations.
5. Conclusions
A novel method for the stochastic modeling of solar radiation is proposed, with consideration of both intraday variation and daily stochastic evolution. By introducing the clearness index, the daily radiation intensity is divided into nine states, and the evolution characteristics are modeled based on the Markov model. Diffusion-based kernel density estimation is used to obtain the probabilistic distributions of the intraday clearness index at different daily clearness states. The use of the clear sky radiation model naturally ensures that the obtained radiation data comply with the local solar seasonality and variations in sunlight duration. Meanwhile, the two-layer stochastic model ensures the rationality of daily variations and intraday fluctuations in radiation intensity, providing a data foundation for the optimization of the designing of solar-powered hybrid energy systems. A methodology for the application of the proposed stochastic radiation model is illustrated by optimal sizing in the case of a solar-powered hybrid energy system. The main conclusions are presented below.
- (1)
The proposed generation model can effectively reflect variations in solar radiation, providing a reliable data foundation for the optimization of the designing of solar-powered hybrid energy systems (HES).
- (2)
The optimal size range for each device in the hybrid energy system under random solar radiation variations can be obtained based on the proposed model.
- (3)
The size combination in the maximum total yearly radiation scenario exhibits a better overall performance. Compared to the worst-case scenario, its PV curtailment rate decreases by 7.02%, and the COE is only USD 0.0041/kWh higher than the current minimum.
This work considers a simple off-grid PV-battery-diesel generator system, but it can be easily extended to more complex architectures, including other renewable energy sources and grid connections.