1. Introduction
Uncertainty in power system planning becomes a critical factor as the capacity of renewable generators in power systems increases to cope with global warming and climate change due to electricity generation. In this context, generation capacity planning models considering the uncertainty of renewable resources have been developed and can be found in the literature [
1,
2,
3]. In those models, renewable resources such as wind and solar power are represented by random variables or uncertainty sets as a means of quantifying uncertainty.
In [
1], hydropower resources with uncertainty are quantified as scenario sets, and a mathematical generation planning model is formulated with two-stage stochastic programming. In [
2], uncertainty sets for electric demand and wind power generation capability are formed via a statistical method, k-means clustering. A multi-stage stochastic generation planning model is presented in [
3], where seasonal, day- and nighttime random samples are generated for wind power availability, solar direct normal irradiance (DNI), and electric load using Gaussian copula. Seasonal variability is represented in the scenarios; however, the short-term variability or chronology is not included. In [
4], generation capacity planning models applying methods for the uncertainty representation of renewable resources are surveyed.
In the generation capacity planning models described above, the uncertainty of the random parameters is well presented in the view of long-term planning, and optimal solutions are obtained within a reasonable amount of time; however, temporal variability is not fully captured because one random variable or scenario represents a long time period. Moreover, the chronology of the random parameters is ignored, which may affect the optimal solution of generation capacity to be built. Therefore, there have been discussions regarding the effect when the chronology and temporal details of the random parameters such as electric loads are included in the planning models, which can be found in [
5,
6,
7,
8,
9,
10,
11,
12].
In [
5], with 1-h time steps, operating costs for generation capacity building portfolios are evaluated where a detailed operational model with a unit commitment decision-making is included in the optimization problem. Scaled time-series data for electric demand and wind power are used. An energy planning tool is employed in [
6] to see the relation between a long-term planning model and short-term dynamics of renewable resources. As a scenario, a typical load curve is selected from historical time-series data for each month. Four different generation capacity building plans are evaluated based on the given scenario of electric load.
In [
7], scenario paths are formed for each season with different time intervals (1-, 2-, 6-, 12-, and 24-h intervals), and the aggregated scenario paths from the spring to the winter form yearly scenario paths.
A unit commitment decision is included in the operational problem with 1-h time resolution in [
8], and a test system without transmission lines is applied to investigate the performance of the model.
For more precise representation of the operational model, unit commitment decision-making is included in a generation capacity expansion planning problem with hourly time steps [
9], and typical scenarios with 1-h time steps of demand and renewable energy sources for four seasons are selected as representative scenarios.
In [
10], linked simulations with pre-developed long-term and short-term models with 1-h time intervals are performed. With the planned (fixed) future capacity of renewable generation systems, scenarios for electricity demand growth and capacity factors for renewable generators are used based on assumed and historical data for the year 2010, respectively.
Experiments are performed with scenarios in [
11] with 1- and 4-h time steps for a generation capacity investment problem without considering transmission capacity limits. The scenarios are generated via the aggregation method from the hourly historical data.
The models above consider a detailed temporal variability by applying short time intervals in long-term planning models; however, the uncertainty of renewable resources is not explicitly modeled in the mathematical problem.
It is challenging to maintain a high resolution of the planning horizon with short time intervals, due to increasing problem size. In particular, in the stochastic optimization context, difficulties in computation drastically increase in accordance with short time steps since the number of decision variables is increased by the number of scenarios and time steps. There may be a tradeoff between computational costs and the quality of approximated solutions, because solution quality becomes relatively poor if a small number of scenarios are applied.
In view of integrating solar power in the power systems, it is important to capture the unavailability of solar power during nighttime and rapid changes in availability around sunrise and sunset times. Therefore, any methods where a scenario at a particular time interval represents a long time period may not reflect the actual behavior of solar power accurately.
Several stochastic generation planning models containing a high-resolution operational model are introduced, e.g., [
12], where a stochastic model is developed under a two-stage stochastic programming framework with hourly operational intervals; however, transmission capacity constraints are not considered, and the scenario paths for renewable resources such as wind and solar power availability are simply based on the historical information; therefore, the obtained solutions rely heavily on historical events, not future events that possibly occur with a certain level of probability.
In order to overcome the shortcomings of the generation planning models appeared in the literature, as stated above, this paper proposes a stochastic generation capacity planning model using hourly random scenarios and investigates the impacts of the short-term variability inclusion by applying the model to the IEEE 300-bus system with transmission constraints.
The contributions of this paper can be pointed out as follows: (1) Impacts on a stochastic generation planning model with an hourly resolution of the operating problem are investigated. (2) An optimization model is developed to find an optimal solution to the generation capacity planning problem with sample paths under a stochastic programming framework and implemented on a high-voltage 300-bus system with transmission constraints. (3) An ARTA stochastic process is successfully implemented on the optimization model to generate hourly random paths, which enables Monte Carlo simulation.
The rest of this paper is organized as follows.
Section 2 describes the method in which sample paths are generated, and the procedure for sample path generation is explained step by step. The mathematical formulation of a two-stage stochastic generation capacity planning model including the decision process is presented in
Section 3. Simulation results applied to a modified IEEE 300-bus system are presented, and a comparison of the results from the models with and without consideration of the short-term variability is reported in
Section 4. The paper concludes with some remarks in
Section 6.
2. Scenario Generation Method
As a method to generate random sample paths representing the short-term variability of the uncertain parameters, the AutoRegressive-To-Anything (ARTA) process is applied [
13,
14]. The ARTA process can be achieved by finding an AR(
p) base process,
, with known autocorrelations and the marginal distribution of the desired process
and transforming
into the process
, where
and
.
First of all, a stationary desired process needs to be defined; however, the corresponding time-series data, solar DNI, wind power availability, and electric load are not stationary in general, so a stationary desired process is defined, where satisfies that . The base process, , is derived using the autocorrelation for the stationary process and marginal cumulative distribution, of .
The whole procedure for ARTA process can be summarized as several steps: (1) finding the autocorrelation structure for AR(
p) process
using the autocorrelations and the marginal distribution of
in
Section 2.2, (2) finding AR parameters of
with the autocorrelations of
in
Section 2.3, (3) forming
and transforming them into the desired process
, (4) transforming the process
back into the target process,
. These steps are illustrated in
Figure 1. All calculations for generating the sample paths are done with MATLAB. Detailed steps to generate the ARTA process are described in the following subsections.
2.1. Historical Seasonal Data
The obtained historical time-series data include hourly potential wind power, solar irradiance, and electric load in Texas for a year from November 2009 to October 2010 [
15]. In order to obtain sample paths that indicate different seasonal characteristics, data are classified into four seasons, and the sample paths are generated for each season. Marginal distributions and autocorrelation structures for those seasonal data are found from the divided datasets separately. For solar DNI, only daytime data are used, and sample paths for nighttime are assumed to be zero without generating samples. The daytime hours for solar DNI are assumed to be from 07:00 to 19:00 h for spring and fall, 07:00 to 20:00 h for summer, and 08:00 to 19:00 h for winter, respectively.
2.2. Autocorrelation Structure of
A marginal distribution and autocorrelations of each process for solar DNI, wind power availability, and electric load are derived with the classified data, which are described in
Section 2.1. Johnson’s translated systems are fit to the historical data for the marginal distributions of the process [
16], whereas the Johnson’s translation system has a great flexibility for numerical analysis. Parameter estimation for the distributions is done with statistical package R. The autocorrelation structure composed of the autocorrelations of the process
is represented by
, where
refers to the correlation between two time series,
.
The autocorrelation structure of an underlying standard normal process
, can be represented by
,
vector, and the autocorrelation,
, can be found with
where
In the above Equations (1) and (2),
,
, and
represent expectation, the standard normal cumulative distribution function, and a standardized bivariate normal distribution function, respectively. Since
,
, and
are known in (1), the left-hand-side (LHS) value,
, in (2) is known. By adjusting an unknown value,
, that is implicitly incorporated in
, the right-hand-side value (RHS) can be found when both LHS and RHS values are equal within a tolerance value (in this simulation, the tolerance is
). Using this method, the autocorrelation structure
can be determined. The autocorrelations of the desired process and the derived stationary base process for solar DNI, wind power availability, and electricity demand with different lags,
and
, are listed in
Table 1 and
Table 2.
2.3. AR(p) Parameters of Base Process
A matrix composed of variances and covariances for a standardized normal stationary process
with lag
p can be denoted by
’s parameters,
can be found via
A random variable,
, can be generated using variance
which is denoted by
2.4. Generating Scenarios Using ARTA Process
In
Section 2.3, the AR parameters,
, and the variance
, are determined. By using the parameters, a stationary AR(
p) base process,
, can be found as follows
where
is a random variable satisfying
. Once
is formed, a stationary processes,
, can be generated by
, and the final, target process
is generated. The procedure to generate random sample paths
is described below:
- (1)
Generate initial values, , using cumulative distribution functions for the historical initial value data, where p represents a lag. Initial values for , composed of the series of the differences, , are calculated, e.g., if the lag p is equal to 2, three initial values, , and , are generated, and the differences, and are obtained, where , and ;
- (2)
Initial values for the AR(p) base process, , are generated via using the differences, s, which are derived in the previous step;
- (3)
Generate using the AR parameters, s, where ;
- (4)
Transform the time series, , into using ;
- (5)
If is out of the expected, realistic range at any t, go back to Step (1), otherwise go to the next step;
- (6)
Transform back into the target sample path with , where the initial value, , is pre-determined in Step (1).
In order to generate
N random sample paths for a year, the steps described need be done for each season and are repeated
N times. With the given procedure,
N sample paths with AR order
p = 2 for
T = 24 are generated in this simulation. The generated 20 sample paths are illustrated in
Figure 2. The solar DNI sample paths are generated only for daytime, as described in
Section 2.1, and the values for nighttime are assumed to be zero.
6. Discussions
Long-term planning models using stochastic programming generally do not consider the short-term variability and chronology of renewable resources due to high computational efforts. Therefore, the effect of incorporating the short-term variability as well as the uncertainty has not been properly addressed, and expected changes in the economic value of renewable resources due to the short-term variability have not been examined.
One thing to be noted is that for accommodating utility-scale solar PV systems that evidently experience unavailability during nighttime and rapid changes in generation for sunrise and sunset times, the inclusion of chronology and high temporal resolution of the planning horizon have impacts on optimal generation capacity solutions in the long-term planning models. There is a tradeoff between the level of operational details in the model and computational burden in stochastic generation planning models. In many cases, very detailed operational information introduced in the model may lead to intractability of the problem. It is still challenging to obtain optimal solutions to the stochastic generation capacity planning expansion problem considering the short-term variability, and few studies introduce stochastic models incorporating the short-term variability of renewable resources due to excessive computational costs.
A limitation of the model is indeed a high computational cost. Solution times with different numbers of samples are compared in this paper, and from the comparison result we can see that incorporating the short-term variability with high resolution of the operational model dramatically increases the computational time; therefore, an optimal level of operational details needs to be found to efficiently estimate the value of uncertain, intermittent renewable resources. The high computational costs and accuracy of the operational model are tradeoffs in stochastic capacity planning models.