1. Introduction
Wind generation capacity in the United States has grown rapidly in the past two decades due to the precipitous decline in costs as well as financial incentives and state level mandates for renewable energy playing a central role in US environmental policy [
1,
2]. Consistently, annual electricity generation from wind energy increased from 6 billion kWh (0.1% of total) in 2000 to 338 billion kWh (8.4% of total) in 2020 in the US [
3]. Since the environmental benefits are cited as the main justification for allocating public funds to incentivize renewable energy sources, accurately measuring these environmental benefits is key for evaluating existing policies as well as setting future policy directions. For this reason, this topic has been the focus of a recently growing strand of economic and engineering studies. This paper extends the literature by significantly expanding the time period and hence the scope of wind generation capacities within which this subject is studied and by using more flexible functional forms in the estimation models allowing for an additional source of heterogeneity in the estimated effect.
Fixed costs make up the majority of the total cost of operating a wind farm, and as there are no fuel costs, the marginal operating cost is nearly negligible [
4]. Furthermore, each unit of generation receives a production tax credit from the federal government; hence, wind energy generators have the incentive to produce whenever possible, including the periods of time when the wholesale market price is zero or even negative (except when there is curtailment due to transmission constraints which constitutes a very small portion of wind generation). This implies that wind turbines displace fossil-fuel burning generators which account for 25% of greenhouse gas emissions in the US, one of the world’s top greenhouse gas emitters [
5]. Meanwhile, an important challenge associated with this dramatic transformation in the electricity system is that renewable sources are highly intermittent. The need to instantaneously match generation (supply) and load (demand) in the absence of large-scale storage in the grid necessitates dispatchable sources such as natural gas and coal plants to ramp up and down more frequently in the face of stochastic wind and solar generation. Ramping up and down production is fuel intensive and has the potential to increase per-unit emissions from fossil-fuel burning generators, limiting the environmental benefits of renewable energy technologies [
6]. On the other hand, as explained in [
7], this additional need for flexibility can force the system operator to dispatch more flexible natural gas generators at a higher intensity. While this can imply less generation from coal and hence less emissions, it is possible that operating at lower output levels can cause the coal burning units to have higher heat and emissions rate [
8]. The net effect of all these, sometimes competing, mechanisms are not clear ex ante without solving a complex dynamic grid optimization problem using a detailed dispatch model.
An alternative route would be an ex post econometric data analysis, which is what this paper takes. Specifically, using 13 years of hourly and sub-hourly data from the Electric Reliability Council of Texas (ERCOT), we estimated the causal effects of wind generation and its intermittency on CO
, SO
, and NO
emissions from the electricity sector. ERCOT is an ideal setting for such a study as it has a high level of wind power penetration with 25 GW of installed wind capacity as of October 2020 (82 GW total) [
9]. This was also confirmed by our own calculations compiled using EIA data presented in
Table 1. Based on the data presented here, in 2007, the share of wind generation constituted about 2.9% of the total generation, while in 2019, this number was about 20%. The second factor that renders ERCOT a good fit for this study is that it does not have significant grid scale storage capacity (0.25 GW). High levels of storage capacity would make the estimation of the causal effect more complicated as wind energy would inevitably be stored to be used (and displace fuels) in the later hours. Therefore, the full effect would be captured over the course of multiple hours requiring a more dynamic estimation approach. The third factor is that ERCOT is rather isolated from the other electrical grids, making it sufficient to confine the study to the plants in the grid to capture the full effect [
10].
Economic theory shows that the efficient way of correcting for externalities is to charge a tax equal to marginal damages, i.e., Pigovian taxes [
11,
12]. However, in the US, the use of these policies have so far been rather limited. Current environmental policies catering to the energy sector are predominantly geared toward mandates and financial incentives for renewable energy sources. At the state level, renewable portfolio standards (RPS) require a minimum share of electricity to be generated from renewable resources [
13]. Currently, 38 states have an RPS goal in place, with 12 of those states requiring 100% clean electricity by 2050 or earlier [
14]. At the federal level, there is a production tax credit for renewable electricity [
15]. Such a policy environment motivated researchers from both economics and engineering to estimate the emission reductions attributable to renewable energy sources with the overarching objective to evaluate whether government incentives are justified on the grounds of net economic benefit.
Although at a first glance, calculating emission reductions due to renewable energy sources might seem as straightforward as multiplying average emissions per kWh of fossil-fuel generation by total renewable energy output, this would, in fact, not be the correct approach. A more careful way of thinking about this problem involves identifying which power plant(s) would pause or reduce operations when the renewable energy sources are dispatched. Emissions avoided due to these marginal power plants reducing their operations would amount to the environmental benefits of renewable sources. In principle, marginal power plants can be identified using a dispatch model based on the assumption that renewables would replace high-marginal-cost plants. The papers that take this general approach include [
16,
17,
18,
19,
20]. The prevailing insight from this group of studies is that while an increased level of renewable integration would reduce emissions, the magnitude of the effect depends on the existing fuel mix and technical characteristics of the system. While this method is more accurate compared to using the average emissions, performing a full dynamic optimization analysis would require often hard-to-access grid dispatch models and proprietary data on transmission lines and individual generating units.
More recently, an alternative approach pioneered by [
21] emerged in the economics literature. The core idea here is that instead of calculating the avoided emissions as a result of solving a grid optimization problem using a dispatch model, we could estimate it using publicly available emissions and generation data and econometric methods. There is a growing strand in the energy economics literature adopting this method (e.g., [
10,
22,
23,
24]).
There are two important areas in the current literature studying the avoided emissions attributable to wind that are open to further elaboration. The first one is that these studies are conducted using hourly data from relatively short periods of time. With the exception of [
10] who studied a 5-year period, the time periods of the studies in the literature are limited to one to two years. Therefore, their results can only be informative within a limited range of installed capacity, implying a need for studies using data from longer periods.
Second, while many studies investigate the environmental benefits of renewable energy, the subset of these studies specifically taking intermittency into account are rather limited. Notable recent exceptions are [
7,
24]. Using two years of sub-hourly data from the Southern Power Pool, [
24] estimates the effect of wind and its intermittency on emissions. They find that while the increased renewable capacity does reduce emissions, the effect of intermittency is modest in comparison. In their analysis using data aggregated to a 5 h level from a 2 year period, [
7] presents estimates for CO
within ERCOT. The study concludes that intermittency in wind reduces CO
emissions by indirectly giving natural gas facilities an advantage due to their flexibility. Our study mainly builds on these papers while complementing them in three important ways.
First, our study uses a dataset that ranges from January 2007 to December 2019, which makes it the first paper that econometrically estimates the causal effect of wind and its intermittency using sub-hourly data that spans well over a decade. Throughout this period, we see a seven-fold increase in the installed wind capacity (see the last two columns of
Table 1). Second, the fact that we use a much longer period of time allows us to present two sets of results for each pollutant: results obtained using the entire dataset (henceforth referred to as the “pooled results”) and results from breaking down the data into 10 sub-groups based on hourly wind generation. Therefore, unlike any of the existing studies in the literature, we provide the comparison of the relationship between the wind generation, its volatility, and the emissions for a variety of wind output levels. Third, the majority of the existing studies consider the effects of wind and its intermittency on emissions linearly in their main econometric specifications, and a small minority present the results of their non-linear analyses in their appendices. In this paper, we incorporate non-linearity in two distinct ways: (i) by breaking data into different wind output intervals (ii) by specifying wind and its intermittency in squared and cubed form in addition to linear form in the estimation models.
Preview of Results
Our main variables of interest are hourly wind generation and its volatility which we measured via the the standard deviation calculated using the 15 min electricity generation data from ERCOT. We find that for all three pollutants we studied (CO, NO, and SO) wind generation had a clear negative effect throughout the study period with moderate variation in the magnitude depending on the wind output. However, for the intermittency, size, magnitude, and statistical significance of the effect have a large variation across different wind generation levels as well as pollutants in an irregular way that would not be captured in the pooled results. This finding is consequential for accurately estimating the environmental benefits of renewable energy and designing effective and efficient environmental and energy policy.
The rest of the paper continues as follows:
Section 2 presents data sources and the econometric strategy. Results and their discussions can be found in
Section 3.
Section 4 offers concluding remarks.
3. Results and Discussion
We ran the regression laid out in Equation (
3) for CO
, NO
, and SO
emissions. The exercise started out with a pooled analysis where different econometric specifications with an increasing degree of polynomials for wind and its intermittency were used on the entire dataset. In addition, load and temperature were included in polynomial forms up to cubic. This flexible approach is suitable for the non-linear nature of the grid operations. Presenting results for alternative specifications with different polynomial degrees also allowed us to demonstrate the variation in the robustness of different coefficients. For each pollutant, we selected a specification from the pooled analysis and then continued with running that same specification over subsets of observations. We divided the dataset into 10 subsets based on hourly wind generation level. Within the sub-sample, the means and the standard deviations (in parentheses) of wind generation and its intermittency are presented in
Table 3. The motivation for this exercise is to further increase the flexibility of the models by allowing for irregularities and/or discontinuities in the relationship pattern that a polynomial functional form could not capture despite being non-linear. As noted above, hour-month dummies implicitly control for the growing trend in installed capacity. The insight to be gained here is that controlling for capacity and other factors, whether the marginal effects of wind generation and its intermittency, vary across hours with different wind generation output levels.
We first present the pooled analyses of each pollutant. The results in
Table 4 indicate a significant effect of wind generation on CO
emissions. The size, magnitude, and the significance of the coefficient is fairly robust and consistent across different specifications. The coefficient estimates for wind generation are between −0.53 and −0.65, which is in line with previous estimates published in the literature (e.g., [
10,
24]). We note that all standard errors reported in this study are clustered at the week level.
The goodness-of-fit measures are very close across the models; therefore, for model selection, we take statistical significance and size of the coefficients as well as parsimony into consideration. The coefficients of the squared and cubed wind generation are statistically significant; however, they are negligible in size. The coefficient of cubed wind intermittency measure is also negligible statistically and in size. Therefore, for the sake of parsimony, our preferred specification for CO is model (4) where wind generation is in linear form and intermittency is in quadratic form. The coefficient of wind generation, 0.649, gives its marginal effect since it is in linear form.
While it is not as robust as the coefficient of wind generation, that of intermittency has positive and significant coefficients in most of the models. The sign reverses with the inclusion of the squared intermittency—the coefficient of which is negative and significant. This is consistent with our original conjecture that higher degree polynomials can offer a more suitable functional form when estimating emission reductions. Intermittency is included in quadratic form in the preferred specification; therefore, the marginal effect is given by the following partial derivative:
where the term “Wind Int.” is substituted with its overall sample mean, which is 36.41 as indicated in
Table 2. The marginal effect is calculated as 0.603, while the 95% confidence interval (CI) constructed using the Delta method is [0.21, 0.99]. All marginal effects are summarized in the results section.
The results of the pooled analysis for NO
are presented in
Table 5. For this pollutant, we select specification (7). Squared and cubic wind generation terms are statistically significant, but their size is negligible. Similar to CO
, the results indicate an unequivocally negative effect of wind generation on NO
emissions, and the size of the effect is fairly robust across different specifications. This result is also consistent with other studies while, as it would be expected, the size of the effect is different for different regions. The marginal effect of wind generation on NO
indicated in our models ranges between −0.65 and −0.99. This result is similar to that presented in [
10] who also studies the ERCOT region.
For intermittency, while the cubic term is negligible in size, its inclusion makes a difference for the squared term (comparing models (4) and (7)). In the cubic form, the marginal effect is given by the following partial derivative:
It calculates to −3.918; hence, we conclude that intermittency has a reducing effect on NO
. This is qualitatively similar to the results presented in [
24] for NO
. Again, the size of the effect differs as they study a different transmission region.
The final pooled analysis we present is that of SO
, another local pollutant like NO
. The results can be found in
Table 6. Similar to CO
and the coefficient of wind generation is negative, significant, and robust across different specifications with marginal effect varying between −1.03 and 1.7, which again is very consistent with the current literature [
10]. Again, squared and cubed coefficients are negligible in size. For the same reasons as CO
, the preferred model is (4). For intermittency, for most of the models the effect is positive and significant. The marginal effect of intermittency from model (4) is calculated as 3.07.
Table 7,
Table 8 and
Table 9 extend the analysis into sub-samples for CO
, NO
, and SO
, respectively. We see a general pattern that is common to all three pollutants here: wind generation has a negative significant effect on all pollutants at all generation levels. While this effect is very stable for CO
, we see more of a variation for NO
and SO
at different generation levels. As far as wind intermittency goes, the results significantly differ at various levels of wind generation, and they all together differ from the pooled analysis for each of the pollutants. This implies that the wind generation level matters in net benefit calculations and the degree to which the results can be extrapolated is rather limited when we take intermittency into consideration.
Marginal effects of wind and its intermittency on all pollutants both for pooled and group level results together with their respective standard errors are presented in
Table 10. This table presents a summary of all the findings. The overarching results from these exercises are that (i) wind generation has unequivocal negative effect on all pollution and the effect is relatively stable; (ii) intermittency is an important factor that needs to be taken into consideration as it can matter for certain pollutants for certain generation intervals; and (iii) extrapolating results derived using data within limited ranges of wind output can be misleading when conducting cost–benefit calculations.
4. Conclusions
The electricity systems in the United States has seen a dramatic change toward clean but intermittent energy technologies over the recent years. While economic theory points to Pigovian taxes on carbon emitters as the efficient solution, currently the prevailing policy instruments in the US are mandates and tax-payer-funded subsidies for non-emitters. This brings about the question of how much emission can be attributable to renewable energy sources, where accurate calculations are key to the evaluation of the existing policies as well as informing future decision making.
Large-scale integration of these intermittent energy sources not only brings about technical challenges and potential cost increases but also raises concerns about the actual environmental efficacy of these sources. In this paper, we investigate the extent to which the intermittency in wind effects emissions from the electricity sector in ERCOT—a large grid system in the US with a high degree of wind penetration. For this, we use hourly and sub-hourly data from ERCOT and other sources recorded over the course of 13 years during which installed wind capacity increased by about seven folds. This allows us to observe the grid over a much wider range of wind generation interval than the existing studies.
We find that while wind generation has a clear negative effect on all emissions we studied, the effect of intermittency shows a large and irregular variation across different wind output levels. This implies that running such analyses with the pooled data can be misleading by masking this heterogeneity and can lead to inaccuracies in the emission savings estimates. Therefore, future studies estimating the environmental benefits of renewable energy need to do so at a higher granularity of wind output levels.
Additionally, another notable observation throughout the estimation process is that model selection can be increasingly complex especially with a growing range of wind capacity. For this reason, although it is outside the scope of this paper, future work should certainly include state-of-the-art machine learning methods to estimate emissions savings with an increased level of granularity in the wind generation intervals. This seems to be the current frontier of many economic analyses as the datasets and the variety of variables observed grow conjointly with computational power and memory.
Finally, while the COVID-19 pandemic was not a prominent factor in energy use within the time period of our study, future work should expand this analysis to further study the impacts of the additional uncertainty that the pandemic has introduced into the system.