Next Article in Journal
Two-Dimensional Shape Analysis of Complex Geometry Based on Photogrammetric Models of Iconostases
Previous Article in Journal
Usability Textual Data Analysis: A Formulaic Coding Think-Aloud Protocol Method for Usability Evaluation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Latent-Factor System Model for Real-Time Electricity Prices in Texas

1
Department of Economics, Hong Kong Baptist University, Hong Kong, China
2
Department of Information, Risk and Operations Management, McCombs School of Business, University of Texas in Austin, Austin, TX 78712, USA
3
Department of Economics, University of Texas in Austin, Austin, TX 78712, USA
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(15), 7039; https://doi.org/10.3390/app11157039
Submission received: 18 June 2021 / Revised: 26 July 2021 / Accepted: 27 July 2021 / Published: 30 July 2021
(This article belongs to the Section Energy Science and Technology)

Abstract

:
A novel methodology to model electricity prices and latent causes as endogenous, multivariate time-series is developed and is applied to the Texas energy market. In addition to exogenous factors like the type of renewable energy and system load, observed prices are also influenced by some combination of latent causes. For instance, prices may be affected by power outages, erroneous short-term weather forecasts, unanticipated transmission bottlenecks, etc. Before disappearing, these hidden, unobserved factors are usually present for a contiguous period of time, thereby affecting prices. Using our system-wide latent factor model, we find that: (a) latent causes have a highly significant impact on prices in Texas; (b) the estimated latent factor series strongly and positively correlates to system-wide prices during peak and off-peak hours; (c) the merit-order effect of wind significantly dampens prices, regardless of region and time of day; and (d) the nuclear baseload generation also significantly lowers prices during a 24-h period in the entire system.
JEL Codes:
Q02; Q04; Q41; Q42

1. Introduction

Information about energy prices is known in the day-ahead market, but actual real-time prices will deviate from the day-ahead prices for many “hidden” reasons; see [1]. For example, an error in load forecasts, wind forecasts, solar output forecasts, or the outage of a power plant or transmission line, and many other unforeseen events will cause real-time prices to deviate from day-ahead prices. These latent factors are difficult to measure and adjust in real-time, and yet their impact on prices can be significant. This reveals itself in the fact that the new real-time price is set at most every five minutes.
A typical approach to explaining real-time prices is to start with the ex-post day-ahead price, and model deviations of the real-time price from the day-ahead price as a function of forecasting errors. While this approach may be helpful, it fails to consider the myriad of unobserved latent factors. Also, system-wide hidden factors are difficult to forecast in single-equation models that are used to explain real-time prices.
One of the two main aims of this paper is to present a novel methodology that uses unobserved latent factors and exogenous variables to explain energy prices in Texas by modeling these prices as endogenous, multivariate time-series. This system-wide approach then leads to estimating the attendant merit-order effects of baseload generation (nuclear energy) as well as renewable energy generation (wind and solar).
The hourly real-time market (RTM) energy price used in our analysis originates from the 5-min real-time energy prices based on the real-time operation of the Electric Reliability Council of Texas (ERCOT). ERCOT uses a security-constrained economic dispatch model (SCED) to simultaneously manage energy, system power balance, and network congestion, yielding 5-min locational marginal prices (LMPs) for each electrical bus within the market. The SCED process seeks to minimize offer-based costs, subject to power balance and network constraints. The zonal settlement price for a load-serving entity’s real-time energy purchase is a load-weighted average of all 5-min LMPs in a load zone, which is converted to 15-min values or hourly values by ERCOT.
Economic merit order effects attributable to renewable energy generation have been analyzed for many of the world’s competitive wholesale markets using linear regression models. These include studies of the market in Spain [2], Germany [3,4,5,6], Denmark [7,8], Italy [9], Australia [10], Ireland [11], the U.S. mid-continent or MISO [12,13], Texas [14,15,16], PJM [17], the Pacific Northwest [18,19], and California [20]. Also, quantile regression approaches have been employed to study merit-order effects in Turkey [21] and Germany [22].
Methodological Contribution. We explore this topic using hourly real-time price data for the years 2015–2018 from the ERCOT market. Divided into eight zones—North, Houston, South, West, Austin Energy, CPS Energy, Lower Colorado River Authority, and Rayburn Electric Cooperative—ERCOT serves the electrical needs of the largest electricity-consuming state in the U.S.; it accounts for about 8% of the nation’s total electricity generation, and is repeatedly cited as North America’s most successful attempt to introduce competition in both generation and retail segments of the power industry (Distributed Energy Financial Group, 2015). In the interests of brevity, we report the findings for Houston, Austin, and West regions, since the results from the other regions are similar.
To the best of our knowledge, this study is the first attempt at developing a latent-factor system-wide model for estimating the merit-order effects of baseload and renewable energy generation. While we use the Texas energy market to exemplify the methodology, the models developed here are readily applicable to other markets as well. Moreover, while we focus on real-time prices, the methodology readily lends itself to the study of day-ahead prices as well.
Section 2 describes the data and variables used in the study. The system-wide latent factor model for prices is detailed in Section 3. Section 4 provides the results, followed by a discussion and conclusion in Section 5.

2. Data and Variables

This section describes the data used in the analytic models, including the geographical scope and sample period.

2.1. Geographical Scope

The current ERCOT market with its eight zones is the focus of the paper; see Figure 1 for a map of ERCOT. The North and Houston zones account for about 37% and 27%, respectively, of ERCOT market energy sales, while the South and West zones contribute 12% and 9%. Further, these four zones account for nearly all of the state’s retail competition, and most of the competitive generation resides within these zones.

2.2. Sample Period and Variables

The sample period starts on 1 January 2015 and ends on 31 December 2018. Thus, we have a very large dataset since all the eight price series will appear together as endogenous variables in the multivariate response matrix.
As noted earlier, we discuss at length the results for the following three zones: Houston, Austin, and the West. Additionally, we examine the merit-order effects stemming from three hours in a 24-h cycle: 4:00 a.m., 12:00 p.m. and, 4:00 p.m. The first is off-peak and the other two correspond to peak hours. There is nothing special about the specific hours we chose to work with; a similar analysis with other hours yields the same overall conclusions reported here.
Table 1 provides the summary statistics for the prices ($/MWH) for the three hours and the three zones, respectively. The corresponding time-series plots of the nine series, shown in red, appear in Figure 2. In the analysis, however, we work with the natural log of the price data.
One of the insights we hope to gain is to see how the one-hour-ahead in-sample estimates of the latent factor time series track the price plots in Figure 2. If we can show that there is a strong correlation between the estimated latent factor series and the price series, then that bodes well for the estimation of merit-order effects from alternative energy and baseload generation. On the other hand, if there is a very weak relationship between latent factors and the price series, then exogenous factors should suffice in understanding the fluctuations in the price data.
A brief discussion of each of the independent variables now follows. These variables were selected based on careful data exploration via summary plots/correlation tables, practical considerations of data size, modeling aims, and computational complexities. Additionally, price formation in the ERCOT market has been analyzed in a variety of antecedent studies using many of the same data sources and variables employed in this study [14,15,16].
The exogenous variables used in this study are split into those that appear in the observation and latent factor equations, respectively; these equations are detailed in the next section.
Observation Equation Exogenous Variables. Wind generation, nuclear generation, solar generation, the Henry Hub gas price, and a dummy variable for spikes in prices that exceed $500 MWH are the exogenous variables. ERCOT analysts have noted that industrial customers tend to significantly scale back when prices exceed USD 500. So, a binary dummy variable for extreme price spikes is used. The solar generation variable and the dummy variable do not appear in the 4:00 a.m. equations. We downloaded daily natural gas prices for Henry Hub from the DOE/EIA (See: http://www.eia.gov/dnav/ng/hist/rngwhhdd.htm. Last accessed 18 July 2020). We use the Henry Hub price instead of the local natural gas price (e.g., Houston Ship Channel) since the Henry Hub price is highly correlated with the local natural gas price (r > 0.95). Finally, the latent factor variable, which is estimated from within the system endogenously, appears as an exogenous variable in the observation equation. All variables are on the natural log scale except, of course, the dummy variable.
Latent Factor Equation Exogenous Variables. Recall that the latent factors are unobserved variables; there is no data for them. The parameters corresponding to these variables are recursively estimated from within the system at each point in time, which leads to the following intuition: if one could observe these latent causes, then they are most likely going to be related to load and prices. For instance, power outages, erroneous short-term weather forecasts, unanticipated transmission bottlenecks, etc., would most certainly impact demand and price distributions across ERCOT. Therefore, we use system-wide load (MWH) and lagged weighted price ($/MWH) across all eight zones as the exogenous factors that could likely associate with the unobserved factor variables. Additionally, a first-order autoregressive process for the latent factor is used. This allows us to capture the potential lingering effects of hidden variables over time. As described in the next section, while we could use higher-order lags, we do not do so in the interests of parsimony. Also, the lagged weighted prices do capture some of the previous time period’s effect on the latent factor. Note that the system-wide load is, in one sense, endogenous to the observation equation via the latent factor. Finally, we work with the natural logs of all these variables.

3. The Latent Factor Systems Model

Following [23,24], suppose there are k endogenous variables. Let n f < k denote the number of unknown or hidden latent factors. Then, the system of equations that represent the prices in the k = 8 zones in ERCOT with n f = 1 is given by:
y t = λ f t + β x t + u t f t = δ z t + ρ f t 1 + w t ,
where the first equation is called the observation equation and the second is termed the latent factor equation. The dimensions of the various quantities in Equation (1) are: y t is a k × 1 vector of endogenous variables; λ is n f × n f ; f t is n f × n f ; β is a k × n x vector of parameters; x t is an n x × 1 vector of exogenous variables; u t is a k × 1 vector of random errors that are assumed to be normally distributed with mean zero and unknown standard deviation σ u ; δ is an n f × n z of parameters; z t is an n z × 1 vector of exogenous variables; ρ is an n f × n f matrix of parameters; and w t is an n f × 1   vector of random errors that is normally distributed with mean zero and unknown standard deviation σ w .
It is possible to introduce another Equation in (1) that represents an autoregressive structure for the observation error u t . However, this leads to a larger number of parameters than is dictated in most applications. Moreover, convergence issues abound when the parameter space and the sample size are large. As it is, the class of models contained in (1) is quite rich. By appropriately restricting n f , p and q, we can obtain Zellner’s Seemingly Unrelated Regression model, Vector Autoregressive models; Dynamic Factors with Errors models, etc; see, for example [25,26]. Williamson et al. [1] developed an alternative Bayesian latent factor model, using nonparametric methods, that complements the latent factor model in Equation (1).
We could also add higher-order latent factors ( n f > 1 ), but again we err on the side of parsimony. Indeed, we could also increase the dimension of the autoregressive component of the latent factor vector f t which we have set as an AR(1) process. But we refrain from doing this since we also include the lagged weighted price of all the zones as an exogenous variable in the vector z t ; i.e., we allow the weighted values of lagged prices from the eight zones to guide the hidden factors that could drive each zone’s price in the observation equation where these prices are endogenous in the system given in (1).
Thus, y t is the endogenous matrix of prices from the eight zones; x t contains the exogenous variables wind, nuclear and solar generation, where the last one appears only in the sunlight hours; the Henry Hub gas price; and a dummy variable for real-time prices exceeding USD 500, which will not appear in the night and early morning hours since prices do not rise to very high levels at these times. The endogenous factor variable ft also appears as an exogenous input in the observation equation. The implication is that these hidden factors could influence prices throughout the day. In the latent factor equation, the exogenous variables in z t use system-wide load (MWH) and lagged weighted price (USD/MWH) across all eight zones; these are contemporaneous in time. Additionally, we assume the latent factor follows a first-order autoregressive process. Since we separate our analysis for each hour of the day, the lagged variables are the variables of the previous day. Since ft enters the observation equation exogenously, the system-wide load affects system-wide prices via f t . Lastly, the AR(1) specification for f t in the second equation captures the lagged nature of hidden factors; for example, poor weather forecasts, which could be one of the latent factors, tend to be contiguous over time.
The maximum likelihood estimates (MLEs) for all the parameters (including δ and ρ ) are found via an iterative method that combines the two algorithms developed in [27,28]. All analyses were conducted in STATA.

4. Results

Here, we report and discuss the results for three regions: Houston, Austin, and West; details on all other regions are available on request. Where appropriate, we highlight the empirics from the other regions as well. For the three regions, we report the results for 4:00 a.m., 12:00 p.m., and 4:00 p.m.; these are representative of the other off-peak and peak hours. Thus, we estimate Equation (1) nine times since we have nine models in total. We have the following major results.
Wald Test. This test has a chi-square distribution. It tests the null hypothesis of whether or not all the unknown parameters in the observation and latent factor equations are jointly significant; this is similar to the F-test in multiple linear regression. For all nine models, the Wald Statistic soundly rejects the null hypothesis at any significance level (p < 0.00001).
Actual versus predicted price series. Consider Figure 3 which shows the actual and predicted series. As expected, there are some outliers in the data, especially during the 4:00 p.m. hour for all three zones. Also, again consider Figure 2. Note that the predicted time series, shown in grey, track the original price series in red quite well for the three different hours, barring the time points corresponding to the outliers.
Correlations between actual price series and estimated factor series f t . Table 2 shows the correlations between each of the price series from all eight zones for the three hours. They are all positively correlated to the predicted latent factors. We highlighted the correlations for the regions Houston, Austin, and West in Table 2 in order to emphasize two points. First, note that the West zone has the weakest correlation during the peak hours of 12:00 p.m. and 4:00 p.m., compared to other regions. This is because of the larger impact of wind generation in the West during these hours, compared to other zones. Second, consider Figure 4, Figure 5 and Figure 6. Each comprises four plots. For the sake of clarity, let us focus on just Figure 6 corresponding to the 4:00 p.m. hour. The top left plot is the latent factor one-step-ahead estimated time series. The other three plots in each of the panels are the actual price series for Houston, Austin, and West. The corresponding correlations between the latent factor series and these three price series from Table 2 are: 0.587, 0.591, and 0.457, respectively. It is evident that the latent factor series structurally evolves like the three price series, which are representative of the price series for the entire ERCOT system. The presence of outliers in the price series is unavoidable in the ERCOT data. This would explain why some of the correlations are not as high as one might expect. We experimented with higher-order lags in the autoregressive error structure for the latent factor series in Equation (1). But such an increase in model dimensionality does not change the overall conclusions by much. Hence, we err on the side of parsimony.
Significance of the latent factor coefficient. From Table 3, the endogenous latent factor variable, f t , when it appears as an exogenous variable in the observation equation is statistically significant for all the nine models (p < 0.00001). This result confirms one of the principal assertions in this paper, namely that there are hidden, unobserved factors that influence the distribution of real-time prices throughout a 24-h cycle across all zones. Damien et al. [29] do not use latent factors in their system-wide price and demand ERCOT model. It is evident from this research that latent factors play a significant role in ERCOT’s pricing structure.
The marginal effects of the exogenous variables. Consider Table 4, Table 5 and Table 6 which show the maximum likelihood estimates (MLEs) for coefficients that appear in the observation and latent factor equations in (1); the corresponding p-values; and the 95% confidence intervals for Hours 4:00 a.m., 12:00 p.m., and 4:00 p.m., respectively, for the three zones.
Since we are dealing with the natural logs of all the variables, the MLEs represent elasticities. We first describe some overarching conclusions from all three tables here, saving for later the discussion of the merit-order effects.
First, from the latent factor equations for all three hours and zones, system-wide load (SystemLoad) positively and significantly impacts the hidden factors. Second, lagged weighted price (LagWtPr) is not significant in the off-peak hour but is significant during the peak hours. Interestingly, it impacts the hidden factors negatively at the noon hour and positively at the 4:00 p.m. hour. Third, the lagged latent factor variable is positive and statistically significant at all three hours for all three zones in the latent factor equation. In conjunction with the plots shown in Figure 4, Figure 5 and Figure 6, this further confirms the importance of the latent factor dynamics on energy prices in all eight zones. Fourth, from the observation equation for the three zones, during all three hours, as expected, wind generation and nuclear generation have negative elasticities, and Henry Hub gas has positive elasticity. Fifth, solar generation is a mixed bag, largely because this resource is still growing in Texas, and as such its data are non-stationary. Thus, solar generation is not significant at 12:00 p.m. and its elasticities are positive and weak at 4:00 p.m. Finally, the impact of extreme spikes in real-time prices (the dummy variable) at 12:00 p.m. and 4:00 p.m. is highly significant in all three zones.
System-wide merit-order effects. To best understand the merit-order effects shown as elasticities in Table 4, Table 5 and Table 6, consider the price boxplots shown in Figure 7. The top, middle and bottom panels, corresponding to hours 4:00 a.m. 12:00 p.m., and 4:00 p.m., respectively, have three boxplots in each panel. On the X-axis, the box titled “Before Price” is the group of mean prices in the eight ERCOT zones before accounting for any merit-order effect. The second and third boxes are the change in mean prices after accounting for merit-order effects in wind and nuclear generation, respectively. The Y-axis represents the mean price values ($/MWH). Each value on this axis is the mean price from each of the eight zones during the years 2015–2018. Focus on the 4:00 a.m. panel at the top. The interquartile range (IQR) of the mean prices of the eight zones in ERCOT at this hour is $16.18 to $16.61; see the left-most box in blue. Next, assume wind generation increases by 10%. Using the MLE estimates of the price elasticities for wind generation for each of the eight zones from our latent-factor system model, we adjust the mean prices in the blue box and construct the resulting change in prices due to increased wind generation. The corresponding distribution of the adjusted mean prices in ERCOT is shown as the second box in red. From the caption, the IQR for the prices, after accounting for increased wind generation, is between $15.59 and $16.07. Finally, we do a similar adjustment to energy prices using the parameter estimates for nuclear generation; this is shown as the green box in the top row of Figure 7. The IQR is between $14.98 and $15.42. Observing the three panels, it is also interesting to note that there is less volatility in the mean prices in the entire ERCOT system during the off-peak hour.
Consider the middle panel which corresponds to the 12:00 p.m. hour. While the reduction in energy prices is less now, wind and nuclear generation still have a measurable impact on real-time prices in ERCOT as a whole. Also, there is more volatility in real-time prices during this peak hour.
Finally, the bottom panel shows the impact on prices due to the merit-order effects at 4:00 p.m. Nuclear generation is much more influential than wind at this hour of the day; its boxplot barely intersects with the boxplot from wind generation. Also, the volatility in ERCOT’s prices is lesser at 4:00 p.m. when compared to 12:00 p.m.

5. Conclusions

This paper demonstrated the relevance of latent factors on real-time energy prices using a system-wide approach. The ERCOT system served as the case study. Using energy prices from eight inter-connected zones as endogenous variables, we found that hidden factors significantly impact the merit-order effects of baseload and renewable energy generation.
The latent-factor approach developed here can be improved and extended. Damien et al. [29] use a hierarchical Bayesian approach to compare the impact of day-ahead and real-time prices on wholesale demand in ERCOT. However, they do not model latent factors. This paper clearly shows the importance of accounting for such factors. A Bayesian latent factor system-wide model for prices and/or demand is possible in principle; see [30]. However, the challenges are formidable. First, since the parameter space is very large, convergence issues will be a difficult problem to overcome. Concurrently, while studying energy prices or demand, the attendant datasets tend to be very large, as in this paper. This too will add to convergence issues since the likelihood function will have to be evaluated many-fold in any Markov chain Monte Carlo scheme that is required to obtain posterior distributions.
Another future topic for research that this paper proposes is to model the system of equations in Equation (1) via non-normal errors. For example, Williamson et al. [1] use a nonparametric error distribution—the Indian Buffet Process—to develop a new class of latent factor models. But with large datasets, such nonparametric approaches are even more computationally involved compared to parametric formulations.
Why should a non-normal error structure matter in the context of energy prices, and in the estimation of merit-order effects? Recent studies [21,22] have shown that prices have asymmetric distributions with large kurtosis. Subsequently, error distributions from normal linear models tend to be non-normal heteroscedastic and autocorrelated. Hence, quantile regressions have been proposed and exemplified in the energy literature. However, there is a trade-off. Because of the mathematics underlying them, quantile regressions are essentially single-equation models. Thus, the prices of each of ERCOT’s eight zones can be modeled separately using quantile regressions; see [31]. But the results in this paper clearly demonstrate the importance of treating the eight zones as part of an interconnected system so that we can better understand how latent factors influence prices jointly. This leads to an open question: how should one construct a system-wide, latent-factor quantile regression model that is equivalent to Equation (1) in this paper? This is a very challenging problem for multiple reasons. For example, consider a bivariate time-series that represent prices from, say two of ERCOT’s eight zones. Further, suppose the error term in the observation model in Equation (1) follows a bivariate skew-t distribution since this distribution allows for varying degrees of skewness. How should one jointly model the quantiles of this bivariate distribution as functions of latent factors and exogenous variables? The answer is not at all evident even in this simple bivariate setup. Therefore, instead of multivariate quantile regression systems, we believe, as a first step, it may be easier to recast Equation (1) using nonparametric prior distributions. Indeed, this could also lead to stronger correlations between the factor and price series since nonparametric priors can better treat outliers. The resulting estimation of the merit-order effects in energy markets would be a useful advancement.

Author Contributions

Conceptualization, P.D. and J.Z.; methodology, P.D.; software, K.H.C.; validation, K.H.C., P.D. and J.Z.; formal analysis, K.H.C. and P.D.; investigation, P.D. and J.Z.; resources, J.Z.; data curation, K.H.C. and J.Z.; writing—original draft preparation, K.H.C., P.D. and J.Z.; writing—review and editing, K.H.C., P.D. and J.Z.; visualization, K.H.C.; supervision, P.D. and J.Z.; project administration, K.H.C. and P.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

Data presented in this study are available from the third author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Williamson, S.; Zhang, M.; Damien, P. A new class of time-dependent latent factor models with applications. J. Mach. Learn. Res. 2020, 21, 1–24. [Google Scholar]
  2. Gelabert, L.; Labandeira, X.; Linares, P. An ex-post analysis of the effect of renewable and cogeneration on Spanish electricity prices. Energy Econ. 2011, 22, 559–565. [Google Scholar] [CrossRef]
  3. Sensfuß, F.; Ragwitz, M.; Genoese, M. The merit-order effect: A detailed analysis of the price effect of renewable electricity generation on spot market prices in Germany. Energy Policy 2008, 36, 3086–3094. [Google Scholar] [CrossRef] [Green Version]
  4. Ketterer, J.C. The impact of wind power generation on the electricity price in Germany. Energy Econ. 2014, 44, 270–280. [Google Scholar] [CrossRef] [Green Version]
  5. Cludius, J.; Hermann, H.; Matthes, F.C.; Graichen, V. The merit order effect of wind and photovoltaic electricity generation in Germany 2008–2016: Estimation and distributional implications. Energy Econ. 2014, 44, 302–313. [Google Scholar] [CrossRef]
  6. Paraschiv, F.; Erni, D.; Pietsch, R. The impact of renewable energies on EEX day-ahead electricity prices. Energy Policy 2014, 73, 196–210. [Google Scholar] [CrossRef] [Green Version]
  7. Munksgaard, J.; Morthorst, P.E. Wind power in the Danish liberalized power market–Policy measures, price impact, and investor incentives. Energy Policy 2008, 36, 3940–3947. [Google Scholar] [CrossRef]
  8. Jacobsen, H.K.; Zvingilaite, E. Reducing the market impact of large shares of intermittent energy in Denmark. Energy Policy 2010, 38, 3403–3413. [Google Scholar] [CrossRef]
  9. Clo, S.; Cataldi, A.; Zoppoli, P. The merit-order effect in the Italian power market: The impact of sola and wind generation on national wholesale electricity prices. Energy Policy 2015, 77, 79–88. [Google Scholar] [CrossRef]
  10. Cutler, N.J.; MacGill, I.F.; Outhred, H.R.; Boerema, N.D. High penetration wind generation impacts on spot prices in the Australian national electricity market. Energy Policy 2011, 39, 5939–5949. [Google Scholar] [CrossRef]
  11. Denny, E.; O’Mahoney, A.; Lannoye, E. Modelling the impact of wind generation on electricity market prices in Ireland: An econometric versus unit commitment approach. Renew. Energy 2017, 104, 109–119. [Google Scholar] [CrossRef]
  12. Quint, D.; Dahlke, S. The impact of wind generation on wholesale electricity market prices in the midcontinent independent system operator energy market: An empirical investigation. Energy 2007, 169, 456–466. [Google Scholar] [CrossRef]
  13. Zarnikau, J.; Tsai, C.H.; Woo, C.K. Determinants of the wholesale prices of energy and ancillary services in the US Midcontinent electricity market. Energy 2020. [Google Scholar] [CrossRef]
  14. Zarnikau, J.; Woo, C.K.; Zhu, S.S. Zonal merit-order effects of wind generation development on day-ahead and real-time electricity market prices in Texas. J. Energy Mark. 2016, 9, 17–47. [Google Scholar] [CrossRef]
  15. Zarnikau, J.; Woo, C.K.; Zhu, S.S.; Baldick, R.; Tsai, C.H.; Meng, J. Electricity market prices for day-ahead ancillary services and energy: Texas. J. Energy Mark. 2018, 12, 1–32. [Google Scholar] [CrossRef]
  16. Zarnikau, J.; Woo, C.K.; Zhu, S.S.; Tsai, C.H. Market price behavior of wholesale electricity products: Texas. Energy Policy 2019, 125, 418–428. [Google Scholar] [CrossRef]
  17. Gil, H.A.; Lin, J. Wind power and electricity prices at the PJM market. IEEE Trans. Power Syst. 2013, 28, 3945–3953. [Google Scholar] [CrossRef]
  18. Woo, C.K.; Zarnikau, J.; Kadish, J.; Horowitz, I.; Wang, J.; Olson, A. The impact of wind generation on wholesale electricity prices in the hydro-rich Pacific Northwest. IEEE Trans. Power Syst. 2013, 28, 4245–4253. [Google Scholar] [CrossRef]
  19. Woo, C.K.; Moore, J.; Schneiderman, B.; Olson, A.; Jones, R.; Ho, T.; Toyama, N.; Wang, J.; Zarnikau, J. Merit-order effects of day-ahead wind generation forecast in the hydro-rich Pacific Northwest. Electr. J. 2015, 28, 52–62. [Google Scholar] [CrossRef]
  20. Woo, C.K.; Moore, J.; Schneiderman, B.; Olson, A.; Jones, R.; Ho, T.; Toyama, N.; Zarnikau, J. Merit-order effects of renewable energy and price divergence in California’s day-ahead and real-time electricity markets. Energy Policy 2016, 92, 299–312. [Google Scholar] [CrossRef]
  21. Sirin, S.M.; Yilmaz, B.N. Variable renewable energy technologies in the Turkish electricity market: Quantile regression analysis of the merit-order effect. Energy Policy 2020, 144, 111660. [Google Scholar] [CrossRef]
  22. Maciejowska, K. Assessing the impact of renewable energy sources on the electricity price level and variability–A quantile regression approach. Energy Policy 2020, 85, 104532. [Google Scholar] [CrossRef]
  23. Geweke, J. The dynamic factor analysis of economic time series models. In Latent Variables in Socio-Economic Models; Aigner, D.J., Goldbergered, A.S., Eds.; North–Holland: Amsterdam, The Netherlands, 1977; pp. 365–383. [Google Scholar]
  24. Watson, M.W.; Engle, R.F. Alternative algorithms for the estimation of dynamic factor, MIMIC and varying coefficient regression models. J. Econ. 1983, 23, 385–400. [Google Scholar] [CrossRef] [Green Version]
  25. Bernanke, B.S.; Jean, B.; Pitr, E. Measuring the effects of monetary policy: A Factor-Augmented Vector Autoregressive (FAVAR) approach. Q. J. Econ. 2008, 120, 387–422. [Google Scholar]
  26. Zagaglia, P. Macroeconomic factors and oil futures prices: A data-rich model. Energy Econ. 2010, 32, 409–417. [Google Scholar] [CrossRef] [Green Version]
  27. De Jong, P. The likelihood for a state-space model. Biometrika 1988, 75, 165–169. [Google Scholar] [CrossRef]
  28. De Jong, P. The diffuse Kalman filter. Ann. Stat. 1991, 19, 1073–1083. [Google Scholar] [CrossRef]
  29. Damien, P.; Fuentes-García, R.; Mena, R.H.; Zarnikau, J. Impacts of day-ahead and real-time market prices on wholesale electricity demand in Texas. Energy Econ. 2019, 81, 259–272. [Google Scholar] [CrossRef]
  30. Petris, G.; Petrone, S.; Campanogli, P. Dynamic Linear Models with R; Springer: New York, NY, USA, 2009. [Google Scholar]
  31. Ekin, T.; Damien, P.; Zarnikau, J. Estimating marginal effects of key factors that influence wholesale electricity demand and price distributions in Texas via quantile variable selection methods. J. Energy Mark. 2020, 13, 1–30. [Google Scholar] [CrossRef]
Figure 1. The eight ERCOT zones (Source: www.ercot.com, accessed on 18 July 2020).
Figure 1. The eight ERCOT zones (Source: www.ercot.com, accessed on 18 July 2020).
Applsci 11 07039 g001
Figure 2. Time-series plots of the actual prices (solid red line) and model predicted prices (dash grey line) in Houston, Austin, and the West for the hours 4:00 a.m., 12:00 p.m. and 4:00 p.m.
Figure 2. Time-series plots of the actual prices (solid red line) and model predicted prices (dash grey line) in Houston, Austin, and the West for the hours 4:00 a.m., 12:00 p.m. and 4:00 p.m.
Applsci 11 07039 g002
Figure 3. Scatter plots of actual and predicted prices, along with 45-degree lines, in Houston, Austin, and the West for the hours 4:00 a.m., 12:00 p.m., and 4:00 p.m.
Figure 3. Scatter plots of actual and predicted prices, along with 45-degree lines, in Houston, Austin, and the West for the hours 4:00 a.m., 12:00 p.m., and 4:00 p.m.
Applsci 11 07039 g003
Figure 4. 4:00 a.m.—Latent Factor and Houston series (top left and right); Austin and West (Bottom left and right).
Figure 4. 4:00 a.m.—Latent Factor and Houston series (top left and right); Austin and West (Bottom left and right).
Applsci 11 07039 g004
Figure 5. 12:00 p.m.—Latent Factor and Houston price series (top left and right); Austin and West (Bottom left and right).
Figure 5. 12:00 p.m.—Latent Factor and Houston price series (top left and right); Austin and West (Bottom left and right).
Applsci 11 07039 g005
Figure 6. 4:00 p.m.—Latent Factor and Houston price series (top left and right); Austin and West (Bottom left and right).
Figure 6. 4:00 p.m.—Latent Factor and Houston price series (top left and right); Austin and West (Bottom left and right).
Applsci 11 07039 g006
Figure 7. ERCOT merit-order effects for wind and nuclear generation.
Figure 7. ERCOT merit-order effects for wind and nuclear generation.
Applsci 11 07039 g007
Table 1. Summary statistics of the hourly prices ($/MWH) for Houston, Austin, and the West.
Table 1. Summary statistics of the hourly prices ($/MWH) for Houston, Austin, and the West.
MeanS.D.MinMax
Houston
4:00 a.m.16.405.83−14.8375.66
12:00 p.m.31.1347.959.591110.26
4:00 p.m.47.5499.100.241348.34
Austin
4:00 a.m.16.146.31−14.8997.65
12:00 p.m.27.0018.186.72338.09
4:00 p.m.42.3885.960.111348.11
West
4:00 a.m.17.7914.82−15.56132.71
12:00 p.m.27.5523.43−16.88408.91
4:00 p.m.43.2388.60−15.671384.76
Table 2. Correlations between the actual price series and estimated factor series for all zones.
Table 2. Correlations between the actual price series and estimated factor series for all zones.
AustinHoustonLCRANorthRAYBCPSSouthWest
4:00 a.m.0.3430.3230.3430.3390.3180.3440.3380.346
12:00 p.m.0.5320.4730.5310.5410.5170.5040.4280.365
4:00 p.m.0.5910.5870.5940.6180.5930.5590.5330.457
Note: Certain values are bold in order to better understand the Figure 6 discussion.
Table 3. Coefficients for the Latent Factor f t .
Table 3. Coefficients for the Latent Factor f t .
AustinHoustonLCRANorthRAYBCPSSouthWest
4:00 a.m.0.5710.5320.5810.5330.5100.5500.5410.621
12:00 p.m.0.2660.2840.2730.2230.2040.2720.2680.292
4:00 p.m.0.4330.4300.4410.4270.4250.4230.4120.461
Note: All coefficients have p-values < 0.00001. The latent factor is a vector quantity; hence it appears in bold font to be consistent with the notation in Section 3.
Table 4. ML coefficients for the 4:00 a.m. hour.
Table 4. ML coefficients for the 4:00 a.m. hour.
Coefficientp-Value95% Confidence Intervals
Latent Factor Equation
f t 1 0.16810.00300.05790.2783
SystemLoad1.60920.000011.35731.8610
Lag(WtPr)−0.12480.1660−0.30150.0518
Houston Equation
f t 0.53310.000010.51170.5545
Wind−0.35700.00001−0.4027−0.3112
Nuclear−0.70340.00001−0.9073−0.4995
Henry Hub1.15300.000010.93281.3733
Austin Equation
f t 0.57090.000010.55020.5917
Wind−0.36740.00001−0.4143−0.3204
Nuclear−0.77800.00001−0.9948−0.5612
Henry Hub1.12120.000010.89191.3505
West Equation
f t 0.62070.000010.58310.6583
Wind−0.55170.00001−0.6190−0.4845
Nuclear−0.71530.00001−0.9645−0.4662
Henry Hub1.13170.000010.82881.4345
Table 5. ML coefficients for the 12:00 p.m. sample.
Table 5. ML coefficients for the 12:00 p.m. sample.
Coefficientp-Value95% Confidence Intervals
Latent Factor Equation
f t 1 0.22400.000010.14120.3067
SystemLoad2.09970.000011.86462.3348
Lag(WtPr)−0.25420.0430−0.4999−0.0085
Houston Equation
f t 0.28450.000010.27130.2977
Wind−0.09430.00001−0.1151−0.0736
Nuclear−0.55320.00001−0.6576−0.4487
Solar0.02430.0630−0.00130.0500
Henry Hub0.61540.000010.47860.7522
Dummy2.40060.000012.10302.6982
Austin Equation
f t 0.26610.000010.25590.2763
Wind−0.14320.00001−0.1603−0.1261
Nuclear−0.44450.00001−0.5392−0.3497
Solar0.00900.4140−0.01260.0307
Henry Hub0.64540.000010.52720.7636
Dummy1.28410.000011.04441.5237
West Equation
f t 0.29160.000010.26840.3148
Wind−0.28260.00001−0.3155−0.2497
Nuclear−0.39450.00001−0.5201−0.2689
Solar0.00930.6340−0.02920.0478
Henry Hub0.59490.000010.40130.7885
Dummy0.97670.000010.48421.4692
Table 6. ML coefficients for the 4:00 p.m. sample.
Table 6. ML coefficients for the 4:00 p.m. sample.
Coefficientp-Value95% Confidence Intervals
Latent Factor Equation
f t 1 0.13260.00100.05710.2080
SystemLoad1.85130.000011.63652.0661
Lag(WtPr)0.22430.00100.08990.3587
Houston Equation
f t 0.43020.000010.41170.4487
Wind−0.17610.00001−0.2085−0.1438
Nuclear−0.72920.00001−0.8540−0.6043
Solar0.07030.000010.04130.0993
Henry Hub0.36080.000010.20060.5211
Dummy2.02120.000011.84192.2004
Austin Equation
f t 0.43340.000010.41690.4499
Wind−0.22220.00001−0.2520−0.1924
Nuclear−0.68290.00001−0.8060−0.5599
Solar0.04820.000010.02110.0752
Henry Hub0.34300.000010.19330.4927
Dummy1.85560.000011.69162.0195
West Equation
f t 0.46150.000010.43070.4922
Wind−0.43260.00001−0.4827−0.3825
Nuclear−0.58350.00001−0.7351−0.4319
Solar0.04730.03200.00400.0906
Henry Hub0.52720.000010.28930.7651
Dummy1.24480.000010.96291.5268
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cao, K.H.; Damien, P.; Zarnikau, J. A Latent-Factor System Model for Real-Time Electricity Prices in Texas. Appl. Sci. 2021, 11, 7039. https://doi.org/10.3390/app11157039

AMA Style

Cao KH, Damien P, Zarnikau J. A Latent-Factor System Model for Real-Time Electricity Prices in Texas. Applied Sciences. 2021; 11(15):7039. https://doi.org/10.3390/app11157039

Chicago/Turabian Style

Cao, Kang Hua, Paul Damien, and Jay Zarnikau. 2021. "A Latent-Factor System Model for Real-Time Electricity Prices in Texas" Applied Sciences 11, no. 15: 7039. https://doi.org/10.3390/app11157039

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop