A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads

Wang, Peng; Zou, Suli; Wang, Xiaojuan; Ma, Zhongjing

doi:10.3390/app8081370

Open AccessArticle

A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads

by

Peng Wang

¹

,

Suli Zou

^2,*,

Xiaojuan Wang

³ and

Zhongjing Ma

^1,*

¹

School of Automation, Beijing Institute of Technology (BIT), Beijing 100081, China

²

Automatic Control Laboratory, Department of Information Technology and Electrical Engineering, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland

³

China Electronics Standardization Institute (CESI), Beijing 100007, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2018, 8(8), 1370; https://doi.org/10.3390/app8081370

Submission received: 27 May 2018 / Revised: 26 July 2018 / Accepted: 1 August 2018 / Published: 15 August 2018

(This article belongs to the Special Issue Smart Home and Energy Management Systems)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we study the demand response of the thermostatically controlled loads (TCLs) to control their set-point temperatures by considering the tradeoff between the electricity payment and TCL user’s comfort preference. Based upon the dynamics of the TCLs, we set up the relationship between the set-point temperature and the energy demand. Then, we define a discomfort function with respect to the associated energy demand which represents the discomfort level of the set-point temperature. More specifically, the system is equipped with a coordinator named electric energy control center (EECC) which can buy energy resources from the electricity market and sell them to TCL users. Due to the interaction between EECC and TCL users, we formulate the specific energy trading process as a one-leader multiple-follower Stackelberg game. As the main contributions of this work, we show the existence and uniqueness of the equilibrium for the underlying Stackelberg games, and develop a DR algorithm based on the so-called Backward Induction to achieve the equilibrium. Several numerical simulations are presented to verify the developed results in this work.

Keywords:

thermostatically controlled loads; Stackelberg game; set-point temperature; price response; energy management

1. Introduction

Demand response (DR) can be defined as a program, which induces the end-users to adjust their energy usage in response to changes in the electricity price over time [1,2]. Rapid growth of energy demand has greatly increased the supply burden of the power system. In addition, reliable operation of the system necessitates a perfect balance between supply and demand in real time, which is not easy to achieve because both of them can change rapidly and unexpectedly. Based on the advanced information technologies, DR has been considered as a promising way to resolve these emerging challenges and achieve potential cost saving [1,3,4].

Thermostatically controlled loads (TCLs), as a large fraction of the flexible demand in power grid, offer significant potential for DR [5,6]. They use local hysteresis control to maintain the internal temperature within a dead-band around the set-point temperature. Real-time pricing (RTP) is one of the most important DR programs, where the price rates vary continuously to reflect wholesale market demand changes. Because of the high efficiency gains from a long-term perspective [7], many works have applied the RTP program to manage the flexible electric demand in power grid, e.g., [8,9,10,11]. In this paper, we also specify a RTP based DR program to coordinate the set-point temperature of TCLs to accomplish some objectives.

In order to coordinate TCLs, model formulation of TCLs should be illustrated. Based upon the dynamics of TCL [12,13], two different aggregated TCL models were proposed to mitigate the imbalance of the power gird, say homogeneous model [14] and heterogeneous one [15]. In [5,16], modeling and control of the aggregated TCLs were studied aiming at different goals. However, the preference of each TCL user is not reflected in these works, which is an important indicator to describe the comfort level of TCL users. As stated in [17,18,19,20], the discomfort functions can be defined to reflect the discomfort level w.r.t the energy demand of TCL users. In this paper, we propose a discomfort function with respect to the dynamics of each TCL user.

We study the coordination of TCLs in a typical office or residential building. An electric energy control center (EECC), as a coordinator, is equipped to play the role of buying energy from the wholesale market and selling it to TCL users. Then, an energy trading process occurs between EECC and the TCL users, such that, EECC determines a selling price to maximize the utility benefits and each TCL user adjusts its set-point temperature to maximize their own profits with respect to the selling price from EECC. Considering the dynamics of TCLs, we build a relationship between the set-point temperature of TCLs and the energy demand to reach this temperature. Based upon the above relationship, the energy trading process between EECC and TCL users can be formulated w.r.t the energy demand of TCLs. Moreover, since the decisions between EECC and the TCL users are interacted, we apply a Stackelberg game which is an effective method in power systems [11,21]. Specifically, in this paper, a one-leader N-follower Stackelberg game is established such that EECC serves as a leader and the TCL users are the N followers. We show that the Stackelberg equilibrium exists and is unique, which can be achieved by a backward induction method [22].

Above all, the main contributions of this work can be summarized as below:

We study the demand response of the TCLs to control their set-point temperatures by considering the tradeoff between the electricity payment and TCL user’s comfort preference;
According to the dynamics of TCLs, we set up the relationship between the energy demand and set-point temperature. Besides, we formulate the dissatisfaction function to represent the discomfort level of the set-point temperature;
Based upon the interaction between EECC and TCL users, we formulate the specific energy trading process as a one-leader N-follower Stackelberg game;
We show the existence and uniqueness of the equilibrium for the underlying Stackelberg games, and develop a DR algorithm based on the Backward Induction method to achieve the equilibrium.

The reminder of the paper is organized as follows. In Section 2, we specify the relationship between the energy demand and the set-point temperature and formulate the energy trading process as a DR problem under the RTP scheme. In Section 3, a one-leader N-follower Stackelberg game is established and the existence and uniqueness of the Stackelberg equilibrium is observed. Section 4 presents numerical simulations for the proposed method. In Section 5, we provide a conclusion for the developed work.

The key variables and parameters used in this paper are listed in Table 1.

2. Problem Formulation

In this paper, we consider a typical office or residential building equipped with a coordinator called EECC, whose role is to collect energy resources from the electricity market and allocate them to a group of TCL users

N \equiv {1, 2, \dots, N}

. The buying price of EECC is the market price, denoted by

P

, and the selling price is determined by itself, denoted by p. Each TCL user i (

i \in N

) chooses its set-point temperature, denoted by

{\hat{θ}}_{i}

, based on the broadcasted price p from EECC. Then, EECC will provide the energy demand

u_{i}

to TCL i to reach the set-point temperature

{\hat{θ}}_{i}

. The above energy trading process is shown in Figure 1.

In this paper, we suppose that each TCL user is a price-taker and its decision will not affect the market price

P

. This is a universal assumption when the market involves a large population of users [23,24]. Denote the time horizon by

T

with

T \equiv [t_{k}, t_{k + 1}]

, where

t_{k}

is the start time of this horizon, and

T \equiv t_{k + 1} - t_{k}

is the length of the time horizon.

Section 2.1 provides the model of the TCL dynamics, based on which the relationship between the set-point temperature of a TCL and its energy demand is established in Section 2.2. Then, in Section 2.3, the energy trading process is introduced together with the preferences of TCL users and EECC.

2.1. TCL Dynamics

As stated in [14,16,25], for each TCL

i \in N

at any time

t \in T

, the evolution of the temperature can be expressed as a first-order differential equation, such that,

\frac{d θ_{i} (t)}{d t} = - \frac{1}{C_{i} R_{i}} (P_{i} R_{i} W_{i} (t) + θ_{i} (t) - θ_{a, i} (t)),

(1)

where the notations are specified as below:

$θ_{i} (t)$ and $θ_{a, i} (t)$ represent respectively the internal temperature ( $^{\circ} C$ ) and the ambient temperature ( $^{\circ} C$ ) of TCL i at time t.
$R_{i}$ , $C_{i}$ and $P_{i}$ are thermal parameters which express the thermal resistance (kWh $/^{\circ} C$ ), thermal capacitance ( $^{\circ} C$ /kW) and cooling thermal power (kW) of TCL i, respectively. For notational simplicity, we denote the thermal constant by $τ_{i}$ , such that $τ_{i} \equiv R_{i} C_{i}$ .
The binary variable $W_{i} (t) \in {0, 1}$ represents the switch state of TCL i at instant t.

Remark 1.

In (1), we consider the TCLs in different houses or offices where the evolution of the internal temperature is mainly effected by the heat exchange between inside and outside, hence there is no heat exchange among the TCLs [26]. Besides, Equation (1) is formulated for cooling TCLs such as air conditioners. Then,

P_{i}

in (1) is a positive constant.

To avoid TCL i switching frequently around its set-point temperature

{\hat{θ}}_{i}

, we adopt a temperature dead-band

δ \equiv | [θ_{i}^{-}, θ_{i}^{+}] |

, where

θ_{i}^{-}

and

θ_{i}^{+}

are the lower and upper limit of the dead-band respectively, such that:

θ_{i}^{-} = {\hat{θ}}_{i} - δ / 2,

(2a)

θ_{i}^{+} = {\hat{θ}}_{i} + δ / 2 .

(2b)

Then, the switch state function in (1) is defined as follows [27,28]:

W_{i} (t + △ t) = \{\begin{matrix} 0 in case θ_{i} (t + △ t) \leq θ_{i}^{-} \\ 1 in case θ_{i} (t + △ t) \geq θ_{i}^{+} \\ W_{i} (t) otherwise \end{matrix}

(3)

where

△ t

is an arbitrarily small time interval. The temperature evolution procedure is shown in Figure 2.

T_{o n, i}

appeared in the figure is the time length that the “on” state of TCL i lasts during one time horizon

[t_{k}, t_{k + 1}]

.

2.2. Energy Demand of TCLs

At the start time

t_{k}

of any time horizon, each TCL user chooses a set-point temperature

{\hat{θ}}_{i}

according to the broadcast price p. Then, each TCL needs to consume some energy to make its current internal temperature

θ_{i} (t_{k})

reach the set-point temperature

{\hat{θ}}_{i}

. Denote this energy demand by

u_{i} \equiv f_{i} (\hat{θ_{i}})

which is a function of

{\hat{θ}}_{i}

. In this section, we derive the relationship between

{\hat{θ}}_{i}

and

u_{i}

based on the dynamics of TCLs.

Remark 2.

In this paper, we consider a 15-minute time horizon (

T = 15

min), which is small enough to neglect the variation of

θ_{a, i} (t)

within

T

. That is,

θ_{a, i} (t) \equiv θ_{a, i} (t_{k}),

for all

t \in T

[29].

As the cooling thermal power

P_{i}

is a preknown parameter of TCL i, the energy demand to reach

{\hat{θ}}_{i}

from the current

θ_{i} (t_{k})

within

T

can be expressed as the following form:

u_{i} = T_{o n, i} P_{i}, T_{o n, i} \in [0, T] .

(4)

Recall that

T_{o n, i}

is the time length that the “on” state of TCL i lasts during one time horizon

[t_{k}, t_{k + 1}]

, as shown in Figure 2.

Since the maximum value of

T_{o n, i}

is T, the maximum energy demand is

β_{i}^{+} \equiv T P_{i}

. Then,

0 \leq u_{i} \leq β_{i}^{+} .

(5)

The feasible set of

u_{i}

is denoted by

U_{i}

such that,

U_{i} ≜ [0, β_{i}^{+}], \forall i \in N .

(6)

By (3),

W_{i} (t)

remains unchanged over

T

if the internal temperature

θ_{i} (t), t \in T

always lies in the dead-band. And

W_{i} (t)

changes only if

θ_{i} (t)

hits the limits of the dead-band

θ_{i}^{-}

or

θ_{i}^{+}

for some

t \in T

. Then by (1), after the first change of

W_{i} (t)

,

θ_{i} (t)

will need some certain time to reach the limit

θ_{i}^{-}

or

θ_{i}^{+}

. Therefore, if given appropriate parameters in (1),

θ_{i} (t)

will not hit the boundary of dead-band twice within

T

. Then, we have the following assumption in this paper.

Assumption 1.

The switch state

W_{i} (t)

of each TCL i changes no more than once over

T

.

Based upon Assumption 1, the operation process of TCL i in

T

can be divided into two cases w.r.t.

W_{i} (t_{k})

.

Case 1 ( $W_{i} (t_{k}) = 1$ ): By (1), we have the internal temperature at time $t_{k} + T_{o n, i}$ , such that,

$θ_{i} (t_{k} + T_{o n, i}) = (θ_{a, i} (t_{k}) - P_{i} R_{i}) (1 - e^{- T_{o n, i} / τ_{i}}) + θ_{i} (t_{k}) e^{- T_{o n, i} / τ_{i}} .$

(7)

Combining (2a) and (3), it gives

$θ_{i} (t_{k} + T_{o n, i}) = {\hat{θ}}_{i} - δ / 2 .$

(8)

Then by (4), (7) and (8), the relationship between $u_{i}$ and ${\hat{θ}}_{i}$ is

$u_{i} = τ_{i} P_{i} ln \frac{θ_{i} (t_{k}) + P_{i} R_{i} - θ_{a, i} (t_{k})}{{\hat{θ}}_{i} + P_{i} R_{i} - θ_{a, i} (t_{k}) - δ / 2} .$

(9)
Case 2 ( $W_{i} (t_{k}) = 0$ ): Similar with Case 1, by (1)–(4), we have:

$u_{i} = T P_{i} - τ_{i} P_{i} ln \frac{θ_{a, i} (t_{k}) - θ_{i} (t_{k})}{θ_{a, i} (t_{k}) - {\hat{θ}}_{i} - δ / 2} .$

(10)

In summary, we obtain the relationship between the energy demand

u_{i}

and the set-point temperature

{\hat{θ}}_{i}

such that

u_{i} = f_{i} ({\hat{θ}}_{i}) = \{\begin{matrix} τ_{i} P_{i} ln \frac{θ_{i} (t_{k}) + P_{i} R_{i} - θ_{a, i} (t_{k})}{{\hat{θ}}_{i} + P_{i} R_{i} - θ_{a, i} (t_{k}) - δ / 2}, in case W_{i} (t_{k}) = 1 \\ T P_{i} - τ_{i} P_{i} ln \frac{θ_{a, i} (t_{k}) - θ_{i} (t_{k})}{θ_{a, i} (t_{k}) - {\hat{θ}}_{i} - δ / 2}, in case W_{i} (t_{k}) = 0 \end{matrix}

(11)

2.3. Energy Trading Process

As shown in Figure 1, EECC first collects the energy from the wholesale market under the market price

P

, and then sells the energy to TCL users at a broadcasted price p. Each TCL user adjusts its set-point temperature based on the broadcasted price from EECC. Suppose that EECC and TCL users are strategic players, and all of them make decisions by optimizing their individual objectives. Next we will introduce the preference of EECC and TCL users.

For TCL user

i \in N

, determining

{\hat{θ}}_{i}

is equivalent to determining

u_{i}

as we have a relationship between them in (11). Hence, TCL user can optimize its energy demand by minimizing its individual cost, which contains the electricity payment and the cost associated with its discomfort level. The individual cost of the i-th TCL user with respect to

u_{i}

is given in the following:

C_{i} (u_{i}; p) ≜ p u_{i} + ω d_{i} (u_{i}),

(12)

wherein the first term represents the electricity payment and the second is the dissatisfaction cost, and

ω

denotes a weighting coefficient concerning the importance of the TCL user’s discomfort during

T

. For a rational TCL user, its discomfort level continuously decreases with the reduction of the set-point temperature. By (11), the dissatisfaction cost is a function of

u_{i}

, say

d_{i} (u_{i})

.

At time

t_{k}

, before choosing the set-point temperature

{\hat{θ}}_{i}

, each TCL user has a reference temperature, denoted by

θ_{i}^{r}

, representing its comfortable temperature. Then, the corresponding reference demand, denoted by

q_{i}

, can be computed by (11) such that:

q_{i} = \{\begin{matrix} 0, in case θ_{i}^{r} > {\hat{θ}}_{i, j}^{+} \\ f_{i} (θ_{i}^{r}), in case {\hat{θ}}_{i, j}^{-} \leq θ_{i}^{r} \leq {\hat{θ}}_{i, j}^{+} \\ β_{i}^{+}, in case θ_{i}^{r} < {\hat{θ}}_{i, j}^{-} \end{matrix}

(13)

where

{\hat{θ}}_{i, j}^{+}

and

{\hat{θ}}_{i, j}^{-}

represent the i-th TCL user’s maximum and minimum set-point temperature in Case j respectively, with

j = {1, 2}

.

Remark 3.

In (13), the reference temperature

{\hat{θ}}_{i}^{r}

is the threshold value of the comfortable temperature, which is related to each TCL user’s preference and external environment. It can be recognized as a criterion of the comfort level of TCL users.

Remark 4.

By (11), the expression of

f_{i} ({\hat{θ}}_{i})

is distinct in different cases. Then, by the feasible set of energy demand in (6), we have

{\hat{θ}}_{i, j}^{+}

and

{\hat{θ}}_{i, j}^{-}

are related to the case j for all

i \in N

.

As specified in [18,19,20], the dissatisfaction cost function

d_{i} (u_{i})

is continuous and has the following properties:

\begin{matrix} d_{i} (u_{i}) \{\begin{matrix} > 0, in case u_{i} < q_{i} \\ = 0, in case u_{i} = q_{i} \\ < 0, in case u_{i} > q_{i} \end{matrix} \\ d_{i}^{^{'}} (u_{i}) < 0, d_{i}^{^{″}} (u_{i}) > 0 \end{matrix}

For

u_{i} < q_{i}

, i.e.,

{\hat{θ}}_{i} > θ_{i}^{r}

, the TCL user is dissatisfied with the current temperature and the discomfort level will increase rapidly as

{\hat{θ}}_{i}

(demand

u_{i}

) is away from the reference temperature

θ_{i}^{r}

(demand

q_{i}

). For

u_{i} > q_{i}

, i.e.,

{\hat{θ}}_{i} < θ_{i}^{r}

, the TCL user is satisfied with the current temperature, but the comfort level will not increase infinitely and change slowly as

{\hat{θ}}_{i}

is away from the reference temperature

θ_{i}^{r}

.

Based on the above properties, we apply the dissatisfaction cost function

d_{i} (u_{i})

as the following form [18]:

d_{i} (u_{i}) ≜ e^{b_{i} (1 - u_{i} / q_{i})} - 1,

(14)

with

b_{i} > 0

, where

b_{i}

represents the priority factor of TCL user i.

For EECC, it can obtain benefits by buying energy from the market and selling it to TCL users. Thus as a rational EECC, the selling price should be larger than the market price, i.e.,

p \geq P

. Define the feasible set for the broadcast price p such that

P ≜ \{p | p \geq P\} .

(15)

Besides, EECC should consider the discomfort of all the TCL users, otherwise EECC may set the selling price very high to get more benefits. Hence, the utility function of EECC can be expressed as the following form:

S_{E} (p; u) ≜ (p - P) \sum_{i = 1}^{N} u_{i} - ω \sum_{i = 1}^{N} d_{i} (u_{i}),

(16)

where

u = [u_{1}, u_{2}, \dots, u_{N}]

represents the energy demand of all the TCL users.

3. Stackelberg Game Coordination

As stated in the previous section, EECC buys the total energy that all the TCL users demand from the wholesale market under the market price

P

and broadcasts a selling price p to each TCL user. Then, based on the broadcast price p, each TCL user determines the energy demand

u_{i}

i.e., setting its set-point temperature

{\hat{θ}}_{i}

.

Note that the decisions between EECC and the TCL users are actually interdependent. We establish a Stackelberg game to describe the interplay of TCL users and EECC in Section 3.1. Furthermore, the existence and uniqueness of the Stackelberg equilibrium are specified in Section 3.2.

3.1. Stackelberg Game

Since there exists a hierarchy among players in Stackelberg games, leaders are in a position to enforce their strategies on the followers. In this leader-follower competition, the followers find the best response function first, i.e., getting to know how they will respond once they observe the strategies of leaders. The leaders are aware of the fact that each follower will choose its best response with respect to the leaders strategies. Hence, the leaders are able to maximize their payoffs anticipating the predicted response of the followers. This is actually observed by the followers to adapt their expected strategy accordingly as a response.

We introduce a one-leader, N-follower Stackelberg game to characterize the electricity transaction process between EECC and TCL users, where EECC serves as the leader and TCL users act as followers. Thus, the system proceeds by the following two stages:

Stage I: Each TCL user i implements the best response function with respect to the broadcasted price p from EECC.
Stage II: EECC optimizes the broadcasted price $p^{*}$ considering TCL users’ best response $u^{*} (p)$ at Stage I.

Then observing EECC’s best strategy, each TCL user i determines its optimal energy demand

u_{i}^{*}

under the broadcast price

p^{*}

from Stage II. Based on the above set-up, the optimization problem can be formally formulated as the following:

Leader level:

$p^{*} = \underset{p \in P}{arg max} S_{E} (p; u^{*} (p))$

(17)
Follower level:

$u_{i}^{*} (p) = \underset{u_{i} \in U_{i}}{arg min} C_{i} (u_{i}; p)$

(18)

The optimal strategies of the game take the form of the Stackelberg equilibrium [30,31]. At the equilibrium, the leader’s strategy

p^{*}

is a solution to the optimization problem specified in (17) based on the best strategy trajectories

u^{*} (p)

of the followers. Each follower’s strategy is also a solution to (18) when it is informed of the equilibrium strategy of the leader. The optimal strategies

u_{i}^{*} (p^{*}), i \in N

therefore constitute the equilibrium for all the followers.

Then, we have the following definition of the Stackelberg equilibrium [18,22].

Definition 1 (Stackelberg equilibrium).

The strategy

(p^{*}, u^{*})

is a Stackelberg equilibrium if it satisfies:

S_{E} (p^{*}; u^{*} (p^{*})) \geq S_{E} (p; u^{*} (p)),

(19)

C_{i} (u_{i}^{*}; p^{*}) \leq C_{i} (u_{i}; p^{*}), for all i \in N .

(20)

3.2. Existence and Uniqueness of Stackelberg Equilibrium

Based on the above analysis of the game process, we can deduce the Stackelberg equilibrium by backward induction method [22]. Firstly, each follower determines its best strategy trajectory by solving (18) with respect to a strategy p from the leader. Then, combining the best strategy trajectory

u^{*} (p)

with (17), the leader obtains its best strategy

p^{*}

. Subsequently, each follower determines its best strategy

u_{i}^{*} (p^{*})

when it is informed of the best strategy

p^{*}

of the leader.

Lemma 1.

Given a broadcast price p from EECC, each follower has a unique optimal strategy

u_{i}^{*} (p)

, such that:

u_{i}^{*} (p) = \{\begin{matrix} β_{i}^{+}, in case p \leq \frac{w b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})} \\ q_{i} - \frac{q_{i}}{b_{i}} ln \frac{p q_{i}}{ω b_{i}}, in case \frac{w b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})} < p < \frac{w b_{i}}{q_{i}} e^{b_{i}} \\ 0, in case p \geq \frac{w b_{i}}{q_{i}} e^{b_{i}} \end{matrix}

Proof of Lemma 1.

By (12) and (14), we obtain that each follower’s utility function

C_{i} (u_{i}; p)

is continuous and differentiable over a convex set

U_{i}

.

Then, by (12), we have

\begin{matrix} \partial^{2} C_{i} (u_{i}; p) / \partial u_{i}^{2} = \frac{ω b_{i}^{2}}{q_{i}^{2}} e^{b_{i} (1 - u_{i} / q_{i})} > 0 . \end{matrix}

Hence,

C_{i} (u_{i}; p)

is a strictly convex function w.r.t.

u_{i}

.

By

\partial C_{i} (u_{i}; p) / \partial u_{i} = 0

, we obtain the optimal trajectory w.r.t p as below:

{\tilde{u}}_{i} (p) = q_{i} - \frac{q_{i}}{b_{i}} ln \frac{p q_{i}}{ω b_{i}} .

(21)

Furthermore, because the feasible set

U_{i}

defined in (6) is a bounded set, the boundary conditions of the optimal strategy in (21) is determined by (21). ☐

Based on the best strategies

u^{*} (p)

, EECC determines the best electricity prices

p^{*}

by maximizing its utility function (16).

Lemma 2.

The leader has a unique optimal strategy

p^{*}

, such that:

\begin{matrix} p^{*} = \underset{p \in [P, p_{max})}{arg max} S_{E} (p; u^{*} (p)), \end{matrix}

(22)

where

p_{max} \equiv {max}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i}}\}

.

Proof of Lemma 2.

The Proof of Lemma 2 is given in Appendix A. ☐

Remark 5.

From Lemma 2, there exists a unique optimal strategy (22) when

p \in [P, p_{m a x})

. Considering

p \geq p_{m a x}

, we have

u_{i}^{*} (p) = 0

by (21), for all

i \in N

. In addition, by (16), we obtain that the utility function

S_{E} (p; u_{i}^{*} (p)) = - ω \sum_{i = 1}^{N} d_{i} (0)

is a constant for all

p \geq p_{m a x}

. Therefore, there is no unique optimal strategy in the given range of p.

Theorem 1.

Considering

p_{max} > P

, there exists a unique Stackelberg equilibrium

(p^{*}, u^{*})

for the proposed game.

Proof of Theorem 2.

By Lemma 1, we have

C_{i} (u_{i}^{*}; p^{*}) \leq C_{i} (u_{i}; p^{*})

for all

i \in N

, then (20) holds. Then, by Lemma 2, (19) is satisfied. Therefore, according to the Definition 1,

(p^{*}, u^{*})

is the unique equilibrium of the Stackelberg game.

Remark 6.

If

p_{max} \leq P

, then we have EECC’s utility function

S_{E} (p; u_{i}^{*} (p)) = - ω \sum_{i = 1}^{N} d_{i} (0)

. Thus, considering

p_{max} \leq P

, EECC cannot find a unique optimal strategy.

Based upon Theorem 1, we specify Algorithm 1 to achieve the Stackelberg equilibrium of the game.

Algorithm 1 DR algorithm by Backward Induction.

Require:

Initialize the time horizon $T \equiv [t_{k}, t_{k} + T]$ ;
Initialize the switch state $W_{i} (t_{k})$ of TCL user i;
Initialize the market price $P$ ;
Initialize the reference temperature $θ_{i}^{r}$ of TCL user i;
Set the reference demand $q_{i}$ of TCL user i by (13) w.r.t $θ_{i}^{r}$ .

Ensure:

EECC’s optimal broadcast price $p^{*}$ ;
Each TCL user’s adjusted set-point temperature ${\hat{θ}}_{i}^{*}$ .

1:: By (22), EECC determines the optimal broadcast price $p^{*}$ w.r.t the best strategy trajectory $u_{i}^{*} (p)$ in (21);
2:: Each TCL user $i \in N$ determines the optimal strategy $u_{i}^{*}$ w.r.t. $p^{*}$ by (21);
3:: Each TCL user $i \in N$ computes their optimal set-point temperature ${\hat{θ}}_{i}^{*}$ by ${\hat{θ}}_{i}^{*} = f_{i}^{(- 1)} (u_{i}^{*})$ .

As stated in [18,19], compared with other methods which usually involve interactive iteration processes between the leader and the followers, which are EECC and the TCL users respectively in the underlying games, the DR algorithm based on Backward Induction proposed in our work can significantly reduce the computational time in implementing the equilibrium of the underlying Stackelberg games.

4. Simulation

In this part, some case studies are analyzed to demonstrate the price response coordination of TCLs. The proposed Stackelberg game model and control scheme are validated by the simulations in MATLAB 2014a. Besides, we use the interior-point method to solve the optimization problems and the computational time of all cases are limited in

2.0

s.

We adopt a typical 15-min based pricing by dividing 9-h into 36 equal time instants [19], as shown in Figure 3. An ambient temperature profile from 11:00 to 20:00 in a typical summer day is shown in Figure 4.

4.1. Homogeneous Case

We first consider

N = 100

homogeneous TCLs, and the parameters of the TCLs are specified in Table 2 [32]. We set the weighting factor of the importance of the discomfort level as

ω = 0.2

.

Without loss of generality, assume that

W_{i} (t_{k}) = 0

with

t_{k} = 11 : 00

, for all

i \in N

, i.e., the switch state of each TCL is “off” at 11:00. As specified in Section 2.2, the “off” state implies that

j = 2

.

We also consider the internal temperature

θ_{i} (t_{k}) = 27^{\circ} C

for all

i \in N

and the temperature dead-band

δ = 0.25^{\circ} C

.

Then, according to the reference temperature

θ_{i}^{r}

of TCL user i shown in Figure 4, we obtain the reference demand energy

q_{i}

by (13), which is displayed by the blue dash-dot line in Figure 5. More specifically, taking one time horizon [11:00, 11:15] as an example and given

θ_{a, i} (t_{k}) = 31.2^{\circ} C

,

θ_{i} (t_{k}) = 27^{\circ} C

and

θ_{i}^{r} (t_{k}) = 26^{\circ} C

, we calculate that

{\hat{θ}}_{i}^{-} = f_{i}^{(- 1)} (β_{i}^{+}) = 26.56^{\circ} C

in Case 2. Then by (13), we have

q_{i} (t_{k}) = β_{i}^{+} = 2.275

kWh.

By applying Algorithm 1, EECC implements the optimal price

p^{*}

w.r.t

u^{*} (p)

by (22), which is displayed by the red line in Figure 6.

The broadcast price

p^{*}

satisfies

p^{*} \in (\frac{ω b_{i}}{q_{i}} e^{(1 - β_{i}^{+} / q_{i})}, \frac{ω b_{i}}{q_{i}} e_{i}^{b})

. Then, by (21), the optimal energy demand of each TCL user

u_{i}^{*}

increases as

p^{*}

decreases from 11:00 to 20:00, which is displayed in Figure 5.

Subsequently, according to the relationship between the set-point temperature and the energy demand specified in Section 2.2, each TCL user adjusts its set-point temperature by

{\hat{θ}}_{i}^{*} = f_{i}^{(- 1)} (u_{i}^{*})

, which is displayed by the red line in Figure 7.

Consider the time horizon [13:00, 13:15] as an example. Based upon the reference demand

q_{i} = 0.691

kWh, the market price

P = 0.12

$/kWh and the optimal reaction curve

u_{i}^{*} (p) = q_{i} - \frac{q_{i}}{b_{i}} ln \frac{p q_{i}}{ω b_{i}}

given in Lemma 1, we obtain the optimal broadcast price

p^{*} = 0.219

$/kWh by (22). Afterwards, TCL users observe the best strategy of EECC and compute their best strategies

u_{i}^{*} (p^{*})

by (21). The corresponding set-point temperature is

{\hat{θ}}_{i}^{*} = 25.98^{\circ} C

. Because of the existence of the dead-band

δ

, the internal temperature varies by (1) in [13:00, 13:15], and the switch state will change when the internal temperature hits the upper limit

{\hat{θ}}_{i}^{*} + δ / 2 = 26.11^{\circ} C

.

Moreover, after 13:00, for keeping the internal temperature around the reference temperature

26^{\circ} C

, the switch state changes one time within each time horizon and the optimal set-point temperature stays around

26^{\circ} C

, as illustrated in Figure 7. However, given the same ambient temperature

{\hat{θ}}_{a, i}

, by (9) and (10), the associate reference energy demand

q_{i}

are distinct in different cases. This causes the fluctuation of the energy demand trajectory as displayed in Figure 5.

4.2. Heterogeneous Case

In general, the aggregated TCLs’ switch state are different [14]. For the purpose of demonstration, we suppose that the total 100 TCLs are partitioned into two categories, say 50 TCLs are with

W_{i} (t_{k}) = 1

and another 50 TCLs with

W_{i} (t_{k}) = 0

. As a sequence, the profile of the aggregated energy demand of the 100 TCLs is displayed by the black line in Figure 8.

As observed in Figure 8, the fluctuations of individual TCLs are alleviated by the aggregated TCLs with different

W_{i} (t_{k})

. Thus, we may induce the TCL users to adjust its set-point temperature, to mitigate the fluctuation of the power grid by broadcasting different prices to the groups of TCL1 and TCL2 respectively.

Furthermore, because of the different characteristics of the TCL users, the reference temperature will change with respect to the variational external environment, such as the ambient temperature and the human actions in the room. Therefore, in Figure 9, we consider a scenario with variational reference temperature. EECC broadcasts price

p^{*}

(displayed by the red line) and TCL user i implements the set-point temperature

{\hat{θ}}_{i}^{*}

accordingly (displayed by the purple line) at each instant to maximize the utility benefit and minimize the individual cost of each TCL user.

In reality, the TCLs’ properties vary according to the different preferences of TCL users. Thus, besides the above study for homogeneous TCLs, here we also apply Algorithm 1 for the heterogeneous cases.

We first consider different priority factors of TCL users. By (14), we obtain that the TCL user with higher b will have more discomfort when the set-point temperature exceeds the reference temperature. Therefore, the set-point temperature of TCL2 with

b_{2} = 1.2

decreases faster than TCL1 with

b_{1} = 1.1

, which is displayed in Figure 10.

Furthermore, we consider the heterogeneous case with variational reference temperature and different properties of TCLs. The parameters of heterogeneous TCLs are shown in Table 2. Figure 11 and Figure 12 display the best broadcast price from EECC and the set-point temperature of different TCLs respectively.

5. Conclusions and Ongoing Reasearch

We have studied the coordination of TCLs under a Stackelberg game based price response scheme. Based upon the dynamics of the TCLs, we first establish the relationship between the set-point temperature and the energy consumed to reach the set-point temperature. Then, a discomfort function is defined to represent the discomfort level of the set-point temperature. Based upon the interplay of TCL users and EECC during the electricity trading process, a one-leader N-follower Stackelberg game is established. EECC optimizes its selling price considering the tradeoff of its electricity gross benefit and the dissatisfaction cost of TCL users, while TCL users make decisions by minimizing the electricity payments and the dissatisfaction cost. Compared with other iteration methods in the literature, a more effective DR algorithm by backward induction method is proposed to achieve the unique Stackelberg equilibrium. At the equilibrium, EECC maximizes its utility function and each TCL user adjusts its set-point temperature to minimize its cost.

In the future, unlike the model considered in the current work, we will extend our work by considering the heat exchanges among the TCLs which are interactive with each other. Besides, we would like to design a different electricity price scheme to satisfy different users’ preferences and maximize the utility benefits.

Author Contributions

This paper is a result of the collaboration of all authors. P.W. and Z.M. conceived and designed this work. X.W. performed the experiments. P.W. and S.Z. wrote the paper.

Funding

This research was funded by International S&T Cooperation Program of Beijing Institute of Technology grant number GZ2016065101.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

TCL	Thermostatically controlled loads
EECC	Electric energy control center
DR	Demand response
RTP	Real-time pricing

Appendix A. Proof of Lemma 2

According to the value of market price

P

, we have the following two cases.

Case 1:

P \leq {min}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\}

Based on the boundary conditions in (21), the value of the leader’s strategy p in Case 1 can be divided into two subcases.

Case (1A):

P \leq p \leq {min}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\}

By (21), we have

u_{i}^{*} (p) = β_{i}^{+}, \forall i \in N

. Then by (17), we obtain the leader’s optimal strategy

p^{*}

in the following:

p^{*} = \underset{p \in P_{1}}{arg max} (p - P) \sum_{i = 1}^{N} β_{i}^{+} - ω \sum_{i = 1}^{N} d_{i} (β_{i}^{+}),

(A1)

where

P_{1} \equiv \{p | P \leq p \leq {min}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\}\}

.

Case (1B):

{min}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\} < p < p_{m a x}

We denote the feasible set of p in Case (1B) by

P_{2}

, such that,

P_{2} ≜ \{p | min_{i \in N} \{\frac{ω b_{i}}{q_{i, j}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\} < p < p_{m a x}\} .

(A2)

In addition, we specify three sets

N_{1}

,

N_{2}

and

N_{3}

, such that,

N_{1} ≜ \{m | u_{m}^{*} (p) = β_{m}^{+}, m \in N\},

(A3)

N_{2} ≜ \{n | u_{n}^{*} (p) = q_{n} - \frac{q_{n}}{b_{n}} ln \frac{p q_{n}}{ω b_{n}}, n \in N\},

(A4)

N_{3} ≜ \{l | u_{l}^{*} (p) = 0, l \in N\} .

(A5)

By Lemma 1 and (A2), we have

N = N_{1} \cup N_{2} \cup N_{3}

and

N_{2} \neq \emptyset

.

Then, together with (17), we obtain that,

\begin{matrix} max_{p \in P_{2}} S_{E} (p; u^{*} (p)) & = max_{p \in P_{2}} {(p - P) (\sum_{m \in N_{1}} β_{m}^{+} + \sum_{n \in N_{2}} (q_{n} - \frac{q_{n}}{b_{n}} ln \frac{p q_{n}}{ω b_{n}})) \\ - ω (\sum_{m \in N_{1}} d_{m} (β_{m}^{+}) + \sum_{n \in N_{2}} (\frac{p q_{n}}{ω b_{n}} - 1) + \sum_{l \in N_{3}} d_{l} (0))} \end{matrix}

(A6)

Take the second derivative of the utility function

S_{E} (p; u^{*} (p))

with respect to p, we have,

\begin{matrix} \frac{\partial^{2}}{\partial p^{2}} S_{E} (p; u^{*} (p)) = \sum_{n \in N_{2}} (- \frac{q_{n}}{p b_{n}} - \frac{P}{p^{2} b_{n}}) < 0 . \end{matrix}

Hence, the optimization problem (A6) has a unique optimal strategy

p^{*}

.

Case 2:

{min}_{i \in N} \{\frac{ω b_{i}}{q_{i}} e^{b_{i} (1 - β_{i}^{+} / q_{i})}\} < P < p_{m a x}

By (15), we have

p \in [P, p_{m a x})

. In addition, by Lemma 1, TCL users will have different optimal strategies

β_{i}^{+}

,

q_{i} - \frac{q_{i}}{b_{i}} ln \frac{q_{i} p}{ω b_{i}}

, 0. Similar with Case (1B), there exists a unique strategy of the optimization problem (A6).

In sum, consider

p \in [P, p_{m a x})

, there exists a unique optimal strategy

p^{*}

in (22).

References

Siano, P. Demand response and smart grids—A survey. Renew. Sustain. Energy Rev. 2014, 30, 461–478. [Google Scholar] [CrossRef]
Meng, F.L.; Zeng, X.J. An optimal real-time pricing for demand-side management: A Stackelberg game and genetic algorithm approach. In Proceedings of the 2014 International Joint Conference on Neural Networks, Beijing, China, 6–11 July 2014; pp. 1703–1710. [Google Scholar]
Ipakchi, A.; Albuyeh, F. Grid of the future. IEEE Power Energy Mag. 2009, 7, 52–62. [Google Scholar] [CrossRef]
Molderink, A.; Bakker, V.; Bosman, M.G.C.; Hurink, J.L.; Smit, G.J.M. Management and Control of Domestic Smart Grid Technology. IEEE Trans. Smart Grid 2010, 1, 109–119. [Google Scholar] [CrossRef] [Green Version]
Callaway, D.S. Tapping the energy storage potential in electric loads to deliver load following and regulation, with application to wind energy. Energy Convers. Manag. 2009, 50, 1389–1400. [Google Scholar] [CrossRef]
He, H.; Sanandaji, B.M.; Poolla, K.; Vincent, T.L. Aggregate Flexibility of Thermostatically Controlled Loads. IEEE Trans. Power Syst. 2013, 30, 189–198. [Google Scholar]
Borenstein, S. The Long-Run Efficiency of Real-Time Electricity Pricing. Energy J. 2005, 26, 93–116. [Google Scholar] [CrossRef]
Maharjan, S.; Zhu, Q.; Zhang, Y.; Gjessing, S.; Başar, T. Demand Response Management in the Smart Grid in a Large Population Regime. IEEE Trans. Smart Grid 2015, 7, 189–199. [Google Scholar] [CrossRef]
Yu, M.; Hong, S.H. Supply-demand balancing for power management in smart grid: A Stackelberg game approach. Appl. Energy 2016, 164, 702–710. [Google Scholar] [CrossRef]
Ma, Z.; Zou, S.; Ran, L.; Shi, X.; Hiskens, I.A. Efficient decentralized coordination of large-scale plug-in electric vehicle charging. Automatica 2016, 69, 35–47. [Google Scholar] [CrossRef]
Dai, Y.; Gao, Y.; Gao, H.; Zhu, H. Real-time pricing scheme based on Stackelberg game in smart grid with multiple power retailers. Neurocomputing 2017, 260, 149–156. [Google Scholar] [CrossRef]
Mortensen, R.E.; Haggerty, K.P. A stochastic computer model for heating and cooling loads. IEEE Trans. Power Syst. 1988, 3, 1213–1219. [Google Scholar] [CrossRef]
Ucak, C.; Caglar, R. The effects of load parameter dispersion and direct load control actions on aggregated load. In Proceedings of the 1998 International Conference on Power System Technology, Beijing, China, 18–21 August 1998; Volume 1, pp. 280–284. [Google Scholar]
Bashash, S.; Fathy, H.K. Modeling and Control of Aggregate Air Conditioning Loads for Robust Renewable Power Management. IEEE Trans. Control Syst. Technol. 2013, 21, 1318–1327. [Google Scholar] [CrossRef]
Koch, S.; Mathieu, J.L.; Callaway, D.S. Modeling and control of aggregated heterogeneous thermostatically controlled loads for ancillary services. In Proceedings of the Power Systems Computation Conference, Stockholm, Sweden, 22–26 August 2011. [Google Scholar]
Ghanavati, M.; Chakravarthy, A. Demand-Side Energy Management by Use of a Design-Then-Approximate Controller for Aggregated Thermostatic Loads. IEEE Trans. Control Syst. Technol. 2017, 26, 1439–1448. [Google Scholar] [CrossRef]
Mathieu, J.L.; Koch, S.; Callaway, D.S. State Estimation and Control of Electric Loads to Manage Real-Time Energy Imbalance. IEEE Trans. Power Syst. 2013, 28, 430–440. [Google Scholar] [CrossRef]
Yu, M.; Hong, S.H. A Real-Time Demand-Response Algorithm for Smart Grids: A Stackelberg Game Approach. IEEE Trans. Smart Grid 2017, 7, 879–888. [Google Scholar] [CrossRef]
Yang, P.; Tang, G.; Nehorai, A. A game-theoretic approach for optimal time-of-use electricity pricing. IEEE Trans. Power Syst. 2013, 28, 884–892. [Google Scholar] [CrossRef]
Samadi, P.; Mohsenian-Rad, A.H.; Schober, R.; Wong, V.W.S.; Jatskevich, J. Optimal Real-Time Pricing Algorithm Based on Utility Maximization for Smart Grid. In Proceedings of the IEEE International Conference on Smart Grid Communications, Gaithersburg, MD, USA, 4–6 October 2010; pp. 415–420. [Google Scholar]
Tushar, W.; Chai, B.; Yuen, C.; Smith, D.B. Three-Party Energy Management With Distributed Energy Resources in Smart Grid. IEEE Trans. Ind. Electron. 2015, 62, 2487–2498. [Google Scholar] [CrossRef] [Green Version]
Osborne, M.J.; Rubinstein, A. A Course in Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
Ladurantaye, D.D.; Gendreau, M.; Potvin, J.Y. Strategic Bidding for Price-Taker Hydroelectricity Producers. IEEE Trans. Power Syst. 2007, 22, 2187–2203. [Google Scholar] [CrossRef]
Conejo, A.J.; Nogales, F.J.; Arroyo, J.M. Price-Taker Bidding Strategy under Price Uncertainty. IEEE Power Eng. Rev. 2002, 22, 57. [Google Scholar] [CrossRef]
Liu, M.; Shi, Y. Model Predictive Control for Thermostatically Controlled Appliances Providing Balancing Service. IEEE Trans. Control Syst. Technol. 2016, 24, 2082–2093. [Google Scholar] [CrossRef]
Barata, F.A.; Igreja, J.M.; Rui, N.S. Demand Side Management Energy Management System for Distributed Networks; Springer International Publishing: Basel, Switzerland, 2016; pp. 455–471. [Google Scholar]
Perfumo, C.; Braslavsky, J.H.; Ward, J.K. Model-Based Estimation of Energy Savings in Load Control Events for Thermostatically Controlled Loads. IEEE Trans. Smart Grid 2014, 5, 1410–1420. [Google Scholar] [CrossRef]
Yong, T.Y.; Jin, Y.G. Methods for Adding Demand Response Capability to a Thermostatically Controlled Load with an Existing On-off Controller. J. Electr. Eng. Technol. 2015, 10, 755–765. [Google Scholar] [Green Version]
Tsui, K.M.; Chan, S.C. Demand Response Optimization for Smart Home Scheduling Under Real-Time Pricing. IEEE Trans. Smart Grid 2012, 3, 1812–1821. [Google Scholar] [CrossRef]
Meng, F.L.; Zeng, X.J. A Stackelberg game-theoretic approach to optimal real-time pricing for the smart grid. Soft Comput. 2013, 17, 2365–2380. [Google Scholar] [CrossRef]
Maharjan, S.; Zhu, Q.; Zhang, Y.; Gjessing, S.; Basar, T. Dependable Demand Response Management in the Smart Grid: A Stackelberg Game Approach. IEEE Trans. Smart Grid 2013, 4, 120–132. [Google Scholar] [CrossRef]
Mathieu, J.L.; Callaway, D.S. State Estimation and Control of Heterogeneous Thermostatically Controlled Loads for Load Following. In Proceedings of the Hawaii International Conference on System Science, Maui, HI, USA, 4–7 January 2012; pp. 2002–2011. [Google Scholar]

Figure 1. The framework of the energy trading process.

Figure 2. Temperature evolution procedure of thermostatically controlled loads (TCLs).

Figure 3. Market price data.

Figure 4. Ambient temperature and Reference temperature from 11:00 to 20:00 on a summer day.

Figure 5. Optimal energy demand of the TCL.

Figure 6. Broadcast price

p^{*}

from electric energy control center (EECC).

Figure 6. Broadcast price

p^{*}

from electric energy control center (EECC).

Figure 7. Set-point temperature and Internal temperature of the TCLs.

Figure 8. One hundred TCLs with the same

W (t_{k})

vs. 100 TCLs with different

W (t_{k})

.

Figure 8. One hundred TCLs with the same

W (t_{k})

vs. 100 TCLs with different

W (t_{k})

.

Figure 9. Variational reference temperature.

Figure 10. TCLs with different priority factor b.

Figure 11. Heterogeneous TCLs: set-point temperature of TCL1.

Figure 12. Heterogeneous TCLs: set-point temperature of TCL2.

Table 1. Variables and parameters.

i	Index of the TCL, $i = 1, 2, \dots, N$
$T$	Time interval
p	Broadcast price from the EECC
${\hat{θ}}_{i}$	Set-point temperature of TCL user i
$u_{i}$	Energy demand of TCL i in $T$
$P$	Value of RTP
$θ_{i}$	Internal temperature of TCL user i
$θ_{a, i}$	Ambient temperature of TCL user i
$R_{i}$	Thermal resistance of TCL i
$C_{i}$	Thermal capacitance of TCL i
$W_{i} (t)$	Switch state of TCL i at instant t
$δ$	Temperature deadband
T	Length of the time interval $T$
$T_{o n}$	Length of the “on” state in $T$
$β_{i}^{+}$	Maximum energy demand in $T$
$θ_{i}^{r}$	Reference temperature of TCL user i
$q_{i}$	Reference energy demand of TCL user i in $T$
j	Case of the switch state $W_{i} (t_{k})$ , $j = 1, 2$
$b_{i}$	Priority factor of the TCL user i

Table 2. TCL parameters.

Parameter	Homogeneous TCL	Heterogeneous TCL
R	$2^{\circ} C$ /kW	$2^{\circ} C$ /kW
C	5 kWh/ $^{\circ} C$	6 kWh/ $^{\circ} C$
P	11 kW	14 kW
$β^{+}$	2.75 kWh	3.5 kWh
b	1.1	1.5

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, P.; Zou, S.; Wang, X.; Ma, Z. A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads. Appl. Sci. 2018, 8, 1370. https://doi.org/10.3390/app8081370

AMA Style

Wang P, Zou S, Wang X, Ma Z. A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads. Applied Sciences. 2018; 8(8):1370. https://doi.org/10.3390/app8081370

Chicago/Turabian Style

Wang, Peng, Suli Zou, Xiaojuan Wang, and Zhongjing Ma. 2018. "A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads" Applied Sciences 8, no. 8: 1370. https://doi.org/10.3390/app8081370

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Stackelberg Game Approach for Price Response Coordination of Thermostatically Controlled Loads

Abstract

1. Introduction

2. Problem Formulation

2.1. TCL Dynamics

2.2. Energy Demand of TCLs

2.3. Energy Trading Process

3. Stackelberg Game Coordination

3.1. Stackelberg Game

3.2. Existence and Uniqueness of Stackelberg Equilibrium

4. Simulation

4.1. Homogeneous Case

4.2. Heterogeneous Case

5. Conclusions and Ongoing Reasearch

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. Proof of Lemma 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI