Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals †

Wei, Boyuan; Deconinck, Geert

doi:10.3390/en13010043

Open AccessArticle

Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†^†

by

Boyuan Wei

^‡ and

Geert Deconinck

^*,‡

Department of Electrical Engineering (ESAT), Research Division Electa, KU Leuven, 3001 Leuven, Belgium

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in CIGRE Chengdu Symposium 2019.

^‡

Kasteelpark Arenberg 10, bus 2445, 3001 Leuven, Heverlee, Belgium.

Energies 2020, 13(1), 43; https://doi.org/10.3390/en13010043

Submission received: 20 November 2019 / Revised: 9 December 2019 / Accepted: 16 December 2019 / Published: 19 December 2019

(This article belongs to the Section A1: Smart Grids and Microgrids)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

With the development of distributed energy resources, the low voltage distribution network (LVDN) is supposed to be the integrator of small distributed energy sources. This makes the users in LVDNs multifarious, which leads to more complex modeling. Additionally, data acquisition could be tricky due to rising privacy concerns. These impose severe demands on control schemes in LVDNs that the classical centralized control might not be able to fulfill. To tackle this, a model-free control approach with distributed decision-making architecture is proposed in this paper. Employing statistical methods and game theory, individual users in LVDNs achieve local optimum autonomously. Comparing to conventional approaches applied in LVDNs, the proposed approach is able to achieve active control with less communication burden and computational resources. The paper proves the convergence to the Nash Equilibrium (NE) and uses player compatible relations to form the specific equilibrium. A variant of the log-linear trial and error learning process is applied in a novel “suggest-convince” mechanism to implement the proposed approach. In the case study, a 103 nodes test network based on a real Belgian semiurban LVDN is illustrated. The proposed approach is validated and analyzed with practical load profiles on the 103 nodes network. In addition to that, centralized control is implemented as a benchmark to show the performance of the proposed approach by comparing it with the classical optimization result. The results demonstrate that the proposed approach is able to achieve player compatible equilibrium in an expected way, resulting in a good approximation to the local optimum.

Keywords:

low voltage distribution network; player compatible equilibrium; voltage control; active distribution network; Nash equilibrium; broadcast signals

1. Introduction

The rising penetration level of distributed energy resources (DER) has become a clear trend in modern distribution networks. This has been especially challenging controlling low voltage distribution networks (LVDNs) as LVDNs are expected to facilitate the penetration of DER. On one hand, higher DER penetration levels in LVDNs are synonymous with greater variability, which need larger flexibility reserve and active control. On the other hand, consumer privacy becomes an important concern when controlling users with the deployment and adoption of smart grid technologies, which set severe barriers to data collection from users [1]. User uncertainty and the awareness of privacy bring more difficulties in modeling and information acquisition, both of which are critical in conventional active control approaches [2]. Thus, there is a demand for active control schemes with limited information or even without specific models (model-free).

There are quite a few existing approaches that are deemed to be able to fulfill such a demand at certain degrees. Droop control [3,4] is a classic and reliable way to implement distributed control without relying on communication. Nevertheless, although drawbacks like load sharing can be solved by some modified scheme [5,6], some issues remain. For instance, most of DERs, if not all, have no droop character originally, and the control performance is affected by cable impedance [7]. Moreover, as a passive approach, some notable features of the smart grid such as active management and dynamic optimization are not easy to implement by droop control. Most of the existing active control is centralized and comes with high communication demand, aiming at offering ancillary services [8,9]. Additionally, the modeling of users is a remaining concern especially when they are multifarious.

All of these limits and restrictions suggest that attempts to achieving distributed model-free control is needed. As the decision-making process of control is decentralized, information is collected and used locally in a distributed manner. This helps the predicament on both communication burden and privacy concerns. Furthermore, with the ability to perform parallel computations, distributed algorithms have the potential to be computationally superior to centralized algorithms, both in terms of solution speed and the maximum problem size that can be addressed [10]. Distributed control, including optimization, has found its applications in power systems especially on electrical vehicle charging and demand response. Readers are referred to the survey literature [11,12] for more instances. Most of these works are based on computational models or techniques, such as Augmented Lagrangian Decomposition [13] and the decentralized solution of the Karush–Kuhn–Tucker (KKT) necessary conditions for local optimality [14]. Besides these computational approaches, as a study of strategic interaction among rational decision-makers, game theory finds its application in distributed control structure firmly. Ref. [15] proposed a strategy on distributed energy allocation between providers and consumers, while demand response among residential consumers considering irrational behavior is studied in [16], and a set of distributed robust adaptive computation algorithms for a class of generalized convex games by computing the Nash Equilibrium is proposed by [17]. In this paper, a control scheme under distributed decision-making architecture is studied and the Nash Equilibrium (NE) is involved to drive users toward local optimality on flexibility management in a given LVDN.

As one of, if not the most, famous studies in game theory, NE has been attracting research interests for several decades in various fields. Generally, there are two major threads on seeking NE in practical applications: employing mathematical framework and solving a class of generalized convex games locally or globally, which is so-called “mathematical approaches” and designing rules of learning or evolving that can strike dynamic equilibrium within finite iterations, which sometimes is described as “trial and error learning”. The “mathematical approaches” are widely studied in the control field. For instance, the ODE-based generalized NE computation [18], nonlinear Gauss–Seidel-type approach [19], best-response dynamics [20], generalized convex games over unreliable networks [17], and using Newton-type methods to find NE at a super-linear convergence rate [21]. The “trial and error learning” have myriad applications in economics and has been employed in engineering fields over time. Hart, Foster, and Young’s research [22,23,24] have proved decentralized rules can be devised that converge to NE or correlated equilibrium in general n-person games. [25] uses the classic model of rational Bayesian to maximize the discounted expected utility under the belief that the environment is constant. Based on NE, Player-Compatible Equilibrium (PCE) was proposed by [26], which extends the consideration of “trembles” in the NE by imposing cross-player restrictions on the game, in a way that is invariant to the utility representations of players’ preferences over game outcomes. A trembling hand perfect equilibrium is an equilibrium that takes the possibility of off-the-equilibrium play into account by assuming that the players, through a “tremble”, may choose unintended strategies, albeit with negligible probability. Readers are referred to [27] for more explanations.

Although there are quite a few existing works that involve game theory in power systems, most of them focus on electricity market issues [28,29,30,31,32] or employ game theory as auxiliary to the main control algorithm [33]. For instance, Ref. [34] employs game theory in a hybrid energy system for voltage and frequency control, while the game theory algorithm has only been used to decide which energy source should be used at a certain point. Meanwhile, most of the works use one of Shapley [32], Aumann–Shapley [35], or Nucleolus-based algorithms [36].

This paper attempts to use novel game theoretic algorithms to tackle technical, non-economic issues in LVDNs. The contribution of this paper is three-fold. Firstly, a control scheme that can be implemented by LVDN users in a distributed manner, under a broadcaster-users architecture is proposed. To make it clear, the concept “users” in this paper indicates all the households and individual devices connected to LVDN, including small distributed generators, PV, small wind turbines, and so on. Without massive communication and complicated modeling, users employ local information and simple public information broadcast by the broadcaster to decide their own strategies independently. As no specific information is required in the control scheme, hot-plugging is feasible, which allows users to join/quit the network freely, making LVDNs flexible. Secondly, the paper proves that the proposed scheme is able to drive users to converge to PCE within the limited period to achieve the specific control objectives. Thirdly, a benchmark with centralized optimization is presented in this paper, with remarks on the performance comparison. The paper is organized as follows: the specific problem is elaborated in Section 2; Section 3 introduces the necessary concepts and then illustrates the scheme of the proposed approach; a simulation study based on a practical network and benchmark are presented in Section 4, and then Section 5 concludes the paper.

2. Problem Statement

2.1. Notations

Assume a strategic-form game with N players, and each player i has a finite strategy set

S_{i}

, where strategy

s_{i} \in S_{i}

. The set of mixed strategy

σ_{i}

is

M_{i}

and the set of strictly mixed strategies, where every pure strategy in

S_{i}

has nonzero probability, is denoted by

M_{i}^{°}

. For

s_{i}

, the correlated strategies of the other

N - 1

users is

s_{- i}

. The cost of user i is indicated by

J_{i}

, the utility of user i is

U_{i} = - J_{i}

.

2.2. Problem Statement

Consider an LVDN with N users, where each user is located in a different location. The users are not necessarily homogeneous but consume or provide power on a comparable order of magnitude. Each user i has a power consumption profile

{\tilde{p}}_{t}^{i}

, where generation is regarded as negative consumption in this paper. Assume the given LVDN has limited flexibility reservation. It means if all the users operate as they want, there will be a mismatch in the balance, which leads to voltage issues. In LVDN, due to the high R/X ratio, voltage is more sensitive to active power balance, but still related to reactive power as well [37,38]. A distributed control is needed to optimize the operations among users to strike a balance between user comfort and voltage regulation. Poor user comfort leads to dissatisfaction. Users will become less satisfied if they cannot operate as they wish, and partial operation or stepless adjustment of power is not feasible. For instance, if user i wants to increase its power from 1.5 to 2 kW, increasing its power to 1.8 kW instead will not make it satisfied and there is no guarantee that 1.8 kW is a feasible working state for user i. This is very common for most of the household appliances or distributed generators. Meanwhile, the communication and computation should not be too intensive for easy implementation.

Essentially, power regulation in LVDNs can be abstracted as a flexibility allocation game. If user i changes its working status against the power balance, it consumes flexibility. If the flexibility reserve is not sufficient, this leads to a voltage problem. Conventionally, the LVDN is supported by a backbone distribution network, which is supposed to provide sufficient flexibility. Whereas the flexibility allocation can be optimized to achieve a more robust and independent LVDN, which is in line with the concept of the active distribution network (ADN) [39]. In this game, each user i has to implement its strategy

s_{i} \in S_{i}

in every control cycle, which is decided by a centralized optimization algorithm in conventional control schemes. According to

{\tilde{p}}_{t}^{i}

, each user i has an initial plan

δ_{t}^{i}

, which denotes the original profile change tendency of user i. In the given problem of this paper,

S_{i}

is a finite set, then

p_{t + Δ t}^{i} = \{\begin{matrix} p_{t}^{i} + Δ p^{i} & , s_{i} = 1 \\ p_{t}^{i} & , s_{i} = 0 \\ p_{t}^{i} + δ_{t}^{i} & , s_{i} = - 1 \end{matrix}, where δ_{t}^{i} = {\tilde{p}}_{t + 1}^{i} - {\tilde{p}}_{t}^{i} .

(1)

p_{t}^{i}

is the actual power consumption of user i, where

t + Δ t \in (t, t + 1)

.

Δ p^{i}

is the admissible regulation that proposed by control. Thus, the cost of user i is given by

J_{i} (p_{t + Δ t}^{i}, v_{t + Δ t}^{i}) dt = a \cdot {(p_{t + Δ t}^{i} - {\tilde{p}}_{t + 1}^{i})}^{2} \cdot dt + b \cdot {(v_{t + Δ t}^{i} - ξ)}^{2} dt, ξ = (1 + sgn (v_{t}^{i} - v^{r}) \cdot 0.1 γ) \cdot v^{r},

(2)

where

v_{t + Δ t}^{i}

is the corresponding local nodal voltage of i after

s_{i}

is implemented, and

0 < 2 a < b

.

γ \in [0, 0.99]

is the boundary coefficient. The objective of each user i is to minimize its

J_{i}

. Although

p_{t + Δ t}^{i}

is controlled by

s_{i}

, user i cannot always choose

s_{i} = 1

to minimize

J_{i}

, as

b < 2 a < 0

and if

v_{t + Δ t}^{i}

turns out being far away from

ξ

,

J_{i}

grows.

3. Concepts and Control Scheme

3.1. Concepts Preparation

Proposition 1.

Resource allocation game [40] is a congestion game.

Congestion games are a general model for resource allocation games and are a special class of potential games [41].

Theorem 1.

A potential game converges to a Nash Equilibrium (NE). Games that are close (in terms of payoffs of players) to potential games have similar limiting dynamics to those in potential games.

For simplicity, readers are referred to the Theorem 3.1 in [42] for definitions and corresponding proofs.

Definition 1.

For user

i \neq j

and the corresponding strategies

s_{i}^{*} \in S_{i}, s_{j}^{*} \in S_{j}

, if it holds for every correlated strategy

σ_{- j} \in M_{- j}^{°}

and for every

σ_{- i} \in M^{°} (S_{- i})

satisfying

s_{- i} |S_{- i j} = s_{- j}| S_{- i j}

, that

U_{j} (s_{j}^{*}, σ_{- j}) \geq max_{s_{j}^{'} \in S_{j} ∖ \{s_{j}^{*}\}} U_{j} (s_{j}^{'}, σ_{- j}), while U_{i} (s_{i}^{*}, σ_{- i}) > max_{s_{i}^{'} \in S_{i} ∖ \{s_{i}^{*}\}} U_{i} (s_{i}^{'}, σ_{- i}),

(3)

then we say i is more

player compatible

with

s_{i}^{*}

than j is with

s_{j}^{*}

, which is denoted as

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j)

. This compatibility relation is transitive and asymmetric, as the following propositions. Readers are referred to the appendix in [26] for the corresponding proofs.

Proposition 2.

If

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j) ≿ (s_{l}^{*} | l)

, then

(s_{i}^{*} | i) ≿ (s_{l}^{*} | l)

.

Proposition 3.

If

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j)

, then

(s_{j}^{*} | j) ≿ (s_{i}^{*} | i)

.

Proposition 4.

If the game only has two players

i \neq j

, then

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j) ≿ (s_{l}^{*} | l)

never holds, as this concept considers third parties, whose best response is affected by the relative tremble probabilities of

i \neq j

.

Definition 2.

Assume there is a tremble profile ϵ, which assigns a positive number (no matter how small it is)

ϵ (s_{i} | i) > 0

to

s_{i}

of every player i, use

Π_{i}^{ϵ}

for the set of these strategies of player i, then we write

Π_{i}^{ϵ} : = \{σ_{i} \in M (S_{i}) s . t . σ_{i} (s_{i}) \geq ϵ (s_{i} | i) \forall s_{i} \in S_{i}\} .

(4)

Then

σ^{°}

is ϵ-

equilibrium

, if

σ_{i}^{°} \in \underset{σ_{i} \in Π_{i}^{ϵ}}{arg max} U_{i} (σ_{i}, σ_{- i}^{°})

(5)

Π_{i}^{ϵ}

is convex and compact. Whenever ϵ is small enough so that

Π_{i}^{ϵ}

is non-empty for every i, the existence of ϵ-

equilibrium

holds.

Definition 3.

ϵ is

player compatible

if

ϵ (s_{i}^{*} | i) \leq ϵ (s_{j}^{*} | j)

for all

i, j

, and

s_{i}^{*}, s_{j}^{*}

such that

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j)

. Then a ϵ-

equilibrium

where ϵ is player compatible is called

player compatible ϵ - equilibrium

(ϵ - PCE)

.

Theorem 2.

PCE exists in every finite strategic-form game [26].

Definition 4.

State

z (t) \in Z^{*}

is

stochastically stable state

if for every i and given any small φ > 0,

U_{i} (s_{i}^{*}, σ_{- i}) \geq max_{s_{i}^{'} \in S_{i} ∖ \{s_{i}^{*}\}} U_{i} (s_{i}^{'}, σ_{- i})

(6)

holds for at least the fraction

1 - φ

of all periods τ.

3.2. Control Scheme

3.2.1. Architecture Setup

Besides the users in LVDN, we assume that there is a global broadcaster who periodically monitors the voltages

v_{t}^{k}

from K key points over the network at time t, then calculates the general situation parameter

g_{t}

according to

g_{t} = (1 - \sin (\frac{e^{- {(\frac{μ}{10})}^{2}}}{2} π)) \cdot sgn (μ), where μ = {\bar{v}}_{t} - v^{r} .

(7)

{\bar{v}}_{t}

is the average of

v_{t}^{k}

,

v^{r}

is the rated voltage so that

g_{t}

suggests the general voltage situation of whole LVDN. Then

g_{t}

is broadcast to every user as public information, users decide and implement their own strategies independently within

S_{i}

. There is no extra communication besides the broadcast needed.

3.2.2. Control Scheme

As there is no communication besides broadcast, the decision-making process of users is a non-cooperative game. The public information provided by broadcast promotes coherence among users, as the decision-making process is decentralized compared to centralized control approaches, whose coherence is guaranteed by centralized decision making and bi-directional communication.

As stated in the previous section, the control can be regarded as a flexibility allocation game. According to Proposition 1, this turns out to be a congestion game, which is a special class of potential game. Theorem 1 suggests that a potential game converges to a NE. NE is a stable state of a system involving the interaction of different participants, in which no participant can gain by a unilateral change of strategy if the strategies of the others remain unchanged. According to the definition of NE, it is reasonable to conclude that

J_{i}

has achieved its minimum if the game converges to a NE. However, just like local optimum in numerical optimization, there might be multiple NE in one potential game. Therefore, the proposed control scheme has to fulfill two requirements: firstly, the game should be able to converge to NE within an acceptable time, secondly, there should be a guarantee that the game converges to a specific admissible NE.

One can figure out that the minimum of

J_{i}

is obtained, where

s_{i} = - 1

and

v_{t + Δ t}^{i} ⟶ ξ

. In other words, user i reaches its minimum cost when user i follows its own plan and the local voltage is close to the set point. Practically speaking, this is the idea of user deregulation. In this paper a so-called “suggest-convince” mechanism is proposed to configure the decision making of user i, to minimize

\int_{0}^{τ} J_{i} (p_{t + Δ t}^{i}, v_{t + Δ t}^{i}) dt

in (2), where

τ

is the period.

g_{t}

is the incentive in the proposed control. Whenever user i receives

g_{t}

, it will firstly operate Algorithm 1 independently to figure out the suggestion parameter

ζ_{t}^{i}

, in which Equation (8) is given by

Δ v_{t}^{i} = (1 - \sin (\frac{e^{- {(\frac{μ}{10})}^{2}}}{2} π)) \cdot sgn (μ), where μ = v_{t}^{i} - v^{r} .

(8)

Step 5 in Algorithm 1 works in a seesaw manner. When

v_{t}^{i}

has a large deviation from rated voltage, it is more dominating in the computation of

ζ_{t}^{i}

, otherwise,

g_{t}

will be dominating. This suggests that user i always regards its own situation as a priority.

Algorithm 1 Suggest Phase

Input:

Public voltage information,

g_{t}

Local voltage,

v_{t}^{i}

;

Rated voltage,

v^{r}

;

Output:

Suggestion parameter,

ζ_{t}^{i}

;

1: calculate

Δ v_{t}^{i}

according to Equation (8)

2: if

| Δ v_{t}^{i} | > 0.99

then

3:

Δ v_{t}^{i} = sgn (Δ v_{t}^{i})

;

4: end if

5:

ζ_{t}^{i} = (1 - | Δ v_{t}^{i} |) g_{t} + | Δ v_{t}^{i} | \cdot Δ v_{t}^{i}

;

6: Return

ζ_{t}^{i}

Each user i maintains a two dimensional vector

Λ_{i} = ([\begin{matrix} λ_{11}^{i} \\ λ_{21}^{i} \end{matrix}], [\begin{matrix} λ_{12}^{i} \\ λ_{22}^{i} \end{matrix}], \dots [λ_{m}^{i}])

, where

m \in [1, 2, 3 \dots K], [λ_{m}^{i}] = [\begin{matrix} λ_{1 m}^{i} \\ λ_{2 m}^{i} \end{matrix}], λ_{m}^{i} \in (0, 1), λ_{1 m}^{i} > λ_{2 m}^{i}

. This is so called stubborn vector. The suggestion parameter

ζ_{t}^{i}

that is figured out by Algorithm 1, which indicates the suggested adjustment, is used to “convince” the corresponding stubborn vectors according to Algorithm 2.

Algorithm 2 Convince Phase

Input:

Suggestion parameter,

ζ_{t}^{i}

;

Stubborn vector,

Λ_{i}

;

Period index, m;

Output:

Strategy,

s_{i}

;

1: if

ζ_{t}^{i} > λ_{1 m}^{i}

then

2:

s_{i} = 1

;

3: else if

λ_{1 m}^{i} \geq ζ_{t}^{i} > λ_{2 m}^{i}

then

4:

s_{i} = 0

;

5: else if

λ_{2 m}^{i} > ζ_{t}^{i}

then

6:

s_{i} = - 1

;

7: end if

8: Return

s_{i}

Depending on

s_{i}

, the corresponding control will be implemented according to (1). Nevertheless,

Δ p^{i}

needs to be specified if

s_{i} = 1

, as

ζ_{t}^{i}

only indicates the direction. The specific

Δ p^{i}

is figured out in Algorithm 3. One can find that it is possible for

Δ p^{i} = δ_{t}^{i}

even when

s_{i} = 1

, while it is different from the situation when

s_{i} = - 1

, this will be explained later on. The last admissible status

{p^{i}}^{'}

is the last corresponding

p_{t}^{i}

when

v_{t}^{i}

is within

\pm 10 %

of rated voltage. In order to make the control robust and stable, users are encouraged to trace back to their

{p^{i}}^{'}

if a regulation needs to be applied. Different from centralized control, there is no explicit set point given to users. If

{p^{i}}^{'}

is not applicable, user i can keep the current working status and count 1 more on

c_{i}

. This gives user i some time to wait for other users contribute to the necessary regulation. Whereas, if the situation remains when

c_{i}

reaches to 2, user i will do an experiment by adjusting its

p_{t}^{i}

2% towards the direction suggested by

ζ_{t}^{i}

and reset

c_{i}

to zero. For the users who cannot adjust their working status by such a step, the closest working state will be chosen. Performing experiments is the least preferred operation as it will not be often when the game converges to a proper NE.

Algorithm 3: The calculation of

Δ p^{i}

Input:

Suggestion parameter,

ζ_{t}^{i}

;

initial plan,

δ_{t}^{i}

Last admissible status,

{p^{i}}^{'}

;

Current working status,

p_{t}^{i}

;

Trace-back count,

c_{i}

;

Output:

Admissible adjustment,

Δ p^{i}

;

Trace-back count,

c_{i}

;

1: if

sgn (δ_{t}^{i}) = sgn (ζ_{t}^{i})

then

2:

Δ p^{i} = δ_{t}^{i}

,

c_{i} = 0

;

3: else if

sgn ({p^{i}}^{'} - p_{t}^{i}) = sgn (ζ_{t}^{i})

then

4:

Δ p^{i} = {p^{i}}^{'} - p_{t}^{i}

,

c_{i} = 0

;

5: else if

c_{i} < = 2

then

6:

Δ p^{i} = 0, c_{i} = c_{i} + 1

;

7: else

8:

Δ p^{i} = 0.02 \cdot * p_{t}^{i} * sgn (ζ_{t}^{i}), c_{i} = 0

9: end if

10: Return

Δ p^{i}, c_{i}

As shown in Algorithm 2,

Λ_{i}

is a vector that can affect the distribution of

s_{i}

, where

K = \frac{T}{τ}

. T is the timescale of

Λ_{i}

while

τ

is the timescale of

[λ_{m}^{i}]

. In this paper, T is 24 h and

τ

is 0.5 h, therefore

K = 48

. For each period m, there is a scalar pair

[λ_{m}^{i}]

, in which

λ_{1 m}^{i}

and

λ_{2 m}^{i}

are two thresholds for

s_{i}

as illustrated in Algorithm 2. Namely,

λ_{1 m}^{i}

is the threshold between staying with the current working status and making a change towards the direction

ζ_{t}^{i}

suggests, while

λ_{2 m}^{i}

is the threshold between staying with the current working status and implementing change according to

{\tilde{p}}_{t + 1}^{i}

. It benefits the whole LVDN most if user i makes a change towards

ζ_{t}^{i}

, while implementing change according to

{\tilde{p}}_{t + 1}^{i}

results in perfect user comfort. Scalar pair

[λ^{i}]

indicates control policy of user i, as it configures the corresponding probability distribution of

σ_{i} \in M_{i}^{°}

. Given the situation that users have different behavioral characteristics during one day, different

[λ^{i}]

is needed to seek for the NE in different games during different periods of one day. This is why

[λ_{m}^{i}]

applies. In this paper, the optimal values of

{[λ_{m}^{i}]}^{*}

are figured out by a trial and error learning approach illustrated by Algorithm 4, where sign

bino

stands for the Bernoulli trial.

In Algorithm 4, a variant of the so-called log-linear trial and error learning [43] is implemented. Note that

ζ_{t}^{i}

is given by a linear combination of two log models, and

[λ_{m}^{i}]

is adjusted with probabilities in proportion to

J_{i}

when

v_{t + Δ t}^{i}

approaches the median between

γ

and technical boundaries (

\pm 10 %

of

v^{r}

).

Algorithm 4: Learning Phase

Input:

Strategy,

s_{i}

;

Local voltage,

v_{t + Δ t}^{i}

;

Corresponding scalar pair,

[λ_{m}^{i}]

;

Boundary coefficient,

γ

;

Suggestion parameter,

ζ_{t}^{i}

;

Adjust step,

α = 0.02

;

Output:

Corresponding scalar pair,

[λ_{m}^{i}]

;

1: calculate

Δ v_{t + Δ t}^{i}

according to Equation (8)

2:

Ξ = (0.99 - γ) / 2

;

3: if

ζ_{t}^{i} = 1

then

4:

λ_{1 m}^{i} = λ_{1 m}^{i} - α

;

5:

λ_{2 m}^{i} = λ_{2 m}^{i} - α

;

6: else if

s_{i}

= 1 then

7: if

Δ v_{t + Δ t}^{i} < γ

then

8:

λ_{1 m}^{i} = λ_{1 m}^{i} + α

;

9: else if

Δ v_{t + Δ t}^{i} > 0.99

then

10:

[λ_{m}^{i}] = [λ_{m}^{i}]

; ## do nothing

11: else

12:

λ_{1 m}^{i} = λ_{1 m}^{i} - α \cdot bino (4 \cdot (Δ v_{t + Δ t}^{i} - Ξ)) \cdot sgn (Δ v_{t + Δ t}^{i} - Ξ)

;

13: end if

14: else if

s_{i}

= 0 then

15: if

Δ v_{t + Δ t}^{i} < γ

then

16:

λ_{2 m}^{i} = λ_{2 m}^{i} + α

;

17: if

λ_{2 m}^{i} > λ_{1 m}^{i}

then

18:

λ_{1 m}^{i} = λ_{1 m}^{i} + α

;

19: end if

20: else if

Δ v_{t + Δ t}^{i} > 0.1

then

21:

λ_{1 m}^{i} = λ_{1 m}^{i} - α

;

22: if

λ_{1 m}^{i} < λ_{2 m}^{i}

then

23:

λ_{2 m}^{i} = λ_{2 m}^{i} - α

;

24: end if

25: else

26:

λ_{2 m}^{i} = λ_{2 m}^{i} - α \cdot bino (4 \cdot (Δ v_{t + Δ t}^{i} - Ξ)) \cdot sgn (Δ v_{t + Δ t}^{i} - Ξ)

;

27: end if

28: else if

s_{i}

= −1 then

29: if

Δ v_{t + Δ t}^{i} < γ

then

30:

λ_{1 m}^{i} = λ_{1 m}^{i} + α

;

31:

λ_{2 m}^{i} = λ_{2 m}^{i} + α

;

32: else if

Δ v_{t + Δ t}^{i} > 0.1

then

33:

λ_{2 m}^{i} = λ_{2 m}^{i} - α

;

34: else

35:

b = bino (4 \cdot (Δ v_{t + Δ t}^{i} - Ξ)) \cdot sgn (Δ v_{t + Δ t}^{i} - Ξ)

;

36:

λ_{1 m}^{i} = λ_{1 m}^{i} - α \cdot b

;

37:

λ_{2 m}^{i} = λ_{2 m}^{i} - α \cdot b

;

38: end if

39: end if

40: if

λ_{1 m}^{i} > 1 - α

then

41:

λ_{1 m}^{i} = 1 - α

;

42: if

λ_{1 m}^{i} \leq λ_{2 m}^{i}

then

43:

λ_{2 m}^{i} = λ_{1 m}^{i} - α

;

44: end if

45: else if

λ_{2 m}^{i} < α

then

46:

λ_{2 m}^{i} = α

;

47: if

λ_{2 m}^{i} \geq λ_{1 m}^{i}

then

48:

λ_{1 m}^{i} = λ_{2 m}^{i} + α

;

49: end if

50: end if

51: Return

[λ_{m}^{i}]

3.3. Remarks

The proposed “suggest-convince” mechanism is essentially a simulation of the negotiation process in games. User i employs public information

g_{t}

and its local information

v_{t}^{i}

, to generate

ζ_{t}^{i}

, which indicates the power changes preferred by the circumstance. Then compare

ζ_{t}^{i}

with the three zones divided by corresponding stubborn scalar pair

[λ_{m}^{i}] \in Λ_{i}

, to figure out

s_{i} \in S_{i}

. Eventually, one needs to review

v_{t + Δ t}^{i}

to see whether there is potential to improve

E (J_{i})

, then adjust the corresponding

[λ_{m}^{i}]

, in order to approach the best mixed strategy

σ_{i}^{*} \in M_{i}^{°}

during the given period m.

Within a given period m, assume user i has more moderate profile that leads to milder

δ_{t}^{i}

than user j,

i \neq j

, or the neighboring users of i are more supportive to grid regulation,

v_{t + Δ t}^{i}

will be more likely stay within or closer to

(1 \pm γ) v^{r}

than

v_{t + Δ t}^{j}

, which results in

(s_{i}^{*} | i) ≿ (s_{j}^{*} | j)

or

(σ_{i}^{*} | i) ≿ (σ_{j}^{*} | j)

depending on the network configuration. According to Proposition 2 and 3, this relationship is transitive and asymmetric, it spreads through the whole LVDN via the coupling among the users. It is important to point out that this relationship is not necessarily uniform in LVDN, as the coupling among users depend on the network topology, it is possible to have several player compatible relations in a given LVDN. One of the objectives of Algorithm 4 is discovering such a relation, encouraging users who have the upper hand to maximize their probabilities on strategies

s_{i} = 1

and

s_{i} = 0

within restrictions. Meanwhile, the boundary check on

[λ_{m}^{i}]

in Algorithm 4 (lines 40–49) guarantees the existence of

Π_{i}^{ϵ}

, therefore

ϵ

−

equilibrium

exists. Consequently, as stated in (1), user i has a finite strategy set

S_{i} = {- 1, 0, 1}

. Therefore, this is a finite strategic-game and PCE exists according to Theorem 2.

Ref. [43] suggests that in an interdependent N-person game with a finite strategy set, if all players use log-linear trial and error learning, and that the acceptance probabilities are fairly large relative to the probability of conducting an experiment, then its stochastic stable state will be either a pure NE or mixed strategy that maximizes

\sum_{i = 1}^{N} U_{i}

if pure NE does not exist. For user i in this paper, the “acceptance probabilities” are the probabilities of

v_{t + Δ t}^{i}

will stay within or closer to

(1 \pm γ) v^{r}

, and the “probability of conducting an experiment” is its tremble profile

ϵ

. The conditions are satisfied. Therefore, the stochastically stable state of user i will be PCE. Besides, Definition 2 essentially supposes that user i considers the set of all mixed correlated strategies of other users

σ_{- i} \in M^{°} (S_{- i})

. If the players can learn some prior knowledge about their counterparts’ player compatibility, user i might be able to deduce that the counterparts will only play subset

{\hat{M}}_{- i} \in M^{°} (S_{i})

. This prior knowledge can be obtained by Algorithm 4 in this paper, so that the convergence of

σ_{i}

can be expected.

Additionally, some parameters are tunable.

γ

is set to 0.5 in this paper, which means the control is trying to make voltages converge to

\pm 5 %

of

v^{r}

instead of itself. This makes more sense to the proposed approach as it allows users to take advantage of more flexibility to improve comfort, but still with certain margin. It is necessary to point out that larger

γ

does not lead to more available flexibility, the optimal

γ

still needs further study.

τ

, as the timescale of

[λ_{m}^{i}]

, affects the converge speed and quality. Although the behavior of users in LVDNs is changing, it is relatively stable hourly and has daily characteristics. This is why

τ

is 0.5 h and T is 24 h in the paper. It allows user i to employ

[λ_{m}^{i}]

and

Λ_{i}

to learn the hourly statistic characteristics and fit daily dynamic patterns respectively from their counterparts.

4. Case Study

4.1. Grid Topology

The schematic diagram is illustrated in Figure 1. It is a three-phase 230/400-V reference grid based on the topology of a real semiurban feeder in the region of Flanders, Belgium [44]. To make the network more multifarious and ill-designed, new users such as residential wind turbines and small PV farms are added, increasing the number of nodes from 62 to 103. As listed in Table 1, the impedance values are calculated according to the Belgian standard for underground distribution cables with an assumed operating temperature of 45

^{°}

C. All of the main feeder cables are of type EAXVB 1 kV 4 × 150 mm

^{2}

. A 250 kVA 10/0.4 kV transformer is assumed with an impedance of 0.013 + 0.038j pu. From feeders to each individual user,/hl a cable EXVB 1kV 4 × 16 mm

^{2}

is used with a length of 15 m. To simplify, the three-phase system is assumed to be symmetrical, then the analysis can focus on a single phase. The

v^{r}

of all the users is 230 V.

4.2. Profiles and Conditions

A model of domestic electricity use is used to generate the high-resolution household consumption profile, which is based upon a combination of patterns of active occupancy [45]. Lighting and appliances, occupant’s behavior, month of the year, and weekday or weekend have been taken in account for the profile generation. Household profiles in this paper are assumed on a weekend day of June, whose source database is based on a realistic measurement of 22 domestic dwellings around the town of Loughborough in the East Midlands, UK. It is assumed that normal households randomly have one to four family members, and four to six family members are assumed in high-consuming households. Profiles of PV panels are taken from a 368 kWp PV system on the rooftop of EnergyVille in Genk, Belgium [46]. The system has 24 strings of PV panels, and each string can be recorded individually. The profiles are scaled, randomly selected and combined from the raw data in the week of 6th–12th June, 2016. As all the PV strings are located close to each other, the data has a good correlation, which is suitable to be used as a profile in the LVDN. Regarding residential wind turbines, their profiles are obtained from Elia, a Belgian transmission system operator. The data was measured from Belgian onshore wind turbines on the 6th and 7th of June, 2016, the same time period as the profiles of PV and households. Then it is scaled and assigned randomly to the residential wind turbines in the test network. Four small residential generators are connected to the test network to represent other distributed generation users, such as CHP generators. Their profiles are generated randomly taking the occupants’ behavior into account. The operation ranges of users are given in Table 2.

In LVDNs, due to the high R/X ratio, voltage is more related to active power distribution [37]. Moreover, users, especially households in LVDN do not have any devices of which the reactive power output can be controlled. To simplify the case, assume that users only consume active power or the users’ power factor is at least 0.9. This does not mean that the reactive power is ignored in the control, as users consider the actual voltage changes. The associated influence caused by other factors, such as reactive power changes due to the control, have already resulted in the actual voltage changes. For instance, although the incentive of a user to turn on the washing machine is active power, the associated reactive power is changed as well.

4.3. Simulation Setup

The algorithm runs once per minute, which means 30 times during each m. The initial

[λ_{m}^{i}]

is set to

[0.5, 0.2]

uniformly. The power flow simulation is implemented in MATLAB, based on the backward-forward sweep method [47]. Although the control is simulated step by step, its effect is processed as continuous. Namely, to make the simulation more realistic, user behaviors are extracted from the original profile. In every simulation step, we assume that the users will restore half of the regulations (if any) on them from the previous step, to simulate the decaying continuous effect from the control implemented before. Therefore, the reference profile in each step is the combination of corresponding user behaviors and the decayed previous status, instead of just obtaining status from the original profile rigidly. Meanwhile, to simulate the daily characters from different days, we use the same original profiles for all the days, nevertheless, small random generated variations are added to

p^{i}

to make the operation conditions in the simulation not remain exactly the same among different days, which makes it more realistic.

4.4. Simulation Results and Remarks

To present a basic view on the convergence process, the average voltage from all the users are shown in Figure 2a. The test profile has three typical scenarios: from 0–8 h, the original profile is moderate, the users gradually adapt themselves to form an equilibrium situation; from 10–16 h, the system suffers from severe overvoltage due to the generation by PV modules, users finally reach an equilibrium that employs the

5 %

margin to avoid bothering the users as much as possible. From 18–24 h, the mild voltage variation does not motivate most of the users, so that they increase their corresponding

[λ_{m}^{i}]

to guarantee their comfort. To study this in numbers, the discord rate

D_{m}

is defined as

D_{m} = \sum_{t = 0}^{τ} \sum_{i = 1}^{N} Ψ (s_{i} (g_{t}, v_{t}^{i}), δ_{t}^{i}),

(9)

where

Ψ

is the operation that counts the difference between the actual changes

p_{t + Δ t}^{i} - p_{t}^{i}

and

δ_{t}^{i}

: if

δ_{t}^{i}

is eventually implemented, it counts 0, if does not, it counts 0.5, and it counts 1 if the actual adjustment opposites to what

δ_{t}^{i}

demands. We do not use

J_{i}

here as

J_{i}

is a combination of user comfort and technical demands, which will be shown later on. The hourly

D_{m}

is illustrated in Figure 2b. From day 1 to day 4 one can figure out that along with the form of PCE, the discord rate is decreasing, which suggests better user comfort and smoother control. A centralized control that can derive the optimal adjustment for each participating user is implemented [48,49] as the benchmark. The controller solves an optimization problem by the full non-linear network model (Alternating Current Optimal Power Flow, ACOPF). It employs two-way communication; each participating user sends a range to the central controller, within which it can adjust its power consumption or generation. Besides, assumptions of perfect information, instant computation, communication, and implementation are given to the control, to obtain the theoretical best result to be compared with. The objective is minimizing

{(p_{t + Δ t}^{i} - {\tilde{p}}_{t + 1}^{i})}^{2}

, with the boundary condition of

0.95 \cdot v^{r} \leq v_{t}^{i} \leq 1.05 \cdot v^{r}

. The general average voltage is shown in Figure 3a, while the comparison of the discord rate is illustrated in Figure 3b. The statistical results of general voltage and nodal voltages are shown in Table 3.

From Figure 3 it can be concluded that the proposed approach has achieved a good approximation to a centralized optimization solution. Meanwhile, the proposed approach comes with a relatively lower discord rate. This does not mean that the proposed approach is better than a centralized approach in all the aspects, as the centralized approach has a rigid boundary, while the proposed approach statistically converges to the boundary. This can be observed in Figure 4, which shows all the 103 nodal voltages of the proposed approach and centralized approach. It is clear that the proposed approach does not have an absolute hard boundary—it stochastically allows users to use the preserved margin to some extent, while the conventional ACOPF uses the absolute hard boundary to guarantee the preserved margin, whereas it comes with a higher discord rate. Besides, few overvoltages of nodes can be observed from Figure 4a, this is caused by drastic power changes. As it is model-free, the proposed approach is not able to have an accurate prediction on voltage changes, when the general condition is mild at time t, it is very likely to result in corresponding mild

ζ_{t}^{i}

, which gives users a higher probability on implementing

{\tilde{p}}_{t + 1}^{i}

to maximize the user’s comfort. If

{\tilde{p}}_{t + 1}^{i}

indicates a dramatic change compared to

p_{t}^{i}

, the slight overvoltage might happen. Nevertheless, it is fixed immediately in the next control period.

To show how the PCE is formed and the evolution of the individual user, Figure 5 illustrates the changing of

[λ_{m}^{i}]

in all the four days continuously, user

i = 36

is selected randomly as an instance. Combining the figures of

λ_{1 m}^{i}

and

λ_{2 m}^{i}

, the evolution of the two thresholds of different

s_{i}

in

S_{i}

is changing. During some m, due to flexible neighbors or mild user profile, user

i = 36

remains the upper hand all the time, such that both of the two scalars in its

[λ_{m}^{i}]

reach their maximum, to take advantage of the deregulation as much as possible. Whereas for some m, user

i = 36

increases its

[λ_{m}^{i}]

at the beginning to feel out the other users, then it has to restrain its scalars as it does not have actual superiority compared to others.

The global convergence can be observed from Figure 6. The gross discord rate is given by

\sum_{m = 1}^{24} D_{m}

everyday, while nodal voltage deviations are exactly in the manner of Table 3 but for 7 days. It can be observed that, with the current configuration, from the fourth day, the whole system reaches a statistically steady state, which is supposed to be PCE according to the derivations in Section 2 and Section 3. Users achieved their approximate local optimum in the non-cooperative game. Four days are not a short time, nevertheless, although coming with high gross discord rate, the system gets well controlled from the first day, then the equilibrium is gradually formed via trial and error learning process. As long as the equilibrium is formed, it is robust unless the whole LVDN gets completely changed, which makes the LVDN hot-plugging and flexible. These features are in line with the concept of ADN as well.

5. Conclusions

A model-free control approach with distributed decision-making architecture is proposed in this paper. With statistic and game theories, it achieves good approximation to local optimum among individual users in the LVDN. Compared to conventional approaches applied in the LVDN, the proposed approach is able to achieve active control with a low communication burden and computational resources. Users and broadcasters are double-blind to each other, which allows users to enter or quit the network freely (i.e., hot-plugging). Moreover, there is only anonymous general information delivered through the whole network, which addresses the concern on privacy. These make the proposed approach in accordance with the developing trend of privacy protection and decentralization in the LVDN.

Although not many, there are some existing works that proposed voltage control by game theoretical algorithms. For instance, Zhou et al. [50] employs Volt/VAr control dynamics with nonlinear power flow model to do a voltage control game. Compared to the proposed approach, as [50] works with an explicit model to indicate the influence on the voltage that users can exert by changing their consumption, a more accurate result could be expected if the model is well designed. Nevertheless, if the gradient of its piecewise linear volt/var control curve is too large, the algorithm may have a convergence problem, which is not a problem for the proposed approach as its convergence is guaranteed by the log-linear trial and error learning process and the approach itself is model-free. Nassaj and Shahrtash [51] employs the Shapley–Shubik index to implement dynamic voltage control in the distribution network. Normally this approach needs to figure out the Shapley values by communication before starting the game; Nassaj and Shahrtash [51] calculates the Shapley–Shubik indices, and then distributes them to implement the control, which is essentially not a completely distributed control.

Theoretically, the proposed approach can be applied in other scenarios as well, as long as the control can be formed as a game that meets two prerequisites. Firstly, it should be a potential game, which mainly means the congestion game in the power system. Secondly, there should be a hierarchical relationship in the aspect of priority among interdependent controlled agents, no matter whether this relationship is connatural or given on purpose. For instance, the control of OLTC and elastic loads in [52], smart EV charging in [53], the demand side management of large populations of thermostatically controlled loads [54], and frequency control with energy storage systems in the distribution network [55].

Although the proposed approach is able to minimize

J_{i}

by reaching PCE, it does not guarantee global optimum, as the NE is an equilibrium among users, where every user i is close to its local optimum with limitations. The global optimum could be able to be achieved via peer to peer communication, with sophisticated algorithms and configuration according to the Fundamental Theorems of Welfare Economics [56]. It has the potential to promote Pareto improvement on the achieved NE, and eventually reach a statistically steady point on the Pareto Frontier. This will be the focus of future research as seeking the Pareto Frontier is one of the classic ways to solve multi-objective optimization problems.

Author Contributions

Conceptualization, B.W. and G.D.; methodology, B.W.; software, B.W.; validation, B.W.; formal analysis, B.W.; data curation, B.W.; writing–original draft preparation, B.W.; writing–review and editing, B.W. and G.D.; visualization, B.W.; supervision, G.D.; project administration, G.D.; funding acquisition, G.D. All authors have read and agree to the published version of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by CSC, VLAIO Flux50 ICON HBC.2018.0527 ROLECS “Roll-out of Local Energy Communities”, and KU Leuven BOF/IOF C24/16/018 “Energy Storage as a Disruptive Technology in the Energy System of the Future”.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zeadally, S.; Pathan, A.S.K.; Alcaraz, C.; Badra, M. Towards privacy protection in smart grid. Wirel. Pers. Commun. 2013, 73, 23–50. [Google Scholar] [CrossRef]
Hou, Z.; Xiong, S. On model free adaptive control and its stability analysis. IEEE Trans. Autom. Control. 2019, 64, 4555–4569. [Google Scholar] [CrossRef]
Engler, A.; Soultanis, N. Droop control in LV-grids. In Proceedings of the 2005 IEEE International Conference on Future Power Systems, Amsterdam, The Netherlands, 16–18 November 2005; pp. 1–6. [Google Scholar]
Diaz, N.L.; Dragičević, T.; Vasquez, J.C.; Guerrero, J.M. Intelligent distributed generation and storage units for DC microgrids—A new concept on cooperative control without communications beyond droop control. IEEE Trans. Smart Grid 2014, 5, 2476–2485. [Google Scholar] [CrossRef] [Green Version]
Liu, Y.; Wang, J.; Li, N.; Fu, Y.; Ji, Y. Enhanced load power sharing accuracy in droop-controlled DC microgrids with both mesh and radial configurations. Energies 2015, 8, 3591–3605. [Google Scholar] [CrossRef] [Green Version]
Patel, U.N.; Gondalia, D.; Patel, H.H. Modified droop control scheme for load sharing amongst inverters in a micro grid. Adv. Energy Res. 2015, 3, 81–95. [Google Scholar] [CrossRef]
Planas, E.; Gil-de Muro, A.; Andreu, J.; Kortabarria, I.; de Alegría, I.M. General aspects, hierarchical controls and droop methods in microgrids: A review. Renew. Sustain. Energy Rev. 2013, 17, 147–159. [Google Scholar] [CrossRef]
Martins, V.F.; Borges, C.L. Active distribution network integrated planning incorporating distributed generation and load response uncertainties. IEEE Trans. Power Syst. 2011, 26, 2164–2172. [Google Scholar] [CrossRef]
Cheng, L.; Zhang, Z.; Jiang, H.; Yu, T.; Wang, W.; Xu, W.; Hua, J. Local energy management and optimization: A novel energy universal service bus system based on energy Internet technologies. Energies 2018, 11, 1160. [Google Scholar] [CrossRef] [Green Version]
Molzahn, D.K.; Dörfler, F.; Sandberg, H.; Low, S.H.; Chakrabarti, S.; Baldick, R.; Lavaei, J. A survey of distributed optimization and control algorithms for electric power systems. IEEE Trans. Smart Grid 2017, 8, 2941–2962. [Google Scholar] [CrossRef]
Vardakas, J.S.; Zorba, N.; Verikoukis, C.V. A survey on demand response programs in smart grids: Pricing methods and optimization algorithms. IEEE Commun. Surv. Tutor. 2014, 17, 152–178. [Google Scholar] [CrossRef]
Mukherjee, J.C.; Gupta, A. A review of charge scheduling of electric vehicles in smart grid. IEEE Syst. J. 2014, 9, 1541–1553. [Google Scholar]
Kargarian, A.; Mohammadi, J.; Guo, J.; Chakrabarti, S.; Barati, M.; Hug, G.; Kar, S.; Baldick, R. Toward distributed/decentralized DC optimal power flow implementation in future electric power systems. IEEE Trans. Smart Grid 2016, 9, 2574–2594. [Google Scholar]
Nocedal, J.; Wright, S. Numerical Optimization; Springer Science & Business Media: New York, NY, USA, 2006. [Google Scholar]
Bistritz, I.; Ward, A.; Zhou, Z.; Bambos, N. Smart Greedy Distributed Allocation in Microgrids. In Proceedings of the ICC 2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–6. [Google Scholar]
Liu, X.; Wang, Q.; Wang, W. Evolutionary Analysis for Residential Consumer Participating in Demand Response Considering Irrational Behavior. Energies 2019, 12, 3727. [Google Scholar]
Zhu, M.; Frazzoli, E. Distributed robust adaptive equilibrium computation for generalized convex games. Automatica 2016, 63, 82–91. [Google Scholar]
Li, S.; Başar, T. Distributed algorithms for the computation of noncooperative equilibria. Automatica 1987, 23, 523–533. [Google Scholar]
Pang, J.S.; Scutari, G.; Facchinei, F.; Wang, C. Distributed power allocation with rate constraints in Gaussian parallel interference channels. arXiv 2007, arXiv:cs/0702162. [Google Scholar]
Palomar, D.P.; Eldar, Y.C. Convex Optimization in Signal Processing and Communications; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Li, J.; Li, C.; Xu, Y.; Dong, Z.Y.; Wong, K.P.; Huang, T. Noncooperative game-based distributed charging control for plug-in electric vehicles in distribution networks. IEEE Trans. Ind. Inform. 2016, 14, 301–310. [Google Scholar]
Hart, S.; Mas-Colell, A. Stochastic uncoupled dynamics and Nash equilibrium. Games Econ. Behav. 2006, 57, 286–303. [Google Scholar]
Young, H.P. Learning by trial and error. Games Econ. Behav. 2009, 65, 626–643. [Google Scholar]
Foster, D.P.; Young, H.P. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theor. Econ. 2006, 1, 341–367. [Google Scholar]
Kaufmann, E.; Cappé, O.; Garivier, A. On Bayesian upper confidence bounds for bandit problems. In Proceedings of the Artificial Intelligence and Statistics, La Palma, Canary Islands, 21–23 April 2012; pp. 592–600. [Google Scholar]
Fudenberg, D.; He, K. Player-compatible equilibrium. arXiv 2017, arXiv:1712.08954. [Google Scholar]
Bielefeld, R.S. Reexamination of the perfectness concept for equilibrium points in extensive games. In Models of Strategic Rationality; Springer: New York, NY, USA, 1988; pp. 1–31. [Google Scholar]
Stamtsis, G.C.; Erlich, I. Use of cooperative game theory in power system fixed-cost allocation. IEE Proc. Gener. Transm. Distrib. 2004, 151, 401–406. [Google Scholar] [CrossRef]
O’Brien, G.; El Gamal, A.; Rajagopal, R. Shapley value estimation for compensation of participants in demand response programs. IEEE Trans. Smart Grid 2015, 6, 2837–2844. [Google Scholar] [CrossRef]
Lin, C.H.; Chen, S.J.; Kuo, C.L.; Chen, J.L. Non-cooperative game model applied to an advanced metering infrastructure for non-technical loss screening in micro-distribution systems. IEEE Trans. Smart Grid 2014, 5, 2468–2469. [Google Scholar] [CrossRef]
Marzband, M.; Javadi, M.; Domínguez-García, J.L.; Moghaddam, M.M. Non-cooperative game theory based energy management systems for energy district in the retail market considering DER uncertainties. IET Gener. Transm. Distrib. 2016, 10, 2999–3009. [Google Scholar] [CrossRef] [Green Version]
Shaloudegi, K.; Madinehi, N.; Hosseinian, S.; Abyaneh, H.A. A novel policy for locational marginal price calculation in distribution systems based on loss reduction allocation using game theory. IEEE Trans. Power Syst. 2012, 27, 811–820. [Google Scholar] [CrossRef]
Grammatico, S.; Parise, F.; Colombino, M.; Lygeros, J. Decentralized convergence to Nash equilibria in constrained deterministic mean field control. IEEE Trans. Autom. Control. 2015, 61, 3315–3329. [Google Scholar] [CrossRef] [Green Version]
Yin, H.; Zhao, C.; Li, M.; Ma, C.; Chow, M.Y. A game theory approach to energy management of an engine–generator/battery/ultracapacitor hybrid energy system. IEEE Trans. Ind. Electron. 2016, 63, 4266–4277. [Google Scholar] [CrossRef]
Molina, Y.P.; Prada, R.B.; Saavedra, O.R. Complex losses allocation to generators and loads based on circuit theory and Aumann-Shapley method. IEEE Trans. Power Syst. 2010, 25, 1928–1936. [Google Scholar] [CrossRef]
Bhakar, R.; Sriram, V.; Padhy, N.P.; Gupta, H.O. Probabilistic game approaches for network cost allocation. IEEE Trans. Power Syst. 2009, 25, 51–58. [Google Scholar] [CrossRef] [Green Version]
De Brabandere, K.; Bolsens, B.; Van den Keybus, J.; Woyte, A.; Driesen, J.; Belmans, R. A voltage and frequency droop control method for parallel inverters. IEEE Trans. Power Electron. 2007, 22, 1107–1115. [Google Scholar] [CrossRef]
Laaksonen, H.; Saari, P.; Komulainen, R. Voltage and frequency control of inverter based weak LV network microgrid. In Proceedings of the 2005 IEEE, International Conference on Future Power Systems, Amsterdam, The Netherlands, 16–18 November 2005; pp. 6–13. [Google Scholar]
McDonald, J. Adaptive intelligent power systems: Active distribution networks. Energy Policy 2008, 36, 4346–4351. [Google Scholar] [CrossRef]
Kukushkin, N.; Men’shikov, I.; Men’shikova, O.; Morozov, V. Resource allocation games. Comput. Math. Model. 1990, 1, 433–444. [Google Scholar] [CrossRef]
Bistritz, I.; Leshem, A. Approximate best-response dynamics in random interference games. IEEE Trans. Autom. Control. 2017, 63, 1549–1562. [Google Scholar] [CrossRef] [Green Version]
Candogan, O.; Ozdaglar, A.; Parrilo, P.A. Near-potential games: Geometry and dynamics. ACM Trans. Econ. Comput. 2013, 1, 11. [Google Scholar] [CrossRef]
Pradelski, B.S.; Young, H.P. Learning efficient Nash equilibria in distributed systems. Games Econ. Behav. 2012, 75, 882–897. [Google Scholar] [CrossRef] [Green Version]
Tant, J.; Geth, F.; Six, D.; Tant, P.; Driesen, J. Multiobjective battery storage to improve PV integration in residential distribution grids. IEEE Trans. Sustain. Energy 2012, 4, 182–191. [Google Scholar] [CrossRef] [Green Version]
Richardson, I.; Thomson, M.; Infield, D.; Clifford, C. Domestic electricity use: A high-resolution energy demand model. Energy Build. 2010, 42, 1878–1887. [Google Scholar] [CrossRef] [Green Version]
Yordanov, G.; Smolders, F.; Olaerts, A.; Verbeek, G.; Baert, K.; Driesen, J. A 368-kWp Grid-connected PV System: Known and Hidden Losses. In Proceedings of the European PV Solar Energy Conference and Exhibition (EUPVSEC), Amsterdam, The Netherlands, 25–29 September 2017; WIP: Munich, Germany, 2017. [Google Scholar]
Cheng, C.S.; Shirmohammadi, D. A three-phase power flow method for real-time distribution system analysis. IEEE Trans. Power Syst. 1995, 10, 671–679. [Google Scholar] [CrossRef]
Liu, Y.; Li, J.; Wu, L.; Ortmeyer, T. Chordal relaxation based ACOPF for unbalanced distribution systems with DERs and voltage regulation devices. IEEE Trans. Power Syst. 2018, 33, 970–984. [Google Scholar] [CrossRef]
David Fobes, C.C. ThreePhasePowerModels. Available online: https://github.com/lanl-ansi/ThreePhasePowerModels.jl (accessed on 2 August 2019).
Zhou, X.; Tian, J.; Chen, L.; Dall’Anese, E. Local voltage control in distribution networks: A game-theoretic perspective. In Proceedings of the IEEE 2016 North American Power Symposium (NAPS), Fargo, ND, USA, 9–11 September 2016; pp. 1–6. [Google Scholar]
Nassaj, A.; Shahrtash, S.M. A dynamic voltage control scheme by employing cooperative game theory. In Proceedings of the 2017 IEEE Iranian Conference on Electrical Engineering (ICEE), Tehran, Iran, 2–4 May 2017; pp. 986–990. [Google Scholar]
Christakou, K.; Tomozei, D.C.; Le Boudec, J.Y.; Paolone, M. GECN: Primary voltage control for active distribution networks via real-time demand-response. IEEE Trans. Smart Grid 2013, 5, 622–631. [Google Scholar] [CrossRef] [Green Version]
Tang, Q.; Xie, M.; Yang, K.; Luo, Y.; Zhou, D.; Song, Y. A decision function based smart charging and discharging strategy for electric vehicle in smart grid. Mob. Networks Appl. 2019, 24, 1722–1731. [Google Scholar] [CrossRef]
De Paola, A.; Trovato, V.; Angeli, D.; Strbac, G. A Mean Field Game Approach for Distributed Control of Thermostatic Loads Acting in Simultaneous Energy-Frequency Response Markets. IEEE Trans. Smart Grid 2019, 10, 5987–5999. [Google Scholar] [CrossRef]
Sun, Y.; Bahrami, S.; Wong, V.W.; Lampe, L. Chance-constrained frequency regulation with energy storage systems in distribution networks. IEEE Trans. Smart Grid 2019. [Google Scholar] [CrossRef]
Aliprantis, C.D.; Burkinshaw, O. The fundamental theorems of welfare economics without proper preferences. J. Math. Econ. 1988, 17, 41–54. [Google Scholar] [CrossRef]

Figure 1. The topology of the test network. Cable lengths are drawn to scale.

Figure 2. General voltage (a) and discord rate (b) from day 1 to day 4.

Figure 3. Comparison between proposed approach (day 4) and centralized approach.

Figure 4. Nodal voltages of the proposed approach (day 4) and centralized approach.

Figure 5. The evolution of

[λ_{m}^{i}]

in four days,

i = 36

.

Figure 5. The evolution of

[λ_{m}^{i}]

in four days,

i = 36

.

Figure 6. The evolution of gross discord rate and nodal voltage deviations.

Table 1. Cable parameters.

Cable Type	Impedance at 45 $^{°}$ C
EAXVB 1kV $4 \times 150 m m^{2}$	0.227 + 0.078j $Ω /$ km
EXVB 1kV $4 \times 16 m m^{2}$	1.265 + 0.083j $Ω /$ km

Table 2. Operating ranges of users (negative values mean production, positive values consumption of electricity).

User Type	Operating Range (kW)
Houses without PV	0∼13
Houses with PV	−2∼13
Small PV farms	−11.5∼0
Residential wind turbines	−20∼0
High-consuming unit	2∼17
Residential generators	−7∼0

Table 3. Performance statistics of individual users.

Item	General Voltage Deviation Range $(%)$	$10 % < \| Δ v_{t}^{i} \|$	$\| Δ v_{t}^{i} \| \leq 10 %$
No control	−6.34∼+18.07	$28.08 %$	$71.92 %$
ACOPF	−5.39∼+5.47	$0 %$	$100 %$
Day 1	−4.85∼+10.03	$0.26 %$	$99.74 %$
Day 2	−6.14∼+9.55	$0.19 %$	$99.81 %$
Day 3	−6.00∼+8.93	$0.12 %$	$99.88 %$
Day 4	−6.01∼+8.66	$0.12 %$	$99.88 %$

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wei, B.; Deconinck, G. Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†. Energies 2020, 13, 43. https://doi.org/10.3390/en13010043

AMA Style

Wei B, Deconinck G. Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†. Energies. 2020; 13(1):43. https://doi.org/10.3390/en13010043

Chicago/Turabian Style

Wei, Boyuan, and Geert Deconinck. 2020. "Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†" Energies 13, no. 1: 43. https://doi.org/10.3390/en13010043

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†^†

Abstract

1. Introduction

2. Problem Statement

2.1. Notations

2.2. Problem Statement

3. Concepts and Control Scheme

3.1. Concepts Preparation

3.2. Control Scheme

3.2.1. Architecture Setup

3.2.2. Control Scheme

3.3. Remarks

4. Case Study

4.1. Grid Topology

4.2. Profiles and Conditions

4.3. Simulation Setup

4.4. Simulation Results and Remarks

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals † †

Abstract

1. Introduction

2. Problem Statement

2.1. Notations

2.2. Problem Statement

3. Concepts and Control Scheme

3.1. Concepts Preparation

3.2. Control Scheme

3.2.1. Architecture Setup

3.2.2. Control Scheme

3.3. Remarks

4. Case Study

4.1. Grid Topology

4.2. Profiles and Conditions

4.3. Simulation Setup

4.4. Simulation Results and Remarks

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Distributed Optimization in Low Voltage Distribution Networks via Broadcast Signals ^†^†