Next Article in Journal
A Joint Active Damping Strategy Based on LCL-Type Grid-Connected Inverters for Grid Current Feedback and PCC Voltage Unit Feedforward
Previous Article in Journal
Longitudinal–Torsional Frequency Coupling Design of Novel Ultrasonic Horns for Giant Magnetostrictive Transducers
Previous Article in Special Issue
YOLO-SS-Large: A Lightweight and High-Performance Model for Defect Detection in Substations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Wide-Range TCSC Based ADN in Mountainous Areas Considering Hydropower-Photovoltaic-ESS Complementarity

School of Electrical and Electronic Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(18), 6028; https://doi.org/10.3390/s24186028
Submission received: 13 August 2024 / Revised: 12 September 2024 / Accepted: 16 September 2024 / Published: 18 September 2024

Abstract

:
Due to the radial network structures, small cross-sectional lines, and light loads characteristic of existing AC distribution networks in mountainous areas, the development of active distribution networks (ADNs) in these regions has revealed significant issues with integrating distributed generation (DGs) and consuming renewable energy. Focusing on this issue, this paper proposes a wide-range thyristor-controlled series compensation (TCSC)-based ADN and presents a deep reinforcement learning (DRL)-based optimal operation strategy. This strategy takes into account the complementarity of hydropower, photovoltaic (PV) systems, and energy storage systems (ESSs) to enhance the capacity for consuming renewable energy. In the proposed ADN, a wide-range TCSC connects the sub-networks where PV and hydropower systems are located, with ESSs configured for each renewable energy generation. The designed wide-range TCSC allows for power reversal and improves power delivery efficiency, providing conditions for the optimization operation. The optimal operation issue is formulated as a Markov decision process (MDP) with continuous action space and solved using the twin delayed deep deterministic policy gradient (TD3) algorithm. The optimal objective is to maximize the consumption of renewable energy sources (RESs) and minimize line losses by coordinating the charging/discharging of ESSs with the operation mode of the TCSC. The simulation results demonstrate the effectiveness of the proposed method.

1. Introduction

Up to now, around the world, it has become a development trend to integrate renewable energy sources (RESs) in the form of distributed generations (DGs) into a traditional AC distribution network to form an active distribution network (ADN) [1,2]. This condition has led to dramatic alterations of distribution networks from their structures to operation control modes [3,4]. In the mountainous areas of southern China, RESs are quite affluent, such as hydro, solar, and wind energies, which have been or will be developed as run-of-river small hydropower (RoR-SH) stations, rooftop photovoltaic (PV) generation stations, and distributed wind farms. However, the existing AC distribution networks in these mountainous areas are almost radial and generally have the characteristics of small-section lines and light loads. With the increase in accessed DGs in these networks, the voltage violation issue is frequently emerging, the implementation of optimization operations is difficult, and partial transformers will operate under overload conditions while surplus DG power needs to feed the grid. Therefore, it is valuable and of practical significance to develop new link circuits with low-cost and optimization operation methods for integrating local RES generation systems into the AC distribution networks in the mountainous areas of southern China.
In the past decade or more, there has been significant attention on ADNs, with explosive growth in publications researching optimization operation strategies [5,6,7,8,9], energy storage system (ESS) configuration methods [10,11,12,13], new converter controllers [14,15,16], oscillation analysis, Volt-Var control [17,18,19,20], and other related areas. However, there has been very little focus on developing suitable and cost-effective structures specifically designed to enhance the capacity of ADNs in mountainous regions for integrating DGs, including both RoR-SH and PV.
Under the condition of considering the access of DGs, literature [21] proposed a method for planning the feeder automation configurations in the 10 kV distribution network of mountainous areas. Using the proposed method, the optimal recommendation table for determining the types and quantities of automatic switches for each kind of wiring can be obtained, and then the designed distribution network in mountainous areas is of higher reliability. However, the problems caused by accessing DGs remain unsolved. In [22], focusing on maximizing network utilization and mitigating power variations, an ADN connection equipment—a multi-terminal loop power flow controller combined by voltage source converters connected via a common DC link—is presented. The scheme can balance the loads among feeders, but it should be so expensive that it is not suitable to be used in ADNs of mountainous areas. In [23], a diamond-shaped distribution network structure is designed to improve power supply reliability and line capacity utilization in the current distribution network. The design is based on the typical wiring patterns in developed cities, e.g., Tokyo and Paris, and is also inapplicable to the wiring patterns of ADN in mountainous areas. So, until now, an ADN with excellent capacity for DG integration specifically designed for mountainous areas is wanted.
Motivated by the aforementioned facts, in this paper, we innovatively propose a wide-range thyristor-controlled series compensation (TCSC)-based ADN in mountainous areas, considering hydropower-photovoltaic-ESS complementarity. The proposed ADN incorporates which includes RoR-SH stations, rooftop PV generation stations, and ESSs. Also, a corresponding deep reinforcement learning (DRL)-based optimal operation strategy for the proposed ADN is designed in detail. This paper mainly achieves the following contributions:
(1) At the equipment level, a wide-range TCSC is designed to connect sub-networks in ADNs in mountainous areas, where RoR-SH stations and rooftop PV generation stations are located. Unlike traditional TCSCs, which are primarily set to capacitive mode for transient stability control of power systems, the proposed wide-range TCSC can be continuously adjusted for a larger range of impedance values. This design avoids the issues of resonance that can occur when the TCSC is regulated near its boundary values. Moreover, the designed wide-range TCSC is intended to enhance power transfer efficiency and facilitate power inversion during steady-state operation, providing essential support for the proposed optimal operation strategy for the ADN in mountainous areas.
(2) At the system level, with the goal of achieving the maximum consumption of renewable energy and minimum line losses, further reducing curtailment of solar and hydropower as well as the issue of voltage violation in the ADNs of mountainous areas, a DRL-based optimal operation strategy is proposed considering hydropower-photovoltaic-ESS complementarity.
A Markov decision process (MDP) formulation Is built for the optimal operation problem considering the coordinated control of ESSs and the wide-range TCSC to achieve RES efficient consumption. Unlike the existing math methods, the learned operation strategy based on MDP can deal with the variables of continuous space and achieve superior precision control performance.
(3) By coordinating the application of the designed wide-range TCSC with the proposed optimal operation strategy that considers the complementarity of solar, hydropower, and ESS, the integration capacity of renewable energy generation in mountainous distribution networks can be significantly improved. This approach effectively addresses current issues such as poor transmission efficiency and voltage violations in mountainous distribution networks, thereby demonstrating practical value.
The remainder of the paper is structured as follows: Section 2 describes the principle of the wide-range TCSC and introduces the topology of the proposed wide-range TCSC-based ADN. Section 3 illustrates the DRL-based hydropower-photovoltaic-ESS complementarity optimal operation strategy in detail. In Section 4, simulations are provided to show the effectiveness of the strategy. Section 5 draws the conclusion.

2. Principle of Wide-Range TCSC

TCSC is an important piece of equipment in the flexible AC transmission system due to its multi-functions such as power flow control, damping low-frequency power oscillation, improving transient stability, and suppressing sub-synchronous resonance. However, in this paper, a wide-range TCSC is designed to expand the function of TCSC from power flow control to the range of power flow reversal.
In this section, the principle of the designed wide-range TCSC used in the proposed ADN is introduced, and the topology of the proposed wide-range TCSC-based is introduced.

2.1. Wide-Range TCSC Operation Principle

The wide-range TCSC consists of a traditional TCSC in series with a group of capacitors and includes six main parts: two groups of capacitors, one reactor, one group of bi-directional thyristors, one ZnO lighting arrester, and one bypass breaker, as shown in Figure 1. In this case, the positive or negative of the sum of the reactances X TCSC + X Σ between the two sources determines the flow of power.
Obviously, by controlling the trigger angle of the thyristors, the equivalent reactance X TCSC of the wide-range TCSC can be changed within a certain inductive and capacitive range. X TCSC can be expressed as follows:
X TCSC = 1 ω C 1 1 ω C 2 + A π ω C 1 2 π α + sin 2 π α 4 A π ω C 1 k 2 1 cos 2 π α * η ω 0 = 1 L C 1 A = ω 0 2 ω 0 2 ω 2 k = ω 0 ω η = k tan k π α tan π α
where ω is the angular frequency under the practical operation condition, α π / 2 , π is the trigger angle of thyristors. The change curve of X TCSC with α under the industrial frequency is shown in Figure 2. Wherein, α 0 and α ref are the trigger angles corresponding to when X TCSC = 0 and when the wide-range TCSC generates resonance, respectively.
When α α 0 , α ref , the wide-range TCSC operates in adjustable inductive mode. And it presents a capacitive equivalent reactance when α 90 ° , α 0 α ref , 180 ° (named adjustable capacitive mode). Under the block mode of α = 180 ° , X TCSC is equal to value X C 1 + X C 2 . In the bypass mode of α = 90 ° , X TCSC is equal to X C 2 plus the reactance value of paralleling the capacitor and reactor.
Because the excessively high voltage between the two ports of the traditional TCSC will occur when α is close to α ref , α must be limited within the range 90 ° , α Lmax α C min , 180 ° .
In this paper, the wide-range TCSC plays the role of adjusting and even reversing power flow. As shown in Figure 2, the active power P flowing from Bus 1 to Bus 2 can be expressed as follows:
P = U 1 U 2 X Σ + X TCSC sin δ 1 δ 2 X Σ = X G 1 + X G 2 + X L 1 + X L 1 .
And, the power-voltage phase relationship in Figure 1 is shown in Figure 3.
It can be seen that the magnitude and direction of P can be adjusted by changing the reactance value X TCSC of the wide-range TCSC, which is determined by α .

2.2. Topology of Wide-Range TCSC-Based ADN

Figure 4 demonstrates the proposed wide-range TCSC-based and in the mountainous area, which includes two T-shaped sub-networks and a wide-range TCSC.
In each T-shaped sub-network, three 10 kV AC lines are linked together through a three-winding transformer. Wherein the ends of two AC lines are respectively connected to two 10 kV buses that belong to the substations of the 110 kV/220 kV main grid. And branches of the above two AC lines can be used for serving power users. The remaining AC line is used to connect the wide-range TCSC for the connection between two sub-networks. Meanwhile, it also takes charge of integrating DGs, ESSs, and loads. As shown in Figure 4, this paper studies the case that RoR-SHs and PVs are severally integrated in sub-network 1 and in sub-network 2. Also, both Line 3 and Line 6 are equipped with respective ESS.
Traditional TCSC always operates in the capacitive mode to improve power system stability. But it is also theoretically possible to achieve the power reversal on the line with the traditional one. However, due to the limitation of the boundary value α C min of the trigger angle, the maximum absolute value of the capacitive X TCSC may be smaller than the inductive equivalent impedance value X Σ , resulting in the impossibility of power reversal. Thus, the proposed wider range of TCSC reactance is needed.
According to Thevenin’s theorem, the proposed ADN of Figure 4 can be equated to a double-generator power system as shown in Figure 5. Therefore, the proposed wide-range TCSC analysis can be applied to the proposed ADN.
The open circuit voltage U oc 1 , U oc 2 and equivalent resistance R eq 1 , R eq 2 can be calculated as follows:
f x , y = x y x + y X n = f R S n , Y L n / 2 + Z L n × Y L n / 2 f R S n , Y L n / 2 + Z L n + Y L n / 2 + Z T 1 , n 1 , 2 , 3 , 4 Z m = X 2 m 1 X 2 m Y LT X 2 m 1 Y LT + X 2 m Y LT + X 2 m 1 X 2 m + Z T 2 , m 1 , 2 R o m = Z m Z load m Z m + Z load m X a , c = f f X a , Y LT , Z T 2 + Z load c Z T 2 + Z load c , a 1 , 2 c 1 , 2 X a , b , c = f X a , c Z T 2 + Z load c + Z T 1 , Y L b / 2 + Z L b , b 1 , 2 R eq c = X a , b , c U oc a , b , c = Z load c U s b f X a , b , c , Y L b / 2 X a , b , c R s 2 + f X a , b , c , Y L b / 2 × X a , b , c Z L b Z T 2 + Z load c U oc 1 = U oc 1 , 2 , 1 + U oc 2 , 1 , 1 U oc 2 = U oc 3 , 4 , 2 + U oc 4 , 3 , 2 .

2.3. The Feasibility Verification of Wide-Range TCSC

The feasibility of the wide-range TCSC will be verified in two cases. One is from the adjustable inductive mode regulating to the adjustable capacitive mode; the other is the opposite. Wherein, in the former case, it is required that the wide-range TCSC is firstly controlled to switch to the bypass mode and then to the block mode for the alteration. For the second transition case, the switch procedure is the reverse of the above.
However, a fast and direct TCSC control method, especially one that results in a conversion of the power flow direction in an AC line, will definitely cause violent oscillation or even the destabilization of the power system. Therefore, a smooth TCSC control strategy for the proposed is essential. In this simulation, an equal interval transformation method is adopted to change the impedance of TCSC each time. When it is necessary to adjust tande impedance value of the TCSC from X TCSC 1 to X TCSC 2 , set an interval of X TCSC 1 X TCSC 2 / n to be adjusted every 0.02 s, where n = 10 in the simulation cases of this section.
In these cases, the parameters of the propoand ADN are listed in Table 1. The network parameters are selected with reference to the actual distribution network. The trigger angles α L and α C corresponding to the two modes are randomly selected.
In case 1, with the initial power flow from sub-network 1 to sub-network 2, the initial trigger angle α of the wide-range TCSC is 132°, at which time X TCSC = 1.7708 Ω . Since 0.3 s, by adjusting α , the impedance value of the TCSC is controlled to reduce by roughly 3 Ω every 0.02 s. The adjustment of α stops at 0.48 s, when X TCSC = 27.3106 Ω , corresponding to α of 145.5°.
The simulation result of the above process is shown in Figure 6, in which P1 Q1 and P2 Q2 are the active and reactive power on the left and right sides of the TCSC, respectively. It can be seen that the wide-range TCSC functions of power inversion can be well implemented. Figure 7 proves that the variations of the load voltage Uload1 and Uload2 on the line where the TCSC is located and the voltage UTa3 and UTb3 on the third winding sides of the three-winding transformers are always within the permissible voltage fluctuation range during the above process.
For case 2, the initial power flow from sub-network 2 to sub-network 1, α is adjusted from 145.5° to 132° using the same method described above, resulting in the equivalent reactance value of the TCSC being adjusted from −27.3106 Ω to 1.7708 Ω. Figure 8 verifies the success of the power reversal, and Figure 9 demonstrates that this process does not suffer from the voltage violation.

3. DRL-Based Optimization Operation Strategy

This section proposes a DRL-based optimization operation strategy considering hydropower-photovoltaic-ESS complementarity for the proposed ADN. Firstly, the optimization problem is described in general. Then, we formulate the optimization control problem of ADN as a MDP. TD3 algorithm is introduced to solve the problem.

3.1. Problem Description

It is well known that in the distribution network of mountainous areas, RoR-SH provides stable power but cannot regulate its output and relies entirely on natural runoff. On the other hand, PV generation is intermittent and unstable, as its output is affected by sunlight intensity, weather conditions, and seasonal changes. In a considering hydropower-photovoltaic-ESS complementary system, hydropower provides stable power, PV generation offers additional peak power, and the energy storage system balances the fluctuations between the two, ensuring a stable power supply while reducing dependence on traditional fossil fuels.
In order to reduce light and water waste and promote the consumption of renewable energy, a DRL-based optimization operation strategy with hydropower-photovoltaic-ESS complementarity for the proposed ADN is proposed in this paper. The strategy aims to make the two types of DGs (i.e., RoR-SH and PV) with intermittent characteristic compensate each other in terms of power generation by controlling the wide-range TCSC and ESSs in multiple scenes, thus maximizing the local consumption of DG’s power and minimizing line losses. Wherein the former can be viewed as minimizing the supply of energy from the main grids to the ADN. Then, the objective function can be expressed as follows:
max f = n N = 4 P n + P i loss
where n N = 4 P n are the active power supplied by the main grid n in the proposed ADN, in other words, they are the active power on Busa1, Busa4, Busb1, and Busb4 of the proposed ADN, respectively; P i loss represents the active power loss in line i.

3.2. MDP Formulation of the Optimization Operation Strategy

The problem of the optimization operation strategy can be formulated as an MDP, which is defined as a tuple {S, A, P, R, γ}, where S is a set of the environment states, A is the action set, P is the state transition probability, R is the immediate reward set, and γ is the discount factor of the reward. The definitions of State st, Action at, Reward rt, and probability p at t time-step are elaborated as follows:
  • State: In the environment formed by the proposed ADN, state space st can be defined as follows:
    s t = P t PV , P t RMHS , P t load , P 1 , t ESS , P 2 , t ESS , X 1 , t SOC , X 2 , t SOC , α t TCSC , X t TCSC , P 1 , t grid , P 2 , t g r i d , P 3 , t grid , P 4 , t grid
    where P t PV , P t RMHS , and P t load denote the active power generated by the PV and RoR-SH, and demanded by the users; P 1 , t ESS and P 2 , t ESS , X 1 , t SOC and X 2 , t SOC are the active power outputs and the states of charge (SOCs) of the ESSs; α t TCSC and X t TCSC present the trigger angle and the equivalent reactance value of the wide-range TCSC, respectively.
  • Action: The action a t represents the setting value of the trigger angle α t TCSC of the wide-range TCSC and the active power P 1 , t ESS and P 2 , t ESS charged/discharged by the ESSs at timeslot t with the given state s t . Action a t can be present as follows:
    a t = α t TCSC , P 1 , t ESS , P 2 , t ESS .
    wherein, P 1 , t ESS and P 2 , t ESS are both limited by the maximum and minimum value of the charge/discharge of the ESSs. Due to the wide-range TCSC operation principle in Section 2, α t TCSC is restricted as follows:
    α t TCSC 90 ° , α Lmax α C min , 180 ° .
  • Reward: In accordance with the optimization operation objective of maximizing the local consumption of renewable energy and minimizing line losses analyzed in Section 3.1, the reward r t can be defined as follows:
    r t = n = 1 4 P n + P i loss k
    where k is a penalty term that can be used to correct an improper action strategy. For the proposed ADN, the current action may cause the SOCs of the ESSs in the next state to exceed their permissible upper and lower limits which is not allowed. Meanwhile, voltage violation is not permitted. Thus, k is set to be the following:
    k 1 = X t SOC X t SOCmax X t SOCmax < X t SOC X t SOCmin X t SOC X t SOC X t SOCmin k 2 = U i , t 1.05 U N i 1.05 U N i < U i , t 0.95 U N i U i , t U i , t 0.95 U N i k = p k 1 + q k 2
    where p and q are the penalty factor; X t SOCmax and X t SOCmin are the maximum and minimum SOC values specified for the ESSs, respectively; U i , t is the voltage at node i in the ADN, whose permissible deviation is ±5% of the rated voltage U N i .
  • State Transition: In the current state s t , the trigger angle α of the wide-range TCSC and the active power charged/discharged by the ESSs are controlled according to the action strategy a t , which refreshes the power flow of the proposed ADN and makes the node voltages fluctuate randomly. Thus, the state of the next moment s t + 1 can be obtained. The probability p of the process of s t transited to s t + 1 is as follows:
    p s t , s t + 1 = Pr s t + 1 s t , a t .

3.3. Twin Delayed Deep Deterministic Policy Gradient

Based on the above MDP model, the above hydropower-photovoltaic-ESS complementarity optimization operation strategy model can be transformed into a DRL framework so that it can be solved using DRL algorithms. In this section, we introduce a TD3 algorithm, which is an advanced policy gradient algorithm for solving continuous action control issues, to solve the optimization operation problem of the proposed ADN.
The TD3 algorithm contains a total of six neural networks, which are divided into two main categories: one pair of Actor Online-Target networks π and two pairs of Critic Online-Target networks θ 1 and θ 2 . Actor networks can directly select action value by the policy in continuous space, while Critic networks can judge the behavior of the Actor networks under the current state by fitting the action value function. Its training architecture diagram is shown in Figure 10.
During the learning process, the proposed ADN state s t is extracted and fed into the Actor online network, which will give the corresponding action a t and interact with the environment once. The action a t at time t can be expressed as a t = π s t ; μ π , where μ π is the weight of the Actor network. Then, a new action instruction a t + 1 will be given according to the new state s t + 1 . Noise can be added to the action during the training process. After an interaction is completed, the experience replay buffer stores the interaction information, and the network parameters will be updated by loss functions L when the interaction information stored in the experience replay buffer exceeds the number of network parameter update batches B. The Actor optimizer and the Critic optimizer update the Actor online network parameters and the Critic online network parameters based on the sampled data, respectively, and the Actor target network and the Critic target network are soft updated based on the current Actor online network as well as the Critic online network parameters. When the interaction information stored in the experience replay buffer exceeds the capacity, the initial interaction data are deleted.
DRL algorithms always can be divided into three categories: value-based algorithm, policy-based algorithm, and Actor-Critic algorithm, and it has been verified that the Actor-Critic algorithm always has better results. TD3 is a deterministic policy gradient algorithm with an Actor-Critic frame. It combines Deep Deterministic Policy Gradient (DDPG) and double networks, which can improve DDPG. On the basis of DDPG, TD3 adopts three pivotal technologies.
  • Double Networks: Essentially, the TD3 algorithm is a method of Actor-Critic. What is distinct is that TD3 has two Critic online networks and two Critic target networks. The two pairs of Critic Online-Target networks calculate the R-values at the same time as the parameter update, but the smaller R-value (TD Target) is selected for updating the Crttic networks, as shown in Figure 11.
This method can effectively avoid producing the problem of overestimation caused by the calculation of maximization in DDPG.
2.
Target Policy Smoothing Regularization: When the Critic network is updated in DDPG, the learning process with the deterministic policy easily occurs overfitting phenomenon, which results in a large variance of the target estimation and inaccurate estimation value. This problem can be reduced by regularization, so that target policy smoothing, a regularization method, is introduced. When the target value is calculated, noise obeying a normal distribution is added to the action value to improve the accuracy of the target value estimation and to ensure the stability of the network training process. The amended TD Target can be calculated by the following:
y = r + γ min i = 1 , 2 R i s t + 1 , π t + 1 s t + 1 + ε ε clip N 0 , σ ˜ , c , c
where γ is the discount factor; ε is the clipped noise, which is restricted in [-c, c] and follows a normal distribution.
3.
Delayed Update: After the Critic network has been updated several times, the Actor network will be updated once, so as to ensure the training of the Actor network is more stable.

4. Simulations

In this section, we evaluate the performance of the TD3-based optimization operation strategy with hydropower-photovoltaic-ESS Complementarity for the proposed ADN.

4.1. Experimental Setup

The simulation is conducted based on the ADN shown in Figure 4. The generation information for PV and RoR-SH and load demand information are derived from real-time series data downloaded from a power system research institution in a province in southern China. The data spans from 1 January 2020 for a duration of 3 years, with recordings taken at hourly intervals. Data from the first 20 days of each month are used for training, while the remaining days serve as the test set. The data are scaled appropriately for the system under consideration. Additionally, network parameters and wide-range TCSC parameters follow the specifications in Table 1. The ESSs have the same capacity of 3 MVA and a regulation range of [−0.1, 0.1] MVA.
A training episode is configured to last 24 h, with each hour representing a step, resulting in a total of 24 steps per episode. After each action is executed, feedback in the form of a reward is received from the environment. The episode concludes normally after running successfully for 24 h. However, if SOCs of ESSs or node voltages exceed their limits during any step, a large negative reward is given, and the episode is terminated immediately, after which a new training episode begins.
To evaluate the advantages of the proposed strategy in terms of renewable energy consumption capacity, four cases are set up for comparative analysis:
  • Case 1: Select data from May 15, which represent a period of abundant water with lower PV generation;
  • Case 2: Select data from November 15, which correspond to the period of reduced water availability and higher PV generation;
  • Case 3: Select data from May 15 and employ the ADN without ESSs;
  • Case 4: Select data from May 15 and employ the ADN without wide-range TCSC;
  • Case 5: Select data from May 15 and employ the ADN without wide-range TCSC and ESSs.
The information on renewable energy and load on the above two days is shown in Figure 12.

4.2. Simulation Result Analysis

4.2.1. Algorithm Performance

The loss functions of the three neural networks, the Critic Online network θ 1 and θ 2 , and the Actor Online network π that need to be trained in TD3, are shown in Figure 13. The dark curve is the result of smoothing the true light curve. It can be seen that these training loss values are towards convergence after several steps, indicating that good value judgments and action decisions have been learned to make.
In Figure 14, the training rewards are depicted, with the light-colored curves representing the real reward values under random initial value settings of network parameters, while the darker curve shows the average reward per episode. Initially, the rewards per episode are low, indicating that the training is receiving negative rewards and episodes are cut short. This early phase reflects the DRL algorithms’ exploration of the environment and the lack of effective optimization strategies. As training progresses, the algorithm improves through interaction with the environment, leading to longer episode durations. After 600 episodes, the duration of each episode reaches 24 h with minimal fluctuations, demonstrating that effective strategies have been learned to maintain safe and stable operation without exceeding limits. It can also be seen from Figure 14 that the TD3 algorithm used in this paper achieves more convergent results and higher reward values compared to the DDPG algorithm.

4.2.2. ESSs and Wide-Range TCSC Control Results

The optimization results over a 24 h period are shown in Figure 15 and Figure 16. Specifically, Figure 15a,b display the active power outputs and SOCs of the ESSs for Case 1 and Case 2, respectively, and Figure 16 illustrates the trigger angle and equivalent impedance variation of the wide-range TCSC for both cases.
From Figure 15, it can be seen that in the period of abundant water, during the period of 0:00–2:00/3:00 and 19:00–23:00, due to the low load demand and the absence of PV output, the ESSs are used to discharge the appropriate amount of power to reduce the supply pressure on the main grid and the RoR-SH. From 3:00/4:00 to 18:00, the load demand is higher during this period, the PV output also increases considerably, and the ESS is charged to absorb the excess PV/RoR-SH output to reduce the amount of abandoned light/water.
In the period of withered water and higher PV generation, as a result of the slight increase in PV output, the reduction output in the RoR-SH, and the reduction in load demand compared to case 1, the ESSs are discharged from 0:00–8:00/9:00 and 18:00/19:00–23:00 and charged from 9:00/10:00–17:00/18:00.
As shown in Figure 16, in order to increase the capacity of renewable energy consumption and improve the power transfer efficiency, the wide-range TCSC operates in capacitive mode after a certain value of PV generation is reached, prompting the power from the subnetwork side of RoR-SH to be delivered to the side of PV. Additionally, it operates in an inductive model for the rest of the time and facilitates the power flow in the reverse direction.

4.2.3. Superiority Statement

First, we present the voltage distribution of all nodes in the proposed ADN over a 24-h period for Case 1 and Case 2, as shown in Figure 17. The X-axis represents the ADN node numbers (0–19), the Y-axis represents time (0–24 h), and the Z-axis represents node voltage. The yellow plane above indicates the upper voltage limit (+3%), while the purple plane below represents the lower voltage limit (−3%). The curved surface in the middle shows the voltage distribution at different times for each node. It is evident that all node voltages remain within the voltage constraint range throughout all time periods without exceeding limits. This indicates that the proposed strategy effectively addresses the voltage violation issues in the network of RoR-SH.
Next, calculate the difference in power Δ P 1 and Δ P 2 at the junctions of the two subnetworks with and without the wide-range TCSC in Case 1 and Case 2, as shown in Figure 18. It can be concluded that the power transmission efficiency of a line equipped with wide-range TCSC is always greater than that of a line without wide-range TCSC. This design offers obvious benefits for ADNs in mountainous areas, where lines are long and the environment is complex.
In order to verify the advantages of the proposed strategy in renewable energy consumption, reward values of Case 1, Case 3, Case 4, and, Case 5 for each action are compared over 24 h, as shown in Figure 19.
Reward results in Case 3 and Case 4 illustrate that equipping ESS and wide-range TCSC individually also improves the renewable energy consumption capacity of the ADN to a certain extent, in which wide-range TCSC can balance the active power flow among feeders, improve the power delivery efficiency, and reduce line losses, while ESS further improves the load profile to match the renewable energy generation and reduces the amount of light and water rejections. Nevertheless, it can be seen that the best reward value results can only be obtained by equipping both ESS and wide-range TCSC at the same time, indicating that the renewable energy consumption capacity is the strongest in case 1.

5. Conclusions

With the wide application of high proportion DG, the development of ADNs in mountainous areas with the characteristic of small-section lines and light loads face great challenges, such as difficulties in optimization operation, light and water abandonment, and voltage violation. To address the above issue, this paper proposes a wide-range TCSC-based ADN in mountainous areas and a corresponding DRL-based optimization operation strategy considering hydropower-photovoltaic-ESS complementarity, where the analysis and the arithmetic examples can lead to the following conclusions.
(1) Theoretical analysis and simulation verification demonstrate that the proposed wide-range TCSC can better achieve the function of power reversal and improve the power transfer efficiency under the steady-state operation environment, which provides the conditions for optimization operation.
Compared with the current research popularity of converter-based soft open point (SOP), which can also achieve this function, the smaller footprint and lower cost of TCSC make it more suitable for the ADN in mountainous areas.
(2) A DRL-based optimization operation strategy is proposed considering hydropower-photovoltaic-ESS complementarity for the proposed ADN. Based on the proposed strategy, digital simulation using the proposed ADN in mountainous areas is conducted. It is verified that the proposed optimal operation strategy, combined with the wide-range TCSC, can effectively enhance the renewable energy consumption capacity and power transfer efficiency of the ADN in mountainous areas while avoiding voltage violation issues. This offers practical value for mountainous distribution networks.
Currently, the equipment in mountainous distribution networks is often quite outdated and lacks the infrastructure to support advanced monitoring and control technologies. These deficiencies need to be addressed. The optimization operation issue of the ADN networks is a comprehensive problem, and more means will be further studied and explored in the future to improve the operation of distribution networks.

Author Contributions

Conceptualization, S.W. and D.C.; methodology, S.W. and Y.G.; project administration, S.W.; supervision, S.W. and D.C.; validation, Y.G.; writing—original draft, Y.G.; writing—review and editing, Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are unavailable due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

U i δ i Voltage and phase angle of the power supply i.
X G i Internal resistance of the power supply i.
X L i Equivalent impedance of the line i.
Bai/BbiCircuit breaker Bai/Bbi.
U s i Voltage on the low voltage side of the transformer Ti.
R s i Equivalent resistance of the transformer Ti.
Z L i Line impedance of the line i π-equivalent circuit.
Z T i Winding i impedance of the three-winding transformer equivalent circuit.
Y L i Conductance of line i π-equivalent circuit.
Y LT Conductance of three-winding transformer equivalent circuits.

References

  1. Olivares, D.E.; Mehrizi-Sani, A.; Etemadi, A.H.; Cañizares, C.A.; Iravani, R.; Kazerani, M.; Hajimiragha, A.H.; Gomis-Bellmunt, O.; Saeedifard, M.; Palma-Behnke, R.; et al. Trends in Microgrid Control. IEEE Trans. Smart Grid 2014, 5, 1905–1919. [Google Scholar] [CrossRef]
  2. Blaabjerg, F.; Teodorescu, R.; Liserre, M.; Timbus, A.V. Overview of Control and Grid Synchronization for Distributed Power Generation Systems. IEEE Trans. Ind. Electron. 2006, 53, 1398–1409. [Google Scholar] [CrossRef]
  3. Li, P.; Ji, H.; Wang, C.; Zhao, J.; Song, G.; Ding, F.; Wu, J. Coordinated Control Method of Voltage and Reactive Power for Active Distribution Networks Based on Soft Open Point. IEEE Trans. Sustain. Energy 2017, 8, 1430–1442. [Google Scholar] [CrossRef]
  4. Carrasco, J.M.; Franquelo, L.G.; Bialasiewicz, J.T.; Galván, E.; PortilloGuisado, R.C.; Prats, M.M.; León, J.I.; Moreno-Alfonso, N. Power-Electronic Systems for the Grid Integration of Renewable Energy Sources: A Survey. IEEE Trans. Ind. Electron. 2006, 53, 1002–1016. [Google Scholar] [CrossRef]
  5. Golshannavaz, S.; Afsharnia, S.; Aminifar, F. Smart Distribution Grid: Optimal Day-Ahead Scheduling with Reconfigurable Topology. IEEE Trans. Smart Grid 2014, 5, 2402–2411. [Google Scholar] [CrossRef]
  6. Zhu, X.; Han, H.; Gao, S.; Shi, Q.; Cui, H.; Zu, G. A Multi-Stage Optimization Approach for Active Distribution Network Scheduling Considering Coordinated Electrical Vehicle Charging Strategy. IEEE Access 2018, 6, 50117–50130. [Google Scholar] [CrossRef]
  7. Li, J.; Lin, J.; Song, Y.; Xing, X.; Fu, C. Operation Optimization of Power to Hydrogen and Heat (P2HH) in ADN Coordinated with the District Heating Network. IEEE Trans. Sustain. Energy 2019, 10, 1672–1683. [Google Scholar] [CrossRef]
  8. Yi, Z.; Xu, Y.; Zhou, J.; Wu, W.; Sun, H. Bi-Level Programming for Optimal Operation of an Active Distribution Network with Multiple Virtual Power Plants. IEEE Trans. Sustain. Energy 2020, 11, 2855–2869. [Google Scholar] [CrossRef]
  9. Pilo, F.; Pisano, G.; Soma, G.G. Optimal Coordination of Energy Resources with a Two-Stage Online Active Management. IEEE Trans. Ind. Electron. 2011, 58, 4526–4537. [Google Scholar] [CrossRef]
  10. Sedghi, M.; Ahmadian, A.; Aliakbar-Golkar, M. Optimal Storage Planning in Active Distribution Network Considering Uncertainty of Wind Power Distributed Generation. IEEE Trans. Power Syst. 2016, 31, 304–316. [Google Scholar] [CrossRef]
  11. Nick, M.; Cherkaoui, R.; Paolone, M. Optimal Allocation of Dispersed Energy Storage Systems in Active Distribution Networks for Energy Balance and Grid Support. IEEE Trans. Power Syst. 2014, 29, 2300–2310. [Google Scholar] [CrossRef]
  12. Gabash, A.; Li, P. Flexible Optimal Operation of Battery Storage Systems for Energy Supply Networks. IEEE Trans. Power Syst. 2013, 28, 2788–2797. [Google Scholar] [CrossRef]
  13. Shen, X.; Shahidehpour, M.; Han, Y.; Zhu, S.; Zheng, J. Expansion Planning of Active Distribution Networks with Centralized and Distributed Energy Storage Systems. IEEE Trans. Sustain. Energy 2017, 8, 126–134. [Google Scholar] [CrossRef]
  14. Nguyen, P.H.; Kling, W.L.; Ribeiro, P.F. Smart Power Router: A Flexible Agent-Based Converter Interface in Active Distribution Networks. IEEE Trans. Smart Grid 2011, 2, 487–495. [Google Scholar] [CrossRef]
  15. Chang, J.; Wang, A. Experimental development and evaluations of VF-input high-frequency AC-AC converter supporting distributed power generation. IEEE Trans. Ind. Electron. 2004, 19, 1214–1225. [Google Scholar] [CrossRef]
  16. Zhong, W.; Tzounas, G.; Milano, F. Improving the Power System Dynamic Response Through a Combined Voltage-Frequency Control of Distributed Energy Resources. IEEE Trans. Power Syst. 2022, 37, 4375–4384. [Google Scholar] [CrossRef]
  17. Ku, T.T.; Lin, C.H.; Chen, C.S.; Hsu, C.T. Coordination of Transformer On-Load Tap Changer and PV Smart Inverters for Voltage Control of Distribution Feeders. IEEE Trans. Ind. Appl. 2019, 55, 256–264. [Google Scholar] [CrossRef]
  18. Sun, X.; Qiu, J. Two-Stage Volt/Var Control in Active Distribution Networks with Multi-Agent Deep Reinforcement Learning Method. IEEE Trans. Smart Grid 2021, 12, 2903–2912. [Google Scholar] [CrossRef]
  19. Kraiczy, M.; Stetz, T.; Braun, M. Parallel Operation of Transformers with on Load Tap Changer and Photovoltaic Systems with Reactive Power Control. IEEE Trans. Smart Grid 2018, 9, 6419–6428. [Google Scholar] [CrossRef]
  20. Tang, Z.; Hill, D.J.; Liu, T. Distributed Coordinated Reactive Power Control for Voltage Regulation in Distribution Networks. IEEE Trans. Smart Grid 2021, 12, 312–323. [Google Scholar] [CrossRef]
  21. Ye, B.; Zheng, Z.; Zhang, L.; Wang, K.; Yu, S.; Fan, S. Planning Method of Feeder Automation of Distribution Network in Mountain Area Considering Distributed Generation. In Proceedings of the 2019 9th International Conference on Power and Energy Systems (ICPES), Perth, Australia, 10–12 December 2019. [Google Scholar] [CrossRef]
  22. Xing, X.; Lin, J.; Wan, C.; Song, Y. Model Predictive Control of LPC-Looped Active Distribution Network with High Penetration of Distributed Generation. IEEE Trans. Sustain. Energy 2017, 8, 1051–1063. [Google Scholar] [CrossRef]
  23. Liu, S.; Wu, J.; Xue, L.; Wu, D.; Lu, H.; Li, W.; Wang, S.; Li, W. Research and Application of a Clear Diamond-Shaped Distribution Network Structure. Front. Energy Res. 2021, 9, 830489. [Google Scholar] [CrossRef]
Figure 1. The diagram of the wide-range TCSC.
Figure 1. The diagram of the wide-range TCSC.
Sensors 24 06028 g001
Figure 2. The change curve of X TCSC with α under the industrial frequency.
Figure 2. The change curve of X TCSC with α under the industrial frequency.
Sensors 24 06028 g002
Figure 3. The power-voltage phase relationship diagram.
Figure 3. The power-voltage phase relationship diagram.
Sensors 24 06028 g003
Figure 4. The topology of the proposed wide-range TCSC-based ADN.
Figure 4. The topology of the proposed wide-range TCSC-based ADN.
Sensors 24 06028 g004
Figure 5. Thevenin equivalent circuit of the proposed ADN.
Figure 5. Thevenin equivalent circuit of the proposed ADN.
Sensors 24 06028 g005
Figure 6. The change curve of the active and reactive power on both sides of the TCSC in case 1.
Figure 6. The change curve of the active and reactive power on both sides of the TCSC in case 1.
Sensors 24 06028 g006
Figure 7. The change curve of several voltages in case 1.
Figure 7. The change curve of several voltages in case 1.
Sensors 24 06028 g007
Figure 8. The change curve of the active and reactive power on both sides of the TCSC in case 2.
Figure 8. The change curve of the active and reactive power on both sides of the TCSC in case 2.
Sensors 24 06028 g008
Figure 9. The change curve of several voltages in case 2.
Figure 9. The change curve of several voltages in case 2.
Sensors 24 06028 g009
Figure 10. TD3 training architecture diagram.
Figure 10. TD3 training architecture diagram.
Sensors 24 06028 g010
Figure 11. Double networks update.
Figure 11. Double networks update.
Sensors 24 06028 g011
Figure 12. Information on renewable energy and load.
Figure 12. Information on renewable energy and load.
Sensors 24 06028 g012
Figure 13. TD3 training losses.
Figure 13. TD3 training losses.
Sensors 24 06028 g013
Figure 14. Rewards comparison between two DRL algorithms.
Figure 14. Rewards comparison between two DRL algorithms.
Sensors 24 06028 g014
Figure 15. Active power outputs and SOCs of ESS curve.
Figure 15. Active power outputs and SOCs of ESS curve.
Sensors 24 06028 g015
Figure 16. Optimal results of the wide-range TCSC in Case 1 and Case 2.
Figure 16. Optimal results of the wide-range TCSC in Case 1 and Case 2.
Sensors 24 06028 g016
Figure 17. Voltage distribution of all nodes in the proposed ADN.
Figure 17. Voltage distribution of all nodes in the proposed ADN.
Sensors 24 06028 g017
Figure 18. Power difference in Case 1 and Case 2.
Figure 18. Power difference in Case 1 and Case 2.
Sensors 24 06028 g018
Figure 19. Reward comparison.
Figure 19. Reward comparison.
Sensors 24 06028 g019
Table 1. Parameter setting of the prandsed ADN.
Table 1. Parameter setting of the prandsed ADN.
SymbolValueSymbolValue
U11, U21220 VL3.66 × 10−3 H
U12, U22110 VRLoad3 Ω
X/R ratio7αL132°
C1100 × 106 F α C 145.5°
C2 501 × 10 6 F α Lmax 140°
kT220/110/10 α Cmin 144°
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guo, Y.; Wang, S.; Chen, D. A Wide-Range TCSC Based ADN in Mountainous Areas Considering Hydropower-Photovoltaic-ESS Complementarity. Sensors 2024, 24, 6028. https://doi.org/10.3390/s24186028

AMA Style

Guo Y, Wang S, Chen D. A Wide-Range TCSC Based ADN in Mountainous Areas Considering Hydropower-Photovoltaic-ESS Complementarity. Sensors. 2024; 24(18):6028. https://doi.org/10.3390/s24186028

Chicago/Turabian Style

Guo, Yao, Shaorong Wang, and Dezhi Chen. 2024. "A Wide-Range TCSC Based ADN in Mountainous Areas Considering Hydropower-Photovoltaic-ESS Complementarity" Sensors 24, no. 18: 6028. https://doi.org/10.3390/s24186028

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop