Next Article in Journal
Insights into the Chemistry and Functional Properties of Edible Mushrooms Cropped in the Northeastern Highlands of Puebla, Mexico
Previous Article in Journal
The Influence of Various Commonly Used Building Materials on the Shielding Effectiveness, Reflection and Absorption of the Electromagnetic Wave
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multiple DC Modulation Coordination Strategies Based on Transient Energy Function and Deep Deterministic Policy Gradient Algorithm

1
Power Science Research Institute, Yunnan Power Grid Co., Ltd., Kunming 650000, China
2
School of Electrical Engineering, Sichuan University, Chengdu 610000, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(6), 2519; https://doi.org/10.3390/app14062519
Submission received: 22 January 2024 / Revised: 8 March 2024 / Accepted: 14 March 2024 / Published: 16 March 2024
(This article belongs to the Section Electrical, Electronics and Communications Engineering)

Abstract

:
Ultra-low-frequency oscillation (ULFO) is a new problem of frequency stability in asynchronous network back-end power networks. The negative damping provided by hydropower units is the direct cause of ultra-low-frequency oscillation, and DC modulation is an effective means of suppressing system frequency oscillation. Based on the transient energy function and the limitation of DC modulation, this paper analyzes the conditions of suppressing system oscillation by DC modulation from the perspective of energy. The weight of unit energy participation is defined, and the calculation formula of transient oscillation energy in a certain oscillation mode is derived. DC modulation sensitivity is then used to create a DC modulation sequence table based on the Prony identification approach. Ultimately, the DC involved in the modulation is identified using the DC modulation sequence table and the transient oscillation energy, and the Deep Deterministic Policy Gradient (DDPG) algorithm is used to optimize the chosen DC modulation parameters.
Keywords:
ULFO; DC modulation; DDPG

1. Introduction

After asynchronous networking, the problem of low frequency oscillation caused by weak damping of the AC system in weakly interconnected large systems has been improved to a certain extent, but ultra-low-frequency oscillation occurs in non-synchronous hydropower high-proportion transmission systems. It has been shown that the ultra-low frequency oscillation of asynchronous network transmitter systems is mainly due to the water hammer effect of hydropower units. As an additional control of traditional direct current, DC modulation has been used to suppress ultra-low frequency oscillation. At present, there have been many studies on the use of DC additional control to suppress low-frequency oscillations. Based on the analysis of the action mechanism of governor and FLC, a decoupling coordination design method of governor and FLC was proposed in the literature [1]. The literature [2] proposed and used FLC fast modulation power to jointly improve the damping of oscillation mode of HVDC island system to suppress ultra-low-frequency oscillation and suggested that the FLC dead zone should be smaller than the governor dead zone to enhance the FLC effect.
However, with the introduction of voltage source converter based high voltage direct current transmission (VSC-HVDC) technology based on a modular multilevel converter (MMC), a line-commutated converter (LCC) combined with MMC forms a new direct current transmission topology [3,4,5,6]. With the increasing scale of power grid interconnection, inter-regional low-frequency oscillation caused by system interconnection has gradually become one of the bottlenecks that trouble the safe and stable operation of power grid and limit the transmission capacity of power grid [7,8,9,10]. By controlling the active power output of HVDC system, the oscillation can be quickly suppressed so as to improve the dynamic stability of the AC system and increase the safe transmission capacity. The literature [11,12] used emergency power support and coordinated power control to suppress low-frequency oscillation of AC power grid for a conventional HVDC system. Compared with the conventional HVDC system, VSC-HVDC technology has technical advantages such as self-phase change capability, active power and reactive power can be decoupled and controlled, and no phase change failure will occur. In recent years, it has been widely developed and applied in the world, and China is also vigorously building VSC-HVDC projects. Because VSC-HVDC technology has large degrees of control freedom and switching between operation modes of the converter station, it can further suppress the AC low-frequency oscillation in theory. For an AC–DC hybrid power grid with VSC-HVDC system, an additional controller is designed to adjust the active and reactive power of the flexible direct converter, which can effectively suppress interregional low-frequency oscillation and improve the dynamic performance of the system [13,14,15,16,17]. Based on a wide-area measurement system, the literature [13] determined the optimal value of active and reactive power of DC system through model predictive control, and it adopted fixed frequency prediction and adjustment power instructions to suppress low-frequency oscillation. In the literature [14], a design method of a centralized coordination controller suitable for multiple HVDC lines was proposed by adopting a centralized control scheme, which could improve the damping of small power systems considerably. In order to avoid the risk of losing centralized control communication, literature [15] applied the homologous ethics theory to the controller design of VSC-HVDC converter, and it obtains a robust distributed controller. In literature [16], the nonlinear system is transformed into a linearized system through input–output accurate feedback linearization. Then, the coordinated control structure of multiple HVDC lines is designed through the design method of a linear system controller to improve the stability of the system. The literature [17] proposed a robust fixed-order frequency domain controller design method to maximize the tracking performance with specified stability margin in multiple operating points. In order to suppress ultra-low frequency oscillation, the most effective strategy is governor parameter optimization to improve the system damping within the ultra-low-frequency band. The literature [18] put forward the design principle of the damping control index of ULFO. These studies can provide references for the suppression of ultra-low frequency oscillation. For low-frequency oscillation suppression, the DC modulation power required is small, and it is generally not necessary to have multiple DC simultaneous modulation. However, for ultra-low-frequency oscillation, because of the different oscillation energy, it may require multiple DC modulation at the same time. Methods of using two kinds of DC to suppress the ultra-low-frequency oscillation of the system have become a new problem, but there is some research investigating this.
In this paper, a multi-DC modulation coordination strategy for suppressing ultra-low frequency oscillations is proposed. Firstly, the mechanism of suppressing ultra-low frequency oscillation in hybrid DC modulation system is analyzed from the perspective of energy, and the calculation method of system energy in practical engineering is given. Finally, a deep deterministic policy gradient (DDPG) algorithm is used to optimize the selected DC modulation parameters.

2. Ultra-Low-Frequency Oscillation Suppression Based on Transient Energy Function

2.1. Mechanism of Hybrid DC Modulation to Suppress Ultra-Low-Frequency Oscillation Based on Transient Energy Function

There are n units in the whole system. Each unit uses the second-order classical model and takes damping into account, which is represented by the center of inertia (COI). COI is defined as follows [19]:
{ T JT = i = 1 n T J i δ COI = 1 T JT i = 1 n T J i δ i ω COI = 1 T J T i = 1 n T J i ω i P COI = i = 1 n ( P m i P e i ) ,
where T J i , δ i , ω i , P m i , and P e i are the inertia time constant, power angle, angular velocity, mechanical power, and electromagnetic power of the rotor of unit i in synchronous coordinates, respectively. T J T is the total inertia time constant of rotors of all units. P COI is the acceleration power of COI. δ COI and ω COI are the equivalent rotor angle and equivalent rotor angular velocity of COI, respectively.
When DC modulation is not considered, the motion equation of each unit in COI coordinates is:
{ d θ i / d t = ω ˜ i T ji d ω ˜ i d t = P m i P e i T J i T J T P COI D i ( ω ˜ i + ω COI ) + T J i T J T j = 1 n D j ( ω ˜ j + ω COI )
where θ i = δ i δ COI and ω ˜ i = ω i ω COI are the rotor angle and rotor angular velocity of unit i in COI coordinates, respectively.
The oscillating kinetic energy V K i , oscillating potential energy V P i and damping energy V D i of unit i are defined as follows:
{ V i ( t ) = V K i ( t ) + V P i ( t ) + V D i ( t ) V K i ( t ) = T J i ω ^ i 2 ( t ) / 2 V P i ( t ) = θ i s θ i ( t ) P m i P e i T J i T J T p C O I d θ i V D i ( t ) = θ i s θ i ( t ) D i ω i + T J i T J T j = 1 n D j ( ω ˜ j + ω C O I ) d θ i ,
where θ i ( t ) and θ i s are rotor angle and equilibrium rotor angle under COI coordinates of the system at time t respectively.
After the system is disturbed, its transient energy is:
V ( T ) = i = 1 n V i ( t ) = i = 1 n [ V K i ( t ) + V P i ( t ) + V D i ( t ) ] .
Now assume that there is only one dominant oscillation mode λ k = σ k + j χ k in the system, After the system is disturbed, according to the difference of σ k , the system may appear in the first oscillation period [ t s , t s + k ] (where ω ˜ ( t s ) = ω ˜ ( t s + k ) = 0 , t s + k = t s + 2 π / χ k ) in the following three oscillation cases:
  • σ k < 0; because of the damping energy, the transient energy of the system decreases with time Δ V t k = V ( t s + k ) V ( t s ) < 0 ( Δ V t k is the change in the system energy during the first oscillation period), and the system is stable in the first oscillation period.
  • σ k = 0; because the damping energy of the system V D ≈ 0, in the process of oscillation and oscillation, the kinetic energy and potential energy components increase and decrease, and the transient energy of the system basically remains unchanged, and the system is stable. And V ( t ) ≈ C (C is constant), so Δ V t k = V ( t s + k ) V ( t s ) = 0 .
  • σ k > 0; because of the damping energy V D , the transient energy of the system increases with time, and the system becomes unstable. At the end of the first oscillation period, the transient energy change of the system is:
    Δ V t k = V ( t s + k ) V ( t s ) = i = 1 n [ V P i ( t s + k ) + V D i ( t s + k ) ] i = 1 n [ V P i ( t s + k ) + V D i ( t s + k ) ] = i = 1 n θ i s θ i ( t ) ( P m i P e i T J i T J T p C O I ) D j ( ω ˜ j + ω C O I ) + T J i T J T j = 1 n D j ( ω ˜ j + ω C O I ) d θ i .
In order to realize the suppression effect of DC modulation on oscillation, when σ k > 0, the first oscillation period is divided into four stages. Stage [ t s , t 1 ), ω ˜ ( t ) decreases from 0 to ω ˜ min ; stage [ t 1 , t 2 ), ω ˜ ( t ) increases from ω ˜ min to 0; stage [ t 2 , t 3 ), ω ˜ ( t ) increases from 0 to ω ˜ max ; stage [ t 3 , t s + k ], decreases from ω ˜ max to 0. It is assumed that l DC in the system can control the ultra-low-frequency oscillation mode, and when p (0 < p ≤ l) DC participates in DC modulation, the system oscillation can be suppressed. The maximum percentage of each DC that can participate in power modulation is ε , where the maximum controllable DC power of article τ is Δ P d c τ = ε P d c τ * , and where P d c τ * is the rated active power of article t. Within [ t s , t 2 ), the DC of the system can reduce the transmission power so that the transient energy change of the system can be reduced.
In [ t 2 , t s + k ], the DC of the system can reduce the transient energy change of the system by increasing the transmission power. Therefore, the system transient energy reduction in the first oscillation period due to DC modulation is as follows:
Δ V dc t k = τ = 1 p ( θ i ( t s ) θ i ( t 2 ) Δ P dc τ d θ i + θ i ( t 2 ) θ i ( t s + k ) Δ P dc τ d θ i ) .
When σ k ≤ 0, the system can still maintain stability after disturbance, and DC modulation can make the system oscillation decay as soon as possible and shorten the time for the system to recover stability. When σ k > 0, based on the principle of transient energy decline, in the first oscillation period, the energy change Δ V dc t k of the system p direct current used to suppress the oscillation must be at least greater than the change of the system transient energy—that is, Δ V dc t k > Δ V t k .

2.2. Calculation of System Transient Energy

In a system with n units, there are theoretically n − 1 oscillation modes, and each unit may participate in multiple oscillation modes at the same time. Now, the left eigenvector ψ and the right eigenvector ϕ of the system are introduced. According to the definition of the left eigenvector of the system, ψ λ k can be used to measure the influence weight of each unit on each unit of oscillation mode λ k .
ψ = [ ψ 1 T ψ λ k T ψ n T ] ϕ = [ ϕ 1 ϕ λ k ϕ n ] ,
The rotor motion equation is linearized near the stable operating point,
{ T J i Δ ω ˙ i = Δ P Δ P i = Δ P m i Δ P e i D i Δ ω i ,
where Δ P i is the acceleration power deviation.
For oscillation mode λ k , the acceleration power deviation of unit i can be expressed in a generalized phasor form, namely:
Δ P i , λ k = T J i ( σ k + j χ k ) Δ ω i , λ k .
According to literature [20], under oscillation mode, the ratio of acceleration power deviation of any two units a and b is as follows:
Δ P a , λ k Δ P b , λ k = T Ja Δ ω a , λ k T Jb Δ ω b , λ k .
According to the free motion analysis theory of dynamic systems with zero input, for a certain mode, the ratio of corresponding terms of a and b units in the right characteristic phasor is equal to the ratio of corresponding terms of state variables, that is:
ϕ a , λ k ϕ b , λ k = Δ ω a , λ k Δ ω b , λ k = Δ P a , λ k / T ja Δ P b , λ k / T jb .
In the actual system, the left characteristic phasor and the right characteristic phasor have the following approximate relationship [21].
ψ λ k [ d i a g ( T J / ω s ) ] ϕ λ k d i a g ( T j / ω s ) = 1 ω s [ T Jl 0 0 0 T J 0 0 0 T Jn ] ,
where ω s is the steady-state value of the generator speed.
Then the ratio of corresponding terms of units a and b in the left characteristic phasor is:
ϕ a , λ k ϕ b , λ k = Δ ω a , λ k Δ ω b , λ k = Δ P a , λ k / T ja Δ P b , λ k / T jb .
Based on the law of conservation of energy, this paper defines the integration of power to time as an energy calculation formula, and it obtains the acceleration power p i of each unit by time domain simulation. The maximum accelerated power deviation of the unit under the leading oscillation mode is selected as the reference value, and the ratio of the accelerated power deviation amplitude of the unit i to the reference value is obtained. Then, the ratio is defined as the unit energy participation weight η λ k i —that is:
η λ k i = | Δ P i , λ k | max { | Δ P i , λ k | , ( i = 1 , 2 , n ) } .
By weighting the energy provided by each unit participating in the oscillation mode, the transient energy change of the system under the dominant oscillation mode λ k is:
Δ V t k = i = 1 n Δ V t k i = i = 1 n η λ k i t s t s + k p i d t ,
Δ V d c t k = m = 1 p ( t s t 2 Δ P d c m d t + t 2 t s + k Δ P d c m d t ) .
When the system frequency oscillation deviation is within the dead range, the DC additional control will not operate, and the DC modulation automatically adjusts with the frequency deviation and does not always stay at the maximum. Therefore, the amount of energy change used by DC to suppress oscillations needs to retain a certain margin—that is:
μ Δ V dc t k Δ V t k ,   0 < μ < 1 .

3. Prony Algorithm Identification and DC Sensitive Point Sorting

3.1. Prony Algorithm Identification

The Prony method can be used for both on-line and off-line monitoring of ULF oscillation modes. The Prony algorithm utilizes real-time data from the phase measurement unit (PMU) for online monitoring and historical data for offline monitoring. The data measured by the PMU can be categorized into time-varying, ambient, and sounding data. The Prony algorithm employs the time-varying data, which is the more densely populated data, for the estimation of various modes. Time-varying data y ( i ) can be expressed as:
y ( j ) = i = 1 n λ c i e j λ ( i ) Δ t j = 0 , 1 , , N 1 ,
where, n λ is the total characteristic root number; c i is the amplitude of the i mode; λ ( i ) is the i th eigenvalue of the system; Δ t is the sampling interval; N is the total data points.
The steps of Prony algorithm for pattern recognition of power system are as follows [7]:
Step 1. Linear prediction model (LPM) is established. The nth-order LPM is composed of the predicted value y ( k ) and the first n exact value ( y ( k 1 ) , y ( k 2 ) , , y ( k n ) ) and has: y ( k ) = a 1 y ( k 1 ) + a 2 y ( k 2 ) + + a n y ( k n ) , where a i is the coefficient of the characteristic equation.
Let k = n , n + 1 , , N 1 . There is:
[ y ( n ) y ( n + 1 ) y ( N 1 ) ] = [ y ( n 1 ) y ( n 2 ) y ( 0 ) y ( n ) y ( n 1 ) y ( 1 ) y ( N 2 ) y ( N 3 ) y ( N n 1 ) ] [ a ( 1 ) a ( 2 ) a ( n ) ] + [ ε ( n + 1 ) ε ( n + 2 ) ε ( N ) ] ,
where a ( i ) is the coefficient of the characteristic equation.
The coefficient a 1 , a 2 , …, a n of the characteristic equation is obtained by using the least square method.
Step 2. Discrete mode calculation. Variable z is introduced as the variable of the characteristic polynomial, and combined with the coefficient a of the characteristic equation obtained in step 1, the n -degree characteristic polynomial of variable z is constructed as follows:
z n ( a 1 z n 1 + a 2 z n 2 + + a n z 0 ) = 0 .
Step 3. Residual term calculation in discrete mode. Residual terms of the discrete mode ( c 1 , c 2 , …, c n ) can represent the amplitude and phase angle, and its calculation formula is as follows:
[ z 1 0 z 2 0 z n 0 z 1 1 z 2 1 z n 1 z 1 N z 2 N z n N ] [ c 1 c 2 c n ] = [ y ( 1 ) y ( 2 ) y ( N ) ] .
Step 4. Discrete mode parameter calculation. The formula for calculating amplitude ( A i ), phase angle ( P i ), frequency ( f i ), and oscillation factor (D) is as follows:
{ A i = | c i | P i = arctan | I m ( c i ) Re ( c i ) | f i = 1 2 π Δ t arctan | I m ( z i ) Re ( z i ) | σ i = I n | z i | / Δ t .
Step 5. Distinguish the real number mode and conjugate complex number mode, and calculate the energy of each mode.
For real numbers, there are:
E i = j = 1 N [ r e a l ( c i z i j ) ] 2 .
For conjugate complex modes, there are:
E i = j = 1 N [ 2 r e a l ( c i z i j ) ] 2 .
Different modes have different energy levels. By calculating the energy of each mode and classifying it, the dominant mode of the power system can be identified. Mode energy is defined as E i = j = 1 N [ r e a l ( c i z i j ) ] 2 when c i or z i mode value is large, E i is high, and this mode is the dominant mode.

3.2. DC Sensitive Points Sort

In order to sort the control sensitive points of l DC, the same disturbance is applied to each DC at the same time to stimulate the oscillation of the system, and the power deviation of l DC is taken as the input signal and the system frequency is taken as the output signal. The transient energy variation of the system’s dominant oscillation mode is obtained by the Prony identification method.
For a certain oscillation mode λ k , the mode value of each element in the mode matrix represents the controllability of a certain input to the oscillation mode λ k , so E i is defined as DC modulation sensitivity φ , where the DC modulation sensitivity corresponding to article m is φ m ( m = 1 , 2 , , l ). Set the maximum value of φ to φ max , which corresponds to the DC number 1, and the minimum value of φ to φ min , which corresponds to the DC number l, and sort the DC from the largest to the smallest. A DC modulation sequence table is formed, as shown in Table 1, where the larger the φ , the more sensitive it is to the oscillation mode λ k control.

4. Deep Deterministic Policy Gradient

Frequency limit controller (FLC) is a common DC modulation method in practical engineering [22,23], and its control block diagram is shown in Figure 1. When the system fails and the system frequency oscillation deviation exceeds the upper or lower limit of the FLC dead zone, the FLC is automatically activated and the DC power is automatically adjusted by a closed-loop PI controller.
A deep deterministic policy gradient algorithm (DDPG, Figure 2) is proposed to rectify the PI parameters of FLC. The Deep Deterministic Policy Gradient method uses the Actor–Critic algorithm as its basic framework, approximates the policy and action-value functions using a deep learning network, and uses the stochastic gradient method to train the parameters in both the policy network and value network models. A dual neural network model of real-time network and target network is used for both policy and value functions. At the same time, the algorithm borrows the experience playback mechanism from the DQN algorithm, in which the experience data generated by the interaction between the Actor and the environment will be stored in the experience pool. Then, a batch of data samples will be extracted for training so that the algorithm can be converged more easily.
At each time period t, the agent observes the current state s t , performs an action a t , receives a reward value r t from the environment E, and then transitions to the next state s t + 1 according to the state transition probability function.
For FLC, which adjusts the system frequency, the integral of the absolute value of the system frequency deviation and the product of time is selected as the reward function of DDPG optimization algorithm—that is, the optimization objective function is as follows:
min J = 0 T t | Δ f | d t .
The state space s t is:
s t = ( t , K P , K I , Δ f , Δ P ) ,
the state space includes the current period t, PI parameters K P , and K I of DC FLC. Δ f is the frequency deviation, and Δ P is the active power deviation.
Action space a t is:
a t = ( Δ K P , Δ K I ) .
The reward function r t is:
r t = 0 T t | Δ f | d t .
The first part of the reward function is chosen as the inverse of the integral of the absolute value of the system frequency deviation and the product of time. The greater the reward the agent receives, the smaller the integral of the absolute value of the system frequency deviation and the time product.
The loss function is calculated according to the following formula, where y t is the target Q value.
{ y t = r t + γ Q ( s t + 1 , π ( s t + 1 | θ π ) | θ Q ) L ( θ Q ) = E ( γ t Q ( s t , a t | θ Q ) ) 2 .
According to the gradient update rule, the current value network is updated by calculating the gradient of the loss function [24]:
{ θ k Q = θ k 1 Q μ Q 𝛻 θ Q L ( θ k 1 Q ) 𝛻 θ Q L ( θ k 1 Q ) = E ( 2 ( y t Q ( s , a | θ k 1 Q ) | s = s t , a = a t )   𝛻 Q θ Q ( s , a | θ k 1 Q ) | s = s t , a = a t )
The policy network uses a function of the value network output as a loss function. By finding the strategy gradient of the function, the updated formula is obtained [25]:
{ θ k π = θ k 1 π μ π 𝛻 θ x L ( θ k 1 π ) 𝛻 θ k 1 π π = 𝛻 a Q ( s , a θ k 1 π ) s = s t , a t = π ( s t ) 𝛻 θ x π ( s θ k 1 π ) s = s t ) .
The target network adopts the soft update mode as shown in the following formula.
{ θ k Q = τ θ k 1 Q + ( 1 τ ) θ k 1 Q θ k π = τ θ k 1 π + ( 1 τ ) θ k 1 π .
When learning the strategy, m sample data points are randomly sampled from the experience pool, and the algorithm is optimized by constantly updating the gradient value of the network. The optimization objective function constructed in this paper aims to minimize the integral of the absolute value of the system frequency deviation and the product of time.

5. Multiple DC Modulation Coordination Strategies

The coordinated optimization strategy of multi-DC modulation is shown in Figure 3, and its specific steps are as follows:
  • For the leading oscillation mode λ k , the maximum acceleration power deviation of each unit is taken as the reference value, and the energy participation weight η λ k of each unit in oscillation mode λ k is calculated;
  • Calculate the transient energy change Δ V t k of oscillation mode λ k ;
  • Apply the same power disturbance to the DC of the system l to excite the oscillation of the system;
  • Prony algorithm is used to identify the system;
  • Determine the modulation sensitivity φ m of each DC, and sort l DC to form a DC modulation sequence table;
  • Calculate Δ V d c t k and determine p DC that participates in DC modulation;
  • Using the DDPG algorithm, the FLC parameters of p DC were optimized and the optimization results were output.

6. Example Analysis

The effectiveness of multi-DC modulation coordination optimization strategy is verified by taking the power grid of a region after back-to-back DC projects are put into operation.
After back-to-back DC projects are put into operation in a certain region, all the connections between the power grid and the external power grid will be realized through the DC network, as shown in Figure 4. The dashed line box in the figure represents the synchronous power grid. A hybrid AC–DC network containing a high voltage direct current transmission (HVDC) system is taken as the main research object. Regardless of the influence of frequency change on the DC transmission power, three ultra-high-voltage direct current (UHVDC) networks are selected to develop a coordinated optimization scheme. Constant current control is adopted on the rectifier side, constant voltage control is adopted on the inverter side, and FLC control is added.
When the system’s critical AC communication channel AC2 experiences an N−1 fault, the primary oscillation modes of the power grid under operating modes are as shown in Table 2. Assume that the maximum percentage ε of each DC that can participate in power modulation is 5%. When the system frequency changes, the DC modulation amount is automatically adjusted with the frequency deviation, FLC has an adjustment dead zone (FLC dead zone is set at 0.05Hz in this simulation example), a certain DC modulation capability is retained, and μ is set at 0.68. The multi-DC modulation coordination optimization strategy under two operating modes is simulated and verified, respectively.

6.1. Example 1—Summer Peak

In summer peak operation mode, in 0.073 Hz oscillation mode, the energy of some units of the power grid participates in weight η λ k i , and the transient energy change Δ V t k i of the units after being converted according to η λ k i is shown in Table 3. It is calculated that the transient energy change of the system under this oscillation mode is Δ V t k = 523   kW h .
A 2% power disturbance is applied to three DCS at the same time, and the DC modulation sensitivity φm is obtained by Prony algorithm identification and calculation. The DC modulation sequence is obtained by sequencing, as shown in Table 4. According to the calculation in Table 4, when p > 1, the energy change Δ V dc t k used by DC to suppress oscillation is 1255 kW·h, and it is determined that DC a participates in modulation.
Taking the power disturbance of a DC as an input, and the optimization model was built in MATLAB combined with DDPG algorithm. The learning rate α = 0.00001, discount factor γ = 0.97, and the PI parameters of a DC FLC were iteratively obtained as K1 = 1.5213, T1 = 0.4205. When AC channel AC2 fails N − 1, the system frequency change curve is shown in Figure 5.
As can be seen from Figure 5a, compared with the system without DC modulation, when DC a participates in modulation, the system frequency oscillation is significantly suppressed, which verifies the effectiveness of the proposed strategy.
In order to further verify the correctness of analyzing the oscillation mechanism of DC modulation suppression system from the perspective of energy and the ordering of DC modulation sensitivity, the frequency variation curves of the system with three DCS participating in modulation were compared. As can be seen from Figure 5b, since the energy variation Δ V dc t k of the three DCS that can be used to suppress the oscillation is greater than the transient energy variation Δ V t k of the system, the oscillation of the system can be suppressed when the three DCS are modulated separately. At the same time, compared with c DC modulation, the amplitude of system frequency oscillation is smaller when a DC participates in modulation. Compared with b DC, it takes the shortest time for the system frequency to stabilize when a DC participates in modulation. Therefore, choosing a DC as the DC participating in modulation is the most effective for suppressing system oscillation.

6.2. Example 2—Summer Base

In summer base operation mode, in 0.062 Hz oscillation mode, the energy of some units of the power grid participates in weight η λ k i , and the transient energy change Δ V t k i of the units after being converted according to η λ k i is shown in Table 5. It is calculated that the transient energy change of the system under this oscillation mode is Δ V t k = 1614   kW h .
A 2% power disturbance is applied to three DCS at the same time, and the DC modulation sensitivity φm is obtained by Prony algorithm identification and calculation, and the DC modulation sequence is obtained by sequencing, as shown in Table 6. According to the calculation in Table 6, when p = 2, the energy change Δ V dc t k used by DC to suppress oscillation is 2876 kW·h, and it is determined that DC a and DC c participate in modulation.
Taking the power disturbance of a and c DC as input, the PI parameters of a DC FLC can be iteratively obtained as K1 = 1.0096, T1 = 1.0584, and PI parameters of c DC FLC as K1 = 1.1063, T1 = 0.5437 by using the same steps as example 1. When the N − 1 fault occurs in AC2, the system frequency change curve is shown in Figure 6.
It can be seen from Figure 6a that compared to the system without DC modulation, when DC a and DC c participate in the modulation at the same time, the system frequency oscillation is significantly suppressed, which verifies the effectiveness of the proposed strategy.
In order to further verify the correctness of analyzing the oscillation mechanism of the DC modulation suppression system from the perspective of energy and the ordering of DC modulation sensitivity, three kinds of a DC participate in modulation alone, a and b DC participate in modulation simultaneously, and a and c DC participate in modulation simultaneously are simulated and compared. The frequency change curve of the system is shown in Figure 6b. As can be seen from the figure, when DC a participates in modulation alone, the system oscillation amplitude decreases but the system frequency still oscillates. As a and b DC participate in modulation at the same time, and a and c DC participate in modulation at the same time, the system frequency can be effectively suppressed, but the system frequency oscillation is most effective when a and c DC participate in modulation at the same time.

7. Conclusions

With the rapid development and wide application of HVDC transmission technology, the number of DC drops is gradually increasing in China, and the intensity and transmission scale are rare in the world. Based on the system transient energy and DC modulation sensitivity ranking, this paper proposes a multi-DC modulation coordination optimization strategy, which is verified by numerical simulation. The following conclusions are obtained:
(1) Using the transient energy function, the mechanism of DC modulation to suppress the frequency oscillation of the system can be analyzed from the point of view of energy, and the oscillation energy of the system can be calculated so as to determine the amount of modulation required to be involved by DC.
(2) For a multi-DC system, analyzing the dc modulation sensitivity based on the state-space description of the system, the multiple dc’s involved in the modulation can be ranked so that the dc with the optimal modulation effect can be selected.

Author Contributions

Conceptualization, C.X. and M.L.; methodology, J.P.; software, X.W.; validation, Y.W., S.G. and Z.Z.; formal analysis, J.L.; investigation, C.X.; resources, M.L.; data curation, J.P.; writing—original draft preparation, J.P.; writing—review and editing, C.X.; visualization, C.X.; supervision, M.L.; project administration, M.L.; funding acquisition, J.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the National Natural Science Foundation of China (62101362, 52307127), the Project of State Key Laboratory of Power System Operation and Control (SKLD23KZ07), and the Fundamental Research Funds for the Central Universities (YJ202141, YJ202316).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

Authors Chao Xing, Mingqun Liu and Junzhen Peng were employed by the company Yunnan Power Grid Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. He, J.; Zhang, J.; Li, M.; Li, W.F.; Chang, Q. Frequency domain analysis and control for governor stability problem in islanded HVDC sending systems. Zhongguo Dianji Gongcheng Xuebao/Proc. Chin. Soc. Electr. Eng. 2013, 33, 137–143. [Google Scholar]
  2. Wang, G.; Yu, Z.; Zhang, Y.; Li, Y.; Tao, X.; Li, W. Troubleshooting and analysis of ultra-low frequency oscillation mode in power system. Dianwang Jishu/Power Syst. Technol. 2016, 40, 2324–2329. [Google Scholar]
  3. Xiao, H.; Sun, K.; Pan, J.; Li, Y.; Liu, Y. Review of hybrid HVDC systems combining line communicated converter and voltage source converter. Int. J. Electr. Power Energy Syst. 2021, 129, 106713. [Google Scholar] [CrossRef]
  4. Wang, Y.; Tai, K.; Song, Y.; Kou, R.; Zheng, Z.; Zeng, Q. Research on Double-Deck Traceability Identification Method of Commutation Failure in HVDC System. IEEE Access 2021, 9, 108392–108401. [Google Scholar] [CrossRef]
  5. Zheng, Z.; Xu, Y.; Mili, L.; Liu, Z.; Korkali, M.; Wang, Y. Observability analysis of a power system stochastic dynamic model using a derivative-free approach. IEEE Trans. Power Syst. 2021, 36, 5834–5845. [Google Scholar] [CrossRef]
  6. Xu, Y.; Wang, Q.; Mili, L.; Zheng, Z.; Gu, W.; Lu, S.; Wu, Z. A data-driven koopman approach for power system nonlinear dynamic observability analysis. IEEE Trans. Power Syst. 2023, 39, 4090–4104. [Google Scholar] [CrossRef]
  7. Liu, Z.; Song, H.; Rong, S.; Liu, J.; Zheng, J.; Song, B. Analysis of power system low frequency oscillation using improved prony algorithm. In Proceedings of the International Conference on Information Systems and Computer Aided Education (ICISCAE), Dalian, China, 23–25 September 2022. [Google Scholar]
  8. Xu, Y.; Mili, L.; Sandu, A.; von Spakovsky, M.R.; Zhao, J. Propagating uncertainty in power system dynamic simulations using polynomial chaos. IEEE Trans. Power Syst. 2018, 34, 338–348. [Google Scholar] [CrossRef]
  9. Gao, S.; Song, Y.; Chen, Y.; Yu, Z.; Zhang, R. Fast Simulation Model of Voltage Source Converters With Arbitrary Topology Using Switch-State Prediction. IEEE Trans. Power Electron. 2022, 37, 12167–12181. [Google Scholar] [CrossRef]
  10. Gao, S.; Tan, Z.; Song, Y.; Chen, Y.; Shen, C.; Yu, Z. Accuracy Enhancement of Shifted Frequency-Based Simulation Using Root Matching and Embedded Small-Step. IEEE Trans. Power Syst. 2022, 38, 3345–3357. [Google Scholar] [CrossRef]
  11. Li, Y.; Rehtanz, C.; Ruberg, S.; Luo, L.; Cao, Y. Wide-area robust coordination approach of HVDC and FACTS controllers for damping multiple interarea oscillations. IEEE Trans. Power Deliv. 2012, 27, 1096–1105. [Google Scholar] [CrossRef]
  12. Mao, X.; Zhnag, Y.; Guan, L.; Wu, X. Coordinated control of interarea oscillation in the China Southern power grid. IEEE Trans. Power Syst. 2006, 21, 845–852. [Google Scholar]
  13. Fuchs, A.; Imhof, M.; Demiray, T.; Morari, M. Stabilization of large power systems using vsc-hvdc and model predictive control. IEEE Trans. Power Deliv. 2014, 29, 480–488. [Google Scholar] [CrossRef]
  14. Eriksson, R.; Soder, L. Coordinated control design of multiple HVDC links based on model identification. Comput. Math. Appl. 2010, 60, 944–953. [Google Scholar] [CrossRef]
  15. Pipelzadeh, Y.; Chaudhuri, B.; Green, T.C. Green, Control coordination within a VSC HVDC link for power oscillation damping: A robust decentralized approach using homotopy. IEEE Trans. Control. Syst. Technol. 2013, 21, 1270–1279. [Google Scholar] [CrossRef]
  16. Eriksson, R.; Soder, L. On the coordinated control of multiple HVDC links using input-output exact linearization in large power systems. Int. J. Electr. Power Energy Syst. 2012, 43, 118–125. [Google Scholar] [CrossRef]
  17. Jiang, C.; Zhou, J.; Shi, P.; Huang, W.; Gan, D. Ultra-low frequency oscillation analysis and robust fixed order control design. Int. J. Electr. Power Energy Syst. 2019, 104, 269–278. [Google Scholar] [CrossRef]
  18. Wang, G.; Tang, Y.; Li, Y.; Ai, D.; Chen, G.; Wei, W. Control Method for Additional Damper in Hydro-turbine Speed Governor of Hydro-dominant Power Systems. CSEE J. Power Energy Syst. 2023, 9, 589–598. [Google Scholar]
  19. Stanton, S.E.; Waggoner, D.R. Center-of-inertia transform applied to transient responses of nonlinear power systems. In Proceedings of the Twenty First Annual North American Power Symposium, Rolla, MO, USA, 9–10 October 1989. [Google Scholar]
  20. Kundur, P. Power System Stability and Control; McGraw-Hill: New York, NY, USA, 1994; pp. 699–822. [Google Scholar]
  21. Far, H.G.; Luo, C.; Banakar, H.; Li, P.; Ooi, B.T. Damping interarea oscillations by multiple modal selectivity method. IEEE Trans. Power Syst. 2009, 24, 766–775. [Google Scholar] [CrossRef]
  22. Rakhshani, E.; Sadeh, J. Reduced-order observer control for two-Area LFC system after deregulation. Control. Intell. Syst. 2010, 38, 185–193. [Google Scholar] [CrossRef]
  23. Rakhshani, E.; Sadeh, J. Practical viewpoints on load frequency control problem in a deregulated power system. Energy Convers. Manag. 2010, 51, 1148–1156. [Google Scholar] [CrossRef]
  24. Liang, Y.; Guo, C.; Ding, Z.; Hua, H. Agent-Based Modeling in Electricity Market Using Deep Deterministic Policy Gradient Algorithm. IEEE Trans. Power Syst. 2020, 35, 4180–4192. [Google Scholar] [CrossRef]
  25. Sehgal, A.; La, H.; Louis, S.; Nguyen, H. Deep Reinforcement Learning Using Genetic Algorithm for Parameter Optimization. In Proceedings of the 2019 Third IEEE International Conference on Robotic Computing, Naples, Italy, 26 March 2019. [Google Scholar]
Figure 1. Block diagram of FLC.
Figure 1. Block diagram of FLC.
Applsci 14 02519 g001
Figure 2. DDPG algorithm framework.
Figure 2. DDPG algorithm framework.
Applsci 14 02519 g002
Figure 3. Coordination and optimization strategy of multiple DC modulation.
Figure 3. Coordination and optimization strategy of multiple DC modulation.
Applsci 14 02519 g003
Figure 4. Structure of the power system.
Figure 4. Structure of the power system.
Applsci 14 02519 g004
Figure 5. Frequency curve of power system after an N − 1 fault occurred in the AC2 channel (summer peak case). (a) a DC modulation; (b) a, b, c DC modulation.
Figure 5. Frequency curve of power system after an N − 1 fault occurred in the AC2 channel (summer peak case). (a) a DC modulation; (b) a, b, c DC modulation.
Applsci 14 02519 g005
Figure 6. Frequency curve of power system after a N − 1 fault occurred in AC2 channel (summer base case).(a) a, c DC modulation; (b) Two-channel DC modulation.
Figure 6. Frequency curve of power system after a N − 1 fault occurred in AC2 channel (summer base case).(a) a, c DC modulation; (b) Two-channel DC modulation.
Applsci 14 02519 g006
Table 1. DC modulation sequence table.
Table 1. DC modulation sequence table.
DC Modulation Sequence1ml
name1ml
DC modulation sensitivity φ max φ m φ min
Table 2. Critical oscillation modes of the power grid in summer peak and summer base cases.
Table 2. Critical oscillation modes of the power grid in summer peak and summer base cases.
Mode of OperationReal PartImaginary PartFrequency/HzAmplitude
summer peak0.005040.449650.0730.269
summer base0.000930.384000.0600.443
Table 3. η λ k i and Δ V t k i of some units in critical oscillation mode (summer peak case).
Table 3. η λ k i and Δ V t k i of some units in critical oscillation mode (summer peak case).
Unit i η λ k i Δ V t k i /(kW·h)Unit i η λ k i Δ V t k i /(kW·h)
1167.790.21213.8
20.41228.1100.21213.8
30.38324.5110.21513.6
40.36424.4120.20613.5
50.35524.3130.18512.2
60.31314.8140.18311.5
70.31519.7150.16610.6
80.21314.7
Table 4. DC modulation sequence table (summer peak case).
Table 4. DC modulation sequence table (summer peak case).
DC Modulation Sequence123
Nameabc
DC modulation sensitivity0.6490.6320.317
Table 5. η λ k i and Δ V t k i of some units in critical oscillation mode (summer base case).
Table 5. η λ k i and Δ V t k i of some units in critical oscillation mode (summer base case).
Unit i η λ k i Δ V t k i /(kW·h)Unit i η λ k i Δ V t k i /(kW·h)
1185.390.67455.5
20.98585.4100.67255.6
30.84873.5110.67256.6
40.79662.2120.67155.7
50.78566.3130.65356.4
60.77462.5140.58447.3
70.75360.3150.54533.3
80.67355.3
Table 6. DC modulation sequence table (summer base case).
Table 6. DC modulation sequence table (summer base case).
DC Modulation Sequence123
Nameacb
DC modulation sensitivity0.6040.1240.044
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xing, C.; Liu, M.; Peng, J.; Wang, X.; Wang, Y.; Zheng, Z.; Gao, S.; Liao, J. Multiple DC Modulation Coordination Strategies Based on Transient Energy Function and Deep Deterministic Policy Gradient Algorithm. Appl. Sci. 2024, 14, 2519. https://doi.org/10.3390/app14062519

AMA Style

Xing C, Liu M, Peng J, Wang X, Wang Y, Zheng Z, Gao S, Liao J. Multiple DC Modulation Coordination Strategies Based on Transient Energy Function and Deep Deterministic Policy Gradient Algorithm. Applied Sciences. 2024; 14(6):2519. https://doi.org/10.3390/app14062519

Chicago/Turabian Style

Xing, Chao, Mingqun Liu, Junzhen Peng, Xueke Wang, Yuhong Wang, Zongsheng Zheng, Shilin Gao, and Jianquan Liao. 2024. "Multiple DC Modulation Coordination Strategies Based on Transient Energy Function and Deep Deterministic Policy Gradient Algorithm" Applied Sciences 14, no. 6: 2519. https://doi.org/10.3390/app14062519

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop