Next Article in Journal
A Control-Oriented Model for Predicting Variations in Membrane Water Content of an Open-Cathode Proton Exchange Membrane Fuel Cell
Previous Article in Journal
Multi-Fracture Propagation Considering Perforation Erosion with Respect to Multi-Stage Fracturing in Shale Reservoirs
Previous Article in Special Issue
Optimal Capacity Configuration of Pumped-Storage Units Used to Retrofit Cascaded Hydropower Stations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Distribution Network Topology and Line Parameter Based on Smart Meter Measurements

1
Information and Communication Branch, State Grid Jiangsu Electric Power Co., Ltd., Nanjing 210024, China
2
School of Electrical Engineering, Southeast University, Nanjing 210096, China
3
State Grid Suzhou Power Supply Company, Suzhou 215004, China
*
Author to whom correspondence should be addressed.
Energies 2024, 17(4), 830; https://doi.org/10.3390/en17040830
Submission received: 1 December 2023 / Revised: 28 January 2024 / Accepted: 5 February 2024 / Published: 9 February 2024
(This article belongs to the Special Issue Advances and Optimization of Electric Energy System)

Abstract

:
Accurate line parameters are the basis for the optimal control and safety analysis of distribution networks. The lack of real-time monitoring equipment in grids has meant that data-driven identification methods have become the main tool to estimate line parameters. However, frequent network reconfigurations increase the uncertainty of distribution network topologies, creating challenges in the data-driven identification of line parameters. In this paper, a line parameter identification method compatible with an uncertain topology is proposed, which simplifies the model complexity of the joint identification of topology and line parameters by removing the unconnected branches through noise reduction. In order to improve the solving accuracy and efficiency of the identification model, a two-stage identification method is proposed. First, the initial values of the topology and line parameters are quickly obtained using a linear power flow model. Then, the identification results are modified iteratively based on the classical power flow model to achieve a more accurate estimation of the grid topology and line parameters. Finally, a simulation analysis based on IEEE 33- and 118-bus distribution systems demonstrated that the proposed method can effectively realize the estimation of topology and line parameters, and is robust with regard to both measurement errors and grid structures.

1. Introduction

Accurate line parameters are the basis of state estimation, security control and other advanced applications in energy management systems (EMS). With the continuous expansion of power grids, the operation of distribution networks is complex and changeable; meanwhile, dynamic network reconfiguration is frequent [1,2]. Unlike transmission networks, distribution networks have not realized the real-time monitoring of topology changes and periodic checks of line parameters; therefore, EMS can only rely on the topology and line impedance data recorded in grid-planning files. However, adjustments to the operation structure of a distribution network and changes in the environments around grid lines can lead to deviations between recorded data and actual parameters, which seriously affects the accuracy of EMS safety analysis and control decision results. Therefore, for a distribution network with missing or outdated topology information, it is of great significance to propose an effective line parameter identification method to improve its operation stability and management level [3,4].
Fortunately, the rapid deployment of advanced metering infrastructure (AMI) in distribution networks has enabled high-density historical data to be obtained for line parameter identification [5,6]. In [7,8,9], line parameters were estimated based on nonlinear least squares (NLS), reweighted NLS and augmented state estimation, respectively. In [10], a graphical learning algorithm based on physical parameters was proposed, and the stochastic gradient descent method was used to estimate line parameters; however, the identification efficiency was low in networks with a large number of lines; In [11], a fast graphical learning method was proposed for a large-scale distribution network, which improved the efficiency of parameter estimation. In [12,13], line parameters were identified using particle swarm optimization and a machine learning algorithm, respectively; however, these had little physical interpretability and could not guarantee the generalization of an identification model in different distribution networks. In [14], a hierarchical estimation strategy for line parameters was proposed based on the generalized equations of line voltage drops, and reinforcement learning was used to improve the robustness of the line impedance estimation method to measurement errors. Although the above methods can identify line parameters, they all require prior information for an accurate network topology.
The uncertainty of dynamic distribution network topologies limits the use of the above identification methods in engineering applications. In this regard, some scholars have proposed a joint identification method for distribution network topologies and line parameters. In [15,16], topology identification was transformed into a generalized low-rank approximation problem, and the error-in-variables (EIV) model was used to realize the joint identification of topology and line impedance in a maximum-likelihood estimation framework. In [17], the distribution network topology and line impedance were estimated through the iteration of the Kalman filtering method and Newton–Raphson method. In [18], a deep–shallow neural network was proposed, and the network topology and parameters were identified based on reinforcement learning. In [19], a complex recursive grouping algorithm for the unsupervised identification of topology and line parameters was adopted, which is applicable in distribution networks with latent nodes. The methods proposed in [15,16,17,18,19] are based on the data collected by phasor measurement units (PMU), which synchronously sample the power and voltage measurement data of buses based on global positioning system (GPS) time references.
However, due to the limitations of economic cost, PMU devices have not been fully deployed in many distribution networks [20], and smart meters are generally used as alternative measurement equipment. Compared with PMU, smart meters cannot obtain voltage angle measurement information. There are several identification methods based on smart meter data. In [21], a linear regression model was established to construct the network topology and calculate the line parameters sequentially from the leaf nodes to the root nodes; however, the calculation capacity required by the algorithm increased rapidly with large-scale distribution networks. In [22], the measurement data of voltage magnitudes and power injections were used to estimate the impedance distance between observed buses, and the operation topology and line parameters were iteratively identified. In [23], a topology label matrix was constructed based on the LinDistFlow model, and the topology and line parameters were estimated by feature clustering. Nevertheless, the methods proposed in [22,23] require a large number of samples, thus the effectiveness of algorithm cannot be guaranteed in a distribution network with frequent topology changes, and they solely take the radial grid structure into consideration.
As above, existing parameter identification methods generally require prior information on grid topology, PMU devices, or months of measurement data. To overcome these limitations, we propose a method to identify topology and line parameters simultaneously that does not rely on prior information of the grid topology or parameters. Consequently, it addresses uncertainty in the topology, which may change flexibly and frequently in distribution networks. First, a linear power flow model is established according to the operating characteristics of the distribution network and the voltage angle information is ignored temporarily. The initial identification results of distribution network topology and line admittance are quickly obtained through the simplified model. Then, the identification results are modified through decoupling iterative optimization among variables, aiming to reduce the influence of modeling errors on the accuracy of the parameter estimation. Finally, we demonstrate the effectiveness of our method on IEEE test cases, and the robustness of proposed algorithm is verified through a sensitivity analysis.
The main contributions of this paper are given as follows:
(1) All the measurement data are provided by smart meters and do not rely on expensive equipment such as PMU. Additionally, the samples required by the algorithm ars quite reasonable, only one day of historical data is needed in a five-minute sampling period, which can avoid topology changes during data collection to a large extent;
(2) The method accurately identifies the topology and effectively estimates the line parameters; moreover, it is suitable for large-scale distribution networks and superior to the method that can only estimate the topology or line parameters;
(3) The proposed identification algorithm is robust to measurement noises, and is applicable for distribution grids with different structures, including weakly meshed grids.
The rest of this paper is organized as follows. Section 2 provides the mechanism analysis of line parameter identification and demonstrates the overall flow of the proposed method. Section 3 and Section 4 concretely introduce the initial identification strategy and enhanced identification strategy, respectively. Section 5 validates the performance of our method on IEEE 33- and 118-bus distribution systems. Section 6 concludes the paper.

2. Mechanism Analysis of Line Parameter Identification in Dynamic Distribution Network

The admittance matrix contains both the topology and line parameter information of the distribution network, so this paper solves the estimated admittance matrix G^ and B^ based on the power and voltage data collected by smart meters. Similar to state estimation, the objective function is established according to the minimum variance between actual power injection and the power calculated by G^, B^, which can be expressed as follows.
min i = 1 N ( p i ^ p i ) 2 + ( q i ^ q i ) 2
where N is the total number of buses in the distribution network, pi and qi are the active and reactive power injection of bus i, respectively, p i ^ and q i ^ are the estimated active and reactive power injection of bus i, respectively.
According to the classical power flow equations, there are constraints between the estimated power injection and the estimated admittance matrix as follows.
p i ^ = k = 1 N v i v k ( G i k ^ cos θ i k + B i k ^ sin θ i k )
q i ^ = k = 1 N v i v k ( G i k ^ sin θ i k B i k ^ cos θ i k )
where v i and v k are the voltage magnitude of bus i and bus k, respectively, θ i k is the voltage angle difference between bus i and bus k, G i k ^ and B i k ^ are the estimated conductance and susceptance between bus i and bus k, respectively. In order to simplify the symbolic representation without losing its generality, the following will uniformly use v to represent |v|.
In addition, according to the characteristics of admittance matrix, the optimization model needs to meet the following constraints.
(1) Where there is no connecting branch between bus i and bus k, it has G i k ^ = B i k ^ = 0.
(2) For the non-diagonal element of the admittance matrix, it always has:
i k ,   G i k ^ 0
i k ,   B i k ^ 0
(3) Since the shunt resistance in distribution network could be neglected [15], the diagonal element of the admittance matrix always has:
G i i ^ + k = 1 , k i N G i k ^ = 0
B i i ^ + k = 1 , k i N B i k ^ = 0
(4) The admittance matrix is symmetrical as follows.
G ^ = [ G ^ ] T
B ^ = [ B ^ ] T
where []T represents matrix transpose.
With the premise that the topology information of the dynamic distribution grid is unavailable, we can construct a mixed-integer programming problem by introducing a binary variable representing the line connection status to jointly identify the grid topology and line parameters, whereas there is barely an effective method for solving this problem. Furthermore, for the two buses without connecting branches, it is difficult to optimize the corresponding element in G^ and B^ to 0. In this paper, the corresponding value of the unconnected line in the estimated admittance matrix is set to 0 by noise reduction, which can effectively imply the topological information and reduce the number of optimization variables. On this basis, a two-stage identification method for jointly estimating the distribution network topology and line parameters is proposed to ensure the accuracy of the identification model.
The principle of noise reduction is that due to the constraints of power flow and the characteristics of the admittance matrix, if there is no connecting branch between bus i and k, then G i k ^ and B i k ^ will be gradually optimized to a number close to 0; consequently, the value of | G i k ^ |/| G i i ^ | would be sufficiently small. Therefore, we can calculate | G i k ^ |/| G i i ^ | for each non-diagonal element G i k ^ in G^; if the value is less than ωg, the noise reduction threshold and corresponding branch should be eliminated, as well as G i k ^ and B i k ^ should be set to 0. Owing to the fact that the distribution network usually adopts a radial or weakly meshed grid structure, the number of branches connecting bus i is much lower than N − 1; therefore, ωg can be properly set to 1/(N − 1).
In order to further verify the rationality of the proposed criterion, we use four IEEE test grids to calculate the |Gik|/|Gii| of each branch. As shown in Table 1, for all branches in the four grids, the values of |Gik|/|Gii| are greater than ωg. Accordingly, we can effectively remove the unconnected branches by noise reduction.
The overall flow of the proposed two-stage identification method is shown in Figure 1. In stage 1, the approximate topology and line parameters are quickly obtained based on linear regression, and the number of optimization variables is reduced significantly. On this basis, the identification results are modified by decoupling the iterative optimization between variables in stage 2. At the same time, the unconnected branches are further eliminated by noise reduction. The final identification result is outputted until the iteration converges.

3. Initial Identification Strategy Based on Linear Regression

The nonlinearity of power flow equations may lead the identification model to become a non-convex optimization problem, which is difficult to solve; its linearization can improve the convergence and robustness of the identification algorithm [24,25]. Therefore, a linear power flow model is established in this stage, and we propose an initial identification strategy based on linear regression. Although the simplification of power flow equations and the neglect of the voltage angle affect the identification accuracy of line parameters to a certain extent, the main purpose of this stage is to quickly remove most of the unconnected branches and reduce the number of optimization variables, as well as to provide the basic topology and initial values of line parameters for stage 2. In that stage, the estimated admittance matrix will be further modified based on a classical power flow model to improve the accuracy of the identification results.
According to the characteristics of a normally operating distribution gird, we can obtain the following approximations [24,25,26]:
(1) The voltage magnitude of bus i is close to 1 p.u., which implies Δ v i = v i − 1 is a small number.
(2) The voltage angle difference θ i k is usually less than 5°, so we have:
sin θ i k θ i k
cos θ i k 1
With the approximations discussed above, the active power flow equation can be expressed as follows.
p i v i = k = 1 N ( G i k v k cos θ i k + B i k v k sin θ i k )
Replacing v k with 1 + Δ v k , and combining Equations (10) and (11), we obtain:
p i v i k = 1 N G i k ( 1 + Δ v k ) + k = 1 N B i k ( 1 + Δ v k ) θ i k = k = 1 N G i k ( a ) + k = 1 N G i k Δ v k ( b ) + k = 1 N B i k θ i ( c ) k = 1 N B i k θ k ( d ) + k = 1 N B i k Δ v k θ i k ( e )
Equations (6) and (7) imply that term (a) and (c) in Equation (13) are 0; furthermore, Δ v k multiplied by θ i k yields the value of term (e) to be much smaller than term (b) and term (d); therefore, we can omit term (e) and obtain:
p i v i k = 1 N G i k Δ v k k = 1 N B i k θ k
Analogically, we can obtain the linear equation of reactive power flow as follows.
q i v i k = 1 N B i k Δ v k k = 1 N G i k θ k
Equations (14) and (15) can be rewritten in matrix form, and we can obtain the linear power flow model, as follows.
p v q v = G B B G Δ v θ
where [p/v] = [p1/v1pN/vN]T, [q/v] = [q1/v1qN/vN]T, Δv = [Δv1 … ΔvN]T, θ = [θ1θN]T are vectors containing the measurement data of all buses.
Assuming that we obtain M1 independent samples without voltage angle information, it is necessary to further simplify Equation (16). Considering that the voltage angle difference between the slack bus and the other bus is typically small, Equation (16) is simplified in stage 1, as follows.
P V = G Δ V
Q V = B Δ V
where [P/V] = [[p/v]1 … [p/v]M1], [Q/V] = [[q/v]1 … [q/v]M1], ΔV = [[Δv]1 … [Δv]M1].
The solutions of G^ and B^ are:
G ^ = P V Δ V T ( Δ V Δ V T ) 1
B ^ = Q V Δ V T ( Δ V Δ V T ) 1
In order to eliminate the unconnected branches, G^ and B^ should be modified by noise reduction. That is, traversing G^, and setting G i k ^ and B i k ^ to 0 if the value of | G i k ^ |/| G i i ^ | is less than ωg.
Additionally, G^ and B^ should hold the symmetry of the admittance matrix, thus we symmetrize G^ and B^ to obtain G Sym ^ and B Sym ^ as follows:
G Sym ^ = G ^ + G ^ T 2
B Sym ^ = B ^ + B ^ T 2
for convenience, we denote G Sym ^ and B Sym ^ as G^, B^, respectively.
Moreover, for the purpose of eliminating the influence of removed branches on regression results, we should renew the remaining non-zero elements in G^ and B^ as follows:
G i ^ = P V i Δ V i T ( Δ V i Δ V i T ) 1
B i ^ = Q V i Δ V i T ( Δ V i Δ V i T ) 1
where G i ^ and B i ^ are the non-zero elements in the i-th row of G^ and B^, respectively, [P/V]i and [Q/V]i are vectors corresponding to the measurement data of bus i, ΔVi is a voltage matrix corresponding to the column indexes of non-zero elements in the i-th row.
Repeat the steps of noise reduction, symmetrization, and element renewing until G^ and B^ remain unchanged before and after iteration; we can then obtain the initial identification results of the topology and line parameters.

4. Enhanced Identification Strategy Based on Decoupling Iterative Optimization

In stage 1, the influence of the voltage angle on line parameter identification accuracy is temporarily ignored, and there are certain modeling errors in the linear regression model. Hence, in this stage, we will take the change of voltage angle into consideration, and iteratively modify G^ and B^, i.e., outputs in stage 1, based on the classical power flow model, so as to obtain more accurate identification results.
In stage 2, the variables of the optimization problem include branch admittance and the voltage angle. Simultaneously optimizing all variables together may slow down the execution speed of the algorithm, or even make the iterative process difficult to converge. In order to improve the robustness of optimization algorithm, we take the branch admittance as the main optimization variable and propose a decoupling iterative optimization method between the voltage angle and branch admittance. In each iteration, we first obtain the estimated voltage angle corresponding to each data sample through a pseudo power flow calculation; on this basis, the adaptive ridge regression model is established to modify G^ and B^, and the topology is further corrected by noise reduction. The iterative process ends until the convergence condition is satisfied.
In the pseudo power flow calculation, if we use the existing power system simulation tools, such as MATPOWER (version 7.1) [27], to build the distribution network model, and take the measurement data of power injection and G^, B^ as inputs, the voltage angle information can be obtained through the power flow calculation.
Once the estimation value of the voltage angle is acquired, we are able to establish a more accurate mathematical model to solve the line parameters.

4.1. Mathematical Optimization Model and Its Solving Algorithm

Taking line admittance as variables, the power flow equations can be rewritten as follows.
p i = l i j E ( ( v i 2 v i v j cos θ i j ) g i j v i v j sin θ i j b i j )
q i = l i j E ( v i v j sin θ i j g i j ( v i 2 v i v j cos θ i j ) b i j )
where g i j and b i j are the conductance and susceptance of line lij, respectively, E is the set of connected lines.
We take Equation (25) as an example to illustrate the establishment process of the mathematical optimization model. Equation (25) suggests that there is a linear relationship between p i and g i j , b i j . We denote the set of connected lines obtained in stage 1 as E^ with the basis M^, then the matrix form of Equation (25) can be expressed as follows:
p = a g a b g b
where p = [p1pN]T, is the active power injection measurement, g = [g1gM^]T, b = [b1bM^]T, are matrices containing the line conductance and susceptance in E^, a g , a b   ϵ   R N × M ^ , their elements can be expressed as follows.
a g ( i , m ) = v i 2 v i v j cos θ i j ( τ ( m , 1 ) = = i τ ( m , 2 ) = = j ) ( τ ( m , 2 ) = = i τ ( m , 1 ) = = j ) 0 otherwise
a b ( i , m ) = v i v j sin θ i j ( τ ( m , 1 ) = = i τ ( m , 2 ) = = j ) ( τ ( m , 2 ) = = i τ ( m , 1 ) = = j ) 0 otherwise
where τ   ϵ   R M ^ × 2 , its first and second columns store the start bus and end bus index of each line in E^, respectively. We denote a p = [ a g   a b ], x = [g; b], then Equation (27) can be expressed as follows:
p = a p x
A minimized loss function model for solving line parameters can be built based on Equation (30). We denote the number of data samples in stage 2 as M2, and the optimization model based on square loss function can be expressed as follows:
min A 1 x y 1 2 2
where A 1 = [ap1; …; apM2], y 1 = [p1; …; pM2].
In order to prevent overfitting and enhance the stability of the solution, the L2 regularization term is added to the loss function to construct the ridge regression model as follows:
min A 1 x y 1 2 2 + λ x 2
where λ is the regularization parameter.
The regularization term of the ridge regression model equally penalizes all parameters of x; consequently, biased estimates may occur when the distribution grid contains several lines with relatively large admittance values. Therefore, an adaptive ridge regression model is established as follows:
min A 1 x y 1 2 2 + λ w x 2
where ‘·’ is the matrix dot product operator, w   ϵ   R M ^ × 2 as follows:
w = ( x 0 γ + δ γ ) 1 γ
where both δ and γ are constant parameters, referring to [28], we select δ = 10−5, γ = 2, x0 is the initial estimated value of x, which can be obtained by least square method as follows:
x 0 = ( A 1 T A 1 ) 1 A 1 T y .
Equations (33)–(35) indicate that the adaptive ridge regression model adds a penalty weight coefficient to each variable according to the initial estimated value of the corresponding line parameter. The weight coefficient is smaller when the admittance of the corresponding line is larger; as a result, the degree of shrinkage is reduced.
Analogically, we can obtain (36) from Equation (26):
min A 2 x y 2 2 2 + λ w x 2
where A 2 = [aq1; …; aqM2], y 2 = [q1; …; qM2], aq = [ab -ag], q = [q1qN]T.
Based on the above analysis, the flow chat of the proposed enhanced identification algorithm is shown in Figure 2. In the figure, ξ is the empirical threshold for deciding whether the iteration converges or not, which is set to 0.02. Appendix A specifically introduces the process of determining the value of ξ. It is worth noting that the adaptive ridge regression model in (33) and (36) are convex optimization problems, and stage 1 provides an appropriate initial value for the estimation of branch admittance; as a result, the enhanced identification algorithm has a fast solution speed and good convergence performance.

4.2. Parameter Selection

4.2.1. The Number of Measurement Samples

In order to improve the accuracy of the initial identification results, the measurement data of stage 1 should have a certain degree of redundancy. Fortunately, the linear regression problem is easy to solve, and the identification algorithm with redundant measurements still has a fast execution speed. The number of optimization variables is lower in stage 2 than in stage 1; as a consequence, we can reduce the samples used in stage 2 to save computing resources. In order to ensure the robustness of the algorithm, the number of independent equations M2 × N in Equation (33) should not be less than 2 × M^. Therefore, M2 should not be less than 2M^/N.

4.2.2. The Value of λ

λ directly affects the performance of the algorithm. When the value of λ is too small, the effect of the penalty term is not obvious; when the value of λ is too large, there is a high risk of eliminating too many lines, which makes it difficult to converge in a power flow calculation. Generalized cross-validation (GCV) selects the appropriate value of λ by minimizing V(λ), defined as follows [29]:
V ( λ ) = ( M 2 × N ) 1 ( I K 1 ) y 1 2 2 ( M 2 × N ) 1 t r ( I K 1 ) 2
where I is the identity matrix, tr() represents the trace of matrix, K 1 is defined as follows:
K 1 = A 1 ( A 1 T A 1 + ( M 2 × N ) λ w w I 2 M ^ ) 1 A 1 T .
Figure 3 shows the V(λ) curve of an IEEE 33-bus system in one iteration. The blue dot in the figure is the minimum point, and the curve changes gently near the minimum value. In order to better reflect the advantage of regularization, we select λ where the curve begins to rise significantly; that is, the λ corresponding to the red star point in the figure.

5. Results

5.1. Simulation Parameter Setting

The identification algorithm was executed in MATLAB, and the convex optimization model was solved by CVX toolbox. IEEE 33- and 118-bus systems (case 1 and case 2 respectively for short) were used to verify the effectiveness of the proposed identification method. The grid structure of the test cases is shown in Figure 4. We chose bus 1 as the slack bus and assumed that all buses in the grids were equipped with smart meters.
The one-day active power injection measurement data of 1200 users in China were collected as the original sample sets, and 50–150 users were randomly assigned to each bus except the slack bus to simulate the power consumption characteristics of the distribution network. We emulated the corresponding reactive power injection according to a random lagging power factor cos φ, e.g., cos φ~Unif (0.85, 0.95) [30], and the voltage magnitude data were simulated by MATPOWER. The sampling period of the original datasets was 1 min. With the intention of alleviating the data redundancy of the regression model and reducing the consumption of computational resources, we set the sampling interval as 5 min. Affected by the accuracy of the measuring equipment, the sample data were contaminated with noise. In order to verify the practical applicability of the proposed method in real scenarios, noise was added to the measurement data. In most cases, the accuracy level of existing smart meters deployed in distribution networks is 0.2 s, so we set 0.2% additional error ε for the power injection data. Furthermore, we set ωg of case 1 and case 2 as 0.03 and 0.008, respectively, and the M2 of case 1 and case 2 as 10 and 20, respectively.

5.2. Performance Evaluation Index

We selected the F1 score as the index to evaluate the accuracy of the topology identification results. The F1 score was defined as follows:
F 1 = 2 × f P × f R f P + f R
where f P and f R are, respectively, the index of precision and recall as follows:
f P = T P T P + F P
f R = T P T P + F N
where TP, FP and FN, respectively, denote the number of branches that are correctly identified, the number of branches that are mistakenly identified as connected, and the number of branches that are mistakenly identified as unconnected.
We selected the mean absolute percentage error (MAPE) as the index to evaluate the accuracy of line parameter identification results, which is defined as follows:
I M A P E ( g ) = 1 m l i j E g i j ^ g i j g i j × 100 %
I M A P E ( b ) = 1 m l i j E b i j ^ b i j b i j × 100 %
where m is the number of lines, g i j ^ and b i j ^ are the estimated conductance and susceptance of lij, respectively.

5.3. Identification Results and Comparative Analysis

5.3.1. Identification Results

The identification results of test cases are shown in Table 2. The F1 scores of two cases both exceed 0.9 after stage 1, indicating that the initial identification algorithm can effectively remove most of the unconnected lines. However, due to the simplification of the power flow model and the neglect of the voltage angle information, there is still a fraction of misidentified elements in E^ and certain errors in the estimated value of the line admittance. After stage 2, two cases both achieved correct identification of the grid topology, and the MAPE of line parameters was between 0.26% and 0.65%. Considering that we set ε as 0.2%, the line parameter identification accuracy of the two cases is within the acceptable range. Figure 5 and Figure 6 show the parameter estimation errors of each line after stage 1 and stage 2, respectively. After modifying the identification results in stage 2, the estimation accuracy of all line parameters has been significantly improved.

5.3.2. Comparative Analysis of Linear Models

In order to verify the superiority of the proposed linear power flow model in topology identification, we select two typical linear power flow models for comparative testing with different levels of measurement errors. The corresponding mathematical model is shown in Table 3.
The initial identification results of case 2 with three linear models are shown in Table 4. For all levels of measurement errors, the topology identification accuracy of proposed model is higher than that of other two comparison models, and the proposed model is robust to data noise.

5.3.3. Comparative Analysis of Optimization Models

In order to verify the performance superiority of the adaptive ridge regression model, we selected the minimum loss function model (model A for short), the ridge regression model (model B for short), and the adaptive lasso regression model based on L1 regularization (model C for short) [31] for comparison. The test results are shown in Figure 7.
All models except model B can correctly identify the topology—model B has the misidentified lines in case 2. The reason for this is that, for case 2, there are more line parameters to be estimated, and stage 1 can only output rough admittance estimates. Due to model B adding the same penalty weight to all variables, branches are more likely to be deleted incorrectly, resulting in errors in topology identification. Compared with the adaptive ridge regression model, the number of algorithm iterations based on model A is higher and the estimation error of the line admittance is relatively higher. The IMAPE(g) in case 2 increases by 0.72%, indicating that the over-fitting can be effectively solved by regularization, so that the proposed model has better regression performance. Among the four optimization models, the identification algorithm based on model B has the most iterations and the lowest accuracy with respect to line parameter estimation. The IMAPE(b) in case 1 reaches 5.69%. This is because the ridge regression model excessively penalizes the elements with large admittance values, which leads to suboptimal estimation. The iterations of the identification algorithm based on model C are fewer, indicating that, compared with L2 regularization, L1 regularization has a faster convergence speed. However, the model proposed in this paper still has higher accuracy with regard to line parameter estimation. This is because L1 regularization can not only constrain parameters and prevent overfitting, but also constrains some line parameters with small admittances to zero, thus is more suitable for feature selection. While the constraint characteristic of L2 regularization can prevent a variable from being deleted directly, if the weight is too large in the optimization process, the optimization model is not particularly sensitive to data noise; therefore, the identification performance is superior.

5.4. Sensitivity Analysis

In this subsection, we tested the influence of measurement errors and various grid structures on the identification results to verify the robustness of proposed identification method.

5.4.1. Sensitivity to Measurement Errors

Data samples inevitably have measurement errors. The identification results of proposed method with different measurement errors are shown in Figure 8. For all levels of errors, the proposed method can correctly identify the grid topology, and the number of iterations is relatively stable for the enhancement identification algorithm. With the increase of measurement errors, the estimation accuracy of line parameters decreases accordingly. When ε is 2%, IMAPE(g) and IMAPE(b) in case 1 are 3.66% and 4.80% respectively, which are less than 2.5 times of ε, IMAPE(g) and IMAPE(b) in case 2 are 2.57% and 5.96% respectively, which are less than 3 times of ε. Therefore, even if the data accuracy is low, the estimation error of line parameters is still within the normal range.

5.4.2. Sensitivity to Various Grid Structures

Distribution networks have various topology structures. By changing the state of line switches in case 1, we generated 9 comparative cases (called T1–T9, respectively). The specific modification scheme is shown in Table 5. Notably, the structures of T8 and T9 are weakly meshed.
The identification results are shown in Table 6. For various grid structures, the proposed method can correctly identify the topology, and the estimation accuracy of line parameters has strong stability. However, compared with others, there are more iterations of the enhanced identification algorithm in T9 due to the robustness of the ridge regression. Therefore, the proposed identification method is effective for various grid structures; however, for some weakly meshed grids, the time cost of the identification algorithm may be relatively high. If real-time application is strictly required, the iterative convergence condition can be relaxed to improve the solving speed, which may sacrifice some accuracy.

6. Conclusions

Accurate topology and line parameters are prerequisites for the safe and efficient operation of a distribution network. Based on the measurement data collected by smart meters, a method for the joint identification of grid topologies and line parameters was proposed in this paper. First, a linear power flow model was established according to the operation characteristics of the distribution network, and the initial identification results were quickly obtained based on linear regression. On this basis, a decoupling iterative optimization algorithm was proposed to modify the identification results. The superiority and robustness of the proposed method were verified in a case study. The method is suitable for distribution networks without PMU equipment. Moreover, our method was also validated for both radial and weakly meshed grids.

Author Contributions

Conceptualization, M.L.; methodology, M.L.; software, C.W.; validation, C.W. and M.L.; formal analysis, C.W., Z.L. and M.L.; investigation, Z.L.; resources, C.Z.; data curation, D.J.; writing—original draft preparation, M.L.; writing—review and editing, M.L.; visualization, C.W., M.L. and C.Z.; supervision, D.J.; project administration, Z.L.; funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Research on Key Technologies for Constructing the ‘Integrated Map of Operation and Distribution’ Based on Multi-source Data Fusion, grant number J2023062.

Data Availability Statement

Restrictions apply to the availability of these data. The data are not publicly available due to confidentiality agreements.

Conflicts of Interest

Authors Chong Wang and Zheng Lou were employed by the Information and Communication Branch, State Grid Jiangsu Electric Power Co., Ltd., Authors Zhaoyang Zhu and Dongsheng Jing were employed by the State Grid Suzhou Power Supply Company. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A

ξ is an empirical threshold used to judge whether the iterative process of enhanced identification algorithm converges or not. Due to Δx^ containing the change information of the estimated line admittance, we can observe the change of Δx^ in the iterative process, when the absolute value of the Δx^ difference before and after iteration is lower than the convergence threshold ξ, it is considered that stable values for estimated line parameters have been obtained and the algorithm ends its iterations and outputs the final identification results.
When we select the value of ξ, the IEEE-33 bus system is first used to execute the enhanced identification algorithm without setting ξ. The changes in Δx^, IMAPE(g) and IMAPE(b) in the iterative process are shown in Figure A1. Δx^ is basically stable after the 15th iteration, and the variation range of Δx^ before and after iteration is 0.017, which is lower than 0.02. As the iteration continues, the variation range of Δx^ is very small, suggesting that the estimated values of line parameters will not change greatly, and we can observe that both IMAPE(g) and IMAPE(b) do not decrease obviously. As a result, we can end the algorithm after the 15th iteration; thus ξ is set as 0.02.
Figure A1. Convergence characteristics of IEEE 33-bus distribution system in stage 2.
Figure A1. Convergence characteristics of IEEE 33-bus distribution system in stage 2.
Energies 17 00830 g0a1
Furthermore, for the purpose of verifying the effectiveness of the selected ξ in a large-scale distribution network, the IEEE-118 bus system was used to execute the same algorithm with ξ = 0.02. The results show that the algorithm ends after 19 iterations. In order to confirm the validity of ξ, we continued to execute the algorithm artificially, and the changes in Δx^, IMAPE(g) and IMAPE(b) in the iterative process are shown in Figure A2. It can be observed that Δx^, IMAPE(g) and IMAPE(b) are basically stable after the 19th iteration, after which the estimation errors of the line parameters will not decrease obviously. It is thus appropriate to end the algorithm after the 19th iteration; consequently, 0.02 is a suitable value for ξ.
Figure A2. Convergence characteristics of IEEE 118-bus distribution system in stage 2.
Figure A2. Convergence characteristics of IEEE 118-bus distribution system in stage 2.
Energies 17 00830 g0a2

References

  1. Zhao, J.; Li, L.; Xu, Z.; Wang, X.; Wang, H.; Shao, X. Full-scale distribution system topology identification using Markov random field. IEEE Trans. Smart Grid 2020, 11, 4714–4726. [Google Scholar] [CrossRef]
  2. Yu, X.; Zhao, J.; Zhang, H.; Wang, X.; Bian, X. Data-driven distributed grid topology identification using backtracking Jacobian matrix approach. IEEE Trans. Ind. Inform. 2024, 20, 1711–1720. [Google Scholar] [CrossRef]
  3. Lv, X.; Yuan, L.; Cheng, Z.; He, Y.; Yin, B.; Ding, C. Sparse Bayesian learning-based topology reconstruction under measurement perturbation for fault location. IEEE Trans. Instrum. Meas. 2024, 73, 1–9. [Google Scholar] [CrossRef]
  4. Wang, N.; Dimarogonas, D.V. Finite-time topology identification for complex dynamical networks. In Proceedings of the 2023 62nd IEEE Conference on Decision and Control, Singapore, 13–15 December 2023; pp. 425–430. [Google Scholar]
  5. Manbachi, M.; Ordonez, M. AMI-based energy management for islanded AC/DC microgrids utilizing energy conservation and optimization. IEEE Trans. Smart Grid 2019, 10, 293–304. [Google Scholar] [CrossRef]
  6. Bi, J.; Luo, F.; Liang, G.; Yang, X.; He, S.; Dong, Z.Y. Impact assessment and defense for smart grids with FDIA against AMI. IEEE Trans. Netw. Sci. Eng. 2023, 10, 578–591. [Google Scholar] [CrossRef]
  7. Jiang, W.; Lin, X.; Yang, Z.; Xu, D.; Chen, J.; Zhang, K. Line parameters identification in LV distribution stations based on non-liner least squares. In Proceedings of the 2023 5th International Conference on Power and Energy Technology, Tianjin, China, 27–30 July 2023; pp. 666–670. [Google Scholar]
  8. Dutta, R.; Patel, V.S.; Chakrabarti, S.; Sharma, A.; Das, R.K.; Mondal, S. Parameter estimation of distribution lines using SCADA measurements. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
  9. Wang, Y.; Xia, M.; Yang, Q.; Song, Y.; Chen, Q.; Chen, Y. Augmented state estimation of line parameters in active power distribution systems with phasor measurement units. IEEE Trans. Power Del. 2022, 37, 3835–3845. [Google Scholar] [CrossRef]
  10. Wang, W.; Yu, N. Estimate three-phase distribution line parameters with physics-informed graphical learning method. IEEE Trans. Power Syst. 2022, 37, 3577–3591. [Google Scholar] [CrossRef]
  11. Wang, W.; Yu, N.; Zhao, Y. Fast graphical learning method for parameter estimation in large-scale distribution networks. In Proceedings of the 2022 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids, Singapore, 25–28 October 2022; pp. 27–33. [Google Scholar]
  12. Wang, J.; Xie, D.; Peng, W.; Ding, C.; Wang, H. Identification of distribution network parameters based on particle swarm optimization method. In Proceedings of the 2023 4th International Conference on Information Science, Parallel and Distributed Systems, Guangzhou, China, 14–16 July 2023; pp. 422–427. [Google Scholar]
  13. Qin, C.; Vyakaranam, B.; Etingov, P.; Venetos, M.; Backhaus, S. Machine learning based network parameter estimation using AMI data. In Proceedings of the 2022 IEEE Power & Energy Society General Meeting, Denver, CO, USA, 17–21 July 2022; pp. 1–5. [Google Scholar]
  14. Ban, J.; Park, J.-Y.; Kim, Y.-J.; Catalão, J.P.S. AMI data-driven strategy for hierarchical estimation of distribution line impedances. IEEE Trans. Power Del. 2023, 38, 513–527. [Google Scholar] [CrossRef]
  15. Yu, J.; Weng, Y.; Rajagopal, R. PaToPa: A data-driven parameter and topology joint estimation framework in distribution grids. IEEE Trans. Power Syst. 2018, 33, 4335–4347. [Google Scholar] [CrossRef]
  16. Yu, J.; Weng, Y.; Rajagopal, R. PaToPaEM: A data-driven parameter and topology joint estimation framework for time-varying system in distribution grids. IEEE Trans. Power Syst. 2019, 34, 1682–1692. [Google Scholar] [CrossRef]
  17. Srinivas, V.L.; Wu, J. Topology and parameter identification of distribution network using smart meter and µPMU measurements. IEEE Trans. Instrum. Meas. 2022, 71, 1–14. [Google Scholar] [CrossRef]
  18. Li, H.; Weng, Y.; Vittal, V.; Blasch, E. Distribution grid topology and parameter estimation using deep-shallow neural network with physical consistency. IEEE Trans. Smart Grid 2024, 15, 655–666. [Google Scholar] [CrossRef]
  19. Moffat, K.; Bariya, M.; Von Meier, A. Unsupervised impedance and topology estimation of distribution networks—Limitations and tools. IEEE Trans. Smart Grid 2020, 11, 846–856. [Google Scholar] [CrossRef]
  20. Chang, J.-W.; Lee, G.-S.; Oh, S. Bus impedance matrix estimation using non-PMU measurements. IEEE Trans. Power Syst. 2023, 38, 1743–1746. [Google Scholar] [CrossRef]
  21. Lave, M.; Reno, M.J.; Peppanen, J. Distribution system parameter and topology estimation applied to resolve low-voltage circuits on three real distribution feeders. IEEE Trans. Sustain. Energy 2019, 10, 1585–1592. [Google Scholar] [CrossRef]
  22. Park, S.; Deka, D.; Backhaus, S.; Chertkov, M. Learning with end-users in distribution grids: Topology and parameter estimation. IEEE Trans. Control Netw. Syst. 2020, 7, 1428–1440. [Google Scholar] [CrossRef]
  23. Wang, X.; Zhao, Y.; Zhou, Y. A data-driven topology and parameter joint estimation method in non-PMU distribution networks. IEEE Trans. Power Syst. 2024, 39, 1681–1692. [Google Scholar] [CrossRef]
  24. Li, Z.; Yu, J.; Wu, Q.H. Approximate linear power flow using logarithmic transform of voltage magnitudes with reactive power and transmission loss consideration. IEEE Trans. Power Syst. 2018, 33, 4593–4603. [Google Scholar] [CrossRef]
  25. Yang, Z.; Zhong, H.; Bose, A.; Zheng, T.; Xia, Q.; Kang, C. A linearized OPF model with reactive power and voltage magnitude: A pathway to improve the MW-only DC OPF. IEEE Trans. Power Syst. 2018, 33, 1734–1745. [Google Scholar] [CrossRef]
  26. Xu, Z.; Jiang, W.; Xu, J.; Wang, D.; Wang, Y.; Ou, Z. Distribution network topology identification using asynchronous transformer monitoring data. IEEE Trans. Ind. Appl. 2023, 59, 323–331. [Google Scholar] [CrossRef]
  27. Zimmerman, R.D.; Murillo-Sánchez, C.E.; Thomas, R.J. MATPOWER: Steady-state operations, planning, and analysis tools for power systems research and education. IEEE Trans. Power Syst. 2011, 26, 12–19. [Google Scholar] [CrossRef]
  28. Frommlet, F.; Nuel, G. An adaptive ridge procedure for L0 regularization. PLoS ONE 2016, 11, e0148620. [Google Scholar] [CrossRef]
  29. Golub, G.H.; Heath, M.; Wahba, G. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 1979, 21, 215–223. [Google Scholar] [CrossRef]
  30. Liao, Y.; Weng, Y.; Liu, G.; Rajagopal, R. Urban MV and LV distribution grid topology estimation via group Lasso. IEEE Trans. Power Syst. 2019, 34, 12–27. [Google Scholar] [CrossRef]
  31. Al-Khowarizmi; Efendi, S.; Nasution, M.K.; Herman, M. The role of detection rate in MAPE to improve measurement accuracy for predicting FinTech data in various regressions. In Proceedings of the 2023 International Conference on Computer Science, Information Technology and Engineering, Jakarta, Indonesia, 16–17 February 2023; pp. 874–879. [Google Scholar]
Figure 1. Overall flow of the identification method.
Figure 1. Overall flow of the identification method.
Energies 17 00830 g001
Figure 2. Flow chart of enhanced identification algorithm.
Figure 2. Flow chart of enhanced identification algorithm.
Energies 17 00830 g002
Figure 3. V(λ) curve of IEEE 33-bus distribution system in one iteration.
Figure 3. V(λ) curve of IEEE 33-bus distribution system in one iteration.
Energies 17 00830 g003
Figure 4. Grid structure of test cases: (a) IEEE 33-bus distribution system; and (b) IEEE 118-bus distribution system.
Figure 4. Grid structure of test cases: (a) IEEE 33-bus distribution system; and (b) IEEE 118-bus distribution system.
Energies 17 00830 g004aEnergies 17 00830 g004b
Figure 5. Relative errors of line parameters after stage 1. (a) IEEE 33-bus distribution system; (b) IEEE 118-bus distribution system. (where g represents line conductance, b represents line admittance).
Figure 5. Relative errors of line parameters after stage 1. (a) IEEE 33-bus distribution system; (b) IEEE 118-bus distribution system. (where g represents line conductance, b represents line admittance).
Energies 17 00830 g005
Figure 6. Relative errors of line parameters after stage 2: (a) IEEE 33-bus distribution system; and (b) IEEE 118-bus distribution system.
Figure 6. Relative errors of line parameters after stage 2: (a) IEEE 33-bus distribution system; and (b) IEEE 118-bus distribution system.
Energies 17 00830 g006aEnergies 17 00830 g006b
Figure 7. Identification results under different optimization models: (a) F1 score and number of enhanced identification algorithm iterations; (b) IMAPE(g); and (c) IMAPE(b).
Figure 7. Identification results under different optimization models: (a) F1 score and number of enhanced identification algorithm iterations; (b) IMAPE(g); and (c) IMAPE(b).
Energies 17 00830 g007
Figure 8. Identification results under different measurement errors: (a) F1 score and number of enhanced identification algorithm iterations; (b) IMAPE(g); and (c) IMAPE(b).
Figure 8. Identification results under different measurement errors: (a) F1 score and number of enhanced identification algorithm iterations; (b) IMAPE(g); and (c) IMAPE(b).
Energies 17 00830 g008
Table 1. |Gik|/|Gii| of distribution network branches.
Table 1. |Gik|/|Gii| of distribution network branches.
Distribution NetworkThe Mean of |Gik|/|Gii|The Minimum of |Gik|/|Gii|ωg
IEEE 15-bus system0.50.20.07
IEEE 33-bus system0.50.090.03
IEEE 74-bus system0.50.090.01
IEEE 118-bus system0.50.050.008
Table 2. Identification results of test cases.
Table 2. Identification results of test cases.
IndexCase 1Case 2
Actual number of lines32117
Number of identified lines after stage 133133
F1 score after stage 10.980.94
IMAPE(g)/% after stage 128.8033.02
IMAPE(b)/% after stage 126.9027.76
Number of identified lines after stage 232117
F1 score after stage 211
IMAPE(g)/% after stage 20.350.26
IMAPE(b)/% after stage 20.540.65
The number of enhanced identification algorithm iterations1519
Table 3. Typical linear power flow model.
Table 3. Typical linear power flow model.
ModelLinear Power Flow Equations
[24] p i ( 1 ln v i ) k = 1 N G i k ( 1 + ln v k ) k = 1 N B i k θ k
q i ( 1 2 ln v i ) k = 1 N B i k ( 1 ln v i + ln v k ) k = 1 N G i k θ k
[25] p i k = 1 N G i k v k 2 2 k = 1 N B i k θ k
q i k = 1 N B i k v k 2 2 k = 1 N G i k θ k
Table 4. Initial identification results under different linear power flow models.
Table 4. Initial identification results under different linear power flow models.
ModelF1 Score after Stage 1
ε = 0.1%ε = 0.2%ε = 0.5%ε = 1%ε = 2%
Proposed0.940.940.940.920.92
[24]0.810.820.800.820.81
[25]0.930.930.930.910.92
Table 5. Topology structure modification scheme.
Table 5. Topology structure modification scheme.
IndexLine Switch OffLine Switch On
T1l7,8l8,21
T2l11,12l12,22
T3l28,29l25,29
T4l7,8, l14,15l8,21, l9,15
T5l11,12, l28,29l12,22, l25,29
T6l7,8, l14,15, l11,12l8,21, l9,15, l12,22
T7l7,8, l14,15, l11,12, l28,29l8,21, l9,15, l12,22, l25,29
T8 l18,33
T9 l25,29
Table 6. Identification results under different topology structures.
Table 6. Identification results under different topology structures.
IndexF1 Score
after Stage 2
IMAPE(g) after Stage 2IMAPE(b) after Stage 2Number of Enhanced Identification Algorithm Iterations
T0 110.350.5415
T110.330.5915
T210.240.4619
T310.320.5515
T410.360.4916
T510.380.5217
T610.360.4715
T710.350.5716
T810.380.4716
T910.400.5540
1 T0 is the original topology of case 1.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, C.; Lou, Z.; Li, M.; Zhu, C.; Jing, D. Identification of Distribution Network Topology and Line Parameter Based on Smart Meter Measurements. Energies 2024, 17, 830. https://doi.org/10.3390/en17040830

AMA Style

Wang C, Lou Z, Li M, Zhu C, Jing D. Identification of Distribution Network Topology and Line Parameter Based on Smart Meter Measurements. Energies. 2024; 17(4):830. https://doi.org/10.3390/en17040830

Chicago/Turabian Style

Wang, Chong, Zheng Lou, Ming Li, Chaoyang Zhu, and Dongsheng Jing. 2024. "Identification of Distribution Network Topology and Line Parameter Based on Smart Meter Measurements" Energies 17, no. 4: 830. https://doi.org/10.3390/en17040830

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop