Next Article in Journal
Method for Switching between Traction and Brake Control for Speed Profile Optimization in Mountainous Situations
Next Article in Special Issue
Intelligent Regulation Method for a Controllable Load Used for Improving Wind Power Integration
Previous Article in Journal
Hybrid Approach for Detecting and Classifying Power Quality Disturbances Based on the Variational Mode Decomposition and Deep Stochastic Configuration Network
Previous Article in Special Issue
Hybrid Imperialist Competitive and Grey Wolf Algorithm to Solve Multiobjective Optimal Power Flow with Wind and Solar Units
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Integrated Method to Diagnose Faults in Power Transformers

1
School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, China
2
China Electric Power Research Institute, Beijing 100192, China
*
Author to whom correspondence should be addressed.
Energies 2018, 11(11), 3041; https://doi.org/10.3390/en11113041
Submission received: 10 September 2018 / Revised: 6 October 2018 / Accepted: 15 October 2018 / Published: 5 November 2018
(This article belongs to the Special Issue Optimization Methods Applied to Power Systems)

Abstract

:
In a smart grid, many transformers are equipped for both power transmission and conversion. Because a stable operation of transformers is essential to maintain grid security, studying the fault diagnosis method of transformers can improve both fault detection and fault prevention. In this paper, a data-driven method, which uses a combination of Principal Component Analysis (PCA), Particle Swarm Optimization (PSO), and Support Vector Machines (SVM) to enable a better fault diagnosis of transformers, is proposed and investigated. PCA is used to reduce the dimension of transformer fault state data, and an improved PSO algorithm is used to obtain the optimal parameters for the SVM model. SVM, which is optimized using PSO, is used for the transformer-fault diagnosis. The diagnostic-results of the actual transformers confirm that the new method is effective. We also verified the importance of data richness with respect to the accuracy of the transformer-fault diagnosis.

1. Introduction

Several advanced technologies can be used to monitor power-equipment, and the large amount of status data of used equipment helps make the power grid “smarter”. Power transformers are expensive and important components of the smart grid, and hub devices for power transformation and transmission [1,2,3,4]. Because various faults, such as discharge and overheating, can occur during the operation of transformers, many characteristics corresponding to the faults can be affected like dissolved gases (H2, CH4, C2H6, C2H4, C2H2, CO, etc.), organic compounds (methanol, ethanol and 2-furfural), as well as the current and power of the transformers [5]. The dissolved gas analysis (DGA) is a common tool for monitoring and identifying transformer’s faults. IEEE C57.104. and IEC 60599 provide different methods such as key gases, Doernenburg Ratio, Rogers, three basic gas ratio, Duval triangle, and so on. However, due to the complexity of the working environment and the process structure of the transformers, these methods are not enough to make a right judgement and cannot judge fault fuzzy boundary. According to [6], their accuracy rates are about in 60%, which means the ratio methods cannot account for the diagnostic criteria completely [7]. In addition, the concentrations of cellulose chemical markers in oil, such as methanol, ethanol and 2-furfural, are used as a determination mark for diagnosing transformer insulation failure, which still present many challenges for an accurate interpretation in real transformers [8].
To improve the accuracy of fault diagnosis, artificial intelligence and machine learning algorithm were added to the field of transformer-fault diagnosis (TFD), including fuzzy sets [9], artificial neural networks (ANN) [10], artificial immune networks [11], probabilistic neural networks [12], rough sets [13], and support vector machines (SVM) [14]. These algorithms provide ways to develop new TFD technologies. However, these algorithms have some disadvantages. For example, it is difficult to determine the selection of parameters of fuzzy sets, artificial immune networks and probabilistic neural networks, ANNs are easier to fall into local minimum, and the fault-tolerant ability and generalization ability of rough set are weak.
SVM is usually used as a classification tool. From early 2-category techniques, multi-class SVM have been developed and are more suitable for TFD. The accuracy of multi-class SVM is determined by the parameters of its kernel function and penalty factor. In order to improve the efficiency of SVM in processing large amounts of input fault data, principal component analysis (PCA) will be used. Moreover, to reduce the influence of human experience and subjective judgment on these parameters, a new Particle Swarm Optimization (PSO) is borrowed to search the optimized parameters. This way, the most suitable SVM parameters within the effective input data reflecting the transformer’s fault will be found. SVM integrated with PCA and PSO can improves the speed and accuracy of TFD considerably. This paper is organized as follows. Section 2 introduces the complete TFD procedure implemented by improved SVM; In Section 3, we compare the accuracy of transformer-fault diagnosis using different methods. We then verify the effectiveness of the proposed method, and analyze the effect of data richness on the accuracy of the fault diagnosis. Section 4 summarizes all results.

2. TFD Model Based on SVM Integrated with PCA and PSO

TFD model based on SVM integrated with PCA and PSO is shown in Figure 1. It includes two main parts. One is that a set of transformer fault data (Data set) such as the densities of the dissolved gases is preprocessed by PCA. The other is that the parameters of SVM model are searched and optimized by PSO.

2.1. Data Set Preprocessed by PCA

TFD is a complicated task. In order to improve the operating efficiency of the SVM when there are many transformer fault data, the data needs to be pre-processed before they are used to train the SVM model. PCA aims to reduce the dimensions of fault data and replaces them with fewer uncorrelated and unoverlapped data (called principal components). The number of principal components is selected by variance contribution rate indicating how much information is included.
Suppose the data set X has n groups and each group has p fault data and they construct an original data observation matrix:
X n × p = ( x 11 x 12 x 1 p x 21 x 22 x 2 p x n 1 x n 2 x n p )
To solve the principal components, it needs to find i ( i p ) linear functions: y j i = A i T X j , where j = 1 , , n , X j = ( x j 1 , x j 2 , , x j p ) T , and A i = ( a 1 i , a 2 i , , a p i ) T is unknown. The amount of information of x is proportional to its variance. Letting Y j i = ( y 1 i , y 2 i , , y n i ) T and to avoid var ( Y j i ) , A i 2 = 1 Therefore, to obtain the maximum variance, the following equations of conditional extremes are formed:
{ max var ( Y j i ) = max A i T A i A i T A i = 1
where represents the covariance matrix.
Here Lagrange multiplier method is used to solve (2). The Lagrangian objective function is expressed as:
Q ( A i ) = A i T A i λ j ( A i T A i 1 )
where the Lagrange multiplier λ j is the characteristic root of and A i is the corresponding eigenvector. Because A i 0 and A i T A i = var ( A i T X j ) > 0 , is positive definite and all characteristic roots are positive. Assuming that:
λ ` j 1 λ ` j 2 λ ` j p 0
In the practical applications, only p principal components will be selected, which satisfies k = 1 p λ j k / i = 1 p λ j i 0.85 . The k-th principal component for the j-th group is y j k = A k T X j . All the principal components form a vecor Y j = ( y j 1 , y j 2 , , y j p ) T .

2.2. Support Vector Machine

Suppose the j-th group of principal component Y j reflects the fault type z j . We divide n groups of fault data into two sets. One set is the training set including l groups and the other set is the testing set including (n-l) groups. The training set is used to solve the parameters of SVM.
TFD is usually a multi-class problem to classify the categories in d ( d 2 ). The one-versus-one (OVO) method is adopted to extend 2-category SVM to multi-class SVM in this paper. This means it need to build SVM classifiers for any two different fault types F 1 and F 2 ( F 1 , F 2 { z j } ), and there are a total of d ( d 1 ) / 2 classifiers. Assume a hyperplane function ω T φ ( Y ) + b = 0 can accurately separate F 1 and F 2 whose category labels are marked in −1 and 1. Here ω is the normal vector of the hyperplane, b is the offset, and φ ( y ) is nonlinear transformation function. For the optimal classification hyperplane, the following conditions should be satisfied:
{ ω T φ ( Y j ) + b 1 , H j = 1 ω T φ ( Y j ) + b 1 , H j = 1
and H j { F 1 , F 2 } is the classification of Y j . In this case, Y j is mapped into a high-dimensional space.
The maximum margin between the plane and the nearest data is 1 / ω . The greater it is, the better the classification confidence is. To increase the misclassification tolerance of SVM, a non-negative variable e j is introduced. Then the problem can be described as:
{ min ( 1 2 ω 2 + C j = 1 l e j ) H j ( ω T φ ( Y j ) + b ) 1 e j e j 0
where C is a constant named penalty factor and controls the punishment degree for misclassified data. Lagrange multiplier method is also used to solve (6). The corresponding Lagrangian function is:
L ( ω , σ , e j , α j , β j ) = 1 2 ω 2 + C j = 1 l e j j = 1 l α j [ H j ( ω T φ ( Y j ) + b ) 1 + e j ] j = 1 l β j e j
where α j > 0 and β j > 0 are the Lagrangian multipliers. After α j ( j = 1 , , l ) , ω and b are solved, the final SVM classification function is:
f ( Y r ) = sgn ( j = 1 l α j H j K ( Y r , Y j ) + b )
where Y r ( r = l + 1 , , n ) is the r-th group data in testing set, K ( Y r , Y j ) = φ ( Y r ) φ ( Y j ) is the kernel function and we choose Gaussian radial basis function:
K ( Y r , Y j ) = exp ( Y r Y j 2 σ 2 )
where σ is the parameter of kernel function.

2.3. Parameter Optimization in SVM Using Improved PSO

As mentioned before, when using SVM for fault diagnosis, we first need to determine the parameter σ in kernel function (9) and penalty factor C in (6). σ affects the optimal classification performance and generalization ability of the SVM. C is required to balance the learning machine’s complexity and empirical risk when determining the minimization of the objective function. Therefore, σ and C should be optimized. We use an improved PSO algorithm for optimization.
Assuming that in a 2-dimensional search space, there is a swarm including S particles, q s = ( q s 1 , q s 2 )   ( s = 1 , , S ) . Each particle represents a potential solution and corresponds to a point in the 2-dimensional search space. Its velocity is v s = ( v s 1 , v s 2 ) T and optimal position is P s = ( P s 1 , P s 2 ) T . The optimal position within the S-particle population represents the global extremum, and it is set to P g = ( P g 1 , P g 2 ) T . The position-updating method for the particle’s velocity is expressed as:
v s d ( t + 1 ) = w v ( t ) v s d ( t ) + c 1 ( t ) r 1 ( t ) ( P s d ( t ) q s d ( t ) ) + c 2 ( t ) r 2 ( t ) ( P g d ( t ) q s d ( t ) ) , d = 1 , 2
q s d ( t + 1 ) = q s d ( t ) + v s d ( t + 1 )
where c 1 ( t ) and c 2 ( t ) are acceleration constants, r 1 ( t ) and r 2 ( t ) obey the (0,1)-uniform distribution, w v ( t ) is the speed update inertia weight representing the effect of the previous generation’s particles on the next generation particles’ velocity during the particle updating process.
Generally, the algorithm has relatively strong global optimization capability when w v ( t ) is large, and a relatively strong local optimization capability when w v ( t ) is small. However, the linear weight-adjustment method is single, and thus limits the optimization of the search ability. Aiming to change to single adjustment mode and better adapt to the complex environment, we present a new scheme for the stochastic inertia weight:
{ w v ( t ) = α 1 + ε / 2.0 , k 0.05 w v ( t ) = α 2 + ε / 2.0 , k < 0.05
k = fit ( t ) fit ( t - 10 ) fit ( t - 10 )
where fit ( t ) represents the optimal fitness value of the t-th generation and fit ( t - 10 ) is the optimal fitness value of the (t-10)-th generation, α 1 and α 2 are set to 0.5 and 0.4, respectively, reflecting the search ability in different situations, and ε is a random value between 0 and 1. The acceleration constants c 1 ( t ) and c 2 are modified in:
{ c 1 ( t ) = c 1 1 + ( c 12 c 1 1 ) t T c 2 ( t ) = c 21 + ( c 22 c 21 ) t T
where c 1 ( t ) decreases linearly from the initial value c 11 to the final value c 12 , while c 2 ( t ) increases linearly from c 21 to c 22 .

3. Verification and Discussion

Based on the above mentioned SVM-diagnosis model, optimized using PSO, a code is made in MATLAB in which SVM algorithm is implemented directly by MATLAB toolkit [15]. Some real TFD examples are analyzed.

3.1. TFD Example 1

We analyze the dissolved-gas data for the existing 157 groups of transformers under normal and other fault conditions. The dissolved-gas data were detected from 6 types of real transformer faults: low-energy discharge fault (LE-D), high-energy discharge fault (HE-D), high temperature overheat fault (HT), medium temperature overheating fault (MT), medium and low temperature overheating failure (ML-T), and low temperature overheating fault (LT). 112 groups of data were selected as training samples, and the remaining 45 groups were used for testing. The distribution of the various faults and normal state samples are shown in Table 1.
In this analysis, the particle swarm number is 20, the maximum iteration number is 200, and the search intervals for parameters C and σ are [0.01, 1000] and [0.01, 1000], respectively. Furthermore, C = 15.8823 and σ = 50.1658.
The fault diagnostic results of both the training set and the testing set are shown in Figure 2. Only one group of samples shows diagnosis errors among 112 groups of training samples (LE-D fault is diagnosed as HT fault), while 3 out of 45 samples yield diagnosis errors which are marked by circles. The accuracy reaches 93.33%. Diagnostic errors are either normal (diagnosed as HE-D), or LE-D fault (diagnosed as HE-D), or MT fault (diagnosed as HE-D). The results indicate that this method is relatively accurate and capable of realizing the aim of TFD.
We adopt the three-ratio method, Duval triangle method, back propagation neural network (BPNN), and SVM methods to diagnose the testing data set for comparison. The same set of data was used for all methods. During the test, BPNN selected a network structure with 13 hidden nodes.
Table 2 shows the fault-diagnosis accuracy for different methods, when testing the same sample of transformer. The Duval triangle method shows the lowest accuracy. The three ratio method’s accuracy is better than The Duval triangle method, however, worse than other methods. Both of three-ratio and Duval triangle methods are obtained from typical accidents, and they will fail when dealing with some complicated faults. The accuracy of the neural-network algorithm (BPNN) is 60% and it will be improved if there are a lot of data. Compared with the BPNN and IEC methods, the SVM method shows a relatively good diagnosis. When the SVM parameters are optimized, the accuracy of the fault diagnosis improves substantially.

3.2. TFD Example 2

This section uses SVM optimized by PSO to analyze the fault and normal states from the 132 groups of data detected from real transformers. The data were from the oil-dissolved gas and SCADA. We also verified the impact of data richness on the results. The dissolved gases in the oil include C2H2, C2H4, C2H6, CH4, CO, CO2, H2 and total hydrocarbon. The SCADA data include maximum current, minimum current, average current, maximum active power, minimum active power, average active power, maximum reactive power, minimum reactive power, and average reactive power. SVM optimized by PSO is used to diagnose the faults for three kinds of data: using only the dissolved gas data in oil, using only the SCADA data, and using all data. We used 112 groups as the training set and 20 groups as the testing set, and then judged the effect of data types on fault diagnosis.
In this experiment, the number for the particle swarm is 20, the maximum iteration number is 200, and the search interval of parameters C and σ are [0.01, 1000] and [0.01, 1000], respectively. The optimized parameter values and accuracy rate of different data types are shown in Table 3.
For TFD results using only the dissolved gas in the oil, four out of 20 groups of normal data are misdiagnosed as fault state. However, the fault state is diagnosed correctly, and so the accuracy can reach 80% in general. The TFD results of using only SCADA data, as shown in Table 3, has many errors in the non-fault diagnosis. 13 out of 20 groups were correctly evaluated, and the accuracy is as high as 65%. For the TFD results using the dissolved gas in oil and the SCADA data as input data of SVM optimized by PSO, only 1 out of 20 groups of test data is erroneous (the non-fault condition is misdiagnosed as fault state). Hence, the accuracy of the fault diagnosis is 95%.
Comparing the diagnostic results for the above three data types, type 1 and type 2 can be used to diagnose most of the transformer states, but there errors are different, when diagnosing non-fault states. This is because each data type can reflect the fault states of the transformer to some extent, including part of the transformers’ state changes—but not completely. By combining these two kinds of data, more state information can be considered. The SVM optimized by PSO can learn the rich information from the data, thus improving the accuracy of the transformer-fault diagnosis.

4. Conclusions

A new fault-diagnosis technology for transformers, SVM optimized using PSO, was proposed in this paper, and it fully combines the advantages of the SVM and PSO. We used the dissolved-gas data in oil as the characteristic quantity and compared it with traditional methods as three-ratio, Duval triangle, and the artificial intelligence and machine learning algorithms as BPNN and SVM. The results showed that the proposed technology greatly improves the accuracy of the fault diagnosis. We also analyzed the effect of different data types of transformer on the TFD results. The data types included the dissolved-gas data in the transformer oil only, the SCADA data only, and both of them. The combination of the dissolved-gas data and SCADA data can improve the accuracy of the new technology substantially. This also testifies the importance of having sufficient data to perform an accurate transformer fault diagnosis.
The proposed method can be applied to prognostics and health management system in smart substations. It is noted that, if the amount of fault data is very big, it will take a long time to perform the fault diagnosis and it is inappropriate for on-line diagnosis; if the amount of fault data is not enough, its accuracy will be limited. The balance between the data amount and accuracy in its real applications needs further discussion.

Author Contributions

Conceptualization, K.L., J.S., J.W. and L.X.; Methodology, K.L., J.W., J.S. and L.X.; Software, K.L.; Validation, K.L. and J.W.; Data resources, L.X.; Writing—original draft preparation, K.L.; Supervision, J.W.; Project administration, J.S. and J.W.; All authors have read and approved the final manuscript.

Funding

This work was supported in part by National Natural Science Foundation of China, grant number 51207004.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhao, Z.Y.; Tang, C.; Zhou, Q. Identification of Power Transformer Winding Mechanical Fault Types Based on Online IFRA by Support Vector Machine. Energies 2017, 10, 2022. [Google Scholar] [CrossRef]
  2. Kari, T.; Gao, W.S.; Zhao, D.B.; Zhang, Z.W.; Mo, W.X. An Integrated Method of ANFIS and Dempster-Shafer Theory for Fault Diagnosis of Power Transformer. IEEE Trans. Dielectr. Electr. 2018, 25, 360–371. [Google Scholar] [CrossRef]
  3. Wang, Y.Y.; Gong, S.L.; Grzybowski, S. Reliability evaluation method for oil-paper insulation in power transformers. Energies 2011, 4, 1362–1375. [Google Scholar] [CrossRef]
  4. Tenbohlen, S.; Coenen, S.; Djamali, M.; Müller, A.; Samimi, M.H.; Siegel, M. Diagnostic measurements for power transformers. Energies 2016, 9, 347. [Google Scholar] [CrossRef]
  5. Zheng, H.B.; Liao, R.J.; Grzybowski, S.; Yang, L.J. Fault diagnosis of power transformers using multi-class least square support vector machines classifiers with particle swarm optimization. IET Electr. Power Appl. 2011, 5, 691–696. [Google Scholar] [CrossRef]
  6. Zhao, A.X.; Zhang, C.T. DGA fault diagnosis based on the counter propagation neural network optimized by parallel genetic algorithm. In Proceedings of the 2013 IEEE International Conference of IEEE Region 10 (TENCON 2013), Xi’an, China, 22–25 October 2013; pp. 1–5. [Google Scholar]
  7. Ghoneim, S.S.M.; Taha, I.B.M.; Elkalashy, N.I. Integrated ANN-based proactive fault diagnostic scheme for power transformers using dissolved gas analysis. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1838–1845. [Google Scholar] [CrossRef]
  8. Jalbert, J.; Lessard, M.C.; Ryadi, M. Cellulose chemical markers in transformer oil insulation part 1: Temperature correction factors. IEEE Trans. Dielectr. Electr. Insul. 2013, 2013. 20, 2287–2291. [Google Scholar] [CrossRef]
  9. Arispe, J.C.G.; Mombello, E.E. Power transformer diagnosis using FRA and Fuzzy Sets. IEEE Latin Am. Trans. 2015, 13, 2991–2997. [Google Scholar] [CrossRef]
  10. Seifeddine, S.; Khmais, B.; Abdelkader, C. Power transformer fault diagnosis based on dissolved gas analysis by artificial neural network. In Proceedings of the 2012 First International Conference on Renewable Energies and Vehicular Technology, Hammamet, Tunisia, 26–28 March 2012; pp. 230–236. [Google Scholar]
  11. Xiong, H.; Sun, C.X. Artificial immune network classification algorithm for fault diagnosis of power transformer. IEEE Trans. Power Deliv. 2006, 22, 930–935. [Google Scholar]
  12. Li, S.; Li, X.Y.; Wang, W.X. Fault Diagnosis of Transformer Based on Probabilistic Neural Network. In Proceedings of the Fourth International Conference on Intelligent Computation Technology and Automation, Shenzhen, China, 28–29 March 2011; Volume 1, pp. 128–131. [Google Scholar]
  13. Su, P.; Yuan, J.S.; An, X.L.; Li, Z.; Shen, T. Diagnosis method of transformer faults based on rough set theory. Electr. Power Sci. Eng. 2008, 24, 56–59. [Google Scholar]
  14. Bigdeli, M.; Vakilian, M.; Rahimpour, E. Transformer winding faults classification based on transfer function analysis by support vector machine. IET Electr. Power Appl. 2012, 6, 268–276. [Google Scholar] [CrossRef]
  15. Hsu, C.W.; Lin, C.J. A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 2002, 13, 415–425. [Google Scholar] [PubMed] [Green Version]
Figure 1. Transformer-fault diagnosis (TFD) model based on support vector machines (SVM) integrated with principal component analysis (PCA) and particle swarm optimization (PSO).
Figure 1. Transformer-fault diagnosis (TFD) model based on support vector machines (SVM) integrated with principal component analysis (PCA) and particle swarm optimization (PSO).
Energies 11 03041 g001
Figure 2. Results of the transformer-failure diagnosis: (a) training sets; (b) testing sets.
Figure 2. Results of the transformer-failure diagnosis: (a) training sets; (b) testing sets.
Energies 11 03041 g002
Table 1. Statistics of samples for training and testing, corresponding to various types of real faults.
Table 1. Statistics of samples for training and testing, corresponding to various types of real faults.
Fault TypeTraining SampleTest SampleTotal
Normal17724
LE-D231033
HE-D20828
HT231033
MT729
ML-T13518
LT9312
Total11245157
Table 2. Accuracy rate for the different diagnostic methods of transformer.
Table 2. Accuracy rate for the different diagnostic methods of transformer.
MethodThree-RatioDuval TriangleBPNNSVMThis Paper
Accuracy rate51.111%42.222%60.000%75.556%93.333%
Table 3. Fault-diagnostic results of transformer of different methods.
Table 3. Fault-diagnostic results of transformer of different methods.
TypesDataC σ Accuracy Rate
1Dissolved gas data in oil only5.0230.70993180%
2SCADA data only26.56317.378765%
3Both above data36.69180.07458195%

Share and Cite

MDPI and ACS Style

Wu, J.; Li, K.; Sun, J.; Xie, L. A Novel Integrated Method to Diagnose Faults in Power Transformers. Energies 2018, 11, 3041. https://doi.org/10.3390/en11113041

AMA Style

Wu J, Li K, Sun J, Xie L. A Novel Integrated Method to Diagnose Faults in Power Transformers. Energies. 2018; 11(11):3041. https://doi.org/10.3390/en11113041

Chicago/Turabian Style

Wu, Jing, Kun Li, Jing Sun, and Li Xie. 2018. "A Novel Integrated Method to Diagnose Faults in Power Transformers" Energies 11, no. 11: 3041. https://doi.org/10.3390/en11113041

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop