An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine

Zhang, Yiyi; Wang, Yuxuan; Fan, Xianhao; Zhang, Wei; Zhuo, Ran; Hao, Jian; Shi, Zhen

doi:10.3390/en13246678

Open AccessArticle

An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine

by

Yiyi Zhang

¹

,

Yuxuan Wang

¹,

Xianhao Fan

¹,

Wei Zhang

²,

Ran Zhuo

³,

Jian Hao

⁴

and

Zhen Shi

^1,*

¹

Guangxi Key Laboratory of Power System Optimization and Energy Technology, Guangxi University, Nanning 530004, China

²

Guangxi Power Grid Co., Ltd. Electric Power Research Institute, Nanning 530000, China

³

Electric Power Research Institute of China Southern Power Grid Company Limited, Guangzhou 510080, China

⁴

School of Electrical Engineering, Chongqing University, Chongqing 400030, China

^*

Author to whom correspondence should be addressed.

Energies 2020, 13(24), 6678; https://doi.org/10.3390/en13246678

Submission received: 18 November 2020 / Revised: 8 December 2020 / Accepted: 12 December 2020 / Published: 17 December 2020

(This article belongs to the Special Issue Artificial Intelligence Technologies for Electric Power Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Support vector machine (SVM), which serves as one kind of artificial intelligence technique, has been widely employed in transformer fault diagnosis when involving dissolved gas analysis (DGA). However, when using SVM, it is easy to misclassify samples which are located near the decision boundary, resulting in a decrease in the accuracy of fault diagnosis. Given this issue, this paper proposed a genetic algorithm (GA) optimized probabilistic SVM (GAPSVM) integrated with the fuzzy three-ratio (FTR) method, in which the GAPSVM can judge whether a sample is near the decision boundary according to its output probabilities and diagnose the samples which are not near the decision boundary. Then, FTR is used to diagnose the samples which are near the decision boundary. Combining GAPSVM and FTR, the integrated model can accurately diagnose samples near the decision boundary of SVM. In addition, to avoid redundant and erroneous features, this paper also used GA to select the optimal DGA features. The diagnostic accuracy of the proposed GAPSVM integrated with the FTR fault diagnosis method reached 86.80% after 10 repeated calculations using 118 groups of IEC technical committee (TC) 10 samples. Moreover, the robustness is also proven through 30 groups of DGA samples from the State Grid Co. of China and 15 practical cases with missing values.

Keywords:

power transformer; fault diagnosis; probabilistic support vector machine; expert experience; dissolved gas analysis feature; genetic algorithm

1. Introduction

Oil-immersed power transformers are important pieces of power transmission equipment in the power system. Transformer failure causes widespread power blackout, resulting in economic losses that cannot be estimated [1,2,3]. Therefore, the safe and stable operation of the transformer is important to the power system, and it is of great importance to diagnose transformer faults such as over-heating and discharges in time and correctly.

In the existing research, dissolved gas analysis (DGA) has been widely used as the on-line fault monitoring approach for power transformer fault diagnosis. The gases dissolved in the transformer oil mainly include hydrogen (H₂), methane (CH₄), acetylene (C₂H₂), ethylene (C₂H₄), and ethane (C₂H₆) from oil decomposition, in conjunction with carbon monoxide (CO) and carbon dioxide (CO₂) from paper decomposition. Currently, the commonly used DGA fault diagnosis methods, such as the Roger ratio [4], International Electrotechnical Commission (IEC) three ratios [5], and Doernerburg ratio [6], are based on experimental experiences, which results in many problems in application. For example, the Rogers ratio reflects the thermal decomposition temperature range only, and IEC three ratios has incomplete coding [7,8,9].

With the rise of artificial intelligence (AI) technology, the fault types of transformers can be diagnosed by complex function mapping based on the DGA data [10,11,12,13]. However, DGA benchmarking samples are difficult to acquire. Large-scale models like deep learning are challenging to apply to DGA-based fault diagnosis. Support vector machine (SVM) performs well with small samples and has a strong generalization ability [14,15]. Hence, SVM is widely used in transformer fault diagnosis based on DGA. Previous research [16] proposed an SVM-based transformer fault diagnosis method. Using the characteristic gas of DGA as the input feature of SVM, the fault type of the transformer can be diagnosed through the trained model. In order to reduce redundancy and incorrect input features, the authors of [17] proposed a genetic programming method to select effective DGA features to improve diagnostic accuracy. Moreover, the authors of [18] proposed a genetic algorithm (GA) combined with SVM to select the optimal DGA gas ratios to improve the diagnostic accuracy. Additionally, the kernel parameter and slack variable of SVM should be set manually, as inappropriate parameter settings reduce the accuracy of fault diagnosis. Thus, combining feature selection and SVM parameter optimization, the authors of [19] proposed improved krill herd (IKH) optimized SVM (IKHSVM), in which IKH can optimize the internal parameters of SVM. To avoid the noise and outliers affecting the diagnostic accuracy, the authors of [20] proposed fuzzy SVM, which can reflect the impact of different samples on SVM by assigning weights to them. The weights assigned to noise and outliers are reduced, which has little impact on the model. Moreover, to avoid the limitations of single SVM, the authors of [21,22] introduced SVM and another three classifiers combined into an ensemble classifier, and this method could always select the most accurate classifier using multi-objective Particle Swarm Optimization algorithm. In addition, the authors of [23] also proposed an association rule mining method, which can select the most appropriate fault diagnosis method from two empirical rules and three AI-based classifiers. These integrated methods have significantly improved the diagnostic accuracy. However, these methods only combine several classifiers and select the most effective classifier for diagnosing transformer fault types according to certain rules or optimization algorithms, resulting in a large time complexity in the calculation process. In addition, these studies have not pointed out the defects of each classifier.

According to previous research [24], SVM is prone to misclassifying certain samples located near the decision boundaries. Therefore, the key to improving the classification performance of SVM is to effectively classify the samples near the decision boundary. The authors of [25] proposed the probabilistic SVM (PSVM), which provides the probability of each class. It can be judged whether the sample is near the decision boundary according to the output probability of PSVM. To effectively diagnose the samples near decision boundaries and reduce the complexity of the integrated model, this paper introduced the expert experiment-based fuzzy three-ratio (FTR) model [26], which is not influenced by whether a sample is near the decision boundary of the SVM. Combining PSVM and FTR, a transformer fault diagnosis approach based on GA optimized probabilistic SVM (GAPSVM) integrated with FTR is achieved. The integrated model improves the diagnostic accuracy by effectively diagnosing the samples near the decision boundaries of SVM. Thus, the proposed approach has not only the superiority of AI-based algorithms but also combines with the expert experience to eliminate the impact of data quality on AI-based algorithms. Moreover, taking into account the redundant or wrong features, this paper also uses GA to screen the optimal DGA features (ODF) from 36 groups of generated features.

2. A Fault Diagnosis Approach Based on GAPSVM Integrated with Expert Experience

2.1. Optimization of Transformer DGA Features Based on GA and SVM

2.1.1. Gas Features Dissolved in Oil

The conventional DGA features mainly include H₂, CH₄, C₂H₂, C₂H₄, C₂H₆, CO, CO₂, and total hydrocarbon (TH). To find the ODF, contents of the above gases and the ratio of every two gas contents formed all DGA features to be selected. The corresponding DGA features are numbered in Table 1. No.1–No.8 are the conventional DGA features, and No.9–No.36 are the ratios of every two gas contents. ODF is selected from the above DGA features.

2.1.2. DGA Feature Selection Based on GA Combined with SVM

Feature engineering is an important procedure in machine learning. Redundant features will reduce the calculation speed of the algorithm, and incorrect features may reduce the accuracy of the algorithm [27]. The feature selection method based on GA and SVM proposed in [28] is improved and used in this work for ODF selection; the binary encoding of chromosomes is shown in Figure 1.

The chromosomes of GA are generated by binary coding. Each chromosome consists of three genes. The first two genes are penalty factor c and σ of SVM; the third gene is the 36 sets of DGA features in order. Moreover, the corresponding relationship is shown in Figure 1. The encoding “1” represents the DGA feature that has been selected, while “0” represents the one which has not been selected. The parameter settings of GA are shown in Table 2. The ODF can be obtained by GA iterations using k-fold cross-validation (CV) accuracy as the fitness function.

2.1.3. Nonlinear Support Vector Machine

The conventional SVM is a linear and two-class classifier which must be upgraded as the transformer fault diagnosis is a nonlinear and multi-classification problem. The nonlinear SVM model and its flowchart are shown in Figure 1.

\begin{array}{l} \min Φ (ω, ξ) = \frac{1}{2} {‖ ω ‖}^{2} + C \sum_{i = 1}^{l} ξ_{i} \\ s . t . {\begin{cases} y_{i} [ω^{T} φ (x_{i}) + λ] \geq 1 - ξ_{i} \\ ξ_{i} \geq 0, i = 1, 2, \dots, l \end{cases} \end{array}

(1)

where ξ_i is a slack variable and C is a penalty factor. The Lagrange function is presented as follows:

\begin{array}{l} L (ω, λ, ξ, α, β) = Φ (ω, ξ) - \\ \sum_{i = 1}^{l} α_{i} {y_{i} [ω^{T} φ (x_{i}) + λ] - 1 + ξ_{i}} - \sum_{i = 1}^{l} β_{i} ξ_{i} \end{array}

(2)

Additionally, the decision function is:

y = s i g n [\sum_{i = 1}^{l} α_{i} y_{i} K (x, x_{i}) + λ]

(3)

where K(x_i,x_j) is the kernel function which maps low-dimensional space to high-dimensional space. The commonly used kernel functions are Gaussian radial basis functions (RBF), polynomial functions, etc. There is only one parameter to be fitted in RBF function. Therefore, RBF is used as the kernel function of SVM:

K (x_{i}, x_{j}) = \exp (- \frac{| | x_{i} - x_{j} | |^{2}}{2 σ^{2}})

(4)

2.1.4. Probabilistic SVM

To output the probability of each class, Platt [25] proposed a sigmoid-fitting method to obtain probabilistic outputs for SVM instead of uncalibrated values.

p_{i} = \frac{1}{1 + e x p (A f_{i} + B)}

(5)

where f_i is the sample’s unthresholded output, y_i is the sample’s label, A and B are the parameters to be fitted by minimizing a cross-entropy function of p_i and t_i, which is shown in Equation (6). t_i is the target probabilities, which is defined as Equation (7).

\min - \sum_{i} t_{i} l o g (p_{i}) + (1 - t_{i}) l o g (1 - p_{i})

(6)

t_{i} = \frac{y_{i} + 1}{2}

(7)

2.2. GAPSVM Integrated with FTR Model

2.2.1. Fuzzy Three-Ratio Model

For conventional IEC three ratios for transformer fault diagnosis, ratios of C₂H₂/C₂H₄, CH₄/H₂, and C₂H₄/C₂H₆ are respectively encoded in a certain interval; the coding rule of the three-ratio method is shown in Table 3. The fault types can be recognized according to the corresponding codes in Table 4.

However, the coding boundaries are too clear and depend heavily on the experience; a very small increase in the gas ratio may sharply change the codes. In fact, the boundaries of each code should be fuzzy [29].

In the FTR model, IEC codes 0, 1, 2 are replaced by ZERO, ONE, TWO; each gas ratio can be represented by a fuzzy vector. [u_ZERO(r_i), u_ONE(r_i), u_THREE(r_i)] is used to replace the IEC codes to obtain the fuzzy boundaries, where r₁ = C₂H₂/C₂H₄, r₂ = CH₄/H₂ and r₃ = C₂H₄/C₂H₆. u_ZERO(r_i), u_ZERO(r_i), u_ZERO(r_i) are membership functions. In previous studies on the fuzzy three-ratio model, the triangular membership function is often used, because the triangular membership function has fewer parameter settings and the sine curve transition is relatively smooth. Therefore, the triangular membership function is also chosen in this paper. Replace the conventional logic “AND” by “min”, “OR” by “max”, then calculate the fuzzy fault diagnosis vector f(i) [30]. To make the sum of f(i) equal to one, the normalization is shown as Equation (8).

f^{'} (i) = \frac{f (i)}{\sum_{j = 1}^{5} f (j)}

(8)

According to Equation (8), if f′(i) is the maximum, it can be considered that the transformer has No.i fault. If the second maximum f′(j) is very close to f′(i), the transformer is considered to have both No.i and No.j fault.

2.2.2. Analysis of PSVM and the Combination Method of GAPSVM and FTR

The outputs of the GAPSVM are the probabilities of each fault type of a sample; p_i represents the probability of the No. i fault. Thus, there might be the following conditions:

If p_i > 0.5, the SVM has high confidence that the sample belongs to the corresponding fault type.
If p_i ≤ 0.5, the sample is near the decision boundary of the SVM which carries out the classification of the fault in this situation. SVM has low confidence to classify the samples, and misclassification usually occurs in this situation.
The sample is more likely to be divided into the class with higher probability.

Based on the above theory, if the maximum probability of each classification does not reach 0.5, it is considered that SVM is not sufficient for the sample and the FTR model will be chosen for fault diagnosis.

The flowchart of the GAPSVM integrated with FTR model is shown in Figure 2. The ODF is selected by GA combined with SVM, and DGA samples are divided into a training set and testing set. Afterwards, the GAPSVM gives the probabilities of each fault type. If the maximum probability exceeds 0.5, the diagnosis result will be given by GAPSVM; otherwise, the sample will be diagnosed by FTR. The total equation for the integrated model is given as Equation (9). P_i is the max GAPSVM output probability of a certain sample. When P_i is less than or equal to 0.5, the sample is considered to be near the decision boundary of SVM, so it is diagnosed by FTR; u(r_i) is the fuzzy vector of FTR. FTR calculates the fuzzy vectors according to max and min and finally diagnoses the fault type of the sample. When P_i is more than 0.5, the sample is considered not near the decision boundary of SVM, and the sample is diagnosed by GAPSVM. Where ξ_i is a slack variable and C is a penalty factor, and K(x_i,x_j) is the kernel function, L(ω, λ, ξ, α, β) is the Lagrange function.

{\begin{matrix} {\begin{matrix} f (1) = & \max {\min [u_{O N E} (r_{1}), u_{Z E R O} (r_{2}), u_{O N E} (r_{3})], \\ \min [u_{O N E} (r_{1}), u_{Z E R O} (r_{2}), u_{T W O} (r_{3})], \\ \min [u_{T W O} (r_{1}), u_{Z E R O} (r_{2}), u_{O N E} (r_{3})], \\ \min [u_{T W O} (r_{1}), u_{Z E R O} (r_{2}), u_{T W O} (r_{3})]} \\ f (2) = & \min [u_{O N E} (r_{1}), u_{Z E R O} (r_{2}), u_{T W O} (r_{3})] \\ f (3) = & \max {\min [u_{Z E R O} (r_{1}), u_{Z E R O} (r_{2}), u_{O N E} (r_{3})] \\ \min [u_{Z E R O} (r_{1}), u_{Z E R O} (r_{2}), u_{T W O} (r_{3})], \\ \min [u_{Z E R O} (r_{1}), u_{T W O} (r_{2}), u_{O N E} (r_{3})], \\ \min [u_{Z E R O} (r_{1}), u_{T W O} (r_{2}), u_{T W O} (r_{3})]} \\ f (4) = & \max {\min [u_{Z E R O} (r_{1}), u_{T W O} (r_{2}), u_{O N E} (r_{3})], \\ \min [u_{Z E R O} (r_{1}), u_{T W O} (r_{2}), u_{O N E} (r_{3})]} \\ f (5) = & \min [u_{Z E R O} (r_{1}), u_{Z E R O} (r_{2}), u_{Z E R O} (r_{3})] \\ f^{'} (i) = \frac{f (i)}{\sum_{j = 1}^{5} f (j)}, i = 0 ~ 5 \\ y = \arg \max (f^{'} (i)) \end{matrix}} p_{i} \leq 0.5 \\ {\begin{matrix} \min Φ (ω, ξ) = \frac{1}{2} {‖ ω ‖}^{2} + C \sum_{i = 1}^{l} ξ_{i} \\ s . t . {\begin{cases} y_{i} [ω^{T} φ (x_{i}) + λ] \geq 1 - ξ_{i} \\ ξ_{i} \geq 0, i = 1, 2, \dots, l \end{cases} \\ L (ω, λ, ξ, α, β) = Φ (ω, ξ) - \\ \sum_{i = 1}^{l} α_{i} {y_{i} [ω^{T} φ (x_{i}) + λ] - 1 + ξ_{i}} - \sum_{i = 1}^{l} β_{i} ξ_{i} \\ y = s i g n [\sum_{i = 1}^{l} α_{i} y_{i} K (x, x_{i}) + λ] \end{matrix}} p_{i} > 0.5 \end{matrix}

(9)

3. Result Analysis

3.1. Fault Sample Data Source and Data Preprocessing

IEC TC 10 is a standard benchmarking dataset for power transformer diagnosis. In total, 118 samples of IEC TC 10 fault data have been randomly divided into training and testing datasets in each computation. The training set includes 93 samples of fault data and the testing set contains 25 samples of fault data. The information of the 118 samples is shown in Table 5.

LE-D, HE-D, LM-T, H-T, and N-C, respectively, represent low energy discharge, high energy discharge, low and medium temperature fault, and normal condition. In order to eliminate the error caused by large data variation, the DGA data are normalized by the following equation:

x_{n i} = \frac{x_{i} - x_{i \min}}{x_{i \max} - x_{i \min}}

(10)

where x_i is the i-th sample to be normalized, x_i_max, x_i_min is the maximum and minimum of values before normalization. x_ni is the normalized value.

3.2. DGA Feature Optimization Result Analysis

After 50-time GA optimal feature selection, DGA features are screened according to CV accuracy. The best three sets of DGA features and their CV accuracy are shown in Table 6. The CV accuracy of No.1 DGA feature (89.83%) is higher than those of No.2 (88.98%) and No.3 (88.14%), so the No.1 DGA feature is considered as the ODF.

In the process of GA optimal feature selection, the fitness curve of GA is shown in Figure 3, and accuracy for all sample points and the optimal point found by GA is shown in Figure 4. Figure 4a shows the training accuracy of different c and σ. Figure 4b is the top view of Figure 4a, and the optimal point found by GA is marked in this figure.

In order to compare the accuracy between different DGA features, the input features of GAPSVM are divided into three categories: (1) the DGA full data, including H₂, CH₄, C₂H₂, C₂H₄, C₂H₆, CO, CO₂, and TH; (2) the IEC three-ratio feature including CH₄/H₂, C₂H₄/C₂H₆, and C₂H₂/C₂H₄; (3) the ODF including H₂/CH₄, H₂/C₂H₆, H₂/TH, CH₄/C₂H₂, CH₄/C₂H₆, C₂H₂/C₂H₄, C₂H₄/TH C₂H₆/TH, CO/CO₂. After 30 repeated genetic algorithm optimized SVM (GASVM) calculations, the accuracy of the training and testing sets of the three DGA features is shown in Table 7.

Both the training and testing accuracy of ODF are higher than those of the other two DGA features, which indicates that the ODF significantly improves the training and testing accuracy of fault diagnosis. Moreover, ODF did not significantly increase the time complexity.

3.3. Analysis of the Output of GAPSVM

3.3.1. Threshold Optimization of the Integrated Model

The authors of [24] proposed that when the output probability of PSVM is approximately 0.5, then the sample is near the decision boundary. For the research in this paper, the question of how to find the optimal threshold to determine whether to choose GAPSVM or FTR for diagnosis is of great importance. When the threshold selected is larger, most of the samples will be diagnosed by FTR; when the threshold selected is smaller, most of the samples will be diagnosed by GAPSVM. Hence, choosing the right threshold is essential to the accuracy of the model. Thus, this paper selects nine values from 0.3 to 0.7 in steps of 0.05 as the thresholds to be selected. The training set and the testing set are randomly selected for 10 repeated calculations; the threshold with the highest average fault diagnosis accuracy of the testing set in the 10 repeated calculations is the optimal threshold. The average diagnostic accuracy of each threshold is shown in Figure 5.

It can be seen from the figure that when the threshold is 0.5, the testing accuracy is the highest, because when the threshold is too small, a large number of samples near the decision boundary still choose GAPSVM for diagnosis, but GAPSVM has a lower diagnostic accuracy for samples at the decision boundary; when the threshold selected is too large, a large number of samples that are not near the decision boundary are diagnosed by FTR. For samples that are not near the decision boundary, the diagnostic accuracy of FTR is lower than that of GAPSVM. Moreover, the balance is reached when the threshold is selected as 0.5, so the optimal threshold is chosen as 0.5.

3.3.2. Analysis of Accuracy of GAPSVM

The training and testing accuracy of the maximum output probability which are larger and equal to 0.5 or smaller after 30 repeated GAPSVM calculations, with the ODF feature as input, are listed in Table 8.

It can be identified from Table 8 that the training and testing accuracy of the maximum probability >0.5 is much higher than that of the maximum probability ≤0.5, which reflects the deficiency of the GAPSVM when a sample’s maximum probability ≤0.5. As described in Section 2, the FTR model will be applied to diagnose the samples of max probability ≤0.5; the training set and testing set accuracy of the FTR model is 76.76% and 75.25%, respectively. The FTR model significantly improves the accuracy of the testing set and does not reduce the accuracy of the training set, although its accuracy is less than that of the SVM in the training samples with the max probability ≤0.5, because the average number of samples in the training set with a max probability ≤0.5 is only 1.7 in 30 calculations.

3.4. Comparisons with Other Diagnosis Methods

Back propagation neural network (BPNN), K-Nearest Neighbor (kNN), and GASVM are usually used in traditional power transformer fault diagnosis, when ODF is adopted as the input feature of these methods. The testing accuracy of the above methods and two published studies is listed in Table 8 and the accuracy of 10-time computation of different methods is shown in Figure 6.

As shown in Table 9, the testing accuracy of the GAPSVM integrated with the FTR model proposed in this paper reaches 86.80%, which is higher than that of kNN (64.00%), BPNN (81.60%), and GASVM (82.00%), the method in [18] (83.60%) and the method in [19] (84.40%). It can be also seen from Figure 6 that, in most cases, this model performs better than the traditional methods.

3.5. Model Evaluation

To verify the validity and generalization ability of the proposed model, 30 groups of DGA fault samples from the State Grid Co. of China are used as testing samples of the trained model in 3.4; the diagnostic results are shown in Table 10.

From the diagnostic results of the 30 DGA samples, the proposed model is able to correctly classify 26 samples, and the accuracy can reach 86.67%. Furthermore, confusion matrix, F-measure, precision, and recall are introduced to examine the performance of the proposed model. The confusion matrix illustrates the relationship between predicted fault types and true fault types. Precision indicates the percentage of the samples that are identified as positive categories which are indeed positive categories, while the recall indicates the percentage of the positive examples which are predicted correctly in the dataset. On the other hand, F-measure is a weighted harmonic average of precision and recall, which provides a single score that balances both the concerns of precision and recall in one number. Equations of each measure index are shown as follows.

p r e c i s o n = \frac{T P}{T P + F P}

(11)

r e c a l l = \frac{T P}{T P + F N}

(12)

F_{m e a s u r e} = 2 \times (\frac{p r e c i s i o n \times r e c a l l}{p r e c i s i o n + r e c a l l})

(13)

It can by identified from Table 11 that the model can effectively diagnose most of the fault types. Precision, recall, and F-measure are 0.875, 0.874, and 0.859, respectively. The above measure indexes and confusion matrix proved the validity and generalization of proposed model.

3.6. Model Validation Using Practical Dataset

In order to verify the performance of the method proposed in this article in practical applications and other datasets, the dataset of [18] is cited. The lack of some DGA data in the actual operation of the transformers is considered in this dataset, in which one or two gases are null. The information of the dataset is shown in Table 12. Firstly, the missing dissolved gas is replaced by the average value of the gas corresponding to the fault type. Because C₂H₆ in HE-D are all missing values, the C₂H₆ value of HE-D is replaced by the average value of C₂H₆ gas corresponding to the fault type in the IEC TC 10 database. Then, kNN, BPNN, GASVM, the method in [18], the method in [19], and the model proposed in this paper are used to diagnose the fault types of DGA samples. The fault types of DGA samples diagnosed by this method are shown in Table 12. Moreover, the diagnostic accuracy of the different methods is shown in Table 13.

It can be identified from the diagnostic results that the integrated model proposed in this paper is able to correctly diagnose 13 of 15 DGA fault samples. The fault diagnosis accuracy reached 86.67%, which is higher than kNN (66.67%), BPNN (73.33%), GASVM (73.33%), the method in [18] (80%), and the method in [19] (80%). The diagnostic results proved the superiority and robustness of the integrated model.

4. Conclusions

In this paper, GA combined with SVM is used to select the ODF, which is adopted as the input feature of the proposed fault diagnosis model. Aiming at eliminating the insufficiency of GASVM in some samples which are located near the decision boundary, an AI and expert experience combined model based on the GAPSVM integrated with FTR is proposed, which is the main innovation of this paper. The conclusions are as follows:

The ODF is selected from 36 DGA features by the GA and SVM, and the average testing accuracy of GASVM is 82.96%, which is higher than that of the IEC three-ratio feature (75.41%) and DGA full data (57.53%). The ODF is more suitable as the input feature of the power transformer fault diagnosis model.
The AI and expert experience combined model is established based on the IEC TC 10 dataset, and the average testing accuracy is 86.80% after 10-time computation, which is higher than kNN (64.00%), BPNN (81.60%), GASVM (82.00%), the method in [18] (83.60%), and the method in [19] (84.4%). Specifically, this model avoids misclassification efficiently when a sample is near the decision boundary of GAPSVM. Moreover, when 30 groups of DGA data from the State Grid Co. of China are diagnosed by the proposed model trained by 118 groups of IEC TC 10 DGA data, diagnostic accuracy is 86.67%. Additionally, the validity and generalization are verified by measure indexes of classification.
A total of 15 real cases with missing values are tested by six methods. GAPSVM integrated with the FTR model correctly diagnosed the fault types of the 13 cases, which proves that AI-based algorithms integrated with expert experience have great robustness.

Author Contributions

Conceptualization, Y.Z. and Y.W.; methodology, Y.Z.; software, Y.W.; validation, X.F.; formal analysis, X.F.; investigation, Z.S.; resources, Z.S.; data curation, Y.Z.; writing—original draft preparation, Y.W.; writing—review and editing, Y.Z. and Z.S.; visualization, W.Z.; supervision, R.Z. and J.H.; project administration, Y.Z.; funding acquisition, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded part by the National Natural Science Foundation of China under Grant 61473272 and Grant 51867003, the Natural Science Foundation of Guangxi (2018JJB160056; 2018JJB160064; 2018JJA160176), the Guangxi Thousand Backbone Teachers Training Program, The Boshike Award Scheme For Young Innovative Talents, the Basic Ability Promotion Project for Yong Teachers in Universities of Guangxi (2019KY0046; 2019KY0022) and the Guangxi Bagui Young Scholars Special Funding in support of this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

Liu, J.; Zhang, H.; Fan, X.; Zhang, Y.; Zhang, C. Aging evaluation for transformer oil-immersed cellulose insulation by using frequency dependent dielectric modulus technique. Cellulose 2020. [Google Scholar] [CrossRef]
Liu, J.F.; Fan, X.H.; Zhang, Y.Y.; Lai, B.H.; Jiao, J. Analysis of low-frequency polarization behavior for oil-paper insulation using logarithmic-derivative spectroscopy. High Volt. 2020. [Google Scholar] [CrossRef]
Liu, J.F.; Fan, X.H.; Zhang, Y.Y.; Zheng, H.B.; Jiao, J. Temperature correction to frequency dielectric modulus and activation energy prediction of immersed cellulose insulation. IEEE Trans. Dielectr. Electr. Insul. 2020, 27, 956–963. [Google Scholar] [CrossRef]
Li, E.; Wang, L.; Song, B. Fault Diagnosis of Power Transformers with Membership Degree. IEEE Access 2019, 7, 28791–28798. [Google Scholar] [CrossRef]
Mineral Oil-Impregnated Electrical Equipment in Service–Guide to the Interpretation of Dissolved and Free Gases Analysis. Standard IEC 60599, AMD. 2007, pp. 1–8. Available online: https://standards.iteh.ai/catalog/standards/clc/7f653c70-c287-4344-a82d-ea9729a1b4f6/en-60599-2016 (accessed on 15 October 2020).
Engineers, E.E.; Board, I.S. IEEE Guide for the Interpretation of Gases Generated in Oil-Immersed Transformers; IEEE: Piscataway, NJ, USA, 2009. [Google Scholar]
Shang, H.; Xu, J.; Zheng, Z.; Qi, B.; Zhang, L. A Novel Fault Diagnosis Method for Power Transformer Based on Dissolved Gas Analysis Using Hypersphere Multiclass Support Vector Machine and Improved D–S Evidence Theory. Energies 2019, 12, 4017. [Google Scholar] [CrossRef] [Green Version]
Zeng, B.; Guo, J.; Zhu, W.; Xiao, Z.; Yuan, F.; Huang, S. A Transformer Fault Diagnosis Model Based on Hybrid Grey Wolf Optimizer and LS-SVM. Energies 2019, 12, 4170. [Google Scholar] [CrossRef] [Green Version]
Huang, X.; Zhang, Y.; Liu, J.; Zheng, H.; Wang, K. A Novel Fault Diagnosis System on Polymer Insulation of Power Transformers Based on 3-stage GA–SA–SVM OFC Selection and ABC–SVM Classifier. Polymers 2018, 10, 1096. [Google Scholar] [CrossRef] [Green Version]
Liao, R.; Zheng, H.; Grzybowski, S.; Yang, L. Particle swarm optimization-least squares support vector regression based forecasting model on dissolved gases in oil-filled power transformers. Electr. Power Syst. Res. 2011, 81, 2074–2080. [Google Scholar] [CrossRef]
Benmahamed, Y.; Teguar, M.; Boubakeur, A. Application of SVM and KNN to Duval Pentagon 1 for Transformer Oil Diagnosis. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 3443–3451. [Google Scholar] [CrossRef]
Xiao, Y.; Pan, W.; Guo, X.; Bi, S.; Feng, D.; Lin, S. Fault Diagnosis of Traction Transformer Based on Bayesian Network. Energies 2020, 13, 4966. [Google Scholar] [CrossRef]
Ghoneim, S.S.M.; Taha, I.B.M.; Elkalashy, N.I. Integrated ANN-Based Proactive Fault Diagnostic Scheme for Power Transformers Using Dissolved Gas Analysis. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1838–1845. [Google Scholar] [CrossRef]
Siva Sarma, S.S.; Kalyani, G. ANN approach for condition monitoring of power transformers using DGA. In Proceedings of the TENCON IEEE Region 10 Conference Proceedings, Chiang Mai, Thailand, 21–24 November 2004; IEEE: New York, NY, USA, 2004; pp. C444–C447. [Google Scholar]
Yin, Z.; Hou, J. Recent advances on SVM based fault diagnosis and process monitoring in complicated industrial processes. Neurocomputing 2016, 174, 643–650. [Google Scholar] [CrossRef]
Bacha, K.; Souahlia, S.; Gossa, M. Power transformer fault diagnosis based on dissolved gas analysis by support vector machine. Electr. Power Syst. Res. 2012, 83, 73–79. [Google Scholar] [CrossRef]
Shintemirov, A.; Tang, W.; Wu, Q.H. Power Transformer Fault Classification Based on Dissolved Gas Analysis by Implementing Bootstrap and Genetic Programming. IEEE Trans. Syst. Man Cybern. C 2009, 39, 69–79. [Google Scholar] [CrossRef]
Li, J.; Zhang, Q.; Wang, K.; Wang, J.; Zhou, T.; Zhang, Y. Optimal Dissolved Gas Ratios Selected by Genetic Algorithm for Power Transformer Fault Diagnosis Based on Support Vector Machine. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1198–1206. [Google Scholar] [CrossRef]
Zhang, Y.; Li, X.; Zheng, H.; Yao, H.; Liu, J.; Zhang, C.; Peng, H.; Jiao, J. A Fault Diagnosis Model of Power Transformers Based on Dissolved Gas Analysis Features Selection and Improved Krill Herd Algorithm Optimized Support Vector Machine. IEEE Access 2019, 7, 102803–102811. [Google Scholar] [CrossRef]
Ashkezari, A.D.; Hui, M.; Saha, T.K.; Ekanayake, C. Application of fuzzy support vector machine for determining the health index of the insulation system of in-service power transformers. IEEE Trans. Dielect. Electr. Insul. 2013, 20, 965–973. [Google Scholar] [CrossRef] [Green Version]
Peimankar, A.; Weddell, S.J.; Jalal, T.; Lapthorn, A.C. Ensemble classifier selection using multi-objective PSO for fault diagnosis of power transformers. In Proceedings of the 2016 IEEE Congress on Evolutionary Computation (CEC), Vancouver, BC, Canada, 24–29 July 2016; IEEE: Vancouver, BC, Canada, 2016; pp. 3622–3629. [Google Scholar]
Peimankar, A.; Weddell, S.J.; Jalal, T.; Lapthorn, A.C. Evolutionary multi-objective fault diagnosis of power transformers. Swarm Evol. Comput. 2017, 36, 62–75. [Google Scholar] [CrossRef]
Yang, Z.; Tang, W.H.; Shintemirov, A.; Wu, Q.H. Association Rule Mining-Based Dissolved Gas Analysis for Fault Diagnosis of Power Transformers. IEEE Trans. Syst. Man Cybern. C 2009, 39, 597–610. [Google Scholar] [CrossRef]
Wu, T.F.; Lin, C.J.; Weng, R.C. Probability estimates for multi-class classification by pairwise coupling. J. Mach. Learn. Res. 2004, 5, 975–1005. [Google Scholar]
Platt, J. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv. Large Margin Classif. 1999, 10, 61–74. [Google Scholar]
Nemeth, B.; Laboncz, S.; Kiss, I. Condition Monitoring of Power Transformers using DGA and Fuzzy Logic. In Proceedings of the 2009 IEEE Electrical Insulation Conference, Montreal, QC, Canada, 31 May–3 June 2009; IEEE: New York, NY, USA, 2009; p. 373. [Google Scholar]
Fang, J.; Zheng, H.; Liu, J.; Zhao, J.; Zhang, Y.; Wang, K. A Transformer Fault Diagnosis Model Using an Optimal Hybrid Dissolved Gas Analysis Features Subset with Improved Social Group Optimization-Support Vector Machine Classifier. Energies 2018, 11, 1922. [Google Scholar] [CrossRef] [Green Version]
Frohlich, H.; Chapelle, O.; Scholkopf, B. Feature selection for support vector machines by means of genetic algorithm. In Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence, Sacramento, CA, USA, 3–5 November 2003; IEEE: Piscataway, NJ, USA, 2003; pp. 142–148. [Google Scholar]
Su, Q.; Mi, C.; Lai, L.L.; Austin, P. A fuzzy dissolved gas analysis method for the diagnosis of multiple incipient faults in a transformer. IEEE Trans. Power Syst. 2000, 15, 593–598. [Google Scholar] [CrossRef]
Ma, H.; Li, Z.; Ju, P.; Han, J.; Zhang, L. Diagnosis of power transformer faults on fuzzy three-ratio method. In Proceedings of the 2005 International Power Engineering Conference, Singapore, 29 November–2 December 2005; IEEE: Piscataway, NJ, USA, 2005. [Google Scholar]

Figure 1. The binary encoding of chromosomes.

Figure 2. Flowchart of GAPSVM integrated with FTR.

Figure 3. Fitness curves of genetic algorithm.

Figure 4. Testing accuracy for all c and σ points. (a) 3D visualization of all c and σ and its corresponding testing accuracy. (b) Top view of (a).

Figure 5. Testing accuracy of different thresholds.

Figure 6. Testing accuracy of different methods.

Table 1. DGA features to be selected.

No	DGA Feature	No	DGA Feature	No	DGA Feature
1	H₂	13	H₂/CO	25	C₂H₂/CO₂
2	CH₄	14	H₂/CO₂	26	C₂H₂/TH
3	C₂H₂	15	H₂/TH	27	C₂H₄/C₂H₆
4	C₂H₄	16	CH₄/C₂H₂	28	C₂H₄/CO
5	C₂H₆	17	CH₄/C₂H₄	29	C₂H₄/CO₂
6	CO	18	CH₄/C₂H₆	30	C₂H₄/TH
7	CO₂	19	CH₄/CO	31	C₂H₆/CO
8	TH	20	CH₄/CO₂	32	C₂H₆/CO₂
9	H₂/CH₄	21	CH₄/TH	33	C₂H₆/TH
10	H₂/C₂H₂	22	C₂H₂/C₂H₄	34	CO/CO₂
11	H₂/C₂H₄	23	C₂H₂/C₂H₆	35	CO/TH
12	H₂/C₂H₆	24	C₂H₂/CO	36	CO₂/TH

Table 2. Parameter settings of GA.

Parameters	Settings
Maximum iteration	200
Population size	100
Crossover probability	0.9
Mutation probability	0.01
Range of C	[0, 200]
Range of σ	[0, 100]

Table 3. Coding rule of three-ratio method.

Ranges of Gas Ratios	Codes of Different Gas Ratios
Ranges of Gas Ratios	C₂H₂/C₂H₄	CH₄/H₂	C₂H₄/C₂H₆
<0.1	0	1	0
0.1–1	1	0	0
1–3	1	2	1
>3	2	2	2

Table 4. Fault types of DGA codes.

No	Fault Type	Code of the Ratios
No	Fault Type	C₂H₂/C₂H₄	CH₄/H₂	C₂H₄/C₂H₆
1	Discharge of low energy density	1 or 2	0	1 or 2
2	Discharge of high energy density	1	0	2
3	Thermal fault of low temperature < 300 °C	0	0 or 2	1 or 2
4	Thermal fault of high temperature ≥ 300 °C	0	2	1 or 2
5	No fault	0	0	0

Table 5. Transformer fault sample information.

Fault Type	LE-D	HE-D	LM-T	H-T	N-C
Sample quantity	23	45	10	14	26

Table 6. Best three sets of DGA feature.

DGA Feature	1	2	3
DGA ratios	H₂/CH₄	H₂/C₂H₄	H₂/C₂H₆
	H₂/C₂H₆	H₂/C₂H₆	CH₄/C₂H₂
	H₂/TH	H₂/TH	CH₄/C₂H₆
	CH₄/C₂H₂	CH₄/CO	C₂H₂/C₂H₄
	CH₄/C₂H₆	CH₄/CO₂	C₂H₂/CO
	C₂H₂/C₂H₄	C₂H₂/C₂H₄	C₂H₄/TH
	C₂H₄/TH	C₂H₂/C₂H₆	C₂H₆/TH
	C₂H₆/TH	CO/CO₂	CO/CO₂
	CO/CO₂	C₂H₄/TH	--
	--	C₂H₆/TH	--
CV accuracy	89.83%	88.98%	88.14%

Table 7. Accuracy and computing time of each DGA feature.

Features	Average Accuracy (%)		Computing Time (s)
Features	Training	Testing	Training	Testing
DGA full data	89.71	57.53	37.1847	2.19 × 10⁻⁴
Three-ratio feature	91.25	75.41	36.6727	1.44 × 10⁻⁴
ODF	94.84	82.96	37.7412	2.65 × 10⁻⁴

Table 8. Accuracy of max probability >0.5 and max probability ≤0.5 samples.

Max Probability		Training Accuracy	Testing Accuracy
>0.5	Max	100%	88.64%
	Min	97.08%	82.96%
	Mean	97.53%	86.85%
≤0.5	Max	92.00%	85.71%
	Min	85.00%	50.00%
	Mean	89.83%	56.72%

Table 9. Average testing accuracy of different methods.

Diagnosis Method	Testing Accuracy (%)
kNN	64.00
BPNN	81.60
GASVM	82.00
Method in [18]	83.60
Method in [19]	84.40
This Paper	86.80

Table 10. Diagnostic results of 30 groups of DGA fault data.

Fault Type	LE-D	HE-D	LM-T	H-T	N-C
True samples	6	6	5	5	7
Predicted samples	6	6	7	6	4

Table 11. Confusion matrix of the diagnostic result.

		Predicted by the Proposed Model
		LE-D	HE-D	LM-T	H-T	N-C
Actual	LE-D	6	0	0	0	0
	HE-D	0	7	0	0	0
	LM-T	0	0	4	1	0
	H-T	0	0	0	5	0
	N-C	0	1	2	0	4

Table 12. DGA data information of [18].

Actual Fault	H₂	CH₄	C₂H₂	C₂H₄	C₂H₆	CO	CO₂	TH	Diagnostic Result
LE-D	78	20	28	13	11	/	784	72	LE-D
	95	10	39	11	/	122	467	60	LE-D
	8	/	101	43	/	192	4067	144	LE-D
HE-D	7020	1850	4410	2960	/	2140	1000	9220	HE-D
	120	31	94	66	/	48	271	191	HE-D
	5100	1430	1010	1140	/	117	197	3580	HE-D
LM-T	48	610	/	10	29	1900	970	649	HE-D
	12	18	/	4	4	559	1710	26	LM-T
	66	60	/	7	2	76	90	69	LM-T
H-T	8800	64,064	/	95,650	72,128	290	90,300	231,842	H-T
	1100	1600	26	2010	221	/	1430	3857	H-T
	1860	4980	1600	10,700	/	158	1300	17,280	LM-T
N-C	134	134	/	45	157	1008	10,528	336	H-T
	/	225	3	110	225	785	4500	563	N-C
	200	3	/	200	50	1000	20,000	253	N-C

Table 13. Fault types diagnosed by proposed model and other algorithms.

Algorithms	kNN	BPNN	GASVM	Method in [18]	Method in [19]	This Paper
Accuracy	66.67%	73.33%	73.33%	80.00%	80.00%	86.67%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Y.; Wang, Y.; Fan, X.; Zhang, W.; Zhuo, R.; Hao, J.; Shi, Z. An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine. Energies 2020, 13, 6678. https://doi.org/10.3390/en13246678

AMA Style

Zhang Y, Wang Y, Fan X, Zhang W, Zhuo R, Hao J, Shi Z. An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine. Energies. 2020; 13(24):6678. https://doi.org/10.3390/en13246678

Chicago/Turabian Style

Zhang, Yiyi, Yuxuan Wang, Xianhao Fan, Wei Zhang, Ran Zhuo, Jian Hao, and Zhen Shi. 2020. "An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine" Energies 13, no. 24: 6678. https://doi.org/10.3390/en13246678

APA Style

Zhang, Y., Wang, Y., Fan, X., Zhang, W., Zhuo, R., Hao, J., & Shi, Z. (2020). An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine. Energies, 13(24), 6678. https://doi.org/10.3390/en13246678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Integrated Model for Transformer Fault Diagnosis to Improve Sample Classification near Decision Boundary of Support Vector Machine

Abstract

1. Introduction

2. A Fault Diagnosis Approach Based on GAPSVM Integrated with Expert Experience

2.1. Optimization of Transformer DGA Features Based on GA and SVM

2.1.1. Gas Features Dissolved in Oil

2.1.2. DGA Feature Selection Based on GA Combined with SVM

2.1.3. Nonlinear Support Vector Machine

2.1.4. Probabilistic SVM

2.2. GAPSVM Integrated with FTR Model

2.2.1. Fuzzy Three-Ratio Model

2.2.2. Analysis of PSVM and the Combination Method of GAPSVM and FTR

3. Result Analysis

3.1. Fault Sample Data Source and Data Preprocessing

3.2. DGA Feature Optimization Result Analysis

3.3. Analysis of the Output of GAPSVM

3.3.1. Threshold Optimization of the Integrated Model

3.3.2. Analysis of Accuracy of GAPSVM

3.4. Comparisons with Other Diagnosis Methods

3.5. Model Evaluation

3.6. Model Validation Using Practical Dataset

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI