An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants

Yu, Wenmin; Yu, Ren; Tao, Jun

doi:10.3390/su14052700

Open AccessArticle

An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants

by

Wenmin Yu

¹

,

Ren Yu

^1,* and

Jun Tao

²

¹

School of Nuclear Science and Techniques, Naval University of Engineering, Wuhan 430034, China

²

China Nuclear Power Operation Management Co., Ltd., Jiaxing 314300, China

^*

Author to whom correspondence should be addressed.

Sustainability 2022, 14(5), 2700; https://doi.org/10.3390/su14052700

Submission received: 25 January 2022 / Revised: 11 February 2022 / Accepted: 16 February 2022 / Published: 25 February 2022

(This article belongs to the Topic Artificial Intelligence and Sustainable Energy Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Dissolved gas in oil (DGA) is a common means of monitoring the condition of an oil-immersed transformer. The concentration of dissolved gas and the ratio of different gases are important indexes to judge the condition of power transformers. Monitoring devices for dissolved gas in oil are widely installed in main transformers, but there are few recorded fault data of main transformers. The special operation and maintenance modes of main transformers leads to the fault modes particularity of main transformers. In order to solve the problem of insufficient samples and the feature uncertainty, this paper puts forward an unsupervised mutual information method to select the feature verified by the optimized support vector machine (SVM) model of particle swarm optimization (PSO) method and tries to find the feature sequence with better performance. The methos is validated by data from nuclear power transformers.

Keywords:

main transformer; condition monitoring; unsupervised mutual information; feature selection; DGA

1. Introduction

Power transformers that work under harsh environments would experience thermal decomposition of oil and cellulose insulation materials, such as arcing, corona discharge, low energy sparks, severe overloading, overheating of insulation systems and pump motor failures. These conditions alone or in combination can produce combustible and noncombustible gases [1] Detection of anomalies requires an assessment of the amount of gas produced. Gas in oil-immersed transformers can be used to identify fault types, including thermal and electrical interference. Gases obtained from chromatographic analysis of insulating oils may contain dissolved carbon monoxide (

C O

), carbon dioxide (

C O_{2}

), nitrogen (

N_{2}

), hydrogen (

H_{2}

), methane (

C H_{4}

), acetylene (

C_{2} H_{2}

), ethylene (

C_{2} H_{4}

), and ethane (

C_{2} H_{6}

). The composition, formation rate and specific content ratio of dissolved gas can be used to indicate transformer condition.

The composition and content of dissolved gases in oil of transformer insulation can reflect the operation condition of transformer to a great extent thus dissolved gas analysis (DGA) has become an effective method for fault diagnosis of oil-immersed transformers [2].

Organizations such as the Institute of Electrical and Electronics Engineers (IEEE) and the International Electrotechnical Commission (IEC) recommend a variety of diagnostic techniques [3], depending on the type of transformer and operating conditions. Some of the most commonly used techniques include Doernenburg ratio, Rogers ratio, Duval triangle model, etc. These classical diagnostic methods mostly take the ratio of different gases as the characteristic input and then judge the actual operating condition of the transformer by the threshold value formed by experience or statistical methods. Fuzzy network, support vector machine, artificial neural network, and other commonly used artificial intelligence methods are also generally introduced into the field of power transformer fault diagnosis [4,5,6,7]. However, in the studies of different scholars, the features used as the basis of intelligent diagnosis are often different.

In previous studies, in addition to problems in diagnostic methods, there are the following phenomena in monitoring data: less test data, less available data sets and unbalanced data type distribution, which bring great problems to algorithm verification [8].

Main transformers are important equipment for power generation of nuclear facilities. They are in a high-load long-term condition and are more prone to failure caused by aging [9]. Meanwhile, due to the particularity of nuclear power refueling overhaul and the conservative culture of nuclear power [10], the maintenance strategy of nuclear equipment is more rigorous and conservative, and the failure modes of main transformers may be slightly different. The data of the main transformers are classified separately in the IEC database [2], which shows transformer performance difference in nuclear industry.

Due to the particularity of nuclear power transformers, there are less marked data and more constraints on the monitoring data that can be used for research. The features are important inputs of the diagnostic algorithm. High-dimensional features bring high computational cost and the risk of “over-fitting”. Dimensionality reduction or selection is an important research direction.

In this paper, an unsupervised mutual information feature algorithm is proposed for feature selection of different features proposed in the current classical algorithm and intelligent algorithm, as a pattern recognition method, SVM can construct the optimal classification hyperplane under the condition of small sample learning and distinguish transformer conditions according to the input features. The main transformer condition diagnosis model based on support vector machine is adopted for diagnosis in this paper, and the case data of main transformer is verified.

2. Framework of the Feature Selection Method Based on the SVM Model for Main Transformers

The research framework for the feature selection method based on the architecture of the SVM model for the main transformers is shown in Figure 1.

The gas concentration values measured in the continuous operation process of nuclear power transformer are obtained, and there is almost no-fault data.

The features used in various power transformer condition diagnosis methods based on dissolved gas in oil are extensively studied. On this basis, the initial feature set is formed.

The unsupervised mutual information feature extraction algorithm is adopted to extract features, and the set of sequence features is obtained according to the weight coefficient from high to low.

In the feature set, different number of feature sets are selected sequentially and verified by optimized SVM model for transformer fault diagnosis.

In order to reduce the contingency of the experiment, the 5-fold verification method is used to process the training samples and test samples to verify the validity of the selection feature in the diagnosis of the nuclear power transformer condition diagnosis.

Based on the accuracy of diagnosis, the feasibility of different feature extraction algorithms in the condition diagnosis of main transformers is analyzed.

3. Condition Diagnosis Model for Main Transformer

Condition diagnosis model is important to verify the feature selection algorithm and to determine the accuracy and rapidity of transformer condition diagnosis. SVM is a machine learning method based on statistical learning theory, compared with other algorithms, it can well solve practical problems such as small sample, nonlinear [11], the PSO algorithm can converge fast in the parameter optimization. The SVM optimized with PSO in the field of power transformer fault diagnosis has further application [12].

3.1. Support Vector Machine

State diagnosis of main transformers, As the case of a typical nonlinear classification problem, the overall plan of SVM is the first use of a nonlinear transform the input space data is mapped to a high-dimensional feature vector space, and then in the feature space of the optimal separating hyperplane is constructed, linear classification, after the last map back to the original space Became a nonlinear classification of input space [13].

SVM settings

At present, the commonly used kernel functions are mainly polynomial kernel function, radial basis (RBF) kernel function, hyperbolic tangent (sigmoid) kernel function, and so on. This paper mainly uses the RBF kernel function to the apply to SVM model.

3.2. PSO for Optimal Parameters

PSO is a kind of evolutionary computation, the basic idea of which is to find the optimal solution through the cooperation and information sharing between individuals in the group. It mimics a bird in a flock by designing a massless particle with just two properties: speed, which represents how fast it is moving, and position, which represents the direction it is moving. Each particle separately searches for the optimal solution in the search space, and records it as the current individual extreme value, and shares the individual extreme value with other particles in the whole particle swarm and finds the optimal individual extreme value as the current global optimal solution of the whole particle swarm. All particles in a swarm adjust their speed and position based on the current individual extremum they find, and the current global optimal solution shared by the whole swarm [14].

PSO-SVM Parameter Settings

C_{1}

: the initial value is 1.5, local search capability of PSO parameters

C_{2}

: 1.7 initially, PSO parameter global search capability

Maxgen: The initial value is 200, the maximum number of evolutions

Sizepop: the initial value is 20 and the maximum size of the population

K: initial 0.6 (k belongs to [0.1, 1.0]), the relationship between the speed and x (

V = K X

)

WV: The initial value is 1 (wV best belongs to [0.8, 1.2]), and the rate updates the elasti coefficient before the speed in the formula

WP: The initial value is 1, the elastic coefficient in front of the velocity in the population renewal formula

V: Initial 5, SVM Cross Validation parameter

Popcmax: the maximum value of the change in the SVM parameter C, initially 100.

Popcmin: the initial value is 0.1, the minimum change of SVM parameter C.

Popgmax: the initial value is 1000, the maximum value of the change of THE SVM parameter G.

Popgmin: the initial value is 0.01, the minimum change value of the SVM parameter C.

4. Feature Selection Algorithms for Main Transformer Condition

This chapter introduces the unsupervised mutual information filtering feature sorting method used in feature selection. In feature selection, the relevance of each feature is calculated first, the importance of the feature is evaluated by the forward sequential search, and finally an ordered feature sequence is output.

4.1. Stepwise Feature Selection Process

The process of stepwise feature selection is to select a feature from the unselected feature set each time and add the feature set S. In accordance with the selection order, the feature set outputs an ordered feature sequence.

When initializing, the feature set is empty. The unselected feature set is the complete set of all known features.

After each step selection, the feature set increases the feature set selected in this step, while the feature set not selected reduces the feature set selected in this step. Until the unselected feature set is empty.

4.2. Selection Principle

The principle of ‘’minimum redundancy—maximum correlation’’ which is similar to the famous supervised feature selection method is adopted [15], and the selection of the mth feature is based on:

l_{m} = a r g m a x_{f_{i} \in U_{m}} {R e l (f_{i}) - \frac{1}{m - 1} \sum_{f_{t} \in S_{m - 1}} R e d (f_{i}, f_{t})}

(1)

where

U_{m}

represents the set of unselected features in the current step

f_{i}

represents a feature in the unselected feature set in the current step;

R e l (f_{i})

represents Relevance of feature

f_{i}

, which is the average mutual information between feature

f_{i}

and any other one in the whole feature set is defined as

R e l (f_{i}) .

R e l (f_{i})

can be calculated with Formula (2).

R e l (f_{i}) = \frac{1}{n} \sum_{t = 1}^{n} I (f_{i}; f_{t}) = \frac{1}{n} (H (f_{i}) + \sum_{1 \leq t \leq n, t \neq i} I (f_{i}; f_{t}))

(2)

S_{m - 1}

is the selected feature set in the current step;

R e d (f_{i}, f_{t})

is the redundancy of feature

f_{i}

relative to selected feature

f_{t}

.

R e d {(f_{i}, f)}_{t}

can be calculated with Formula (3).

R e d (f_{i}, f_{t}) = R e l (f_{t}) - Rel (f_{t} | f_{i})

(3)

Rel (f_{t} | f_{i})

is conditional relevance of

f_{t}

with

f_{i}

,

Rel (f_{t} | f_{i})

can be calculated with Formula (4).

Rel (f_{t} | f_{i}) = \frac{H (f_{t} | f_{i})}{H (f_{t})} \times R e l (f_{t})

(4)

4.3. Relationship with Supervised Algorithms

When the data type is supervised, the labels of the class can represent the information of the whole feature set.

Then relevance of feature

f_{i} can be defined as

R e l (f_{i}) = I (f_{i}, c)

(5)

where c in Formula (5) is the class label

[16]

.

Redundancy between feature

f_{i}

and the selected feature

f_{t}

is defined as

R e d (f_{i}, f_{t}) = I (f_{i}, f_{t})

(6)

According to the principle of mathematics [17], relevance in an unsupervised algorithm is the lower bound of relevance in a supervised algorithm, and redundancy in an unsupervised algorithm is proportional to the redundancy in a supervised algorithm. When the initial feature set is approximately equal to the labels of the class, the sequence features obtained by the unsupervised algorithm are highly correlated with the sequence features obtained by the supervised algorithm.

5. Experiment and Validation

5.1. Experiment Description

The internal fault mode of power transformer is mainly mechanical fault, thermal fault and electrical fault, the latter two types of faults is the major issues, and mechanical fault is often shown in the form of thermal fault or electrical fault [18]. General power transformers are often subdivided into fault modes according to the degree of heating or arcing.

Due to safety culture of nuclear power plants, maintenance strategy for the main transformers tends to be conservative and strict; The failure of nuclear power transformers is rare to happen, and the failure data that can be accessed to publicly is very few. Available fault data cannot cover all the modes. Therefore, the condition of the main transformers are divided into the following three types in this paper, and only two summative failure modes are reserved and the corresponding as illustrated in Table 1.

The fault data used in the experiment in this paper are reactor-related transformer data obtained from IEC TC database, and the feature selection data and normal data are the monitoring values of a nuclear power main transformer under normal operation. The data can be obtained in the Supplementary Material. A nuclear power plant generator, 24 kV voltage, is stepped up to 500 kV and connected to the 500 KV power grid through the main transformer. The main transformer is a three single-phase transformer, each phase capacity 410 MVA. The neutral points on the high voltage side are connected and directly grounded. Oil is regularly sampled and analyzed once every 3 months manually. Sampling intervals can sometimes be uneven, depending on special focus judged by sampling staff or adjustment by work schedules.

5.1.1. Sample Data

Gas concentration of DGA method is analyzed in one part per million (PPM). To facilitate the presentation and analysis of the characteristics in the figure, logarithmic processing is performed for each monitoring value in Figure 2 and Figure 3. The box diagram of the monitored data value in the sample data is shown in Figure 2. Sample type distribution and monitoring value distribution of each basic feature are presented in Figure 3.

5.1.2. Basic Features by DGA Condition Monitoring

Typical gases measured in the DGA method of the main transformers include

H_{2}

,

C H_{4}

,

C_{2} H_{2}

,

C_{2} H_{4}

,

C_{2} H_{6}

,

C O

,

C O_{2}

as illustrated in Figure 2 and Figure 3.

T H = C H_{4} + C_{2} H_{2} + C_{2} H_{4} + C_{2} H_{6}

Other features commonly used in transformer condition diagnosis methods are shown in Table 2.

The symbols in the Duval triangle method shown in Table 1 are denoted as

% C_{2} H_{2} = 100 x / (x + y + z)

;

% C_{2} H_{4} = 100 y / (x + y + z)

;

% C H_{4} = 100 z / (x + y + z)

; with

x = (C_{2} H_{2})

;

y = (C_{2} H_{4})

;

z = (C H_{4})

in PPM.

φ (H_{2})

,

φ (C H_{4})

,

φ (C_{2} H_{6})

,

φ (C_{2} H_{4})

and

φ (C_{2} H_{2})

in Table 1 represent the contents of five characteristic gases, respectively, and Total Combustion Gases (TCG) as in: TCG =

H_{2} + C H_{4} + + C_{2} H_{4} + C_{2} H_{6} + C_{2} H_{2}

;

φ (H_{2}) = H_{2} / T C G

;

φ (C H_{4}) = C H_{4} / T C G

;

φ (C_{2} H_{6}) = C_{2} H_{6} / T C G

;

φ (C_{2} H_{4}) = C_{2} H_{4} / T C G

;

φ (C_{2} H_{2}) = C_{2} H_{2} / T C G

;

And

% H_{2} = 100 * H_{2} / (H_{2} + C_{2} H_{6} + C O + C O_{2})

.

As shown in Figure 2 and Figure 3, the class of the sample is not balanced and several basic features are similar in distribution. Unsupervised feature extraction is adopted to obtain the feature set with maximum correlation and minimum redundancy.

5.2. Extracted Feature Sequence by Unsupervised MI V.S. Supervised MI

Table 3 shows the comparison of feature sequences extracted by the unsupervised mutual information method and supervised mutual information method and their corresponding weights. As obtained from Table 3.

Table 3 Extracted feature sequence by unsupervised MI V.S. supervised MI the mutual information of unsupervised feature extraction method selects the same first feature from cases of nuclear power transformer DGA with that of supervised method, while the weight value, generated by supervised feature extraction algorithm is greater than the corresponding values in the unsupervised algorithm, which means supervised method has more dynamic to choose the first feature; In other steps, selected feature is not the same, but the unsupervised algorithm has a strong dynamic at each step.

5.3. Results of the Diagnosis by Optimized SVM

5.3.1. Diagnostic Precision with Supervised Mutual Information Feature Selection Method versus Unsupervised Approach

According to the framework described in Section 2, a PSO-optimized SVM model is applied to classify cases to reflect the fitness of the selected features.

The feature sequences selected by the supervised mutual information feature selection algorithm and the unsupervised mutual information feature selection algorithm are applied, respectively. Different numbers of features are selected from feature sequences obtained by both methods, and the diagnostic accuracy of the both method is shown in Figure 4, in which the red point is where the best fitness is obtained.

As can be seen from Figure 4, features selected from both unsupervised and supervised feature selection methods have good performance as input to diagnostic accuracy in cases, and both of the models achieve greatly increased diagnostic accuracy in the second feature.

5.3.2. Diagnostic Precision by Features of the Unsupervised Approach with Best Fitness vs. Other Classical Feature Set

Feature set with the highest diagnostic accuracy obtained by unsupervised mutual information feature selection method is used as input to optimized SVM diagnosis model for main transformer cases, other typical feature sets are used in contrast as shown in Table 4.

As shown in Table 4, the feature set obtained by the unsupervised mutual information feature selection algorithm, is used as the input of the optimized SVM diagnosis model and performs better than other inputs of the feature set in the case of diagnosis of the main transformer condition diagnosis. Therefore, the algorithm has high applicability.

6. Conclusions and Analysis

Fault data of main transformer lacks. In addition, the fault mode of the main transformer is different from that of other power transformers. This paper proposes an unsupervised mutual information feature selection method to calculate DGA monitoring data of main transformer and output feature selection sequence. Compared with the supervised mutual information feature selection algorithm, the unsupervised mutual information feature selection algorithm is highly correlated with the sequence features output by the supervised feature selection algorithm in feature selection. In the samples, the training samples and test samples were designed by five-fold method based on the appropriate feature set obtained by the unsupervised mutual information feature selection algorithm. The PSO optimized support vector machine model was used to verify the main transformer fault diagnostic, and the diagnosis accuracy was high. This method is suitable for feature extraction in main transformer fault diagnosis. However, the feature extraction method based on unsupervised mutual information is essentially an embedded feature extraction method with some significant advantages and disadvantages at the same time. The redundancy between features in the selected feature set is minimized, and its limitations depend on the evaluation of candidate solutions by the classification algorithm, which is computationally more expensive. Therefore, the offline data set can be used for training and verification in practical application, and the obtained feature set can be used to judge the condition of nuclear power transformers online.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/su14052700/s1, the experimental data of DGA samples used in Section 5.1 of this paper.

Author Contributions

Conceptualization, R.Y.; data collection, J.T.; methodology and the other, W.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable for studies not involving humans or animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data can be obtained in the Supplementary Materials of the paper.

Conflicts of Interest

Jun Tao is an employer from CNNC Nuclear Power Operation Management Co., Ltd. The other authors declare that they have no conflict of interest.

References

Bustamante, S.; Manana, M.; Arroyo, A.; Castro, P.; Laso, A.; Martinez, R. Dissolved Gas Analysis Equipment for Online Monitoring of Transformer Oil: A Review. Sensors 2019, 19, 4057–4078. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Duval, M. A review of faults detectable by gas-in-oil analysis in transformers. IEEE Electr. Insul. Mag. 2002, 18, 8–17. [Google Scholar] [CrossRef] [Green Version]
Duval, M.; Depabla, A. Interpretation of gas-in-oil analysis using new IEC publication 60599 and IEC TC 10 data-bases. Electr. Insul. Mag. IEEE 2002, 17, 31–41. [Google Scholar] [CrossRef]
Liao, R.; Zheng, H.; Grzybowski, S.; Yang, L.; Zhang, Y.; Liao, Y. An integrated decision-making model for condition assessment of power transformers using fuzzy approach and evidential reasoning. IEEE Trans. Power Deliv. 2011, 26, 1111–1118. [Google Scholar] [CrossRef]
Kari, T.; Gao, W.; Zhao, D.; Abiderexiti, K.; Mo, W.; Wang, Y.; Luan, L. Hybrid feature selection approach for power transformer fault diagnosis based on support vector machine and genetic algorithm. IET Gener. Transm. Distrib. 2018, 12, 5672–5680. [Google Scholar] [CrossRef]
Miranda, V.; Castro, A.R.G.; Lima, S. Diagnosing Faults in Power Transformers with Autoassociative Neural Networks and Mean Shift. IEEE Trans. Power Deliv. 2012, 27, 1350–1357. [Google Scholar] [CrossRef] [Green Version]
Amora, M.; Braga, A.; Lisboa, L.; Pontes, R.; Almeida, O.; Barbosa, F. Improved DGA method based on rules extracted from high-dimension input space. Electron. Lett. 2012, 48, 1048–1049. [Google Scholar] [CrossRef]
Mirowski, P.; LeCun, Y. Statistical Machine Learning and Dissolved Gas Analysis: A Review. IEEE Trans. Power Deliv. 2012, 27, 1791–1799. [Google Scholar] [CrossRef]
Alexander, G.W.; Corbin, S.L.; Mcnutt, W.J. Influence of design and operating practices on excitation of generator step-up transformers. IEEE Trans. Power Appar. Syst. 1966, 8, 901–909. [Google Scholar] [CrossRef]
Agarwal, V.; Lybeck, N.J.; Pham, B.T. Diagnostic and Prognostic Models for Generator Step-Up Transformers; Idaho National Lab. (INL): Idaho Falls, ID, USA, 2014. [Google Scholar]
Guo, H.; Wang, W. An active learning-based SVM multi-class classification model. Pattern Recognit. 2015, 48, 1577–1597. [Google Scholar] [CrossRef]
Illias, H.A.; Zhao Liang, W. Identification of transformer fault based on dissolved gas analysis using hybrid support vector machine-modified evolutionary particle swarm optimisation. PLoS ONE 2018, 13, e0191366. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Baesens, B.; Viaene, S.; Van Gestel, T.; Suykens, J.; Dedene, G.; De Moor, B.; Vanthienen, J. Least Squares Support Vector Machine Classifiers: An Empirical Evaluation; Access & Download Statistics; Departement Toegepaste Economische Wetenschappen, K.U. Leuven: Leuven, Belgium, 2000. [Google Scholar]
Akhtar, M.S.; Gupta, D.; Ekbal, A.; Bhattacharyya, P. Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis. Knowl.-Based Syst. 2017, 125, 116–135. [Google Scholar] [CrossRef]
Peng, H.; Long, F.; Ding, C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1226–1238. [Google Scholar] [CrossRef] [PubMed]
Pohjalainen, J.; Räsänen, O.; Kadioglu, S. Feature selection methods and their combinations in high-dimensional classification of speaker likability, intelligibility and personality traits. Comput. Speech Lang. 2015, 29, 145–171. [Google Scholar] [CrossRef]
Xu, J.; Zhou, Y.; Chen, L.; Xu, B. An unsupervised feature selection approach based on mutual information. J. Comput. Res. Dev. 2012, 49, 372–382. [Google Scholar]
Christina, A.; Salam, M.A.; Rahman, Q.M.; Wen, F.; Ang, S.P.; Voon, W. Causes of transformer failures and diagnostic methods–A review. Renew. Sustain. Energy Rev. 2018, 82 Pt 1, 1442–1456. [Google Scholar]
Singh, J.; Sood, Y.R.; Verma, P. Dissolved Gas Analysis for Transformers. Electr. India 2007, 47, 72–74, 76, 78, 82. [Google Scholar]
Ghoneim, S.S.; Taha, I.B. A new approach of DGA interpretation technique for transformer fault diagnosis. Int. J. Electr. Power Energy Syst. 2016, 81, 265–274. [Google Scholar] [CrossRef]
Li, X.; Wu, H.; Wu, D. DGA Interpretation Scheme Derived from Case Study. IEEE Trans. Power Deliv. 2010, 26, 1292–1293. [Google Scholar] [CrossRef]

Figure 1. Framework of Feature Selection method based on SVM model for main transformers.

Figure 2. Box diagram of the monitored data value in the sample data.

Figure 3. Sample type distribution and monitoring value distribution of each basic feature.

Figure 4. Diagnostic precision with supervised mutual information feature selection vs. unsupervised approach.

Table 1. Code of power transformer operation condition.

NO.	FAILURE MODE	CONDITION TYPE	CODE
1	Partial discharge	Electrical fault	1
2	Low-energy discharge
3	High-energy discharge
4	Thermal fault < 300 °C	Thermal fault	2
5	Thermal fault 300 to 700 °C
6	Thermal fault > 700 °C
7		Normal	3

Table 2. Common features of transformer DGA condition diagnosis methods.

NO.	FEATURE	DIAGNOSIS METHOD
1	$% C_{2} H_{2}$	The Duval Triangle method [2]
2	$% C_{2} H_{4}$
3	$% C H_{4}$
4	$C H_{4} / H_{2}$	The three basic gas ratios of IEC 599/IEC 60599 [19]
5	$C_{2} H_{4} / C_{2} H_{6}$
6	$C_{2} H_{2} / C_{2} H_{4}$
7	$C_{2} H_{2} / C H_{4}$	Doernenberg Ratios [19]
8	$C_{2} H_{6} / C_{2} H_{2}$	Doernenberg Ratios [19]
9	$C_{2} H_{2} / H_{2}$	Two new Gas ratios in IEC 60599
10	$C O_{2} / C O$	Two new Gas ratios in IEC 60599
11	$φ (H_{2})$	Other approach [20]
12	$φ (C H_{4})$
13	$φ (C_{2} H_{6})$
14	$φ (C_{2} H_{4})$
15	$φ (C_{2} H_{2})$
16	$% H_{2}$	fourth % ratio [21]

Table 3. Extracted feature sequence by unsupervised MI V.S. supervised MI.

No.	Supervised MI		Unsupervised MI
No.	Feature	Weight	Feature	Weight
1	$% C_{2} H_{4}$	0.380665	$% C_{2} H_{4}$	0.31706
2	$% C_{2} H_{2}$	0.021415	$% H_{2}$	0.276004
3	$φ (C_{2} H_{4})$	0.095868	$φ (C_{2} H_{2})$	0.302481
4	$C_{2} H_{4} / C_{2} H_{6}$	0.052677	$φ (C_{2} H_{4})$	0.306476
5	$C_{2} H_{2} / C H_{4}$	0.058468	$φ (C_{2} H_{6})$	0.299351
6	$C_{2} H_{6}$	0.066181	$φ (C H_{4})$	0.29588
7	$% C H_{4}$	0.082497	$φ (H_{2})$	0.292157
8	$φ (C_{2} H_{2})$	0.067574	$C O_{2} / C O$	0.298659
9	$C O_{2}$	0.066806	$C_{2} H_{2} / H_{2}$	0.304623
10	$C_{2} H_{6} / C_{2} H_{2}$	0.068373	$C_{2} H_{6} / C_{2} H_{2}$	0.305679
11	$C_{2} H_{2}$	0.059614	$C_{2} H_{2} / C H_{4}$	0.30906
12	$C H_{4} / H_{2}$	0.055447	$C_{2} H_{2} / C_{2} H_{4}$	0.301607
13	$C_{2} H_{4}$	0.057832	$C_{2} H_{4} / C_{2} H_{6}$	0.306031
14	$φ (C H_{4})$	0.064196	$C H_{4} / H_{2}$	0.301547
15	$% H_{2}$	0.032973	$% C H_{4}$	0.303705
16	$φ (C_{2} H_{6})$	−0.00899	$% C_{2} H_{2}$	0.279383
17	$C H_{4}$	0.002819	$T H$	0.26508
18	$C O$	−0.00451	$C O_{2}$	0.243314
19	$C_{2} H_{2} / C_{2} H_{4}$	−0.01682	$C O$	0.245554
20	$T H$	−0.00641	$C_{2} H_{6}$	0.246708
21	$C_{2} H_{2} / H_{2}$	−0.02399	$C_{2} H_{4}$	0.229841
22	$% H_{2}$	−0.02539	$C_{2} H_{2}$	0.213115
23	$φ (H_{2})$	−0.0789	$C H_{4}$	0.202028
24	$C O_{2} / C O$	−0.08773	$H_{2}$	0.187682

Table 4. Diagnostic precision by features of the unsupervised approach with best fitness vs. other classic feature set (percent).

TYPE	1	2	3	4
5-FOLD-1	100.00	89.29	89.29	96.43
5-FOLD-2	100.00	100.00	100.00	100.00
5-FOLD-3	100.00	96.43	96.43	100.00
5-FOLD-4	100.00	92.86	100.00	100.00
5-FOLD-5	92.86	96.43	92.86	92.86
AVERAGE	98.57	95.00	95.71	97.86

Note: 1 refers to the features of the unsupervised approach with the best fitness obtained as shown in Figure 4 as input to the optimized svm model; 2 refers to the features used in the three ratios method [1]; 3 refers to the features used in some intelligent methods [16]; 4 refers to the features used in the Duval Triangle method [2].

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, W.; Yu, R.; Tao, J. An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants. Sustainability 2022, 14, 2700. https://doi.org/10.3390/su14052700

AMA Style

Yu W, Yu R, Tao J. An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants. Sustainability. 2022; 14(5):2700. https://doi.org/10.3390/su14052700

Chicago/Turabian Style

Yu, Wenmin, Ren Yu, and Jun Tao. 2022. "An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants" Sustainability 14, no. 5: 2700. https://doi.org/10.3390/su14052700

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Unsupervised Mutual Information Feature Selection Method Based on SVM for Main Transformer Condition Diagnosis in Nuclear Power Plants

Abstract

1. Introduction

2. Framework of the Feature Selection Method Based on the SVM Model for Main Transformers

3. Condition Diagnosis Model for Main Transformer

3.1. Support Vector Machine

3.2. PSO for Optimal Parameters

4. Feature Selection Algorithms for Main Transformer Condition

4.1. Stepwise Feature Selection Process

4.2. Selection Principle

4.3. Relationship with Supervised Algorithms

5. Experiment and Validation

5.1. Experiment Description

5.1.1. Sample Data

5.1.2. Basic Features by DGA Condition Monitoring

5.2. Extracted Feature Sequence by Unsupervised MI V.S. Supervised MI

5.3. Results of the Diagnosis by Optimized SVM

5.3.1. Diagnostic Precision with Supervised Mutual Information Feature Selection Method versus Unsupervised Approach

5.3.2. Diagnostic Precision by Features of the Unsupervised Approach with Best Fitness vs. Other Classical Feature Set

6. Conclusions and Analysis

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI