1. Introduction
According to the EU study on electricity supply disruptions, in the period 2010–2014, up to 850 GWh of electricity annually is not supplied to the consumers, which caused a lost value up to EUR 25 billion per year to the commercial users [
1]. The vast majority of these disruptions occur due to the problems in distribution grids that are caused by the failure of insulation of the conductors, such as presence of voids and cracks. This leads to a partial discharge (PD), which can be defined as a localized breakdown of the insulation under the stress of high voltage [
2]. In case of PD, the bridge at the certain part of insulation is created which transports the electric charge between two electrodes. PD tend to develop over time, which causes progressive damage to high voltage (HV) assets and leads to false operation of the distribution grid. Therefore, early-stage PD detection, localization and monitoring are significant tasks to ensure safe and reliable supply of the electric power.
As the location and type of discharge may impact HV assets differently, the positioning of the PD source is of considerable practical interest. In the event of the PD, the energy is transmitted into the surrounded media as electromagnetic waves, heat, light and acoustic waves, so it can be detected utilizing various sensing technologies [
3,
4]. Acoustic measurements have shown several advantages in PD detection over the existing techniques, such as immunity to electromagnetic interference, ability to localize discharge sources, easy and non-invasive deployment on-site. Ultrasonic PD localization technologies use sensor arrays of different configurations to detect sound waves generated by discharge and measure time of flight between subsequent channels [
5,
6,
7]. In such a way, an approximate position of the source can be determined. Various sensor array configurations, such as circular and cross-shaped, have been investigated by different research groups, showing PD source angular positioning errors of approximately 5° [
8,
9,
10]. Among the signal processing approaches, the wavelet transform, Hilber-Huang transform and empirical mode decomposition are frequently used to describe PD signals [
11,
12,
13]. In most of the researches, the spectral signal features are exploited for PD detection, however other works show that time domain analysis can be beneficial for PD detection, especially when the repetition rate and signal reverberations within the substation are important [
14]. For the PD classification tasks, recent advances in pattern recognition techniques based on artificial intelligence have been widely implemented [
15,
16]. Both unsupervised [
17,
18,
19] and supervised [
20,
21,
22] learning techniques are used to group PD signals according to their statistical similarities. For example, Lewin et al. [
23] used DBSCAN and t-SNE clustering along with wavelet denoising to discriminate PD signals arriving from multiple sources. Contin et al. [
24] presented K-means based clustering method to separate PD signals in case of multiple active sources. Li et al. [
25] used two back propagation neural networks for partial discharge recognition in gas insulated switchgear. Choi et al. [
26] tested various ML methods like bagging, k-nearest neighbor, support vector machines and linear discriminant analysis to detect cap damage of porcelain insulators using frequency response functions. An increasing number of researches that use deep learning (DL) neural networks for PD classification can be noticed recently. Florkowski [
27] used convolutional neural networks (CNN) to detect deterioration of electrical insulation from the phased resolved PD images. In contrast to machine learning methods, DL do not require feature extraction, can handle large datasets and provide better accuracy [
28,
29].
Most of the aforementioned advances in sensing and PD classification are applied to detect, de-noise and localize discharges in oil-filled transformer tanks, where discharge-induced acoustic signals propagate in oil with relatively low attenuation. For example, recent research of Hamidreza et al. [
30] show that by using the time reversal approach, the PD source can be localized with superior resolution of λ/10 using one to few sensors. On the other hand, its estimated that up to 17% of total transformer failures are caused by faulty bushings in air power lines [
31]. These can be produced using resin-bonded paper, resin-integrated paper or resin-impregnated synthetic technologies, which can be susceptible to PD. Ultrasonic assessment of PD in transformer bushings can be completed by using contactless techniques in open-air. This introduces additional challenges, such as high transmission losses of acoustic signals, noise and multiple reflections within transformer substation, relatively short inspection distances and increased discharge source positioning errors. Among the attempts to detect and localize corona discharge, Dong et al. [
32] used a fusion of ultrasonic measurement and ultra-violet imaging to visualize the corona source locations in optical views with an angular error of 5.32% at 30 m distance.
In this research, hybrid an approach to detect, localize and visualize corona discharge source is presented. The technique proposed in this study is based on the fusion of ultrasonic and optical data using machine learning and deep learning methods, which enable intuitive discharge visualization in a real-scene environment. The technique uses ultrasonic measurements to detect and localize the source of discharge, while the source itself is identified and emphasized with optical camera by using deep learning methods. The proposed methods approach demonstrates increased sensitivity to PD and low lateral source positioning errors up to 0.1 m. In comparison with similar work conducted by Dong et al. [
23], the proposed technique offers a cost-effective way to detect PD without using an expensive ultra-violet imaging equipment employing only three acoustic sensors for PD localization instead of 31. The obtained solution provides lower positioning errors, due to specific arrangement of ultrasonic sensors with optimized inter-element distance, successful implementation of machine learning models to filter random noise signals and precise time of flight (ToF) measurements. The workflow of the proposed approach can be described as follows. At first, the proposed technology identifies discharge signals and differentiates them from random noise using different machine learning methods like Support Vector Machines (SVM), Naïve Bayes (NB) and Linear Discriminant Analysis (LDA) classifiers. A comprehensive study based on Pearson correlation,
t-test analysis and Mahalanobis distance evaluation is performed here in order to extract features representing discharge signals. The identified discharge signals are then analyzed with ultrasonic signal processing methods to estimate the spatial source position in open-air. The proposed solution uses only three ultrasonic sensors arranged in an L shape to detect and localize discharge location, while the inter-element distance is optimized to achieve the desired accuracy and compact design of the system. The detected position of discharge source is provided to pan-tilt servo motors which control the movement of an optical camera. Then, deep learning networks are trained to detect bushing insulators in optical images. The final output of the proposed technique is an optical image with detected suspicious bushing elements.
This paper is organized as follows: In the first chapter, the ML methods and discharge feature extraction are described for discriminating discharge signals from noise. Then the ultrasonic technique to detect source angular position in two orthogonal planes is presented and verified with appropriate experiments. Finally, DL methods are described for transformer bushing detection in optical image at the direction of PD source.
2. Architecture of Proposed Discharge Detection Technique
The architecture of proposed PD detection system is presented in
Figure 1. It consists of hardware parts that detect, digitize and process PD signals and signal processing—part which is responsible for discharge signal identification—source localization and recognition.
The system was designed for discharge signal detection in open-air at 40 kHz frequency with 3 sensors arranged in an L shape (see
Figure 2). Such frequency characteristics of the system was selected taking into the account attenuation of sound in air (−2.6 dB/m, at 60% relative humidity and 40 kHz) and ensuring operating distance of the device up to 30 m. Such design of the system ensures non-invasive inspection of transformer bushings and reduces noise that is created by HV assets since all measurements are taken at least from a distance of 5 m. Commercially available open-structure air coupled 40 kHz ultrasonic transducers (MA40S4R produced by Murata Manufacturing Co., Ltd. (Kyoto, Japan) with directivity of 80°, sensitivity of −63 dB and bandwidth of 5 kHz at −6 dB level) were selected for discharge signal acquisition. The amplification circuit was manufactured using low noise operational amplifier which provides 50 dB gain (at 40 kHz) and input voltage noise of
. The 20 MHz 12bit 4-channel analogue to digital converter was used to digitize the PD signals. The front view of the proposed PD detector is presented in
Figure 2.
The signal processing part of the system implements ML algorithms to discriminate signals originating from noise. These ML algorithms act as firewall, so further signal processing is skipped if the signal is found to be originating from surrounding noise. If the received signal is originated from the PD, then the ultrasonic signal processing methods are used to determine spatial location of PD source. Finally, to recognize discharging asset, DL convolutional neural networks are implemented, which detect suspicious discharging asset in optical images. In the few following sections, the discharge signal identification, localization and recognition techniques will be discussed in further details.
3. A technique to Identify Discharge Signals
To complete the purpose of discriminating PD signals from surrounding noise, three machine learning classifiers were implemented and tested, named Support Vector Machines (SVM) with Radial Basis Function (RBF) kernel, Naïve Bayes (NB) and Linear Discriminant Analysis (LDA). The dataset of discharge-induced signals was acquired using laboratory corona discharge simulator. The PD source was a needle–needle electrode separated at 230 mm distance. The emitted ultrasound signals were collected at 40 kHz with sampling of 1 MS/s, maintaining 10 m distance from source and sensors. In total, 150 signals were acquired that represent the PD data. The noise signals were recorded from the surroundings of the transformer sub-station, registering signals every 12 min for period of 24 h. This resulted in 120 signals representing acoustical noise in the sub-station. To represent each of the dataset, nine quantitative ultrasonic time domain features were evaluated: mean absolute value, variance, simple square integral, kurtosis, root mean square, average amplitude change, difference absolute standard deviation, modified mean absolute value and maximum fractal length [
33,
34]. In this research, time domain features were investigated only, as the proposed system incorporates narrowband sensors, limiting the spectrum of the received signals. The mean absolute value can be defined as the average of absolute value of the signal:
where
—is the magnitude of the signal at discrete time instance,
—is the length of the signal. Variance is expressed as squared sum of instantaneous values:
Simple squared integral is the summation of signal square values, without consideration of signal length:
Kurtosis define distribution of the signal relative to normal distribution as:
where
is mean value of
,
—is the standard deviation. Root mean square is another feature that defines square root of mean square as:
Average amplitude change is the average magnitude change between neighboring instantaneous values of the signal expressed as:
Difference absolute standard deviation is an average amplitude change related parameter that is defined as the square root of magnitude change square:
Modified mean absolute value uses a weight window function in addition to Equation (1) as:
here
is the weight coefficient. Finally, maximum fractal length is defined as:
The aforementioned quantitative measures have been applied to datasets representing discharge and noise signals. All features were standardized, making the mean value of each feature equal to 0 and variance of 1 as:
where
is the normalized feature vector,
is the initial feature vector,
—is the mean value of the feature vector
,
—is the standard deviation of feature vector
,
, where
is total number of available features.
As the large number of features can reduce the classification accuracy, redundant features were removed using Mahalanobis distance evaluation,
t-test, Pearson correlation and principal component analysis. A two sample
t-test can be formulated as follows:
where
,
—are sample means,
—is the sample variance,
—are the sample sizes,
—is a Student
quantile with
degrees of freedom. The Mahalanobis distance is expressed as a distance from vector to a distribution with mean and covariance as:
where
—is a vector from which the distance is being evaluated,
—is the distribution mean.
The statistical evaluation of each of the abovementioned features is summarized in
Table 1. The aim of such evaluation was to extract statistically significant features that can be further used for training and testing of machine learning models. The null hypothesis
in the
t-test was formulated as: “there is no significant difference between the discharge induced signal and random noise”. The
p-value was calculated with the confidence interval of 95%, meaning that
p-values higher than 5% will reject the null hypothesis. In
Table 1, the
means that the null hypothesis was rejected during the
t-test analysis. The last row of the
Table 1 represents the Pearson correlation value between discharge and noise dataset.
The results presented in
Table 1 demonstrate that there are six out of nine features that are statistically significant according to the
t-test analysis named: variance (
v); simple squared integral (
S); Kurtosis (
K); root mean square (
R); difference absolute standard deviation (
D); maximum fractal length (
MF). In all cases, there was no significant correlation between feature vectors of noise and discharge datasets (normalized correlation around 0.2 was considered as very low), hence only the
t-test and
p-values were used as criteria to select significant features. The normalized Mahalanobis distance evaluation demonstrated six significant features at the pre-defined threshold of 0.7. The results of Mahalanobis distance evaluation correlated well with the
t-test analysis, showing the same significant feature vectors. The normalized values of the Mahalanobis distance for each statistical feature are illustrated in
Figure 3.
In order to further minimize the number of statistical features, principal component analysi (PCA) was performed with an intrinsic dimension of 2, indicating following most significant features for classification task: kurtosis (K) and difference absolute standard deviation (D).
The abovementioned classifiers (SVM, NB, LDA) were trained and tested for both cases: using all six statistical features identified with Mahalanobis distance evaluation and only two features, identified with PCA. The results demonstrated that specificity, sensitivity and area under the receiver operating curve (ROC) were similar for all classifiers either using all six or only two feature vectors. This means that kurtosis and difference absolute standard deviation holds most of the variance in the entire dataset. The ROC curves in the case of six features and two features (kurtosis and difference absolute standard deviation) for all three classifiers are illustrated at
Figure 4. The comparison between the sensitivity and specificity of three selected classifiers in the case of six and two feature vectors can be seen in
Table 2.
The results presented both in
Figure 4 and
Table 2 indicate that classification is accurate in both cases, either using six or two feature vectors. In case of six feature vectors, the area under the ROC curve (AUC) is 99.7% for SVM, 99.1% for NB and 99.6% for LDA classifiers. In contrast, the two-feature vector provides AUC of 98.8% for SVM, 98.7% for NB and 98.9% for LDA. The results in
Table 2 indicate, that the RBF SVM classifier provides the best performance among all the classifiers in terms of sensitivity and specificity. The classification boundaries for all three classifiers in the case of the two-feature vector space is presented in
Figure 5.
The results presented here demonstrate that the classification between the discharge signal and surrounding noise can be performed quite accurately. These results need to be further verified, as the discharge signals were collected using a laboratory discharge emulator. It is expected that the performance of the classifiers will slightly degrade going into the in-situ discharge measurements. The RBF SVM classifier overall demonstrated best performance, since classification boundary for the dataset appeared to be non-linear. According to the architecture of the proposed discharge detector, the ML methods presented in this chapter act as an outlier detector with the aim to filter random noise signals from further processing. If the signal is identified as a discharge, the ultrasonic localization algorithms presented in following chapter are used then to localize the source position in open space.
5. Recognition of Faulty HV Asset
The proposed discharge detection and localization system incorporates an optical camera which is controlled from ultrasonic measurement data by means of pan-tilt servo manipulator (see
Figure 1 and
Figure 2). The output of ultrasonic signal processing algorithms presented in previous chapter is an angular position of the PD source in horizontal and vertical planes. Such data can be fed to pan-tilt motor, to steer the camera to the desired spatial position. As the camera is positioned in front of the detected discharge source, an optical image is being recorded for further processing. Typically, in such image, many bushing insulators are present, depending on the mounting position of the device in the transformer substation. So it’s important to distinguish a single bushing insulator that possess a risk of discharge and mark it to the end user. In order to detect and emphasize single suspicious discharging insulator, deep learning convolutional neural networks (CNN) are applied and embedded to the system. Usually, if the spatial location of acoustic source is correctly determined by ultrasonic methods described in previous chapter, the discharging source should be located somewhere at the center of an image. So, it is assumed here that the central area of the image most likely will contain the view of discharging asset. Hence, the convolutional neural networks are used to process the area around center point of the image and detect the assets of bushing insulators. To achieve this purpose, as a first step. a small area around the center of an image is cropped and provided to a CNN object detector. If none of the objects are detected, then the size of an area is increased until the object detector finds at least one insulator. The final output is the original image captured by optical camera with boundary of single insulator. The CNN object detector is pre-trained to detect high voltage transformer bushings. The detector is implemented by using Faster R-CNN convolutional neural network, which uses a region proposal network (RPN) for feature map estimation and Fast R-CNN as a detector [
35]. Faster R-CNN is an evolution of its predecessor, Fast R-CNN, and proposes anchors that most likely contains objects to generate region proposals instead of selective object search. The CNN model was trained using TensorFlow GPU v1.9 object detection API on NVIDIA Geforce GTX 1060 with CUDA v9.0.176 and cuDNN v7.0. In total, 430 images of transformer bushing insulators were labelled and used in training dataset, while the testing dataset was compiled from images that were acquired from different transformer stations across Lithuania. The model was trained in 150k epochs (batch size 1) achieving average classification loss of 0.04 and mean average precision (mAP) of 91.31%. The example of single discharging bushing insulator which was detected using Faster R-CNN is presented in
Figure 14.
Object detection with CNN is used here for two purposes: first, the single discharging asset is recognized in the image, which serves as an output of the detection device; second, if none of the assets are recognized in the image, the measurement is repeated as this is treated as the measurement error. Hence CNN acts as an additional evaluation loop to avoid false alarms.
6. Conclusions
The ultrasonic non-invasive approach for detection, localization of the partial discharge (PD) that is designed to assess dielectric conditions of connectors of bushing insulators was proposed. The proposed solution uses machine learning, ultrasonic signal processing and deep learning methods to detect and localize the source of PD. At the first stage ML methods act as a firewall to filter acoustic noise signals that are captured by the system and differentiate between actual PD signals and surrounding noise. Then the binaural methods are used for PD source localization which exploits the time difference between arrival time of the same discharge signal received by different ultrasonic sensors. Based on these methods, the angular direction of PD source is reconstructed in 2D and 3D space with an average lateral positioning error of 0.1 m. Finally, ultrasonically estimated positions of PD source are fed to pan-tilt servo manipulators to steer the optical camera to the detected PD source position. Then the recognition of the discharging asset is performed by using deep learning convolutional neural networks which identifies single discharging HV assets.
The proposed ultrasonic PD localization and detection technique was verified and optimized with the appropriate experiments. At first, ML methods were evaluated for separation between discharge signals and surrounding noise. It was estimated, that two signal features (kurtosis and difference absolute standard deviation) holds the most variance and are most appropriate for determination of the noise origin. Among the different ML classifiers, it was found that RBF-SVM is the one that shows the best sensitivity and specificity as it provides a non-linear classification boundary. A separate set of the experiments were performed with the different positions of the discharge source and inter-element distances between sensors R1, R2, R3 with the aim to investigate the detectability, lateral error and optimal spacing between sensors. The investigations demonstrated a strong correlation between PD signals and the lateral error of source positioning was estimated to be up to 0.1 m. The inter-element spacing optimization experiment demonstrated that 20 cm pitch between the sensors is optimal in order to reliably detect PD source at 8 m distance as it provides 2.7 µs delay between neighboring channels. It was shown that the inter-element spacing plays a significant role for sources that are positioned in front of the detector, while the sources at some angles are easier to detect due to longer signal propagation paths. As a result, a smaller pitch such as 10 cm may be insufficient to detect the PD at certain positions. Finally, the deep learning neural networks was implemented which are capable to detect single suspicious discharging assets in optical images based on the spatial PD positions determined by ultrasonic methods. As a result, the output of the proposed system is an intuitive discharge visualization in a real-scene environment.
As the results presented in this article showed quite good performance both in classification of discharge source and localization of it in 3D space, a logical next step would be to apply these methods in an actual substation with pre-identified faulty insulators. This would include additional challenges as more significant signal reverberations, changing environmental conditions, increased noise levels and reduced PD signal energies. As a result, one might expect to obtain larger positioning errors and a lower classification accuracy. On the other hand, the ML techniques used in this approach can improve on each measurement, while receiving more and more data describing the PD signals. With the advent of ML methods, better understanding of the PD signals may subsequently lead to an increased overall detection and localization accuracy on the long-term perspective.