3.1. Overview
Before presenting the methodology, it is important to review some terminology defined in our previous study [
18]. The noise gap is the horizontal band around amplitude zero without PDs. ANGPD discharges (an acronym for “Adjacent to Noise Gap PDs”) are close to the noise gap, and the other discharges are called N-ANGPD (Non-Adjacent to Noise Gap PDs). ANGPD clouds are candidates to belong to PD sources InV, InD, DCI, slot, and corona, whereas N-ANGPD discharges are potential surface tracking and gap sources. PDs termed higher or lower have higher or lower absolute amplitudes than others. Those terms are illustrated in
Figure 3.
The common PD sources in rotating machines produce PRPDs typically characterized according to
Table 1 [
15,
20]. Following the categorization of
Table 1, it is possible to classify PD sources using a two-step process. First, a preliminary separation can be performed based on the symmetry between positive and negative PDs, and then the PD clouds’ shape is considered to resolve between sources with the same symmetry information. The proposed PD recognition methodology was structured with that same reasoning.
In this study, a PD-recognition methodology that is able to identify a single or multiple typical sources in rotating machines is proposed. It is capable of identifying up to three simultaneous PD types: the dominant ANGPD source plus surface tracking and/or gap discharges. The dominant ANGPD source is the one associated with the pair of positive and negative ANGPD clouds with the highest number and density of PDs. Gap-discharge clouds are detected even if they are partially superposed onto the ANGPD clouds. The absence of ANGPD and/or N-ANGPD sources can also be detected. The technique is an extension of those previously proposed by the authors in [
18,
22]; it operates entirely on PRPD patterns, which are treated as images.
During this research, we did not find a feature that would single-handedly be capable to accurately differentiate between all the typical PD sources mentioned in IEC 60034-27-2 [
15]. Therefore, the recognition task was divided into smaller problems. This approach usually results in shorter training times and better generalization due to the simpler individual classifiers [
23].
The proposed methodology is based primarily on fully-connected feedforward artificial neural networks. It consists of training, validation, and testing stages. In the first two phases, different ANNs are trained so that each solves a smaller problem of the classification task in an isolated manner. In the testing stage, the previously trained ANNs are combined hierarchically to perform PD recognition on unknown samples.
The training and validation stages are illustrated by the block diagram of
Figure 4. First, the input pattern is subjected to a PRPD image-denoising algorithm for suppressing spurious discharges that are not related to the underlying PD source(s). The algorithm is based on our proposal of [
18] but with some modifications to address the multiple-source case (
Section 3.2). Next, a novel image-processing technique is applied to separate horizontal N-ANGPD clouds partially superposed onto the ANGPD discharges (
Section 3.3). The image denoising yields groups (clusters) of denoised N-ANGPD and ANGPD discharges. From these groups, two sub-PRPDs are generated: one containing only N-ANGPD PDs and the other with only ANGPD clusters.
From the patterns known to have surface tracking and/or gap activity (as labeled by the specialist), the clouds of their N-ANGPD sub-PRPDs were used to optimize the thresholds of a manually predetermined set of rules (
Section 3.5). In the testing stage, those rules are used to identify the presence of such PD sources by analyzing the dimensions and positions of a PRPD’s N-ANGPD clouds.
The dominant ANGPD PD source is identified by the cooperation of four types of neural networks: ANNs-0, ANNs-1, ANNs-2, and ANNs-3. Some of their hyperparameters are listed in
Table 2; the topology (number of hidden layers and hidden neurons) is determined heuristically in
Section 4.2. As detailed in
Section 3.8 and
Section 3.9, the networks were trained and tested with several combinations of mutually exclusive subsets of training, validation, and test data. The numbers of PRPDs forming those subsets are also listed in
Table 2 for each network type.
From the ANGPD sub-PRPD, neural network ANN-0 detects whether the pattern’s ANGPD clouds are noisy by analyzing simple cloud characteristics such as height and density (
Section 3.7). Next, the ANGPD sub-PRPDs with a valid ANGPD PD source (as labeled by the specialist) are partitioned into different subsets (
Section 3.8) to train other three types of neural networks, each fed with one of the input features: amplitude histograms [
18] and the novel contour attribute, described in
Section 3.6. The first neural network (ANN-1) uses histograms to perform preliminary recognition, by merging InV with InD patterns (class InV/InD) and slot with corona (Slot/Corona) into two different groups of samples. Those mergers are necessary because the histogram does not accurately capture the differences in cloud shape that differentiate those classes [
15,
18]. The other neural networks (ANN-2 and ANN-3) solve these ambiguities with contour features, which are more sensitive to the cloud shape.
The testing stage is illustrated in
Figure 5. An input pattern is subjected to the algorithm of noise removal and multiple source separation (
Section 3.2 and
Section 3.3). In the N-ANGPD sub-PRPD, one looks for surface tracking and gap PDs by comparing the dimensions and positioning of its clouds against predetermined thresholds. From the ANGPD sub-PRPDs, several metrics were extracted and used by the ANN-0 network to estimate whether ANGPD clouds are noisy or valid. If both clouds are deemed as noisy, it is considered that there is no ANGPD PD source in the pattern. If at least one cloud is judged valid, the pattern is classified by a cascade of trained networks ANN-1, ANN-2, and ANN-3. ANN-1 identifies the sample based on its amplitude histogram. If ANN-1’s output is DCI, this already is the system classification for the ANGPD PD source, since this class is characterized solely by the symmetry between positive and negative PDs (
Table 1) [
15,
20]. If ANN-1’s output is InV/InD or slot/corona, contour features are calculated and passed to the corresponding auxiliary neural network (ANN-2 or ANN-3) to solve the ambiguity.
Keeping the practice of [
18] of describing the algorithms independently from the PRPD dimensions, in this study, the parameter values of the denoising and multiple-source separation algorithms are described as functions of the number of pixels
A forming the width/height of a PRPD pattern. In the case of this study,
.
The following subsections provide more detailed descriptions of the stages forming the methodology, especially the novel contributions (shown as boxes in blue in
Figure 4,
Figure 5 and
Figure 6).
3.3. Separation of Superposed N-ANGPDs
In this stage, illustrated by the namesake block in the blue of
Figure 6 and detailed in
Figure 7, the separation between partially superposed horizontal N-ANGPD (potential gap discharges) and ANGPD clouds in the PRPD image was performed, a common situation in the database. The separation was based on the fact that N-ANGPD clouds usually have PD counts higher than the ANGPD clouds at the intersection region.
After the removal of non-dominant ANGPDs (
Figure 6), PDs were grouped by means of nearest-neighbor clustering [
24] using a 0.05A × 0.05A pixel submatrix.
Figure 8b shows the results of clustering applied to the pattern of
Figure 8a, which in turn is the PRPD of
Figure 3d after the removal of non-dominant ANGPDs. Each cluster is indicated with its PDs in a different color.
The following algorithm was applied to each ANGPD cluster separately. From the PRPD, a matrix containing only the PDs of the cluster being analyzed was formed. This matrix was divided into a grid of
cells. One calculates matrix
M, whose elements are equal to the sum of PD counts within each cell of the grid. Matrix
D was then obtained by normalizing
M by its maximum non-zero element excluding outliers.
Figure 8c, for example, shows the matrix
D relative to the positive ANGPD cluster of
Figure 8b.
In order to remove as much superposition as possible, it is necessary to estimate the actual dimensions of the superposed N-ANGPD clouds (if any). For that, we exploited the fact that PD clouds are usually formed by higher-count discharges in the central region, surrounded by smaller-count discharges. If only the high-count PDs are considered, one obtains smaller versions of the superposed N-ANGPD clouds whose boundaries (formed by the furthermost discharges from the cloud’s center) are farther from one another, facilitating their identification as separate groups. From the identified higher-count portion of the superposed N-ANGPD clouds, it is easier to estimate their actual dimensions. The separation algorithm developed in this study follows such reasoning.
It is convenient to introduce the
matrix
, given by
where
L is a real-valued number from 0 to 1 that sets the threshold of the minimum normalized PD count considered. For a given value of
L, the matrix
is formed, and its non-zero pixels of
are grouped with nearest-neighbor clustering [
24] using a 0.0195A × 0.0195A submatrix. One determines the bounding box (BB) of each cluster, which is the smallest imaginary rectangle (height is
H and width is
W pixels) containing all the cluster’s discharges. In
Figure 8d, for example, the BB of the cluster in red is shown.
’s N-ANGPD clusters are scanned, and those meeting
and
are considered horizontal. This is repeated iteratively varying
L in the sequence {0.70, 0.50, 0.40, 0.30, 0.20, 0.15, 0.10, 0.08, 0.05, 0.02, 0.00}, in a process called the loop of thresholds. On the course of this loop,
gradually rebuilds matrix
D, from the highest-count pixels to those of smallest-count, as seen in
Figure 8d–f. In the initial iterations (high
L), there are only the innermost high-count PDs of each cloud, and thus those groups tend to be separated (
Figure 8d). As
L decreases (
Figure 8e–f), the groups’ boundaries progressively approach each other, eventually forming a single cluster again. For each horizontal N-ANGPD cloud detected during the loop, let
be the corresponding cluster in matrix
for the smallest
L at which the N-ANGPD and ANGPD clusters are still separated. Based on the
clusters, one separates N-ANGPD clouds from the ANGPD cloud with the procedure explained below, illustrated in
Figure 9,
Figure 10 and
Figure 11. If no horizontal cloud is found at any of the
L thresholds, it is assumed that there are no superpositions, and no separation is performed for the current ANGPD cloud. The loop of thresholds is then repeated for the other ANGPD clusters of the PRPD, one at a time.
The horizontal clusters
detected in the loop of thresholds underestimate the dimensions of superposed N-ANGPD clouds. For a more accurate estimation, the bounding box of each
cluster was “expanded” using the following procedure, illustrated in
Figure 9 for the pattern of
Figure 8. Such expansion occurs iteratively in matrix
D, in the vertical and horizontal directions, while the PDs within the expanded BB are distributed similarly to the PDs bounded by
’s original BB. For each column of
’s original BB, one calculates the proportion of the non-zero elements (
) relative to the total number of pixels in the column and the average of the row coordinates of the non-zero elements weighted by its values in matrix
D (
), as shown in
Figure 9b. The averages of those metrics over all the columns of the original BB are
and
. In the horizontal direction, the BB’s left bound is shifted to the left, one column at a time, while the relative differences between the
and
values in the new column and
and
are less than 75% and 40%, respectively. The same is performed to the BB’s right bound. An analogous procedure was then applied in the vertical direction. The pixels within the expanded BB were estimated to form the actual N-ANGPD cluster (pixels in green in
Figure 9a).
The proposed separation algorithm deals with two types of superposition. Case 1, common in the database and illustrated in
Figure 10a (original PRPD shown in
Figure 3e), consists of a wide horizontal group above the ANGPD cloud, both connected by noisy pixels of low/intermediate density. In case 2 (
Figure 8a), there is an actual superposition between the horizontal N-ANGPD and ANGPD clouds.
Each case is treated differently. Thus, it is necessary to identify to which category the superposition between ANGPD and N-ANGPD clouds belongs to. In the region bounded by the superposed cluster’s BB, 0.08A equally spaced columns of elements of matrix
D (vertical profiles) were sampled along the phase axis, as shown in
Figure 10b. The point-wise average of those 0.08A curves resulted in the average vertical profile, illustrated in
Figure 10c. The average vertical profile can be seen as a function
describing the average distribution of PD counts across amplitudes. For case 1, this function should have, in the direction from the ANGPD cloud to N-ANGPD discharges, a sequence of a local maximum, minimum, and maximum, respectively, associated to the ANGPD clouds, noise, and horizontal PDs (
Figure 10c). In this study, the local minima and maxima were found by iteratively sliding a 0.02A-element-wide window along the average vertical profile. The central point is a local maximum if it is the largest element inside the window, or it is a local minimum if it is the smallest. Moreover, since horizontal PDs span a phase interval comparable to the ANGPD cloud’s width in case 1, little variation between the individual vertical profiles was expected. This variation was quantified inversely by the similarity
, calculated as the average of the cross-correlations between all the pairwise combinations of vertical profiles. Based on this information, the superposition was considered to belong to case 1 if all the following criteria were met: (i) N-ANGPD cloud is horizontal; (ii)
; and (iii) the average vertical profile presents exactly three local extrema, one minimum between two maxima, and that the minimum has a value less than 1% of the first maximum’s. Otherwise, superposition was considered to be of case 2.
The following explanations were described based on the
x and
y axes (
Figure 10 and
Figure 11). The
x-axis is horizontal, oriented left-to-right in the PRPD. The
y-axis is vertical, oriented from the bottom to the top of the superposed cluster.
In the case-1 superposition, let
and
be the
y coordinates of the upper and lower bounds of the intermediate noisy region, respectively (
Figure 10b). Due to the small number of discharges in this zone, the average vertical profile function
assumes lower images in this region (rows
) than in the other two regions, which are associated with the ANGPD and horizontal clouds (
Figure 10c). It is considered that
matches the lower bound of the horizontal cloud’s expanded BB. Moving in the direction of decreasing
y, the first row at which
was estimated as the coordinate
. The coordinates
and
are illustrated in
Figure 10b,c. Additionally, let
and
be the
x coordinates of the left and right bounds of the ANGPD cloud, and let
be the
y coordinate of this cluster’s lower bound (
Figure 10b).
Next, one calculates the vectors
and
, containing the
x coordinate of the cluster’s leftmost and rightmost non-zero pixel in each row, respectively, (red and green curves in
Figure 10b). The left and right contours of the ANGPD cloud were estimated by fitting parabolas to certain points of the curves
and
using the least squares method. Such parabolas must be convergent—that is, they must intersect at a point whose
x coordinate is within
—and the intersection point determines the ANGPD cloud’s upper bound. It is estimated that the ANGPD cloud’s peak lies next to the central point
of the noisy region, where
and
. Parabolas were fitted to the points of
and
corresponding to the set of rows
, where
was initially equal to
. If parabolas are not convergent, new fitting attempts are performed by iteratively decreasing
, one row at a time, until the convergence of parabolas is achieved. If there is no
such that the parabolas intersect at a point above the expanded horizontal cloud’s lower bound, the fitting was not adequate, because it violates the hypothesis of no superposition between the clouds. In this case, new parabolas are fitted to the points of
and
corresponding to the set of rows
, and also to the point
. The addition of this point tends to induce the expected convergence of parabolas. In the example of
Figure 10d, the parabolas were obtained in the first fitting attempt to the points of
and
of the rows
. Separation was performed by assigning to the ANGPD cloud the PDs below the fitted parabolas, and the horizontal cloud was composed of the PDs within its expanded bounding box (
Figure 10e).
For case-2 superpositions, the
and
curves of the cluster were also calculated in matrix
D (
Figure 11a). In the absence of disturbances,
and
tend to vary monotonically along the
x-axis. When there are superposed horizontal clouds, however, those curves experience sharp variations along
x in the opposite direction to what is expected (“divergences”), as shown in
Figure 11b. Disregarding the points of
and
associated with the divergences caused by superposed horizontal clouds (diverging points), it is possible to estimate the ANGPD cloud’s left and right contours without superpositions.
Figure 11 illustrates the separation of case-2 superpositions for the positive ANGPD cluster of the pattern of
Figure 8a. One obtains the local maxima (tops) and minima (bottoms) of
and
curves (
Figure 11b). In the
and
curves, one finds the local maxima (tops) and minima (bottoms) within the 2-point distance. For more robustness against noise, one considers only the pairs of consecutive local minimum and local maximum whose
x values differ by at least nine pixels. Each sequence of three local extrema (top-bottom-top for
, or bottom-top-bottom for
) was assumed to be related to the superposition with a horizontal cloud. The superposed horizontal clouds may have counts different or similar to the ANGPD cloud’s in the region of intersection. In
Figure 11a, for example, the superposed horizontal PDs to the right (purple arrow) and to the left (yellow arrow) were of the first and the second type, respectively. The diverging points of
and
were determined for each triplet of local extrema.
For each horizontal cloud of distinctive count (expanded cluster
), detected during the loop of thresholds, one verifies to which sides it causes divergence in the ANGPD cloud. Let
and
be the
y coordinates of the upper and lower bounds of
’s BB (
Figure 11a). In
and
, it was verified if there is a triplet whose first and third local extrema have
y coordinates less than
and greater than
, respectively. In
Figure 11c, for example, this applies to the triplet in
. If such a triplet exists in
, and if the element
of
immediately below the horizontal cloud has
x value (
) between the left and right bounds of
, the points of
corresponding to the rows
are divergent (
Figure 11c). Analogous logic applies to
.
The other triplets of local extrema may be related to superpositions of horizontal clouds not detected during the loop of thresholds. The following procedure was performed for each of those triplets. Let
F be the curve
or
to which the triplet in question belongs, and let
be the coordinates of the first local extremum in
F (
Figure 11c). Starting from this extremum and moving in the direction of increasing
y, one looks for the first point
of
F such that
if
, or
if
. The point
is shown in
Figure 11c. The determination of this point follows the assumption that, in the direction of increasing
y, once one gets past the divergence,
’s
x values are again greater than (or less than if
)
, an indication that the ANGPD cloud resumed its natural trend of narrowing in the phase axis. In order to reduce the influence of noise, the points of
F in the rows
were considered divergent only if all the following criteria were met: (i) there are at least two points of
F between the first and third local extrema of the triplet; and (ii) the magnitude of the difference between
and the median of the
x values of
F in the interval
is greater than or equal to 12. In
Figure 11c, the triplet of local extrema of
meets these conditions.
Using least squares, parabolas were fitted to the
and
curves disregarding the divergent points, as shown in
Figure 11c. For each group of divergent points of the curve
F (
or
), the PDs of
y coordinates equal to the divergent points and not contained by the parabola fitted to
F (to the left of the parabola if
, or to the right if
) were removed from the ANGPD cluster and assigned to a new N-ANGPD cluster. The hatched areas in
Figure 11c indicate the regions in which PDs were separated from the ANGPD cloud.
Figure 11d shows the separation of superposed horizontal and ANGPD clouds.
At this moment, the N-ANGPD clouds superposed onto a single ANGPD cluster were separated. The entire separation process (the calculation of matrix D, the loop of thresholds, and so on) was repeated for the PRPD’s other ANGPD clusters, one at a time. Finally, the pixel coordinates of all the separated clusters were mapped back to the PRPD’s original dimensions . The values of thresholds described in this subsection were defined empirically, in order to separate most of the superposed patterns in the database.
Superposed tracking PDs could be separated in a way analogous to the case-2 superposition. Instead of and , the procedure would be based on curves containing the y coordinates of the cluster’s uppermost and lowermost non-zero pixel at each column. However, due to the low occurrence of this type of superposition in the database, the separation of tracking clouds was not implemented in this study.