Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network

Wang, Kaixuan; Zhang, Jiaqiao; Ni, Hongjun; Ren, Fuji

doi:10.3390/electronics10161986

Open AccessArticle

Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network

¹

School of Mechanical Engineering, Nantong University, Nantong 226019, China

²

Graduate School of Advanced Technology and Science, Tokushima University, Tokushima 770-8506, Japan

^*

Author to whom correspondence should be addressed.

Electronics 2021, 10(16), 1986; https://doi.org/10.3390/electronics10161986

Submission received: 24 July 2021 / Revised: 12 August 2021 / Accepted: 16 August 2021 / Published: 18 August 2021

(This article belongs to the Special Issue Advances in Machine Condition Monitoring and Fault Diagnosis)

Download

Browse Figures

Versions Notes

Abstract

:

Thermal defects of substation equipment have a great impact on the stability of power systems. Temperature is crucial for thermal defect detection in infrared images. The traditional detection methods, which have low efficiency and poor accuracy, record the temperature of infrared images manually. In this study, a thermal defect detection method based on infrared images using a convolutional neural network (CNN) is proposed. Firstly, the improved pre-processing method is applied to reduce background information, and the region of interest is located according to the contour and position information, hence improving the quality of images. Then, the temperature values are segmented to establish the dataset (T-IR11), which contains 11 labels. Finally, the CNN model is constructed to extract features, and the support vector machine is trained for classification. To verify the effectiveness of the proposed method, precision, recall, and F₁ score are adopted and 10-fold cross-validation is employed on the T-IR11 dataset. The results demonstrate that the accuracy of the proposed method is 99.50%, and the performance is superior to that of previous methods in terms of infrared images. The proposed method can realize automatic temperature recognition and equipment with thermal defects can be recorded systematically, which has significant practical value for defect detection in substation equipment.

Keywords:

infrared image; substation equipment; thermal defect detection; adaptive binarization; character recognition; convolutional neural network

Graphical Abstract

1. Introduction

Substation equipment is an important part of the power transmission, and its safe operation directly affects the stability of the power system. Owing to long-term exposure to the natural environment, substation equipment is prone to thermal defects due to corrosion, oxidation, and aging [1,2,3], resulting in abnormal temperature rise. In recent years, non-destructive testing technology has developed rapidly due to its fast and efficient characteristics, such as X-rays [4], ultrasound [5,6], photoacoustic [7], and eddy currents [8]. However, there are potential safety hazards in the detection of substation equipment, which will lead to false detection and missed detection. Infrared thermography technology can quickly detect the state of a device based on the principle of thermal radiation [9,10,11]. It has the advantages of high sensitivity and anti-electromagnetic interference. Presently, to improve the detection efficiency, infrared thermal imagers usually generate temperature maps on the right side of the image, and the maximum and minimum temperature values are marked to make temperature matching convenient [12,13]. There are two methods of infrared image temperature recording: manual and automatic. The disadvantages of the traditional manual recording include low efficiency and large error, whereas those of the automatic recording are low accuracy and poor stability due to the complex background and unstable chromaticity caused by illumination [14,15]. Therefore, research on temperature value recognition algorithms is conducive to the rapid screening of thermal defects by improving the accuracy and reliability of infrared image temperature recording. Whenever the temperature value is recognizable, the problem of thermal defects can be effectively solved, and the stable operation of the power system can be ensured.

The recognition of temperature values in infrared images for substation equipment is actually a character recognition problem. In recent years, with the repaid development of machine vision technology, character recognition has garnered increasing attention [16]. Previous researchers have mainly focused on character location, character segmentation, and character recognition. Han [17] et al. employed a novel method based on the generative adversarial network to read vehicle registration plates and achieved an overall accuracy of 98.72%. Huang [18] et al. proposed a license plate character recognition method based on the local histogram of oriented gradient (HOG) and layered feature fusion from the perspectives of image pre-processing, license location, character segmentation, and recognition. Husnain [19] et al. established a dataset of Urdu characters, which can achieve an accuracy of 98.16% using a one-dimensional long short-term memory method. Sun [20] et al. proposed a graph-matching-based method to recognize characters in historical Chinese seal images, which can achieve better results, particularly in the case of limited samples. Naiemi [21] et al. proposed an enhanced HOG feature extraction method using a support vector machine (SVM) for classification. The method can be resistant to both character variations on scale and translation and it is computationally cost-effective.

Although extensive research has been conducted on character recognition, there are still numerous deficiencies, particularly in the case of infrared images. Compared with visible images, the resolution of infrared images is low, making it difficult to clearly distinguish details [22,23]. Scholars have conducted increasingly more research on infrared images to recognize and record the temperature. Lin [24] et al. adopted the PCANet deep learning network to reconstruct the temperature matrix. By recognizing the characters, the range of temperature was combined, and the fault area was indicated by temperature. Although the accuracy achieved 99.7% in training, it was 92.61% in the test. Prattana [25] et al. developed a text image recognition program that can identify the date and time in images for collecting and arranging the radar images in the archive, which is significant in flood monitoring systems. However, these studies usually recognize the entire image without considering the position distribution of the characters in infrared images, resulting in low accuracy. Once the background is close to and overlaps the temperature value, the feature extraction is difficult to be processed and the recognition accuracy will be influenced. Therefore, the location, segmentation, and recognition method based on character distribution and background for infrared images has become a research direction in the future. In addition, deep learning is widely used in many aspects. Combined with some deep learning models, it will have a certain effect on defect detection.

In this study, a thermal defect detection method for substation equipment based on the convolutional neural network (CNN) was designed for infrared images. First, according to the characteristics of infrared images, an improved adaptive binarization method based on an infrared image histogram was proposed to overcome the influence of the complex background. Second, based on contour and position information, the pixel accumulation method was adopted to locate the position of the temperature value and the region of interest (RoI) was obtained. The characters were segmented using the vertical projection method and the size of the obtained image was normalized. In the recognition stage, the CNN model was constructed to extract the image features, and an SVM was trained instead of Softmax for classification. Owing to the lack of an infrared image temperature value dataset, the T-IR11 dataset containing 11 labels was established for training and testing. To evaluate the performance of the proposed method, three indices including precision, recall, and F₁ score were used, and 10-fold cross-validation was adopted on the dataset. Additionally, previous methods were compared with the proposed method to verify the effectiveness and accuracy. On this basis, the defective substation equipment with thermal defects were selected according to the temperature, which can improve the efficiency and accuracy of thermal defect detection.

The rest of the paper is arranged as follows: Section 2 introduces the proposed method for the infrared image of substation equipment. Section 3 presents the experiment and results. The discussion for the proposed method is illustrated in Section 4. Section 5 concludes this paper.

2. Proposed Methods

The block diagram of the proposed method is illustrated in Figure 1, which can be divided into three stages: image pre-processing, temperature segmentation, and temperature recognition.

2.1. Improved Image Pre-Processing Method

Infrared images of substation equipment usually contain complex backgrounds, such as trees, nests, high-voltage lines, and buildings, which are significantly influenced by light and environmental factors [26]. Image pre-processing usually includes gray transformation and binarization to prepare for infrared image recognition by removing irrelevant information [27,28,29]. By analyzing the characteristics of the infrared image, it can be found that the green proportion is less, followed by the red, whereas blue is the largest, and a weighted method is used for gray transformation. Owing to poor contrast and the loss of details, the gamma correction method is applied to normalize the gray image [30,31,32], and it is expressed in Equation (1) as follows:

I_{b} (x, y) = c \times I {(x, y)}^{γ},

(1)

where I_b(x,y) is the corrected image, I(x,y) is the original image, γ is the correction parameter, and c is the constant.

The binarization method based on histograms is carried out, which can directly reflect the brightness of the image, and represent the distribution characteristics of the image [33]. In order to accurately explain the difference between images and eliminate the accidental background influence, 100 infrared and visible images were randomly selected for the analysis. The statistical results of the histograms are shown in Figure 2. It can be observed from Figure 2a that the pixels of the visible image are evenly distributed and the peak appears when the pixel values are very low, but the trough is not obvious. Therefore, it is difficult to select an appropriate threshold to distinguish between the background and RoI [34,35]. However, the histogram of infrared images has two large peaks, and the trough is visible as observed in Figure 2b. According to the gray values corresponding to the two sides of troughs as thresholds, the binarization results are shown in Figure 3.

Figure 3 shows that when the threshold is 105, although the binarization effect on the temperature value is poor, the contour of the device is preserved. When the threshold is 210, the contour of the device is invisible, and the region of temperature value is well preserved. There are no salt and pepper noise points in the maximum and minimum areas. Therefore, an improved adaptive binarization method based on the infrared image histogram is proposed. The threshold is determined adaptively by the infrared image histogram after gray transformation and gamma correction, and then the ideal binarization results can be obtained.

2.2. RoI Extraction Based on Contour Information

After pre-processing, the image of the substation equipment still contains irrelevant information, such as time and watermark. The processing speed, efficiency, and accuracy will be affected due to the image containing dense pixel information [36,37]. Therefore, it is necessary to locate and segment the RoI from the background. By analyzing the images, it can be found that the rectangular box of the temperature map in the binary image is completely preserved, and the position is relatively fixed with the maximum and minimum temperatures. Therefore, a pixel accumulation method based on the contour and position information for the location is proposed to obtain the RoI. The location flowchart is shown in Figure 4.

According to the number of column pixels of the image, continuous pixels are accumulated in the direction of the length of the rectangular box, which is selected to be equal to the number of columns with the continuous pixels. The short edge of the rectangular box was used as a reference to locate the pixel coordinates of the four corners. The region of the maximum and minimum temperatures is located according to the relative position relationship between the rectangular box and the temperature value.

To accurately segment the characters, the vertical projection method is used to project the RoI in the vertical direction, and is expressed in Equation (2) as follows:

V_{x} = \sum_{m = 1}^{n} f (x, m), 0 \leq x \leq I_{x},

(2)

where V_x is the vertical projection of the image, f(x,m) is the pixel value of column x and row m in RoI, n and I_x are the lengths of rows and columns in RoI, respectively.

Figure 5 shows the vertical projection results for the RoI. The image was first scanned from left to right, and thereafter the pixel value of each column was accumulated [38]. By counting the number of pixels in each column, it can be observed that the cumulative value of pixels undergoes a sudden change at the junction of two characters, where the pixel cumulative value was the minimum value. In Figure 5, two peaks correspond to the boundary area of the characters, which shows that there are two characters in the region. Subsequently, the location of the characters was determined according to their characteristics. By selecting the sudden change as the segment point, the temperature values were segmented.

2.3. Recognition Based on CNN

In the recognition stage, according to the characteristics of the temperature values, the CNN method is designed, as shown in Figure 6. C1 and C2 are convolution layers, P1 and P2 are pooling layers, and FC is the full connection layer.

The convolution kernel size of the two convolution layers is 5 × 5, and the stride is 1, which is applied to extract features better. The max-pooling is adopted in the pooling layer (P1), and the core size is 1 × 1, the stride is 1. In the pooling layer (P2), the core size of 2 × 2 is used to further extract the image features with the stride is 2, and the size of the feature image is 4 × 4 × 12. Finally, the input is converted to the size of 1 × 192 in the full connection layer (FC). To improve the accuracy of recognition, SVM is employed instead of Softmax as the classifier using the features from FC, which has the capability of nonlinear mapping and generalization. The temperature images obtained by pre-processing are used for training, and the network weights are updated by the adaptive moment estimation (Adam) method. Specifically, the collected infrared images of substation equipment are input into the trained CNN model to automatically recognize the temperature values, and the abnormal infrared images with thermal defects are selected according to the temperature.

2.4. Process of the Proposed Method

Based on the above algorithms, the process of the proposed method is displayed in Figure 7.

(1): Image acquisition. The infrared images of substation equipment are acquired by the infrared thermal imager, including insulator, high voltage bushing, transfer switch, etc.
(2): Image pre-processing. Gray transformation and gamma correction are firstly carried out, and the improved adaptive binarization method is used to remove the complex background.
(3): Image segmentation. The RoI is located based on contour and relative position information and then segmented. Then, the temperature values are carefully separated by the vertical projection method.
(4): Dataset establishment. The dataset of temperature values is established with 11 labels after location and segmentation and is divided into a training dataset and test dataset.
(5): Temperature values recognition. The CNN model is constructed to extract features of temperature values, and the SVM is used for classification. The test images are recognized by the trained CNN model, and the temperature values are recognized automatically to select the images with abnormal temperatures.

3. Experiment and Results

The configuration of the hardware is Intel(R) Core (TM)i5-10400F@ 2.90 GHz with 16.0 GB RAM and NVIDIA GTX 2060, and the software is MATLAB 2019b.

3.1. T-IR11 Dataset

The experimental images are collected by FLIR infrared thermal imager (T420, FLIR Systems, Inc., Wilsonville, OR, USA) from the substation equipment in the Jiangsu area. The measuring equipment is shown in Figure 8a. The infrared images have a resolution of 320 × 240 pixels and consist of six types of substation equipment, including insulators, bushings, transfer switches, lightning arresters, circuit breakers, and transformers. Some of the images are shown in Figure 8b.

Based on 600 infrared images, a temperature value dataset with 2200 images was established, which is evenly distributed to 11 labels such as “0–9” and “-”, as shown in Figure 8c. According to the ratio of 8:2, the dataset was divided into training and testing sets, containing 1760 and 440 images, respectively. Four hundred images were used to test the proposed temperature value recognition method.

3.2. Evaluation Method

Precision, recall, and F₁ score were adopted to evaluate the performance of the proposed method. The recognition results can be divided into true positive (TP), false positive (FP), true negative (TN), and false negative (FN), respectively [39,40]. Precision is the percentage of the actual positive predictions among all predicted positive samples. Recall is the percentage of the predicted positive samples among all actual positive samples. The F₁ score considers both precision and recall, so that it is a good comprehensive evaluation index. Based on the confusion matrix of the experiment, three indices are expressed as follows:

P r e c i s i o n = \frac{T P}{T P + F P},

(3)

R e c a l l = \frac{T P}{T P + F N},

(4)

F_{1} = \frac{2 \times P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l},

(5)

3.3. Training Process

The parameters of the CNN model are set as follows: learning rate is 0.5, decay rate is 0.99, loss function coefficient is 0.01, batch size is 55, and iterations is 1600. In addition, the parameters of SVM are set as follows: the kernel function is radial basis function (RBF), kernel parameter is 0.09, and the penalty parameter is 1. The curve of loss and accuracy is shown in Figure 9. It can be observed that the loss decreases rapidly before 300 iterations, and then it gradually converges until 600 iterations. Thereafter, the loss tends to zero until the end. The recognition accuracy is gradually increased after 600 iterations and is greatly improved to achieve 99.55% after 650 iterations.

3.4. Experiment Results

The infrared images were firstly pre-processed. The results of the gray transformation and gamma correction are shown in Figure 10a,b. Moreover, as shown in Figure 10c, the threshold 218 is selected adaptively according to the histogram of the infrared image. The binarized image is shown in Figure 10d, and the temperature value region is independent of the background color, watermark, and brightness.

The segmentation results are shown in Figure 11. The rectangular box and temperature values “81” and “38” are accurately located based on contour and relative position information, as shown in Figure 11a. Figure 11b shows the RoI is clearly extracted from the background without salt and pepper noise points. In Figure 11c, the characters are effectively segmented by utilizing the peak feature of vertical projection, and the characters “8” and “1” are cropped to a uniform size.

After training, the accuracy of the entire and each temperature value recognition is shown in Table 1. It can be observed that 438 temperature value images are correctly recognized, and the overall accuracy is 99.55%. Only two pictures “3” and “8” are recognized incorrectly, and the recognition accuracy of other labels is 100%.

Four hundred images were tested to further verify the performance of the proposed method based on infrared images. Figure 12a shows the field data acquisition. Besides, a thermal defect detection system for substation equipment was designed to apply the proposed method, as shown in Figure 12b. It can be divided into three modules: (1) Image loading; (2) image recognition; and (3) defect detection.

Finally, 398 images are correctly recognized with an accuracy of 99.50%, which is consistent with the test accuracy. The experimental results show that the proposed method has good generalization capability, and the temperature value region can be accurately extracted in the infrared image.

4. Discussion

Presently, there are several binarization methods including the 2-model, p-quantile, Otsu, and maximum entropy thresholding methods [41]. The results of binarization through these methods are shown in Figure 13. It can be observed that the previous binarization methods weaken the foreground information and the binarization effect is not ideal. In addition, there are numerous salt and pepper noise points in the temperature region that affect the accuracy of recognition. This may be due to the difference in color composition between infrared and visible images, which makes some traditional methods unsuitable for processing infrared images.

To better analyze the recognition of labels in the testing set, a confusion matrix was produced. Table 2 shows that label “8”, which recognizes incorrectly, is most likely to be confused with “3” in recognition, and label “3” is recognized as “2” in the test. The main reason is that the right side of “3” and “8” are similar, although the left sides of “3” and “8” are different. However, in image segmentation, the boundary difference between the pixel peaks is not obvious, which makes it easy to disconnect its left side when “8” is segmented. Therefore, it leads to confusion between “8” and “3”, which is similar to “3” and “2”.

Table 3 shows the precision, recall, F₁ score, and 10-fold cross-validation results of the proposed method. As can be seen, the precision, recall of label “3” is lowest, 97.5%, 97.50%, respectively, which is easily confused with “8” and “2”. The F₁ score is close to 1 and flat. Three indices demonstrate that the proposed method is available for temperature recognition.

Comparison experiments were employed to verify the effectiveness of the proposed method. Figure 14 shows that the recognition accuracy of the trained SVM instead of Softmax for classification is better than the CNN. In addition, the training loss of the CNN+SVM model is smaller than the CNN in the whole process, while the test accuracy is higher both in the beginning and end. The training process of the CNN+SVM model is more stable than the traditional CNN model, and converges rapidly, which can save training time effectively.

Table 4 displays the comparative experiment of the classic HOG+SVM, PCANet, traditional CNN, and the proposed method. According to the experimental results, the training accuracy of our proposed method and the PCANet method are both higher than 99%. However, the testing accuracy of the proposed method reaches 99.50%, which is superior to the other methods. This shows that although PCANet has high training accuracy for the images with low background interference, it is prone to overfitting during the training process. When applied to infrared images, it is more susceptible to interference from complex background information. It is obvious that the classic HOG+SVM method achieves the lowest training accuracy and testing accuracy because it is sensitive to noise and is not resistant to scale variations. Compared with the traditional CNN model, the trained SVM instead of the Softmax layer can classify more accurately in training and testing. The proposed method is more effective, and the generalization capability is significantly improved.

5. Conclusions

This paper presents an efficient thermal defect detection method based on infrared images using a CNN. To overcome the problem of a complex background, an improved pre-processing method is proposed based on infrared image characteristics. In the segmentation stage, RoI is effectively extracted according to contour and position information. Then, the T-IR11 dataset of temperature values is established. Finally, a CNN+SVM model is constructed to extract features and trained for classification, thus realizing the thermal defect detection for substation equipment. The conclusions can be drawn as follows:

(1): Compared with the other binarization method, the proposed improved pre-processing method can accurately remove irrelevant information and retain effective regions by selecting the appropriate threshold adaptively. In addition, combined with contour information, the position of the temperature values can be accurately segmented, solving the problem of temperatures overlapping the background.
(2): The T-IR11 dataset established in this study is crucial for thermal defect detection. Based on the infrared images collected from the actual environment, the T-IR11 dataset containing 11 labels is extracted from the infrared images, which provides the foundation for the following defect detection work.
(3): The CNN model is constructed for extract features and the trained SVM is used to replace the Softmax layer for classification. Precision, recall, and F₁ score indices are used to evaluate the performance of the proposed method, and 10-fold cross-validation is employed on the dataset. The accuracy of the proposed method is 99.50%, which is the highest compared with the previous studies in terms of infrared images.
(4): The proposed method realizes the rapid screening and recording of thermal defect images. It is beneficial for reducing the labor intensity of power grid inspectors and improving work efficiency. In the future, the speed of recognition needs to be further prompted to realize real-time recognition and automatic recording. Moreover, the training samples will be augmented to improve the accuracy of the proposed method for engineering applications.

6. Patents

The authors of this paper have carried out research on infrared image recognition for many years. Three China invention patents have been published; the patent numbers are CN113052865A, CN111612784A, CN111627018A.

Author Contributions

Conceptualization, H.N. and K.W.; methodology, K.W. and F.R.; software, K.W.; validation, K.W. and J.Z.; formal analysis, K.W. and F.R.; investigation, J.Z.; resources, H.N.; data curation, J.Z.; writing—original draft preparation, K.W.; writing—review and editing, J.Z.; visualization, F.R.; supervision, F.R.; project administration, H.N.; funding acquisition, H.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions, grant number PAPD; Postgraduate Research & Practice Innovation Program of Jiangsu Province, grant number KYCX21_3078; The Research Clusters program of Tokushima University, grant number 2003002.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhao, Z.; Liu, N.; Wang, L. Localization of multiple insulators by orientation angle detection and binary shape prior knowledge. IEEE Trns. Dielectr. Electr. Insul. 2015, 22, 3421–3428. [Google Scholar] [CrossRef]
Tao, X.; Zhang, D.; Wang, Z.; Liu, X.; Zhang, H.; Xu, D. Detection of power line insulator defects using aerial images analyzed with convolutional neural networks. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 1486–1498. [Google Scholar] [CrossRef]
Lin, T.; Liu, X. An intelligent recognition system for insulator string defects based on dimension correction and optimized faster R-CNN. Electr. Eng. 2021, 103, 541–549. [Google Scholar] [CrossRef]
Akcay, S.; Kundegorski, M.E.; Willcocks, C.G.; Breckon, T.P. Using deep convolutional neural network architectures for object classification and detection within X-Ray baggage security imagery. IEEE Trans. Inf. Forensic Secur. 2018, 13, 2203–2215. [Google Scholar] [CrossRef] [Green Version]
Arbaoui, A.; Ouahabi, A.; Jacques, S.; Hamiane, M. Concrete cracks detection and monitoring using deep learning-based multiresolution analysis. Electronics 2021, 10, 1772. [Google Scholar] [CrossRef]
Ouahabi, A.; Taleb, A. Deep learning for real-time semantic segmentation: Application in ultrasound imaging. Pattern Recognit. Lett. 2021, 144, 27–34. [Google Scholar] [CrossRef]
Allman, D.; Reiter, A.; Bell, M. Photoacoustic source detection and reflection artifact removal enabled by deep learning. IEEE Trans. Med. Imaging 2018, 37, 1464–1477. [Google Scholar] [CrossRef] [PubMed]
Li, S.; Anees, A.; Zhong, Y.; Yang, Z.; Liu, Y.; Goh, R.; Liu, E. Crack profile reconstruction from eddy current signals with an encoder-decoder convolutional neural network. In Proceedings of the 2019 IEEE Asia-Pacific Microwave Conference (APMC), Singapore, 10–13 December 2019; pp. 96–98. [Google Scholar]
Xu, Q.; Huang, H.; Zhou, C.; Zhang, X. Research on real-time infrared image fault detection of substation high-voltage lead connectors based on improved YOLOv3 network. Electronics 2021, 10, 544. [Google Scholar] [CrossRef]
Chen, J.; Liu, Z.; Wang, H.; Nunez, A.; Han, Z. Automatic defect detection of fasteners on the catenary support device using deep convolutional neural network. IEEE Trans. Instrum. Meas. 2018, 67, 257–269. [Google Scholar] [CrossRef] [Green Version]
Glowacz, A. Ventilation Diagnosis of Angle Grinder Using Thermal Imaging. Sensors 2021, 21, 2853. [Google Scholar] [CrossRef] [PubMed]
Jalil, B.; Leone, G.R.; Martinelli, M.; Moroni, D.; Pascali, M.A.; Berton, A. Fault detection in substation equipment via an unmanned aerial system using multi modal data. Sensors 2019, 19, 3014. [Google Scholar] [CrossRef] [Green Version]
Tan, P.; Li, X.; Xu, J.; Ma, J.; Wang, F.; Ding, J.; Fang, Y.; Ning, Y. Catenary insulator defect detection based on contour features and gray similarity matching. J. Zhejiang Univ. Sci. A 2020, 21, 64–73. [Google Scholar] [CrossRef]
Liu, S.; Piao, Y.; Tahir, M. Research on fusion technology based on low-light visible image and infrared image. Opt. Eng. 2016, 55, 123404. [Google Scholar] [CrossRef] [Green Version]
Arafat, S.Y.; Iqbal, M.J. Urdu-text detection and recognition in natural scene images using deep learning. IEEE Access 2020, 8, 96787–96803. [Google Scholar] [CrossRef]
Liu, X.; Yang, T.; Li, J. Real-Time ground vehicle detection in aerial infrared imagery based on convolutional neural network. Electronics 2018, 7, 78. [Google Scholar] [CrossRef] [Green Version]
Han, B.; Lee, J.T.; Lim, K.; Choi, D. License plate image generation using generative adversarial networks for end-to-end license plate character recognition from a small set of real images. Appl. Sci. 2020, 10, 2780. [Google Scholar] [CrossRef] [Green Version]
Huang, J. Research on license plate image segmentation and intelligent character recognition. Pattern Recognit. Artif. Intell. 2020, 34, 2050014. [Google Scholar] [CrossRef]
Husnain, M.; Missen, M.M.S.; Mumtaz, S.; Jhanidr, M.Z.; Coustaty, M.; Luqman, M.M.; Ogier, J.; Choi, G.S. Recognition of Urdu handwritten characters using convolutional neural network. Appl. Sci. 2019, 9, 2758. [Google Scholar] [CrossRef] [Green Version]
Sun, B.; Hua, S.; Li, S.; Sun, J. Graph-matching-based character recognition for Chinese seal images. Sci. China Inf. Sci. 2019, 62, 192102. [Google Scholar] [CrossRef] [Green Version]
Naiemi, F.; Ghods, V.; Khalesi, H. An efficient character recognition method using enhanced HOG for spam image detection. Soft Comput. 2019, 23, 11759–11774. [Google Scholar] [CrossRef]
Wenshan, D.; Duyan, B.; Linyuan, H.; Zunlin, F. Contrast-enhanced fusion of infrared and visible images. Opt. Eng. 2018, 57, 093111. [Google Scholar]
Zuo, Y.; Liu, J.; Yang, M.; Wang, X.; Sun, M. Algorithm for unmanned aerial vehicle aerial different-source image matching. Opt. Eng. 2016, 55, 123111. [Google Scholar] [CrossRef]
Lin, Y.; Qin, J.; Zhang, W.; Zhang, H.; Bai, D.; Xu, R. PCANet based digital recognition for electrical equipment infrared images. J. Phys. Conf. Ser. 2018, 1098, 012033. [Google Scholar] [CrossRef] [Green Version]
Deeprasertkul, P.; Praikan, W. An application of numbers and characters recognition and classification on radar images using for flood monitoring. In Proceedings of the 2018 3rd International Conference on Computer and Communication Systems (ICCCS), Nanjing, China, 27–30 April 2018; pp. 22–226. [Google Scholar]
Gan, W.; Wu, X.; Wu, W.; Yang, X.; Ren, C.; He, X.; Liu, K. Infrared and visible image fusion with the use of multi-scale edge-preserving decomposition and guided image filter. Infrared Phys. Technol. 2015, 7, 37–51. [Google Scholar] [CrossRef]
Du, W.; Shen, H.; Fu, J.; Zhang, H.; He, Q. Approaches for improvement of the X-ray image defect detection of automobile casting aluminum parts based on deep learning. NDT E Int. 2019, 107, 102144. [Google Scholar] [CrossRef]
Ren, F.; Liu, W.; Wu, G. Feature reuse residual networks for insect pest recognition. IEEE Access 2019, 7, 122758–122768. [Google Scholar] [CrossRef]
Kong, W.; Zhang, L.; Lei, Y. Novel fusion method for visible light and infrared images based on NSST-SF-PCNN. Infrared Phys. Technol. 2014, 65, 103–112. [Google Scholar] [CrossRef]
Redouan, L.; Mohamed, E.A.; Ayoub, E. A new thermal infrared and visible spectrum images-based pedestrian detection system. Multimed. Tools Appl. 2019, 78, 15861–15885. [Google Scholar]
Liu, G.; Mao, S.; Kim, J.H. A mature-tomato detection algorithm using machine learning and color analysis. Sensors 2019, 19, 2023. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cao, X.; Li, T.; Li, H. A robust parameter-free thresholding method for image segmentation. IEEE Access 2019, 7, 3448–3458. [Google Scholar] [CrossRef] [PubMed]
Michalak, H.; Okarma, K. Improvement of image binarization methods using image preprocessing with local entropy filtering for alphanumerical character recognition purposes. Entropy 2019, 21, 562. [Google Scholar] [CrossRef] [Green Version]
Jyothish, V.R.; Bindu, V.R.; Greeshma, M.S. An efficient image segmentation approach using superpixels with colorization. Procedia Comput. Sci. 2020, 171, 837–846. [Google Scholar]
He, Y.; Song, K.; Meng, Q.; Yan, Y. An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans. Instrum. Meas. 2020, 69, 1493–1504. [Google Scholar] [CrossRef]
Liang, H.; Zuo, C.; Wei, W. Detection and evaluation method of transmission line defects based on deep learning. IEEE Access 2020, 8, 38448–38458. [Google Scholar] [CrossRef]
Li, X.; Yang, X.; Chen, S.; Qi, N.; Huang, Y. Intensity image quality assessment based on multiscale gradient magnitude similarity deviation. Opt. Eng. 2020, 59, 103101. [Google Scholar] [CrossRef]
Wei, B.; Zuo, Y.; Liu, Y.; Luo, W.; Wen, K.; Deng, F. Novel MOA fault detection technology based on small sample infrared image. Electronics 2021, 10, 1748. [Google Scholar] [CrossRef]
Zhang, J.; Kang, X.; Ni, H.; Ren, F. Surface defect detection of steel strips based on classification priority YOLOv3-dense network. Ironmak. Steelmak. 2020, 48, 547–558. [Google Scholar] [CrossRef]
Ni, H.; Wang, K.; Lv, S.; Wang, X.; Zhang, J.; Zhuo, L.; Li, F. Effects of modified anodes on the performance and microbial community of microbial fuel cells using swine wastewater. Energies 2020, 13, 3980. [Google Scholar] [CrossRef]
Huang, H.; Lin, B.; Feng, L.; Lv, H. Hybrid indoor localization scheme with image sensor-based visible light positioning and pedestrian dead reckoning. Appl. Opt. 2019, 58, 3214–3221. [Google Scholar] [CrossRef] [PubMed]
Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA, 20–25 July 2005; pp. 886–893. [Google Scholar]
LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Block diagram of the proposed method.

Figure 2. Statistical result of the histogram. (a) Visible images histogram; (b) Infrared images histogram.

Figure 3. Binarization result of different thresholds. (a) Threshold of 105; (b) threshold of 210.

Figure 4. Flowchart of temperature value location.

Figure 5. Vertical projection results of RoI.

Figure 6. The structure of the CNN.

Figure 7. The process of the proposed method.

Figure 8. Establishment of the dataset. (a) Image acquisition; (b) infrared images for different types of substation equipment; (c) dataset of temperature values.

Figure 9. Training loss and accuracy curves.

Figure 10. The results of pre-processed stage. (a) Gray transformation; (b) gamma correction; (c) infrared image histogram; (d) binarization.

Figure 11. The results of image segmentation. (a) Location of rectangular box and temperature value; (b) RoI extraction; (c) character segmentation of “8”; (d) character segmentation of “1”.

Figure 12. Application of the proposed method. (a) Field data acquisition; (b) system interface.

Figure 13. Comparison of binarization methods. (a) 2-model; (b) P-quantile; (c) Otsu; (d) maximum entropy thresholding.

Figure 14. Comparison of the CNN and CNN+SVM models.

Table 1. Distribution and recognition of labels.

Label	-	0	1	2	3	4	5	6	7	8	9
Test number	40	40	40	40	40	40	40	40	40	40	40
Correct number	40	40	40	40	39	40	40	40	40	39	40
Accuracy (%)	100	100	100	100	97.5	100	100	100	100	97.5	100

Table 2. Confusion matrix of labels.

Label	-	0	1	2	3	4	5	6	7	8	9
-	40	0	0	0	0	0	0	0	0	0	0
0	0	40	0	0	0	0	0	0	0	0	0
1	0	0	40	0	0	0	0	0	0	0	0
2	0	0	0	40	0	0	0	0	0	0	0
3	0	0	0	1	39	0	0	0	0	0	0
4	0	0	0	0	0	40	0	0	0	0	0
5	0	0	0	0	0	0	40	0	0	0	0
6	0	0	0	0	0	0	0	40	0	0	0
7	0	0	0	0	0	0	0	0	40	0	0
8	0	0	0	0	1	0	0	0	0	39	0
9	0	0	0	0	0	0	0	0	0	0	40

Table 3. Performance of labels.

Label	Precision (%)	Recall (%)	F₁	10-Fold Cross-Validation (%)
-	100	100	1.000	100
0	100	100	1.000	100
1	100	100	1.000	100
2	97.56	100	0.988	99.25
3	97.50	97.50	0.975	98.25
4	100	100	1.000	100
5	100	100	1.000	100
6	100	100	1.000	100
7	100	100	1.000	100
8	100	97.50	0.987	98.75
9	100	100	1.000	100

Table 4. Comparison results of different models.

Models	Training Accuracy (%)	Testing Accuracy (%)
PCANet [24]	99.7	92.61
Classic HOG+SVM [42]	97.2	92
CNN [43]	97.50	98.25
Our method	99.55	99.50

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, K.; Zhang, J.; Ni, H.; Ren, F. Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network. Electronics 2021, 10, 1986. https://doi.org/10.3390/electronics10161986

AMA Style

Wang K, Zhang J, Ni H, Ren F. Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network. Electronics. 2021; 10(16):1986. https://doi.org/10.3390/electronics10161986

Chicago/Turabian Style

Wang, Kaixuan, Jiaqiao Zhang, Hongjun Ni, and Fuji Ren. 2021. "Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network" Electronics 10, no. 16: 1986. https://doi.org/10.3390/electronics10161986

APA Style

Wang, K., Zhang, J., Ni, H., & Ren, F. (2021). Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network. Electronics, 10(16), 1986. https://doi.org/10.3390/electronics10161986

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Thermal Defect Detection for Substation Equipment Based on Infrared Image Using Convolutional Neural Network

Abstract

1. Introduction

2. Proposed Methods

2.1. Improved Image Pre-Processing Method

2.2. RoI Extraction Based on Contour Information

2.3. Recognition Based on CNN

2.4. Process of the Proposed Method

3. Experiment and Results

3.1. T-IR11 Dataset

3.2. Evaluation Method

3.3. Training Process

3.4. Experiment Results

4. Discussion

5. Conclusions

6. Patents

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Label	-	0	1	2	3	4	5	6	7	8	9
-	40	0	0	0	0	0	0	0	0	0	0
0	0	40	0	0	0	0	0	0	0	0	0
1	0	0	40	0	0	0	0	0	0	0	0
2	0	0	0	40	0	0	0	0	0	0	0
3	0	0	0	1	39	0	0	0	0	0	0
4	0	0	0	0	0	40	0	0	0	0	0
5	0	0	0	0	0	0	40	0	0	0	0
6	0	0	0	0	0	0	0	40	0	0	0
7	0	0	0	0	0	0	0	0	40	0	0
8	0	0	0	0	1	0	0	0	0	39	0
9	0	0	0	0	0	0	0	0	0	0	40

Label	-	0	1	2	3	4	5	6	7	8	9
-	40	0	0	0	0	0	0	0	0	0	0
0	0	40	0	0	0	0	0	0	0	0	0
1	0	0	40	0	0	0	0	0	0	0	0
2	0	0	0	40	0	0	0	0	0	0	0
3	0	0	0	1	39	0	0	0	0	0	0
4	0	0	0	0	0	40	0	0	0	0	0
5	0	0	0	0	0	0	40	0	0	0	0
6	0	0	0	0	0	0	0	40	0	0	0
7	0	0	0	0	0	0	0	0	40	0	0
8	0	0	0	0	1	0	0	0	0	39	0
9	0	0	0	0	0	0	0	0	0	0	40

Label	-	0	1	2	3	4	5	6	7	8	9
-	40	0	0	0	0	0	0	0	0	0	0
0	0	40	0	0	0	0	0	0	0	0	0
1	0	0	40	0	0	0	0	0	0	0	0
2	0	0	0	40	0	0	0	0	0	0	0
3	0	0	0	1	39	0	0	0	0	0	0
4	0	0	0	0	0	40	0	0	0	0	0
5	0	0	0	0	0	0	40	0	0	0	0
6	0	0	0	0	0	0	0	40	0	0	0
7	0	0	0	0	0	0	0	0	40	0	0
8	0	0	0	0	1	0	0	0	0	39	0
9	0	0	0	0	0	0	0	0	0	0	40