1. Introduction
The concept of the Internet of Things is proposed, and the era of the Internet of Everything is coming. One of the keys to building the power Internet of Things is to realize online monitoring and analysis and evaluation of the operating status of various power equipment. Among various monitoring technologies, infrared monitoring technology has the characteristics of being long distance and non-contact and having high levels of accuracy and speed [
1,
2]. The extensive and effective installation of infrared sensors will be one of the key issues that need to be solved in the construction of the power Internet of Things. However, due to the limitation of equipment installation cost, data transmission, and storage capacity, it is obviously difficult to achieve the accuracy of mainstream infrared imagers at this stage using the online monitoring infrared sensor that can be installed on a large scale. Therefore, it is necessary to process infrared images collected by low-precision infrared sensors through background algorithms to enhance their visual effects and enrich their connotative information. The super-resolution technology that has emerged in recent years provides new ideas for solving this problem.
Super-resolution (SR) aims to reconstruct a high-quality image
from its degraded measurement
[
3]. SR is a typical ill-posed inverse problem, and it can be generally modelled as
where
is a blur kernel,
denotes the convolution operator, and
the down-sampling operator. According to the number of input images, SR technology is divided into single-image super-resolution (SISR) technology and multi-frame image super-resolution (MISR) technology. Due to data storage, transmission pressure and other issues, the current power industry does not have the conditions to adopt MISR technology, so we focus on the SISR technology.
The SISR method can be divided into three categories according to the principle. The first type is interpolation methods, which are often simple and easy to implement, but usually excessively smooth the high-frequency details of the image, resulting in poor visual quality of the reconstructed image [
4,
5,
6]. The second type of method is a learning-based method, which learns the correspondence between low-resolution (LR) and high-resolution (HR) image blocks from a given training sample [
7,
8,
9,
10]. This results in the effectiveness of the algorithm highly dependent on the selection of training samples. Additionally, when the application conditions change, such as the magnification and degradation information, the model needs to be retrained again which is often accompanied by high computational costs [
8]. Therefore, we focus on the third type of methods, namely, reconstruction-based methods. This type of method constructs the model based on the principle of image degradation, and realizes the SR reconstruction of the image by combining prior information in the Bayesian framework or introducing regularization in its inverse problem [
11,
12,
13,
14,
15,
16,
17]. Such methods are not limited by samples, are flexible in application and have good reconstruction effects, which are easy to be widely applied in the power grid.
In addition, the SISR method can also be divided into three categories according to the different problems it solves. The first type of method ignores the general blurring in the LR image formation process. These methods consider the LR image to be absolutely clear and only improve its resolution [
4,
8,
11,
12]. However, the research of Efrat et al. [
18] shows that the influence of blur kernel on the SISR problem is even greater than the influence of the selected SR model. The second type of method is the non-blind SR method, which does not study the solution of the blur kernel, but only focuses on how to reconstruct the HR image from the LR image when the blur kernel is known [
13,
14,
15,
16,
17]. For example, Glasner et al. [
13] used image self-repeatability to reconstruct HR images. Šroubek et al. [
14] proved that the degenerate operator can be implemented in the frequency domain and designed a fast-solving algorithm based on this. Dong et al. [
15] proposed the concept of sparse coding noise and achieved the goal of image restoration by suppressing sparse coding noise. The third type of method is the blind SR method, which simultaneously solves the problem of blur kernel estimation and HR image reconstruction. However, the joint restoration of blur kernel and HR image is usually difficult, and it is easy to produce sub-optimal reconstruction results [
19]. Therefore, there are few studies on blind SR methods [
20,
21,
22,
23]. Shao et al. [
20] proposed a non-parametric blind SR method based on an adaptive heavy-tail prior. Qian et al. [
21] proposed a blind SR restoration method based on frame-by-frame non-parametric blur estimation. Kim et al. [
22] proposed a single-image blind SR method with low computational complexity, and Michaeli et al. [
23] proposed a blind SR method based on the self-similarity of the spatial structure of image blocks.
In order to improve the quality of SR reconstructed images, to meet the actual needs of the power industry, we propose a blind SR method. Since the joint restoration of the blur kernel and HR image may produce sub-optimal reconstruction results, we chose to estimate the blur kernel first, and then reconstruct the LR infrared image through the non-blind SR method. For the blur kernel estimation, we improved the basic SR model of compressed sensing and introduced the image Extreme Channels Prior to the model; thus, we propose an LR image blur kernel estimation method based on the compressed sensing theory. For the non-blind reconstruction after blur kernel estimation, we propose an adaptive non-blind SR reconstruction algorithm. The algorithm uses adaptive control of the intensity coefficient of the regular term in the reconstruction process to suppress the generation of artifact ringing and improve the quality of the reconstructed image. The final experimental results show that our proposed blind SR reconstruction method for infrared images of power equipment can effectively reconstruct LR infrared images through successive blur kernel estimation and non-blind reconstruction. The reconstructed image has richer details and better visual effects, which can provide better conditions for the infrared diagnosis of the power system.
4. Model Solution
In
Section 2 and
Section 3, we, respectively, established the blur kernel estimation model and the non-blind SR reconstruction model as shown in Equations (10) and (13). This section will introduce their solution methods. In order to facilitate the solution, we use the semi-quadratic split method to introduce auxiliary variables for them, and then use the alternate minimization method to solve the unknown variables in the model. After introducing auxiliary variables, the blur kernel estimation model becomes:
where
and
are penalty parameters.
,
,
are the row and column difference operators, respectively. After the introduction of auxiliary variables, Equation (13) becomes:
Equations (16) and (17) both contain variables
and
, and we solve them by the same objective function, so they are solved in the same way. We solve for
by:
Equation (18) is a typical least squares problem, which can be solved by the gradient descent method. The derivative of Equation (18) with respect to
:
The number of iterations and step length are determined by the one-step steepest descent scheme introduced in [
27]. We solve for
by:
which can be solved by shrinking the soft threshold:
In addition to the common variables, Equations (16) and (17) also contain their own unique variables, and their solutions are introduced separately below. The variables
and
in Equation (16) are all constrained by the
norm, which can be solved by hard threshold shrinkage. We solve for them by:
The variables
and
in Equation (16) are both constrained by the
norm, which can be solved by the method of fast Fourier transform. We solve for
by:
In order to maintain the consistency between
and
to facilitate the solution, by the operational nature of the
and
functions,
can be equivalent to
. In addition, due to the non-linearity of the function
, the equivalent linear operator
is introduced for it.
is essentially a mapping matrix, and its construction method is:
The function of the
matrix is to transfer the minimum value in the image block centered on the
pixel (i.e., the value of the
pixel) to the
pixel. As the transposed matrix of
,
plays a role of reverse rearrangement during operation. Reverse rearrangement means that the pixel value at position
is used to reversely replace the pixel value at position
. Therefore, Equation (28) can be expressed as:
The solution of (30) can be obtained by FFT:
where
is
;
is
;
and
denote the fast Fourier transform and inverse fast Fourier transform, respectively;
is the complex conjugate operator;
denotes component multiplication, and the division in Formula (31) is component division. It should be noted that
and
, as linear operators, did not actually generate a matrix and perform matrix multiplication during the calculation process, but instead set up a lookup table according to its meaning. For example,
does not actually calculate the product of the matrix
and
. Instead, according to the relationship that
is approximately equal to
, the minimum value element in
is replaced with the element in
to obtain the result of
. This avoids the generation and calculation of large matrices in the algorithm, and significantly improves the running speed.
We estimate the blur kernel
by:
For the subproblem
, directly using the intermediate latent image to estimate the blur kernel is not accurate [
28]; therefore, the gradient image is used to estimate the blur kernel. Then, the solution of
can be obtained by solving the following:
The solution of (33) can be obtained by FFT:
Since the blur kernel
and
, after each iteration of the
subproblem, we set the negative elements of
to zero and normalize
at the end. The solution of all variables in the process of blur kernel estimation has been given, and Algorithm 1 shows the main steps for the blur kernel estimation algorithm. As suggested by [
25,
26,
29], we decrease
gradually to make more information available for kernel estimation.
Algorithm 1: Blur Kernel Estimation Algorithm |
Input: Blurred image generate the initial value of each variable do
repeat
solve for using the gradient descent method, . repeat solve for using (26), solve for using (27), . repeat solve for using (21), . repeat solve for using (25),solve for using (31), . until . until . and .
until solve for using (34). . end for
Output: blur kernel.
|
Finally, only the solution of
in Equation (17) has not been given yet. When the value of
is different, the solution method is different. When
, we solve for
by:
The purpose of initial reconstruction using Gaussian prior is only to extract the significant edges of the image. Therefore,
should be a larger value to suppress the generation of ringing artifacts,
. The solution of (35) can be obtained by FFT:
When
, the non-blind SR model is constructed by adaptive regularization, and we solve for
by:
where
and
are the number of pixels in the row and column direction of the image, respectively;
can be obtained according to Equation (15). In order to facilitate the solution, the auxiliary variable
is introduced by the semi-quadratic split method. Equation (37) can be expressed as:
Equation (38) can be divided into the
sub-problem and the
sub-problem to be solved separately. We solve the
sub-problem by:
The solution is the same as (35).
can be obtained by FFT:
After obtaining
, we solve the
sub-problem. Let
, then the objective function can be abbreviated as:
We set the derivative of Equation (41) to be 0:
We further transform Equation (42) into:
The root of Equation (43) is r. According to [
30], when
is between
and
, the solution of Equation (43) is
, otherwise
. However, since the solution of Equation (43) only depends on the ratio of
and
and the value of the variable
, it can be solved by a lookup table (LUT). Where
is an integer power of
between 1 and 256,
is
or
, and
is
different values between
and
. Solving Equation (43) in turn can form an offline lookup table. The LUT can give a solution to the objective function with an accuracy close to that of the analytical method at a faster speed [
30].
We, respectively, give the solution of each unknown variable in the objective function of blur kernel estimation and non-blind SR. When
takes different values, the non-blind SR reconstruction process is shown in
Figure 5.
5. Experiment and Result Analysis
Our test environment parameters were as follows: Intel(R) Core(TM)i5-9300H CPU @2.40 GHz; memory: 16.00 GB; operating system: Windows 10; MATLAB R2019a. We obtained the following fixed parameters through repeated experiments and adjustments: ; ; ; . The image block size used for the dark channel search was 35 × 35. The sparse base used the Daubechies 8 wavelet base.
In order to make the experimental results more convincing, in addition to comparing our method with the classical blind SR method, we also design comparative experiments for the blur kernel estimation and non-blind SR reconstruction in our method. In the blind SR comparison experiment, we compared our method with the methods proposed by Keys [
31], Shao [
20], Michaeli [
23], and Kim [
22]. Since the actual infrared image of the power equipment did not have the original, clear HR image, we adopted two other objective evaluation indicators: average gradient (AG) and information entropy (IE). The calculation method of AG is as follows:
where
and
are the image convolution results of the difference operator in the row and column directions, respectively. The larger the AG value is, the more drastically the grayscale changes in the image and the more the image levels, that is, the clearer the image is.
Entropy represents the uniformity of a system in physics. The more uniformly a system is distributed, the greater its information entropy is. The concept of image information entropy is derived from this, which can be defined as follows:
where
represents the frequency of the pixel point with the gray value of
in the image. The larger the IE value, the richer the information contained in the image.
In addition, in order to prove the effectiveness of the blur kernel estimation method in this paper, we compared the blur kernels estimated by our method and the algorithms proposed in [
20,
23,
32]. We used the sum of the squared differences error (SSDE) to evaluate the accuracy of the estimated blur kernel:
where
represents the estimated blur kernel and
represents the true blur kernel of the image.
Finally, in order to verify the performance of the non-blind SR reconstruction method in this paper, we use the known blur kernel to process the HR and clear infrared image according to Equation (1). Taking the synthetic infrared image and the known blur kernel as input, we compare our method with the existing non-blind SR methods. We select the methods proposed by Keys [
31], Glasner [
13], Dong [
15] and Zhao [
17] as the comparison method. Since the artificially synthesized infrared image had the original, clear HR image, the PSNR and SSIM evaluation indicators could be used to evaluate the reconstruction results.
5.1. Blind SR Comparison Experiment
First, we reconstruct the LR infrared images of the power equipment that are actually collected to verify the effectiveness of the blind SR method in the practical application. For the experiment, we used eleven infrared images taken on site with a resolution of 128 × 128 for SR reconstruction.
Figure 6 shows the images reconstructed using different methods for the 11th LR infrared image.
As shown in
Figure 6, the method proposed by Key does not consider the influence of the blur kernel when reconstructing the image, and there is an inherent smoothness benefit of interpolation algorithms. The reconstruction result obtained by this method has the worst visual effect, and there is no obvious difference from the LR image. Compared with the original low-resolution image, the visual quality of the reconstructed image by Shao’s method has been significantly improved, but the detailed texture is still not clear enough. Obvious artifacts and ringing appear in the reconstruction results of Michaeli’s method. This is a common problem caused by improper regularization intensity in the SR reconstruction process, and it is also a problem that this paper focuses on improving and solving. Although the image reconstructed by Kim has higher contrast and brighter colors, according to the enlarged part in the green box, the edges are too smooth. On the whole, the edge texture of the image reconstructed by our method is the clearest, and there are no artifacts and ringing. This shows that our method has certain performance advantages compared with the comparison method. The AG and IE values of the remaining 10 images reconstructed using different methods are given in
Figure 7. It can be seen that Kim and Shao’s methods are similar in performance. Michaeli’s method has a significantly higher reconstruction image index, which is due to improper control of the regularization intensity. Generally speaking, the infrared image reconstructed by our method has obvious advantages compared with the comparison methods in the evaluation index.
5.2. Experiment of Blur Kernel Estimation
In order to prove the effectiveness of the blur kernel estimation method in this paper, we used the six blur kernels shown in
Figure 8 to sequentially blur 100 infrared images and perform double downsampling. We used our method and the comparison methods to estimate the blur kernel based on the LR blurred image. For each blur kernel, the average SSDE parameters of the blur kernel estimated by the different methods on 100 synthetic blurred infrared images are shown in
Table 1. It can be seen from the data in
Table 1 that compared with the comparison methods, our blur kernel estimation method has achieved better results in accuracy.
5.3. Non-Blind SR Comparison Experiment
In this section, we use the six blur kernels in
Section 5.2 to process 10 HR and clear infrared images according to Equation (1) to obtain 60 artificially synthesized LR images.
Figure 9 shows the LR infrared image of the 10th HR image synthesized by BK6, as well as the reconstruction results of different non-blind SR methods.
It can be seen from
Figure 9 that the method proposed by Keys has the worst visual effect due to the inherent smoothing benefits of interpolation algorithms, and the transformer texture is almost invisible. The visual quality of Glasner and Dong’s methods are similar, but the texture is not as clear as our proposed method, especially in small local details. The reconstruction result of Zhao’s method is somewhat distorted, and the image is too sharp, resulting in too high brightness. This will have a very bad influence in infrared diagnosis, and it is easy to cause the operation and maintenance personnel to misjudge the operating temperature of the equipment. Additionally, according to the part marked in the red box, its ability to reconstruct small textures obviously has a certain gap compared with our method. Due to the large number of images used in the experiment, the reconstruction results of the remaining images are given in the form of evaluation parameters. The PSNR and SSIM values of the reconstructed image of different algorithms are shown in
Figure 10. Due to the large amount of data, for the reconstruction results of the same HR image processed by different blur kernels, the objective evaluation parameters are averaged and displayed. It can be seen from
Table 2 and
Table 3 that the performance of our method is significantly improved compared to the comparison methods.
5.4. Sensitivity Analysis
In this paper, the blur kernel estimation model involves many parameters. In this section, we analyze the influence of its value. The blur kernel estimation model involves five main parameters
and
. In order to analyze the influence of these parameters on the blur kernel estimation, we collect 10 blurred images for tests. For each parameter, we carry out experiments with different parameter settings by varying one and fixing the others with the kernel similarity metric to measure the accuracy of estimated kernels. For parameter
, we set its values from
to 0.01 with the step size of
.
Figure 10a demonstrates that blur kernels can be well estimated by a wide range of
, i.e., within
. Similarly, we set the values of
and
from 0 to 2 with the increment of 0.1, and the values of
and
from 0 to 0.01 with the increase of
. The experimental results of
and
parameters are shown in
Figure 10b,c. Since
in the actual calculation process, the result is displayed by one curve, as shown in
Figure 10d. The experimental results show that the proposed blur kernel estimation algorithm performs well with a wide range of parameter settings. In addition, when
, it can be seen from
Figure 10d that the blur kernel estimation effect is extremely poor, which also proves the necessity of introducing Extreme Channels Prior to the blur kernel estimation model in this paper.
5.5. Comparison with the Deep Learning Method
In order to reflect the superiority of this algorithm in practical applications, this section selects the advanced methods of deep learning algorithms [
33] and our super-resolution reconstruction method for comparison experiments. At this stage, deep learning-based super-resolution algorithms require a large number of high-definition images as training samples. When training resources are insufficient, the performance of the method will decrease significantly. The algorithm in this paper can achieve high-quality image reconstruction without training samples.
Figure 11 shows the comparison of the reconstruction results of the deep learning method with 500 and 2000 infrared images after training the model. It can be seen from
Figure 11 that the deep learning algorithm has an obvious grid phenomenon and color distortion when the training data are insufficient. When the training data are sufficient, the contrast of some edge detail textures is slightly higher than that of our method. However, the result of the learning method contains a certain amount of false texture, which has a bad influence on infrared diagnosis. It can be seen that our method does not require training samples, and the reconstruction results are more accurate, so it has better practical application value in the electric power field.