Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization

Zhao, Hongshan; Liu, Bingcong; Wang, Lingjie

doi:10.3390/s21144820

Open AccessArticle

Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization

by

Hongshan Zhao

,

Bingcong Liu

^* and

Lingjie Wang

School of Electrical & Electronic Engineering, North China Electric Power University, Baoding 071003, China

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(14), 4820; https://doi.org/10.3390/s21144820

Submission received: 22 June 2021 / Revised: 6 July 2021 / Accepted: 13 July 2021 / Published: 14 July 2021

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Infrared sensing technology is more and more widely used in the construction of power Internet of Things. However, due to cost constraints, it is difficult to achieve the large-scale installation of high-precision infrared sensors. Therefore, we propose a blind super-resolution method for infrared images of power equipment to improve the imaging quality of low-cost infrared sensors. If the blur kernel estimation and non-blind super-resolution are performed at the same time, it is easy to produce sub-optimal results, so we chose to divide the blind super-resolution into two parts. First, we propose a blur kernel estimation method based on compressed sensing theory, which accurately estimates the blur kernel through low-resolution images. After estimating the blur kernel, we propose an adaptive regularization non-blind super-resolution method to achieve the high-quality reconstruction of high-resolution infrared images. According to the final experimental demonstration, the blind super-resolution method we proposed can effectively reconstruct low-resolution infrared images of power equipment. The reconstructed image has richer details and better visual effects, which can provide better conditions for the infrared diagnosis of the power system.

Keywords:

power equipment; infrared image; blur kernel estimation; non-blind super-resolution; compressed sensing

1. Introduction

The concept of the Internet of Things is proposed, and the era of the Internet of Everything is coming. One of the keys to building the power Internet of Things is to realize online monitoring and analysis and evaluation of the operating status of various power equipment. Among various monitoring technologies, infrared monitoring technology has the characteristics of being long distance and non-contact and having high levels of accuracy and speed [1,2]. The extensive and effective installation of infrared sensors will be one of the key issues that need to be solved in the construction of the power Internet of Things. However, due to the limitation of equipment installation cost, data transmission, and storage capacity, it is obviously difficult to achieve the accuracy of mainstream infrared imagers at this stage using the online monitoring infrared sensor that can be installed on a large scale. Therefore, it is necessary to process infrared images collected by low-precision infrared sensors through background algorithms to enhance their visual effects and enrich their connotative information. The super-resolution technology that has emerged in recent years provides new ideas for solving this problem.

Super-resolution (SR) aims to reconstruct a high-quality image

X

from its degraded measurement

Y

[3]. SR is a typical ill-posed inverse problem, and it can be generally modelled as

Y = (k \otimes X) ↓

(1)

where

k

is a blur kernel,

\otimes

denotes the convolution operator, and

↓

the down-sampling operator. According to the number of input images, SR technology is divided into single-image super-resolution (SISR) technology and multi-frame image super-resolution (MISR) technology. Due to data storage, transmission pressure and other issues, the current power industry does not have the conditions to adopt MISR technology, so we focus on the SISR technology.

The SISR method can be divided into three categories according to the principle. The first type is interpolation methods, which are often simple and easy to implement, but usually excessively smooth the high-frequency details of the image, resulting in poor visual quality of the reconstructed image [4,5,6]. The second type of method is a learning-based method, which learns the correspondence between low-resolution (LR) and high-resolution (HR) image blocks from a given training sample [7,8,9,10]. This results in the effectiveness of the algorithm highly dependent on the selection of training samples. Additionally, when the application conditions change, such as the magnification and degradation information, the model needs to be retrained again which is often accompanied by high computational costs [8]. Therefore, we focus on the third type of methods, namely, reconstruction-based methods. This type of method constructs the model based on the principle of image degradation, and realizes the SR reconstruction of the image by combining prior information in the Bayesian framework or introducing regularization in its inverse problem [11,12,13,14,15,16,17]. Such methods are not limited by samples, are flexible in application and have good reconstruction effects, which are easy to be widely applied in the power grid.

In addition, the SISR method can also be divided into three categories according to the different problems it solves. The first type of method ignores the general blurring in the LR image formation process. These methods consider the LR image to be absolutely clear and only improve its resolution [4,8,11,12]. However, the research of Efrat et al. [18] shows that the influence of blur kernel on the SISR problem is even greater than the influence of the selected SR model. The second type of method is the non-blind SR method, which does not study the solution of the blur kernel, but only focuses on how to reconstruct the HR image from the LR image when the blur kernel is known [13,14,15,16,17]. For example, Glasner et al. [13] used image self-repeatability to reconstruct HR images. Šroubek et al. [14] proved that the degenerate operator can be implemented in the frequency domain and designed a fast-solving algorithm based on this. Dong et al. [15] proposed the concept of sparse coding noise and achieved the goal of image restoration by suppressing sparse coding noise. The third type of method is the blind SR method, which simultaneously solves the problem of blur kernel estimation and HR image reconstruction. However, the joint restoration of blur kernel and HR image is usually difficult, and it is easy to produce sub-optimal reconstruction results [19]. Therefore, there are few studies on blind SR methods [20,21,22,23]. Shao et al. [20] proposed a non-parametric blind SR method based on an adaptive heavy-tail prior. Qian et al. [21] proposed a blind SR restoration method based on frame-by-frame non-parametric blur estimation. Kim et al. [22] proposed a single-image blind SR method with low computational complexity, and Michaeli et al. [23] proposed a blind SR method based on the self-similarity of the spatial structure of image blocks.

In order to improve the quality of SR reconstructed images, to meet the actual needs of the power industry, we propose a blind SR method. Since the joint restoration of the blur kernel and HR image may produce sub-optimal reconstruction results, we chose to estimate the blur kernel first, and then reconstruct the LR infrared image through the non-blind SR method. For the blur kernel estimation, we improved the basic SR model of compressed sensing and introduced the image Extreme Channels Prior to the model; thus, we propose an LR image blur kernel estimation method based on the compressed sensing theory. For the non-blind reconstruction after blur kernel estimation, we propose an adaptive non-blind SR reconstruction algorithm. The algorithm uses adaptive control of the intensity coefficient of the regular term in the reconstruction process to suppress the generation of artifact ringing and improve the quality of the reconstructed image. The final experimental results show that our proposed blind SR reconstruction method for infrared images of power equipment can effectively reconstruct LR infrared images through successive blur kernel estimation and non-blind reconstruction. The reconstructed image has richer details and better visual effects, which can provide better conditions for the infrared diagnosis of the power system.

2. Blur Kernel Estimation Method

2.1. Basic SR Model of Compressed Sensing

Our blur kernel estimation model is improved from the basic SR model of compressed sensing. So, we first briefly introduce it, and its model is:

y = S K Ψ \tilde{x}

(2)

where

y

is a one-dimensional vectorized LR image;

S

is the down-sampling matrix, which is generally generated according to the principle of cubic interpolation;

K

is the image degradation matrix, which is the cyclic Toeplitz matrix obtained by k;

Ψ

is a sparse base, which can be generated according to Fourier transform, discrete cosine transform, wavelet transform, or it can be an over-complete dictionary;

\tilde{x}

is a sparse coefficient. The one-dimensional vectorized HR image

x = Ψ \tilde{x}

. The SR reconstruction can be completed by solving the following objective function:

{\tilde{x}}^{'} = \arg \underset{\tilde{x}}{\min ‖} S K Ψ \tilde{x} - y ‖_{2}^{2} + ω ‖ \tilde{x} ‖_{0}

(3)

where

‖ \tilde{x} ‖_{0}

represents the number of non-zero elements in

\tilde{x}

. However, because Equation (3) is an NP-hard problem. Donoho [24] pointed out that the equivalent solution can be obtained by solving the L1 optimization problem. The objective function can be expressed as:

{\tilde{x}}^{'} = \arg \underset{\tilde{x}}{\min ‖} S K Ψ \tilde{x} - y ‖_{2}^{2} + ω ‖ \tilde{x} ‖_{1}

(4)

The most sparse

\tilde{x}

is solved by the optimization problem of Equation (4), and the result of image SR reconstruction can be obtained by

x = Ψ \tilde{x}

. When

K

is also unknown, the optimization problem of Equation (4) is transformed into a blur kernel estimation problem:

({\tilde{x}}^{'}, K^{'}) = \arg \underset{\tilde{x}, K}{\min ‖} S K Ψ \tilde{x} - y ‖_{2}^{2} + ω ‖ \tilde{x} ‖_{1}

(5)

Obviously, due to the undetermined nature of the problem, the estimated value of

K

cannot be obtained directly from Equation (5), so it is necessary to introduce prior information to constrain and optimize the choice of solutions. There are obvious differences in the color brightness of infrared images with different pseudo-color conversion methods. Therefore, we chose the image Extreme Channels Prior mentioned in [25] as a constraint and introduce it into Equation (5) to improve the accuracy of blur kernel estimation.

2.2. Priori of Extreme Value Channel of Infrared Image of Power Equipment

In this section, we introduce the image Extreme Channels Prior. There are many types of prior information that can be used to estimate the image blur kernel, such as L0-regularized gradient prior, dark channel prior and Extreme Channels Prior. The reasons why we chose Extreme Channels Prior are as follows: In the infrared image of power equipment, the faulty heating part and the edge texture part of the equipment contain the most important information. The faulty heating part is the focus of infrared diagnosis, and it often appears as the brightest part in the image. The edge texture part is the main basis for image segmentation and target recognition, which is often the darkest part. The Extreme Channels Prior used in this article is calculated based on the local brightest and darkest pixel values in the infrared image before and after blurring. Therefore, Extreme Channels Prior pays more attention to the processing of local brightness extremes in the image and can use this as the focus to estimate the blur kernel, thereby improving the visual quality of the faulty heating part and the edge texture part of the final reconstructed image. This feature can make the infrared image of the power equipment reconstructed by the method more suitable for the needs of practical applications.

It should be noted that this paper does not directly process the infrared image in RGB mode, but converts it into YCbCr mode. Y is the luminance component, Cb is the blue chrominance component, and Cr is the red chrominance component. Since the human eye is more sensitive to the Y component, the visual difference caused by the subtle changes of the other two components is extremely small, so the following image Extreme Channels Prior statistical analysis will take the Y component as an example. Specifically, Extreme Channels includes a bright channel component and a dark channel component, which are, respectively, arranged by the local maximum and minimum values in the Y component according to a certain rule. For image

X

, the components of the bright channel and dark channel are:

B (X) (p) = \max_{q \in N (p)} (X (q))

(6)

D (X) (p) = \min_{q \in N (p)} (X (q))

(7)

where

p

and

q

donate pixel locations;

N (p)

is an image patch centered at

p

;

B (\cdot)

and

D (\cdot)

are functions for finding the local maximum and local minimum of the image, respectively. For a general image, after the pixel brightness value is normalized, most of the values obtained by

B (X)

and

D (X)

should be distributed at both ends of the interval

[0, 1]

, respectively. However, the convolution operation of the blur kernel on the clear image will change the extreme value distribution of the image. Because the convolution operation is a weighted summation of the pixel values in the local neighborhood, it will generally cause the minimum value of the pixel value in the neighborhood to become larger and the maximum value to decrease. The mathematical basis has been proved in the literature [25,26]. Figure 1 is a statistical result of the difference between the local maximum and the local minimum distribution of 100 infrared images of power equipment under clear and blur conditions. It can be seen that the above rules are also applicable to infrared images of power equipment. Therefore, the introduction of Extreme Channels Prior can effectively distinguish between clear infrared images and blurred infrared images, prompting the intermediate latent image to move closer to the clear image, thereby ensuring the accuracy of the blur kernel estimation.

2.3. Blur Kernel Estimation Model

In this section, we will propose a blur kernel estimation model based on maximum a posterior probability (MAP) framework and the basic SR method of compressed sensing. The MAP framework is:

(k^{'}, X^{'}) = \arg \min_{k, X} X (k \otimes X, Y) + β_{k} p (k) + β_{X} p (X)

(8)

where

p (k)

and

p (X)

are the priori information of

k

and

X

, respectively. According to the statistics and analysis in Section 2.2, we use the

L_{0}

norm regular term based on Extreme Channels Prior as the prior information of the image to be reconstructed:

p (X) = ‖ D (X) ‖_{0} + ‖ 1 - B (X) ‖_{0}

(9)

Combined with the SR model of compressed sensing, the objective function of blur kernel estimation can be constructed:

\begin{array}{l} (k^{'}, X^{'}) = \arg \min_{k, X} ‖ S_{l} (k \otimes X) S_{r} - Y ‖_{2}^{2} + δ ‖ \nabla X ‖_{0} + η ‖ k ‖_{2}^{2} \\ + γ ‖ D (X) ‖_{0} + μ ‖ 1 - B (X) ‖_{0} + ρ ‖ Ψ^{T} X ‖_{1} \end{array}

(10)

where

δ, η, γ, μ

and

ρ

are weight coefficients. The first term of the equation is the data fidelity term, which is used to ensure that there is a corresponding relationship between

X

and

Y

and we use the

L_{2}

norm to constrain the difference. The second term is used to preserve the significant gradient of the image and remove the small gradient, thereby improving the accuracy of the blur kernel estimation. The third term is the constraint on the sparseness of the blur kernel. The fourth item is used to preserve the sparse characteristic of the minimum value in the brightness component of the infrared image. The last item is used to ensure that the sparse coefficients obtained after the sparse transformation of the image are sparse enough and is combined with the first item to achieve the goal of SR. In order to ensure the calculation efficiency of the blur kernel estimation, we did not stretch the image into a column vector in the original compressed sensing before the calculation. The reason for this is that if the image is not divided into blocks but directly stretched as a column vector for calculation, the downsampling matrix will be too large and the calculation speed will be greatly reduced. Therefore, we modified the original model slightly, constructing a row sampling matrix

S_{l}

and a column sampling matrix

S_{r}

according to the principle of cubic interpolation so that the downsampling operation is performed twice. The size of the downsampling matrix and the position of non-zero elements are determined according to the downsampling rate, and the element values are determined according to the cubic interpolation downsampling function. The method of solving Equation (10) is introduced in Section 4.

3. Non-Blind SR Reconstruction Model

3.1. Non-Blind SR Objective Function

After completing the estimation of

k

, the basic non-blind SR model is:

X^{'} = \arg \min_{X} ‖ S_{l} (k \otimes X) S_{r} - Y ‖_{2}^{2} + ρ ‖ Ψ^{T} X ‖_{1}

(11)

Equation (11) does not contain any prior information except for the sparse constraint. It is difficult to achieve the effect of deconvolution and deblurring due to the ill-posedness of the problem using the image reconstructed directly from Equation (11). Therefore, it is necessary to introduce other prior information as constraints according to the law of image statistics.

Currently, the best deblurring method generally uses a set of filter output edge statistics to match the edge statistics of the clear image as a priori constraint for the deblurring problem. The Gaussian distribution, Laplacian distribution and Hyper-Laplacian distribution are commonly used to fit the image gradient. Assuming that the edge distribution is Gaussian, the deblurring problem has an analytical solution in the frequency domain, and the image can be restored efficiently through fast Fourier transform. However, a clear infrared image usually has a non-Gaussian edge, as shown in Figure 2, so the Gaussian distribution fitting the image gradient will lead to poor visual effects of the reconstruction result. Another common method is to assume that the edges of the image conform to the Laplacian distribution. However, due to the “heavy tail” characteristic of the edge distribution of the image, the effect of Laplace distribution fitting is also not good. Therefore, most of the methods at this stage adopt the Hyper-Laplacian distribution that can better fit the “heavy tail” characteristic of the image edge as the prior information, and then achieve the purpose of deconvolution and deblurring. Figure 2 shows the probability density curve obtained by statistically calculating the gradient of the infrared image and the fitting effect of different distributions.

Therefore, it can be seen from the above discussion that a prior constraint of image edge distribution needs to be introduced for Equation (11). No matter what kind of distribution is used to fit the image edge, it can be written as

p (X) \propto e^{- λ {|X|}^{α}}

, where

0 < α \leq 2

. If

0 < α < 1

, it is Hyper-Laplacian distribution,

α = 1

is Laplace distribution, and

α = 2

is Gaussian distribution. According to Bayes’ theorem, the maximum posterior probability solution of

X

is:

X^{'} = \arg \min_{X} X (k \otimes X, Y) + β_{X} p (X)

(12)

where

p (X) = \sum_{i} λ ‖ f_{i} \otimes X ‖_{α}^{α}

.

λ

is the strength coefficient of the prior constraint of the edge distribution.

f_{i}

is the derivative filter of each order,

i = \{x, y, x x, y y, x y\}

. The non-blind SR objective function becomes:

X^{'} = \arg \min_{X} ‖ S_{l} (k \otimes X) S_{r} - Y ‖_{2}^{2} + ρ ‖ Ψ^{T} X ‖_{1} + \sum_{i} λ ‖ f_{i} \otimes X ‖_{α}^{α}

(13)

3.2. Adaptive Regularization Intensity Adjustment Method

The non-blind super-resolution objective function is determined in Section 3.1, as shown in Equation (13). However, the model still needs to be further improved. The reason is that the value of

λ

needs to consider the difference in the semantics of the pixels within the image. The same value should not be used to constrain the entire picture. In this way, the edge texture of the reconstructed image is blurred when the regularization is strong, and the reconstructed image appears artifact ringing when the regularization is weak. Figure 3 shows the reconstruction results of infrared images processed with different regularization intensities and our proposed adaptive method.

It can be seen from Figure 3c that when strong regularization is used to constrain the prior information of the image edge, that is, when

λ = 0.1 \times 10^{- 1}

, the reconstruction result does not contain ringing artifacts. However, the edge texture of the device is blurred, the image contrast is low, and the visual effect is poor. As shown in Figure 3d, when weak regularization is used for constraints, that is, when

λ = 0.1 \times 10^{- 4}

, the reconstructed image is clearer and the edge texture contrast of the device is high, but the smooth area in the image has an obvious ringing effect. However, we extract the significant edge regions in the reconstructed image by using a double-prior quadratic estimation method, distinguish the edge regions and smooth regions according to the generated label images, and adjust the intensity of the regular term with different

λ

values. Therefore, the reconstruction effect is better. As shown in Figure 3b, the edge texture of the reconstructed image is clear and does not contain ringing.

Figure 3 shows the necessity of adaptive control of the regularization intensity in the reconstruction process. The following is an introduction to the regular term intensity adjustment method of the double-prior quadratic estimation that we adopted, and the flowchart is shown in Figure 4.

The adaptive regular term intensity adjustment method we adopt is to use different priors as constraints to reconstruct the image twice according to the model of Equation (13). In the first reconstruction, the model uses a Gaussian prior which is easy to solve, extracts the significant edges of the reconstructed image, and generates the label image. The secondary reconstruction adopts the Hyper-Laplacian prior as the constraint, and adaptively adjusts the regularization intensity of different pixels according to the label image. The specific steps in Figure 4 are described as follows:

Step 1: Let

α = 2

in Equation (13), and solve it to obtain the preliminary reconstructed image

X_{1}

.

Step 2: Use filter bank

\{f_{x}, f_{y}, f_{x x}, f_{y y}, f_{x y}\}

to filter

X_{1}

to obtain edge images in all directions.

Step 3: Perform threshold shrinkage on edge images; the shrinking method is:

X_{i} = \frac{f_{i} \otimes X_{1}}{{(\frac{τ_{i}}{f_{i} \otimes X_{1}})}^{4} + 1}

(14)

where

τ_{i} = σ \times \max (|f_{i} \otimes X_{1}|)

is the shrinkage threshold;

σ

is the proportional coefficient; the shrinkage result is denoted as

X_{i} = \{X_{x}, X_{y}, X_{x x}, X_{y y}, X_{x y}\}

.

Step 4: Solve

\nabla X = \sum_{i} X_{i}

, and integrate the significant edges in all directions into the final image significant edge result

\nabla X

.

Step 5: Set the elements smaller than

\sum_{i} τ_{i} / 10

in

\nabla X

to 0, and set the remaining elements to 1, thereby generating a binary image.

Step 6: Perform mathematical morphological processing on the binarized image. The opening and closing operations are performed once to remove the binarized image noise, and the final label image

X_{l a b}

is obtained.

After

X_{l a b}

is obtained, the value of

λ

can be adaptively controlled by:

λ_{(m, n)} = \{\begin{matrix} 0.5 \times 10^{- 2}, & X_{l a b (m, n)} = 0 \\ 2.5 \times 10^{- 4}, & X_{l a b (m, n)} = 1 \end{matrix}

(15)

Let

α = 2 / 3

in Equation (13), and substitute

λ_{(m, n)}

into Equation (13) to complete image SR reconstruction. In addition, the method of solving Equation (13) under different values of

α

is introduced in Section 4.

4. Model Solution

In Section 2 and Section 3, we, respectively, established the blur kernel estimation model and the non-blind SR reconstruction model as shown in Equations (10) and (13). This section will introduce their solution methods. In order to facilitate the solution, we use the semi-quadratic split method to introduce auxiliary variables for them, and then use the alternate minimization method to solve the unknown variables in the model. After introducing auxiliary variables, the blur kernel estimation model becomes:

\begin{array}{l} (k^{'}, X^{'}, G^{'}, a^{'}, b^{'}, c^{'}, {\tilde{X}}^{'}) = \arg \min_{k, X, G, a, b, c, \tilde{X}} ‖ S_{l} G S_{r} - Y ‖_{2}^{2} + ε ‖ k \otimes X - G ‖_{2}^{2} \\ + δ ‖ a ‖_{0} + δ^{'} ‖ a - \nabla X ‖_{2}^{2} + η ‖ k ‖_{2}^{2} + γ ‖ b ‖_{0} + μ ‖ c ‖_{0} + ρ ‖ \tilde{X} ‖_{1} \\ + γ^{'} ‖ D (X) - b ‖_{2}^{2} + μ^{'} ‖ 1 - B (X) - c ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \end{array}

(16)

where

ε, δ^{'}, γ^{'}, μ^{'}

and

ρ^{'}

are penalty parameters.

a = \{a_{h}, a_{v}\}

,

\nabla X = \{\nabla_{h} X, \nabla_{v} X\}

,

\nabla_{h} = [1, - 1], \nabla_{v} = {[1, - 1]}^{T}

are the row and column difference operators, respectively. After the introduction of auxiliary variables, Equation (13) becomes:

\begin{array}{l} (X^{'}, G^{'}, {\tilde{X}}^{'}) & = \arg \min_{X, G, \tilde{X}} ‖ S_{l} G S_{r} - Y ‖_{2}^{2} + ε ‖ k \otimes X - G ‖_{2}^{2} + \sum_{i} λ ‖ f_{i} \otimes X ‖_{α}^{α} \\ + ρ ‖ \tilde{X} ‖_{1} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \end{array}

(17)

Equations (16) and (17) both contain variables

G

and

\tilde{X}

, and we solve them by the same objective function, so they are solved in the same way. We solve for

G

by:

G^{'} = \arg \min_{G} ‖ S_{l} G S_{r} - Y ‖_{2}^{2} + ε ‖ k \otimes X - G ‖_{2}^{2}

(18)

Equation (18) is a typical least squares problem, which can be solved by the gradient descent method. The derivative of Equation (18) with respect to

G

:

d_{G} = 2 S_{l}^{T} (S_{l} G S_{r}) S_{r}^{T} - 2 ε (k \otimes X - G)

(19)

The number of iterations and step length are determined by the one-step steepest descent scheme introduced in [27]. We solve for

\tilde{X}

by:

{\tilde{X}}^{'} = \arg \min_{\tilde{X}} ρ ‖ \tilde{X} ‖_{1} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2}

(20)

which can be solved by shrinking the soft threshold:

\tilde{X} = \max \{|Ψ^{T} X| - \frac{ρ}{2 ρ^{'}}, 0\} \circ sign (Ψ^{T} X)

(21)

In addition to the common variables, Equations (16) and (17) also contain their own unique variables, and their solutions are introduced separately below. The variables

a, b

and

c

in Equation (16) are all constrained by the

L_{0}

norm, which can be solved by hard threshold shrinkage. We solve for them by:

a^{'} = \arg \min_{a} δ ‖ a ‖_{0} + δ^{'} ‖ a - \nabla X ‖_{2}^{2}

(22)

b^{'} = \arg \min_{b} γ ‖ b ‖_{0} + γ^{'} ‖ D (X) - b ‖_{2}^{2}

(23)

c^{'} = \arg \min_{c} μ ‖ c ‖_{0} + μ^{'} ‖ 1 - B (X) - c ‖_{2}^{2}

(24)

Their solutions are:

(a_{h}, a_{v}) = \{\begin{matrix} \{\nabla_{h} X, \nabla_{v} X\}, & {(\nabla_{h} X)}^{2} + {(\nabla_{v} X)}^{2} \geq δ / δ^{'} \\ 0, & otherwise \end{matrix}

(25)

b = \{\begin{matrix} D (X), & {(D (X))}^{2} \geq γ / γ^{'} \\ 0, & otherwise \end{matrix}

(26)

c = \{\begin{matrix} 1 - B (X), & {(1 - B (X))}^{2} \geq μ / μ^{'} \\ 0, & otherwise \end{matrix}

(27)

The variables

X

and

k

in Equation (16) are both constrained by the

L_{2}

norm, which can be solved by the method of fast Fourier transform. We solve for

X

by:

\begin{array}{l} X^{'} = \arg \min_{X} ε ‖ k \otimes X - G ‖_{2}^{2} + δ^{'} ‖ a - \nabla X ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \\ + γ^{'} ‖ D (X) - b ‖_{2}^{2} + μ^{'} ‖ 1 - B (X) - c ‖_{2}^{2} \end{array}

(28)

In order to maintain the consistency between

D (X)

and

1 - B (X)

to facilitate the solution, by the operational nature of the

B (X)

and

D (X)

functions,

1 - B (X)

can be equivalent to

D (1 - X)

. In addition, due to the non-linearity of the function

D (X)

, the equivalent linear operator

M

is introduced for it.

M

is essentially a mapping matrix, and its construction method is:

M (p, q) = \{\begin{matrix} 1, & q = \arg \min_{q \in N (p)} X (q) \\ 0, & otherwise \end{matrix}

(29)

The function of the

M

matrix is to transfer the minimum value in the image block centered on the

p

pixel (i.e., the value of the

q

pixel) to the

p

pixel. As the transposed matrix of

M

,

M^{T}

plays a role of reverse rearrangement during operation. Reverse rearrangement means that the pixel value at position

p

is used to reversely replace the pixel value at position

q

. Therefore, Equation (28) can be expressed as:

\begin{array}{l} X^{'} = \arg \min_{X} ε ‖ k \otimes X - G ‖_{2}^{2} + δ^{'} ‖ a - \nabla X ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \\ + γ^{'} ‖ M_{X} X - b ‖_{2}^{2} + μ^{'} ‖ M_{1 - X} (1 - X) - c ‖_{2}^{2} \end{array}

(30)

The solution of (30) can be obtained by FFT:

X = F^{- 1} (\frac{ε \bar{F (k)} \circ F (G) + δ^{'} F_{a} + γ^{'} F (M_{X}^{T} b) + μ^{'} F (M_{1 - X}^{T} c - 1) + ρ^{'} F (Ψ^{T} \tilde{X})}{ε \bar{F (k)} \circ F (k) + δ^{'} F_{\nabla} + γ^{'} + μ^{'} + ρ^{'}})

(31)

where

F_{a}

is

\bar{F (\nabla_{h})} \circ F (a_{h}) + \bar{F (\nabla_{v})} \circ F (a_{v})

;

F_{\nabla}

is

\bar{F (\nabla_{h})} \circ F (\nabla_{h}) + \bar{F (\nabla_{v})} \circ F (\nabla_{v})

;

F (\cdot)

and

F^{- 1} (\cdot)

denote the fast Fourier transform and inverse fast Fourier transform, respectively;

\bar{F (\cdot)}

is the complex conjugate operator;

\circ

denotes component multiplication, and the division in Formula (31) is component division. It should be noted that

M

and

M^{T}

, as linear operators, did not actually generate a matrix and perform matrix multiplication during the calculation process, but instead set up a lookup table according to its meaning. For example,

M_{X}^{T} b

does not actually calculate the product of the matrix

M_{X}^{T}

and

b

. Instead, according to the relationship that

b

is approximately equal to

M_{X} X

, the minimum value element in

X

is replaced with the element in

b

to obtain the result of

M_{X}^{T} b

. This avoids the generation and calculation of large matrices in the algorithm, and significantly improves the running speed.

We estimate the blur kernel

k

by:

k^{'} = \arg \min_{k} η ‖ k ‖_{2}^{2} + ε ‖ k \otimes X - G ‖_{2}^{2}

(32)

For the subproblem

k

, directly using the intermediate latent image to estimate the blur kernel is not accurate [28]; therefore, the gradient image is used to estimate the blur kernel. Then, the solution of

k

can be obtained by solving the following:

k^{'} = \arg \min_{k} η ‖ k ‖_{2}^{2} + ε ‖ k \otimes \nabla X - \nabla G ‖_{2}^{2}

(33)

The solution of (33) can be obtained by FFT:

k = F^{- 1} (\frac{ε \bar{F (\nabla X)} F (\nabla G)}{ε \bar{F (\nabla X)} F (\nabla X) + η})

(34)

Since the blur kernel

k \geq 0

and

‖ k ‖_{1} = 1

, after each iteration of the

k

subproblem, we set the negative elements of

k

to zero and normalize

k

at the end. The solution of all variables in the process of blur kernel estimation has been given, and Algorithm 1 shows the main steps for the blur kernel estimation algorithm. As suggested by [25,26,29], we decrease

μ, γ, δ

gradually to make more information available for kernel estimation.

Algorithm 1: Blur Kernel Estimation Algorithm

Input: Blurred image

Y

generate the initial value of each variable

for i = 1 : 5

do

ε \leftarrow ε_{0}

repeat
solve for

G

using the gradient descent method,

γ^{'} \leftarrow 2 γ, μ^{'} \leftarrow 2 μ

.
repeat
solve for

b

using (26), solve for

c

using (27),

ρ^{'} \leftarrow 2 ρ

.
repeat
solve for

\tilde{X}

using (21),

δ^{'} \leftarrow 2 δ

.
repeat
solve for

a

using (25),solve for

X

using (31),

δ^{'} \leftarrow 2 δ

.
until

δ^{'} > δ_{m a x}^{'}

ρ^{'} \leftarrow 2 ρ^{'}

.
until

ρ^{'} > ρ_{m a x}^{'}

γ^{'} \leftarrow 2 γ^{'}, μ^{'} \leftarrow 2 μ^{'}

.

until γ^{'} > γ_{m a x}^{'}

and

μ^{'} > μ_{m a x}^{'}

ε \leftarrow 4 ε

.
until

ε > ε_{m a x}

solve for

k

using (34).

μ \leftarrow 0.9 μ, γ \leftarrow 0.9 γ, δ \leftarrow 0.9 δ

.
end for
Output: blur kernel

k

.

Finally, only the solution of

X

in Equation (17) has not been given yet. When the value of

α

is different, the solution method is different. When

α = 2

, we solve for

X

by:

X^{'} = \arg \min_{X} ε ‖ k \otimes X - G ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} + \sum_{i} λ ‖ f_{i} \otimes X ‖_{2}^{2}

(35)

The purpose of initial reconstruction using Gaussian prior is only to extract the significant edges of the image. Therefore,

λ

should be a larger value to suppress the generation of ringing artifacts,

λ = 0.1 \times 10^{- 2}

. The solution of (35) can be obtained by FFT:

X = F^{- 1} (\frac{ε \bar{F (k)} \circ F (G) + ρ^{'} F (Ψ^{T} \tilde{X})}{ε \bar{F (k)} \circ F (k) + ρ^{'} + λ \sum_{i} \bar{F (f_{i})} \circ F (f_{i})})

(36)

When

α = 2 / 3

, the non-blind SR model is constructed by adaptive regularization, and we solve for

X

by:

\begin{array}{l} X^{'} = \arg \min_{X} ε ‖ k \otimes X - G ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \\ + \sum_{m}^{M} \sum_{n}^{N} (\sum_{i} λ_{(m, n)} {|{(f_{i} \otimes X)}_{(m, n)}|}^{\frac{2}{3}}) \end{array}

(37)

where

M

and

N

are the number of pixels in the row and column direction of the image, respectively;

λ_{(m, n)}

can be obtained according to Equation (15). In order to facilitate the solution, the auxiliary variable

w_{i}

is introduced by the semi-quadratic split method. Equation (37) can be expressed as:

\begin{matrix} (X^{'}, w_{i}^{'}) = \arg \min_{X, w_{i}} ε ‖ k \otimes X - G ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} \\ + \sum_{i} (\sum_{m}^{M} \sum_{n}^{N} λ_{(m, n)} {|{(w_{i})}_{(m, n)}|}^{\frac{2}{3}} + ϑ ‖ f_{i} \otimes X - w_{i} ‖_{2}^{2}) \end{matrix}

(38)

Equation (38) can be divided into the

X

sub-problem and the

w

sub-problem to be solved separately. We solve the

X

sub-problem by:

X^{'} = \arg \min_{X} ε ‖ k \otimes X - G ‖_{2}^{2} + ρ^{'} ‖ Ψ^{T} X - \tilde{X} ‖_{2}^{2} + \sum_{i} ξ ‖ f_{i} \otimes X - w_{i} ‖_{2}^{2}

(39)

The solution is the same as (35).

X

can be obtained by FFT:

X = F^{- 1} (\frac{ε \bar{F (k)} \circ F (G) + ρ^{'} F (Ψ^{T} \tilde{X}) + ξ \sum_{i} \bar{F (f_{i})} \circ F (w_{i})}{ε \bar{F (k)} \circ F (k) + ρ^{'} + ξ \sum_{i} \bar{F (f_{i})} \circ F (f_{i})})

(40)

After obtaining

X

, we solve the

w

sub-problem. Let

f_{i} \otimes X = v

, then the objective function can be abbreviated as:

w^{'} = \arg \min_{X, w_{i}} λ {|w|}^{\frac{2}{3}} + ξ {(v - w)}^{2}

(41)

We set the derivative of Equation (41) to be 0:

\frac{2 λ}{3} {|w|}^{- \frac{1}{3}} s i g n (w) + 2 ξ (w - v) = 0

(42)

We further transform Equation (42) into:

w^{4} - 3 v w^{3} + 3 v^{2} w^{2} - v^{3} w + \frac{λ^{3}}{27 ξ^{3}} = 0

(43)

The root of Equation (43) is r. According to [30], when

r

is between

v / 2

and

v

, the solution of Equation (43) is

w^{'} = r

, otherwise

w^{'} = 0

. However, since the solution of Equation (43) only depends on the ratio of

λ

and

ξ

and the value of the variable

v

, it can be solved by a lookup table (LUT). Where

ξ

is an integer power of

\sqrt{2}

between 1 and 256,

λ

is

0.5 \times 10^{- 2}

or

2.5 \times 10^{- 4}

, and

v

is

15, 000

different values between

- 0.9

and

0.9

. Solving Equation (43) in turn can form an offline lookup table. The LUT can give a solution to the objective function with an accuracy close to that of the analytical method at a faster speed [30].

We, respectively, give the solution of each unknown variable in the objective function of blur kernel estimation and non-blind SR. When

α

takes different values, the non-blind SR reconstruction process is shown in Figure 5.

5. Experiment and Result Analysis

Our test environment parameters were as follows: Intel(R) Core(TM)i5-9300H CPU @2.40 GHz; memory: 16.00 GB; operating system: Windows 10; MATLAB R2019a. We obtained the following fixed parameters through repeated experiments and adjustments:

δ = γ = μ = 0.004

;

ε_{0} = 0.25

;

η = ρ = ξ = 1

;

σ = 0.3

. The image block size used for the dark channel search was 35 × 35. The sparse base

Ψ

used the Daubechies 8 wavelet base.

In order to make the experimental results more convincing, in addition to comparing our method with the classical blind SR method, we also design comparative experiments for the blur kernel estimation and non-blind SR reconstruction in our method. In the blind SR comparison experiment, we compared our method with the methods proposed by Keys [31], Shao [20], Michaeli [23], and Kim [22]. Since the actual infrared image of the power equipment did not have the original, clear HR image, we adopted two other objective evaluation indicators: average gradient (AG) and information entropy (IE). The calculation method of AG is as follows:

A G = \frac{1}{M N} \sum_{i} \sum_{j} \sqrt{f_{x}^{2} (i, j) + f_{y}^{2} (i, j)}

(44)

where

f_{x} (i, j)

and

f_{y} (i, j)

are the image convolution results of the difference operator in the row and column directions, respectively. The larger the AG value is, the more drastically the grayscale changes in the image and the more the image levels, that is, the clearer the image is.

Entropy represents the uniformity of a system in physics. The more uniformly a system is distributed, the greater its information entropy is. The concept of image information entropy is derived from this, which can be defined as follows:

I E = - \sum_{i = 0}^{n} p (i) \log_{2} p (i)

(45)

where

p (i)

represents the frequency of the pixel point with the gray value of

i

in the image. The larger the IE value, the richer the information contained in the image.

In addition, in order to prove the effectiveness of the blur kernel estimation method in this paper, we compared the blur kernels estimated by our method and the algorithms proposed in [20,23,32]. We used the sum of the squared differences error (SSDE) to evaluate the accuracy of the estimated blur kernel:

S S D E = \sum_{i} \sum_{j} {(k_{e s t} (i, j) - k_{g t} (i, j))}^{2}

(46)

where

k_{e s t}

represents the estimated blur kernel and

k_{g t}

represents the true blur kernel of the image.

Finally, in order to verify the performance of the non-blind SR reconstruction method in this paper, we use the known blur kernel to process the HR and clear infrared image according to Equation (1). Taking the synthetic infrared image and the known blur kernel as input, we compare our method with the existing non-blind SR methods. We select the methods proposed by Keys [31], Glasner [13], Dong [15] and Zhao [17] as the comparison method. Since the artificially synthesized infrared image had the original, clear HR image, the PSNR and SSIM evaluation indicators could be used to evaluate the reconstruction results.

5.1. Blind SR Comparison Experiment

First, we reconstruct the LR infrared images of the power equipment that are actually collected to verify the effectiveness of the blind SR method in the practical application. For the experiment, we used eleven infrared images taken on site with a resolution of 128 × 128 for SR reconstruction. Figure 6 shows the images reconstructed using different methods for the 11th LR infrared image.

As shown in Figure 6, the method proposed by Key does not consider the influence of the blur kernel when reconstructing the image, and there is an inherent smoothness benefit of interpolation algorithms. The reconstruction result obtained by this method has the worst visual effect, and there is no obvious difference from the LR image. Compared with the original low-resolution image, the visual quality of the reconstructed image by Shao’s method has been significantly improved, but the detailed texture is still not clear enough. Obvious artifacts and ringing appear in the reconstruction results of Michaeli’s method. This is a common problem caused by improper regularization intensity in the SR reconstruction process, and it is also a problem that this paper focuses on improving and solving. Although the image reconstructed by Kim has higher contrast and brighter colors, according to the enlarged part in the green box, the edges are too smooth. On the whole, the edge texture of the image reconstructed by our method is the clearest, and there are no artifacts and ringing. This shows that our method has certain performance advantages compared with the comparison method. The AG and IE values of the remaining 10 images reconstructed using different methods are given in Figure 7. It can be seen that Kim and Shao’s methods are similar in performance. Michaeli’s method has a significantly higher reconstruction image index, which is due to improper control of the regularization intensity. Generally speaking, the infrared image reconstructed by our method has obvious advantages compared with the comparison methods in the evaluation index.

5.2. Experiment of Blur Kernel Estimation

In order to prove the effectiveness of the blur kernel estimation method in this paper, we used the six blur kernels shown in Figure 8 to sequentially blur 100 infrared images and perform double downsampling. We used our method and the comparison methods to estimate the blur kernel based on the LR blurred image. For each blur kernel, the average SSDE parameters of the blur kernel estimated by the different methods on 100 synthetic blurred infrared images are shown in Table 1. It can be seen from the data in Table 1 that compared with the comparison methods, our blur kernel estimation method has achieved better results in accuracy.

5.3. Non-Blind SR Comparison Experiment

In this section, we use the six blur kernels in Section 5.2 to process 10 HR and clear infrared images according to Equation (1) to obtain 60 artificially synthesized LR images. Figure 9 shows the LR infrared image of the 10th HR image synthesized by BK6, as well as the reconstruction results of different non-blind SR methods.

It can be seen from Figure 9 that the method proposed by Keys has the worst visual effect due to the inherent smoothing benefits of interpolation algorithms, and the transformer texture is almost invisible. The visual quality of Glasner and Dong’s methods are similar, but the texture is not as clear as our proposed method, especially in small local details. The reconstruction result of Zhao’s method is somewhat distorted, and the image is too sharp, resulting in too high brightness. This will have a very bad influence in infrared diagnosis, and it is easy to cause the operation and maintenance personnel to misjudge the operating temperature of the equipment. Additionally, according to the part marked in the red box, its ability to reconstruct small textures obviously has a certain gap compared with our method. Due to the large number of images used in the experiment, the reconstruction results of the remaining images are given in the form of evaluation parameters. The PSNR and SSIM values of the reconstructed image of different algorithms are shown in Figure 10. Due to the large amount of data, for the reconstruction results of the same HR image processed by different blur kernels, the objective evaluation parameters are averaged and displayed. It can be seen from Table 2 and Table 3 that the performance of our method is significantly improved compared to the comparison methods.

5.4. Sensitivity Analysis

In this paper, the blur kernel estimation model involves many parameters. In this section, we analyze the influence of its value. The blur kernel estimation model involves five main parameters

δ, η, γ, μ

and

ρ

. In order to analyze the influence of these parameters on the blur kernel estimation, we collect 10 blurred images for tests. For each parameter, we carry out experiments with different parameter settings by varying one and fixing the others with the kernel similarity metric to measure the accuracy of estimated kernels. For parameter

δ

, we set its values from

10^{- 5}

to 0.01 with the step size of

5 \times 10^{- 4}

. Figure 10a demonstrates that blur kernels can be well estimated by a wide range of

δ

, i.e., within

[0.001, 0.01]

. Similarly, we set the values of

η

and

ρ

from 0 to 2 with the increment of 0.1, and the values of

γ

and

μ

from 0 to 0.01 with the increase of

5 \times 10^{- 4}

. The experimental results of

η

and

ρ

parameters are shown in Figure 10b,c. Since

γ = μ

in the actual calculation process, the result is displayed by one curve, as shown in Figure 10d. The experimental results show that the proposed blur kernel estimation algorithm performs well with a wide range of parameter settings. In addition, when

γ = μ = 0

, it can be seen from Figure 10d that the blur kernel estimation effect is extremely poor, which also proves the necessity of introducing Extreme Channels Prior to the blur kernel estimation model in this paper.

5.5. Comparison with the Deep Learning Method

In order to reflect the superiority of this algorithm in practical applications, this section selects the advanced methods of deep learning algorithms [33] and our super-resolution reconstruction method for comparison experiments. At this stage, deep learning-based super-resolution algorithms require a large number of high-definition images as training samples. When training resources are insufficient, the performance of the method will decrease significantly. The algorithm in this paper can achieve high-quality image reconstruction without training samples. Figure 11 shows the comparison of the reconstruction results of the deep learning method with 500 and 2000 infrared images after training the model. It can be seen from Figure 11 that the deep learning algorithm has an obvious grid phenomenon and color distortion when the training data are insufficient. When the training data are sufficient, the contrast of some edge detail textures is slightly higher than that of our method. However, the result of the learning method contains a certain amount of false texture, which has a bad influence on infrared diagnosis. It can be seen that our method does not require training samples, and the reconstruction results are more accurate, so it has better practical application value in the electric power field.

6. Conclusions

In order to improve the quality of SR reconstructed images, so as to facilitate the status monitoring and fault analysis of power equipment, we propose a blind SR method of compressed sensing. Our method is divided into two parts: blur kernel estimation and non-blind SR reconstruction. For the blur kernel estimation part, we improved the basic SR model of compressed sensing and get the basic blur kernel estimation model. In order to improve the estimation accuracy of the blur kernel, we introduce Extreme Channels Prior based on the color characteristics of the infrared image. For the non-blind SR reconstruction method, we propose an adaptive non-blind SR reconstruction algorithm. It controls the intensity coefficient of the regular term adaptively during the reconstruction process to suppress the generation of artifact ringing and improve the quality of the reconstructed image. The above blur kernel estimation method and the non-blind SR method are combined to form our blind SR method. In the experimental part, we compare the two parts of the blind SR method with their corresponding existing classical methods to illustrate the superiority of the performance of our method. According to the experimental results, it can be seen that our method can estimate the blur kernel more accurately, which can complete the non-blind SR reconstruction of LR infrared images with higher quality. The HR infrared image reconstructed by our method has more detailed textures and better visual effects, which can provide better conditions for the application of power system infrared diagnosis. Under the current background of the power industry actively carrying out the construction of the IoT, our method provides a feasible way to reduce the hardware cost of its construction. We think it enjoys broad application prospects to use the mode of front-end using low-cost sensors to collect information and back-end using algorithms to recover the high-quality collected images. This method can effectively reduce the construction cost of IoT and the cost of data transmission and storage. Because the construction of power IoT is still in its infancy, we do not choose to use the data-driven method for infrared image super-resolution reconstruction. When the data acquisition and storage system become more standardized and mature in the future, we think that it will also be an interesting idea to train dictionaries according to the types of power equipment and update them online to achieve sparse representation of different images, which may be able to achieve a better reconstruction effect on the premise of ensuring the accuracy of image information.

Author Contributions

The conceptualization, data curation, formal analysis, and methodology were performed by H.Z. The software, supervision, formal analysis, validation, and writing—original draft preparation, review, and editing were performed by L.W. and B.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Jalil, B.; Leone, G.R.; Martinelli, M.; Moroni, D.; Pascali, M.A.; Berton, A. Fault Detection in Power Equipment via an Unmanned Aerial System Using Multi Modal Data. Sensors 2019, 19, 3014. [Google Scholar] [CrossRef] [Green Version]
Zhao, H.; Zhang, Z. Improving Neural Network Detection Accuracy of Electric Power Bushings in Infrared Images by Hough Transform. Sensors 2020, 20, 2931. [Google Scholar] [CrossRef]
Pan, Z.; Yu, J.; Huang, H.; Hu, S.; Zhang, A.; Ma, H.; Sun, W. Super-resolution based on compressive sensing and structural self-similarity for remote sensing images. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4864–4876. [Google Scholar] [CrossRef]
Dong, W.; Zhang, L.; Lukac, R.; Shi, G. Sparse representation-based image interpolation with nonlocal autoregressive modeling. IEEE Trans. Image Process. 2013, 22, 1382–1394. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, L.; Wu, H.; Pan, C. Fast image upsampling via the displacement field. IEEE Trans. Image Process. 2014, 23, 5123–5135. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Fan, Q.; Bao, F.; Liu, Y.; Zhang, C. Single-image super-resolution based on rational fractal interpolation. IEEE Trans. Image Process. 2018, 27, 3782–3797. [Google Scholar] [PubMed]
Yang, J.; Wang, Z.; Lin, Z.; Cohen, S.; Thomas, H. Coupled dictionary training for image super-resolution. IEEE Trans. Image Process. 2012, 21, 3467–3478. [Google Scholar] [CrossRef]
Dong, C.; Loy, C.C.; He, K.; Tang, X. Learning a deep convolutional network for image super-resolution. In Proceedings of the 13th European Conference on Computer Vision (ECCV 2014), Zurich, Switzerland, 6–12 September 2014; pp. 184–199. [Google Scholar]
Yang, W.; Feng, J.; Yang, J.; Zhao, F.; Liu, J.; Guo, Z.; Yan, S. Deep edge guided recurrent residual learning for image super-resolution. IEEE Trans. Image Process. 2017, 26, 5895–5907. [Google Scholar] [CrossRef] [Green Version]
Schulter, S.; Leistner, C.; Bischof, H. Fast and accurate image upscaling with super-resolution forests. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), Boston, MA, USA, 7–12 June 2015; pp. 3791–3799. [Google Scholar]
Li, L.; Xie, Y.; Hu, W.; Zhang, W. Single image super-resolution using combined total variation regularization by split Bregman Iteration. Neurocomputing 2014, 142, 551–560. [Google Scholar] [CrossRef]
Rasti, P.; Demirel, H.; Anbarjafari, G. Improved iterative back projection for video super-resolution. In Proceedings of the 22nd Signal Processing and Communications Applications Conference (SIU 2014), Trabzon, Turkey, 23–25 April 2014. [Google Scholar]
Glasner, D.; Bagon, S.; Irani, M. Super-resolution from a single image. In Proceedings of the IEEE International Conference on Computer Vision (ICCV 2009), Kyoto, Japan, 29 September–2 October 2009; pp. 349–356. [Google Scholar]
Šroubek, F.; Kamenický, J.; Milanfar, P. Superfast superresolution. In Proceedings of the IEEE International Conference on Image Processing (IEEE ICP 2011), Brussels, Belgium, 11–14 September 2011; pp. 1153–1156. [Google Scholar]
Dong, W.; Zhang, L.; Shi, G.; Li, X. Nonlocally centralized sparse representation for image restoration. IEEE Trans. Image Process. 2012, 22, 1620–1630. [Google Scholar] [CrossRef] [Green Version]
Yanovsky, I.; Lambrigtsen, B.H.; Tanner, A.B.; Luminita, A.V. Efficient deconvolution and super-resolution methods in microwave imagery. IEEE J. Stars 2015, 8, 4273–4283. [Google Scholar] [CrossRef]
Zhao, N.; Wei, Q.; Basarab, A.; Dobigeon, N.; Kouamé, D.; Tourneret, J.Y. Fast Single Image Super-Resolution Using a New Analytical Solution for l2–l2 Problems. IEEE Trans. Image Process. 2016, 25, 3683–3697. [Google Scholar] [CrossRef] [Green Version]
Efrat, N.; Glasner, D.; Apartsin, A.; Nadler, B.; Levin, A. Accurate blur models vs. in image priors in single image super-resolution. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV 2013), Sydney, NSW, Australia, 1–8 December 2013; pp. 2832–2839. [Google Scholar]
Riegler, G.; Schulter, S.; Ruther, M.; Bischof, H. Conditioned regression models for non-blind single image super-resolution. In Proceedings of the IEEE International Conference on Computer Vision (ICCV 2015), Santiago, Chile, 7–13 December 2015; pp. 522–530. [Google Scholar]
Shao, W.Z.; Ge, Q.; Wang, L.Q.; Lin, Y.Z.; Deng, H.S.; Li, H.B. Nonparametric Blind Super-Resolution Using Adaptive Heavy-Tailed Priors. J. Math. Imaging Vis. 2019, 61, 885–917. [Google Scholar] [CrossRef]
Qian, Q.; Gunturk, B.K. Blind super-resolution restoration with frame-by-frame nonparametric blur estimation. Multidimens Syst. Signal. Process. 2016, 27, 255–273. [Google Scholar] [CrossRef]
Kim, W.H.; Lee, J.S. Blind single image super resolution with low computational complexity. Multimed. Tools. Appl. 2017, 76, 7235–7249. [Google Scholar] [CrossRef]
Michaeli, T.; Irani, M. Nonparametric Blind Super-resolution. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV), Sydney, NSW, Australia, 1–8 December 2013; pp. 945–952. [Google Scholar]
Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Yan, Y.; Ren, W.; Guo, Y.; Rui, W.; Xiaochun, C. Image deblurring via extreme channels prior. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CCVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 4003–4011. [Google Scholar]
Pan, J.; Sun, D.; Pfister, H.; Yang, M.H. Blind image deblurring using dark channel prior. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 1628–1636. [Google Scholar]
Li, C. An Efficient Algorithm for Total Variation Regularization with Applications to the Single Pixel Camera and Compressive Sensing. Master’s Thesis, Rice University, Houston, TX, USA, 2010. [Google Scholar]
Tang, S.; Zheng, W.; Xie, X.; He, T.; Yang, P.; Luo, L.; Zhao, H. Multi-regularization-constrained blur kernel estimation method for blind motion deblurring. IEEE Access 2019, 7, 5296–5311. [Google Scholar] [CrossRef]
Cho, S.; Lee, S. Fast Motion Deblurring. Acm Trans. Graph. 2009, 28, 1–8. [Google Scholar] [CrossRef]
Krishnan, D.; Fergus, R. Fast image deconvolution using hyper-Laplacian priors. Adv. Neural Inf. Process. Syst. 2009, 22, 1033–1041. [Google Scholar]
Keys, R.G. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 2003, 29, 1153–1160. [Google Scholar] [CrossRef] [Green Version]
Liang, J.; Zhang, K.; Gu, S.; Gool, L.V.; Timofte, R. Flow-based Kernel Prior with Application to Blind Super-Resolution. arXiv 2021, arXiv:2103.15977. [Google Scholar]
Zhang, K.; Gool, L.V.; Timofte, R. Deep unfolding network for image super-resolution. In Proceedings of the IEEE International Conference on Computer Vision (ICCV 2020), Seattle, WA, USA, 13–19 June 2020; pp. 3217–3226. [Google Scholar]

Figure 1. Extreme channels of infrared images before and after blurring: (a) dark channel of clear and blurred images; (b) bright channel of clear and blurred images.

Figure 2. Fitting results of infrared image gradient probability density curve and different distributions: (a) typical infrared image; (b) fitting effects of different distributions.

Figure 3. Infrared image reconstruction results under different regularization intensity: (a) LR image; (b) strongly regularized reconstruction results; (c) weak regularization reconstruction results; (d) results of the proposed method.

Figure 4. Flow chart of adaptive regularization intensity adjustment method.

Figure 5. Non-blind SR flow chart.

Figure 6. Reconstruction results of different methods: (a) LR infrared image; (b) reconstruction results using the method in [31] (AG = 32.169; IE = 5.932); (c) reconstruction results using the method in [20] (AG = 35.921; IE = 5.991); (d) reconstruction results using the method in [23] (AG = 40.862; IE = 6.073); (e) reconstruction results using the method in [22] (AG = 38.910; IE = 6.040); (f) reconstruction results using our method (AG = 43.280; IE = 6.161).

Figure 7. AG and IE parameter values of reconstruction results of different methods.

Figure 8. Six different blur kernels: (a) BK1; (b) BK2; (c) BK3; (d) BK4; (e) BK5; (f) BK6.

Figure 9. Reconstruction results of different methods when using BK6: (a) HR infrared image; (b) reconstruction results using the method in [31] (PSNR = 28.338 dB; SSIM = 0.911); (c) reconstruction results using the method in [13] (PSNR = 30.542 dB; SSIM = 0.927); (d) reconstruction results using the method in [15] (PSNR = 31.213 dB; SSIM = 0.952); (e) reconstruction results using the method in [17] (PSNR = 29.372 dB; SSIM = 0.940); (f) reconstruction results using our method (PSNR = 32.503 dB; SSIM = 0.963).

Figure 10. Sensitivity analysis of

δ, η, γ, μ

and

ρ

for the blur kernel estimation model: (a)

δ

parameter; (b)

η

parameter; (c)

ρ

parameter; (d)

γ

and

μ

parameters.

Figure 10. Sensitivity analysis of

δ, η, γ, μ

and

ρ

for the blur kernel estimation model: (a)

δ

parameter; (b)

η

parameter; (c)

ρ

parameter; (d)

γ

and

μ

parameters.

Figure 11. Comparison between deep learning and image reconstruction: (a) composite LR image; (b) inadequate training; (c) adequate training; (d) our method; (e) original image.

Table 1. The mean SSDE of each method for each BK on all 100 synthetic blurred images.

Image Number	Shao	Michaeli	Liang	Ours
BK1	0.0473	0.0485	0.0471	0.0464
BK2	0.0472	0.0438	0.0430	0.0419
BK3	0.0467	0.0444	0.0436	0.0423
BK4	0.0390	0.0379	0.0369	0.0351
BK5	0.0431	0.0429	0.0425	0.0415
BK6	0.0422	0.0406	0.0402	0.0395

Table 2. PSNR parameter values of reconstruction results of different methods.

Image Number	Key	Glasner	Dong	Zhao	Ours
1	24.144	25.556	25.789	28.345	29.137
2	27.752	29.622	32.439	32.597	33.778
3	24.372	25.733	27.128	28.135	30.015
4	25.719	27.352	28.095	30.420	31.582
5	26.483	27.303	27.406	29.248	31.738
6	27.851	29.844	29.978	31.581	33.833
7	23.438	24.324	28.360	29.587	30.974
8	20.518	24.047	24.322	26.452	28.039
9	24.804	26.015	28.222	30.799	31.129
10	24.173	24.405	25.642	26.825	29.384

Table 3. SSIM parameter values of reconstruction results of different methods.

Image Number	Key	Glasner	Dong	Zhao	Ours
1	0.787	0.839	0.873	0.884	0.904
2	0.881	0.920	0.954	0.948	0.967
3	0.883	0.903	0.959	0.949	0.973
4	0.832	0.864	0.877	0.927	0.931
5	0.828	0.856	0.865	0.899	0.925
6	0.818	0.827	0.849	0.858	0.905
7	0.798	0.833	0.888	0.902	0.931
8	0.797	0.829	0.866	0.898	0.908
9	0.800	0.826	0.846	0.918	0.922
10	0.899	0.919	0.926	0.931	0.952

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, H.; Liu, B.; Wang, L. Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization. Sensors 2021, 21, 4820. https://doi.org/10.3390/s21144820

AMA Style

Zhao H, Liu B, Wang L. Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization. Sensors. 2021; 21(14):4820. https://doi.org/10.3390/s21144820

Chicago/Turabian Style

Zhao, Hongshan, Bingcong Liu, and Lingjie Wang. 2021. "Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization" Sensors 21, no. 14: 4820. https://doi.org/10.3390/s21144820

APA Style

Zhao, H., Liu, B., & Wang, L. (2021). Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization. Sensors, 21(14), 4820. https://doi.org/10.3390/s21144820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Blur Kernel Estimation and Non-Blind Super-Resolution for Power Equipment Infrared Images by Compressed Sensing and Adaptive Regularization

Abstract

1. Introduction

2. Blur Kernel Estimation Method

2.1. Basic SR Model of Compressed Sensing

2.2. Priori of Extreme Value Channel of Infrared Image of Power Equipment

2.3. Blur Kernel Estimation Model

3. Non-Blind SR Reconstruction Model

3.1. Non-Blind SR Objective Function

3.2. Adaptive Regularization Intensity Adjustment Method

4. Model Solution

5. Experiment and Result Analysis

5.1. Blind SR Comparison Experiment

5.2. Experiment of Blur Kernel Estimation

5.3. Non-Blind SR Comparison Experiment

5.4. Sensitivity Analysis

5.5. Comparison with the Deep Learning Method

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI