Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion

Chen, Feiran; Zhang, Jianlin; Cai, Jingju; Xu, Tao; Lu, Gang; Peng, Xianrong

doi:10.3390/app10186262

Open AccessArticle

Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion

by

Feiran Chen

^1,2,

Jianlin Zhang

^1,2,

Jingju Cai

^1,2,

Tao Xu

³,

Gang Lu

^1,2 and

Xianrong Peng

^1,2,*

¹

Key Laboratory of Optical Engineering, Institute of Optics and Electronics, Chinese Academy of Sciences, No.1, Optoelectronic Avenue, Wenxing Town, Shuangliu District, Chengdu 610209, China

²

University of Chinese Academy of Sciences, Beijing 100039, China

³

Systems Engineering Research Institute, Beijing 100036, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(18), 6262; https://doi.org/10.3390/app10186262

Submission received: 21 July 2020 / Revised: 24 August 2020 / Accepted: 26 August 2020 / Published: 9 September 2020

(This article belongs to the Special Issue Infrared Imaging and NDT)

Download

Browse Figures

Versions Notes

Abstract

:

The detail enhancement and dynamic range compression of infrared (IR) images is an important issue and a necessary practical application in the domain of IR image processing. This paper provides a novel approach to displaying high dynamic range infrared images on common display equipment with appropriate contrast and clear detail information. The steps are chiefly as follows. First, in order to protect the weak global details in different regions of the image, we adjust the original normalized image into multiple brightness levels by adaptive Gamma transformation. Second, each brightness image is decomposed into a base layer and several detail layers by the multiscale guided filter. Details in each image are enhanced separately. Third, to obtain the image with global details of the input image, enhanced images in each brightness are fused together. Last, we filter out the outliers and adjust the dynamic range before outputting the image. Compared with other conventional or cutting-edge methods, the experimental results demonstrate that the proposed approach is effective and robust in dynamic range compression and detail information enhancement of IR image.

Keywords:

digital image processing; image enhancement; dynamic range adjustment and compression; guided filter

1. Introduction

The infrared sensor can capture the thermal radiation emitted by the objects, which is less impacted by the dark condition. It is widely applied in detection, scene surveillance, reconnaissance, and navigation, etc. due to its ability to operate 24 h a day. However, IR images do have many obvious shortcomings, including low contrast, weak details, and blurred resolution, when compared with visible images, which may trigger much inconvenience when people observe the images. Consequently, infrared sensors in high dynamic range (>8 bit) are always applied in practical application to capture more details in these years. If displaying the HDR images on normal facilities (8 bit) directly, some information in the original image could not be represented. The procedure to achieve high-quality visualization of HDR infrared image must take the following problems into consideration. First and foremost, the dynamic range of the output should be mapped to be acceptable for the display device. Meanwhile, in order to take advantage of the HDR sensor and guarantee convenience for the following work, weak details should be enhanced. Last but not least, the output should be as visually pleasing as possible.

The core idea of many conventional approaches is adjusting the distribution of the gray level. Those methods usually include linear stretching and curve stretching (logarithmic, gamma, and sigmoid) methods based on histogram equalization [1] (HE) and gradient domain methods.

Linear stretching can compress the dynamic range to be acceptable for the display equipment, while it leads to detail losing. Curve stretching like Gamma transformations can increase the image contrast, while the fitting parameters change when the images are different. For each image, manual selection of parameters with experience is required.

HE enlarges the contrast of the image by redistributing the image pixel values so that the number of pixels in each gray level is approximately equal. Researchers have made plenty of efforts based on HE [2,3,4,5,6,7,8], including global HE-based approaches and local HE-based approaches. The global HE-based approaches like Plateau HE [3,4] and its improved ones [2,7] can make the gray levels distribute more reasonably, but the ability to preserve the details is insufficient. The local HE-based approaches like the partially overlapped sub-block histogram equalization (POSHE) [8] and contrast-limited adaptive histogram equalization (CLAHE) [5] can generate more details, but they are intended to induce artifacts, over-enhancement, and blocking effects. In short, most of the existing HE-based methods cannot avoid the contradiction between the maintaining of local details and the global consistency of the entire image, because they only take the histogram information into consideration.

The main idea of gradient domain [9,10] methods is attenuating the large gradient while expanding the small gradient to produce the modified gradient field, and then reconstruct the result image by solving a Poisson equation. Generally, gradient domain operators are capable of achieving appropriate dynamic range compression and avoiding artifacts like halo and gradient reversal. However, they may be limited in enhancing the local details effectively, and cautious selection of parameters is necessary, which limits the practical application of these methods.

Recently, multiscale decomposition methods are critical and prevalent in the domain of HDR image display. These methods introduce filters to decompose the image into several components followed by processing each component separately, then compose the processed components to yield the result image. F.Durand et al. [11] introduced a method to display HDR images. In their method, the image is decomposed into the base layer and the detail layer by the bilateral filter. Then, they attenuate the contrast of base layer, while the contrast of the detail layer is kept, thereby the details are preserved. Nowadays, an increasing number of researchers have adopted the use of a Guided Filter [12,13] to process the image for its simplicity and efficiency without the gradient reversal artifacts. B. Gu et al. [14] presented an edge-preserving filter with locally adaptive property, which is particularly effective in preserving or enhancing local details. Besides, researchers also applied hat-top transform [15,16] or wavelet transform [17] to decompose the image, then process the layers individually followed by composing them. Although multiscale decomposition methods have strength on details enhancing, they sometimes trigger halo artifacts in strong edges.

Currently, based on the human visual system (HVS), Retinex theory [18,19], researchers have presented various methods [20,21,22,23,24,25,26]. The critical challenge of those algorithms is the contradiction between calculation speed and the processing effect. On the one hand, better results can be obtained by constructing more complex models; on the other hand, the intricate structure of the model increases the calculation complexity, which limits the widespread application of the algorithm. Meanwhile, HVS-based approaches are more appropriate for the visual images with sufficient details rather than the infrared images lack of details in general.

In conclusion, the inadequacies of the existing methods mainly include (1) the contradiction between the maintaining of local details and the global consistency of the entire image, (2) too many parameters need to be selected manually with experience, and (3) poor robustness for dim images lack of details.

Nowadays, infrared and visible light images fusion [27,28] and the fusion of multi-exposure images [29,30] are both research hotspots in image processing. The visible light sensor mainly captures the reflected light so that the visible light image has abundant background information. In contrast, the infrared sensor can capture the thermal radiation emitted by the object, and it is less impacted by the dark condition or the dim weather. Therefore, the fusion of infrared and visible light images can guarantee more complex and detailed scene information. Similarly, each of multi-exposure images has its own unique details. If these details are well fused into one image, a high-quality image with multiple details can be produced.

This paper presents a novel approach based on adaptive transform and image fusion to overcome the problems above and display HDR infrared images on LDR display equipment with appropriate contrast and clear and abundant detail information. Inspired by the idea of image fusion, we transform the original image into multiple brightness by gamma transformation followed by multiscaled guided filter enhancement to keep and enhance the details in the entire image. In order to simplify the selection of parameters, we adapt the energy of the gradient (EOG) to guide the transformation, and the entropy is utilized to guide the multiscaled guided filter enhancement. The experimental results can prove that our method can achieve acceptable results with the fixed parameters. For typical HDR infrared images of various scenes, the effect of our method is robust.

The rest of this paper can be chiefly described as follows. Section 2 describes the fundamental theory and specific steps of our proposed method. In Section 3, our experiment comparison of the methods are described in detail. In Section 4, the conclusion of the paper is presented. Finally, the acknowledgment is made in Section 5.

2. Proposed Theory

The proposed framework is shown in Figure 1. First, in order to keep the weak global details in different areas, we adopt an adaptive Gamma transformation to adjust the original normalized image into multiple brightness levels. Second, the multiscale guided filter is utilized to decompose the images in different brightness individually into a base layer and detail layers. Details in each image are enhanced separately. Third, to obtain the image with global details of the original image, we fuse the enhanced images in each brightness together. Last, we filter out the bad pixels and adjust the dynamic range before outputting the image.

2.1. EOG Guided Gray Distribution Adjustment

Generally, the dynamic range of the HDR IR image (14 bit, 16 bit or more) far exceeds the dynamic range of the typical display range. Linear mapping is widely used due to its simplicity, but it is not suitable for most of the IR images whose gray levels are unevenly distributed. Different gamma correction parameters have different stretching effects on the image. A smaller gamma value can brighten the entire image and increase the contrast in darker areas; a larger gamma value can darken the entire image and increase the contrast in brighter areas. In order to keep the weak global details in different areas, we adjust the original normalized image into multiple brightness levels. However, manual selection of parameters with experience is required for each image in general, which generates inconvenience in the application.

2.1.1. Energy of Gradient

EOG is a well-established method for evaluating the clarity of the infrared image, due to its simplicity and accuracy. Energy of Gradient (EOG) is chosen as the evaluation criteria for the richness of image details. Let

f (x, y)

be the value of the pixel

(x, y)

. The EOG can be calculated as follows,

E O G (i m) = \sum_{x} \sum_{y} (f_{x}^{2} + f_{y}^{2})

(1)

where

i m

is an image, and

f_{x} = f (x + 1, y) - f (x, y)

(2)

f_{y} = f (x, y + 1) - f (x, y)

(3)

2.1.2. EOG Guided Gray Distribution Adjustment

Let

S_{i} = {S_{b r i g h t}, S_{m o r d e r a t e}, S_{d a r k}}

be set candidates of

γ

, and use the EOG function to evaluate and select the optimal value of

γ

for image in each interval. Thus, the original image is adjusted to several images with rich details of multiple brightness levels.

Denote the original normalized image as

I_{i n p u t}

. Let

S_{1}

(bright),

S_{2}

(moderate), and

S_{3}

(dim) be the three intervals:

γ_{i} = \underset{γ \in S_{i}}{arg max} E O G (I_{i n p u t}^{γ}) i = 1, 2, 3,

(4)

where

E O G (i m a g e)

is the energy of the gradient of the image.

Through calculating, the value of

γ

which can produce the image with the maximum EOG in each brightness intervals is picked out adaptively. Moreover, the details in different areas of the image can be kept separately.

2.2. Multiscale Guided Filter Enhancement

2.2.1. Multiscale Guided Filter Decomposition

He et al. [12] presented the guided image filter (GF), which not only benefits in edge-preserving but also be computationally efficient. Consequently, it is widely applied in the domain of image processing currently. We adopt the guided filter to decompose the image, in which the guide is identical to the filtering input I. The local linear model between the guide image I and the filter output Q is the critical assumption of the guided filter. Q is a linear transformation of I in a window

ω_{k}

centered at the pixel k.

Q_{i} = a_{k} I_{i} + b_{k} \forall i \in ω_{k}

(5)

where

a_{k} = \frac{σ_{k}^{2}}{σ_{k}^{2} + ε}

(6)

b_{k} = \bar{I_{k}} - a_{k} μ_{k}

(7)

Here,

| ω |

is the number of pixels of

ω_{k}

,

μ_{k}

is the mean of I in

ω_{k}

,

σ_{k}^{2}

is the variance of I in the

ω_{k}

, and

\bar{I_{k}}

is the mean value of I in

ω_{k}

.

When the area has rich details, the

σ_{k}^{2}

is relatively large,

a_{k}

approaches 1, and

b_{k}

tends to 0; the guided filter can keep the details in the local area. When the area has rich details, the

σ_{k}^{2}

is relatively small,

a_{k}

approaches 0, and

b_{k}

tends to

\bar{p_{k}}

; the guided filter behaves as a weighted mean filter.

ε

is a parameter depended on the image information, which determines whether it is an edge should be preserved.

Therefore, the guided filter behaves as an edge-preserving smoothing operator. For simplicity, we refer to it as

Q = G F (I)

. Q can be regarded as the base layer of the input image I, which contains the low-frequency information of the input image I, reflects the intensity change of the image on a large scale, while

(I - Q)

can be regarded as the detail layer, which contains the high-frequency information of input image I, reflecting the details of the image on a small scale.

As introduced above, we can obtain a smoothed base layer and a detail layer by the guided filter. In order to obtain more complete details, we could utilize the guided filter iteratively to obtain the multiscale smoothed images. Meanwhile, the multiscale detail images can be generated. The specific procedure can be described as follows.

\{\begin{matrix} B_{1} = G F (I) \\ D_{1} = I - B_{1} \\ B_{k} = G F (B_{k - 1}) & k = 2, 3, \dots \\ D_{k} = B_{k - 1} - B_{k} & k = 2, 3, \dots \end{matrix}

(8)

The

B_{i}

is the ith base layer while the

D_{i}

is the ith detail layer. Then, we can decompose the original image as follows.

I = B_{n} + D_{1} + D_{2} + \dots + D_{n}

(9)

Specifically, in our study and experiment, as is shown in the Figure 1, we decompose the image into three layers: one base layer and two detail layers. Therefore, the multiscale decomposition progress can be described as

I = B_{2} + D_{1} + D_{2}

(10)

2.2.2. Adaptive Multiscale Guided Filter Composition

Each image can be decomposed into several layers; the details in the infrared image are typically weak. As is shown in Figure 2, the input image is one of the EOG guided transformed images, there are rich details captured by the HDR infrared sensors in the detail layers, but they are too weak to be observed.

The composition of the base layer and detail layers can be described as follows.

I_{e n h a n c e d} = B_{n} + α_{1} D_{1} + α_{2} D_{2} + \dots + α_{n} D_{n}

(11)

The layers are linearly accumulated together, the value of each coefficient

α_{i}

gives expression to the importance of the ith detail layer, the more the information in the layer, larger the

α_{i}

. In order to adaptively choose the value of

α_{i}

, we adopt entropy to evaluate the richness of information in each layer:

E n t r o p y (i m) = - \sum_{i = 0}^{1} p_{i} log p_{i}

(12)

where

p_{i}

is the probability of gray level i in the image.

α_{i} = \frac{C \cdot E n t r o p y (D_{i})}{\sum_{k = 1}^{n} E n t r o p y (D_{k}) + z} i = 1, 2, \dots, n

(13)

C is a fixed coefficient and z is a very small number added to prevent the denominator from being 0.

In Figure 2 the effect of the step is shown; the weak details are enhanced. The weak details in detail layers (b,c) become much clearer in (e,f). What needs to be explained is that the figures of detail layers are stretched 10 times for better visibility; the details in fact are much weaker.

The effect of the proposed multiscale guided filter enhancement in this section can also be obviously reflected in Figure 3. There are two groups of images: panels (a,c) are results of the proposed method without the multiscale guided filter enhancement, in which the information is ambiguous, while panels (b,d) are the results of proposed method, which is much more visually comfortable.

2.3. Image Fusion

Inspired by the research hotspots including infrared and visible light images fusion and the fusion of multi-exposure images, which aim at fusing details in different images of a same scene to obtain an image with rich information, we fuse the images generated by previous steps. We adopt a method [29] with clear mathematical principles and high computational efficiency.

Through the steps above, a set of enhanced images in different brightness from the original image can be generated. We regard those images as multi-exposed images. In order to maintain local details well, we block the image in the fusion. Let

{i_{k}^{n}} = {i_{k}^{n} | 1 < k < K}

be a set of column vectors of

N^{2}

dimensions expanded from the blocks at the identical location of the source images that contains K images in multi-brightness. k means the block is from the kth image of the set, n means the nth location. The elements of the vector are value of each pixel in the image block. N is the side length of the image block. n corresponds to the position of the patch in the entire image. In order to express, analyze, and process the feature of a block, the vector

i_{k}^{n}

can be decomposed into three components, including signal strength

p_{k}^{n}

. Signal structure

s_{k}^{n}

and mean intensity

μ_{i_{k}^{n}}

. The definitions of the components are chiefly as follows.

i_{k}^{n} = ∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥ \cdot \frac{i_{k}^{n} - μ_{i_{k}^{n}}}{∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥} + μ_{i_{k}^{n}} = p_{k}^{n} \cdot s_{k}^{n} + μ_{i_{k}^{n}}

(14)

μ_{i_{k}^{n}}

is a vector, in which all the elements equal to the mean value of

i_{k}^{n}

.

p_{k}^{n} = ∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥

(15)

s_{k}^{n} = \frac{i_{k}^{n} - μ_{i_{k}^{n}}}{∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥}

(16)

Obviously, the contrast of an image block can be directly reflected by the signal strength component

p_{k}^{n} = ∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥

. Generally speaking, the higher the contrast, the clearer the block or image. While the excessive contrast may trigger an unrealistic scene. Considering the input images (blocks) are undistorted, we could assume that the block has the largest contrast corresponds to the optimal visibility. Therefore, we choose the highest signal strength of all source image blocks as the signal strength of the fused image block:

\hat{p_{n}} = max_{1 < k < K} p_{k}^{n}

(17)

Determine the structure of the set of image blocks as a series of unit length vectors

s_{k}^{n} = \frac{i_{k}^{n} - μ_{i_{k}^{n}}}{∥ i_{k}^{n} - μ_{i_{k}^{n}} ∥} (1 \leq k \leq K)

, and each one points to a direction in the vector space. The structure of the fused image block should represent the structures of the series of image blocks. Specifically, the relationship between the structure of fused block and the input blocks is defined in a simple but effective way:

\bar{s_{n}} = \frac{\sum_{k = 1}^{K} {∥ p_{k}^{n} ∥}^{ρ} \cdot s_{k}^{n}}{\sum_{k = 1}^{K} {∥ p_{k}^{n} ∥}^{ρ}}

(18)

\hat{s_{n}} = \frac{\bar{s_{n}}}{∥ \bar{s_{n}} ∥}

(19)

The definition of mean intensity of each block is

\hat{μ_{n}} = \frac{\sum_{k = 1}^{K} L (μ_{k}, μ_{i_{k}^{n}}) \cdot μ_{i_{k}^{n}}}{\sum_{k = 1}^{K} L (μ_{k}, μ_{i_{k}^{n}})}

(20)

where

L (μ_{k}, μ_{i_{k}^{n}})

is a weighting function which is controlled by the mean value of the kth whole image

μ_{k}

and the mean value of the current block

μ_{i_{k}^{n}}

in the kth image.

L (\cdot)

should be relatively large when the block

i_{k}^{n}

is in a well-exposed region, and vice versa. To specify it, we adopted a two-dimensional Gaussian function:

L (μ_{k}, μ_{i_{k}^{n}}) = e x p - [\frac{{(μ_{k} - 0.5)}^{2}}{2 σ_{g}} + \frac{{(μ_{i_{k}^{n}} - 0.5)}^{2}}{2 σ_{l}}]

(21)

When signal strength

\hat{p_{n}}

, signal structure

\hat{s_{n}}

, and mean intensity

\hat{μ_{n}}

are computed, the new vector

\hat{i_{n}}

, which means the vector of the fused image block, can be defined and the block can be reconstructed:

\hat{i_{n}} = \hat{p_{n}} \cdot \hat{s_{n}} + \hat{μ_{n}}

(22)

The blocks from the source sequence are obtained by a moving window with a fixed stride D. The pixels in the overlapping blocks are averaged to produce the final output of this step.

2.4. Outliers Filtering

Generally, there are still some outliers in the image, which are usually the brightest or darkest. Specifically, the maximum or minimum value in the image may be the outliers of the image, which affect the result of dynamic range adjustment. To cope with the problem, we have adopted a simple and effective method.

For instance, to avoid manual selection of the parameters with experience, we assume that there are two outliers in each row or in each patch with fixed size a. Take an image in size of

a \times b

as the example. First, take every pixel value in the whole image in descending order. Then, pick out the

a t h

value as the minimum value

f_{m i n}

and the

a t h

last value as the maximum value

f_{m a x}

. Finally, adjust the image according to the effective values.

I_{o u t p u t} = 255 * \frac{\hat{I} - f_{m i n}}{f_{m a x} - f_{m i n}}

(23)

3. Experiment Results

3.1. Experimental Settings

In order to measure the effect and the efficiency of the proposed method.Multiple 16 bits infrared images selected from typical scenes in databases FLIR Thermal Starter Dataset Version 1.3 [31] and LTIR Dataset Version 1.0 [32] were utilized for testing. The information of the images including image size and dynamic range is listed in Table 1. Meanwhile, four well-established methods (HE [1], CLAHE [5], MSR [23], and Reinhard [24]) and two novel approaches (AHPBC [6] and LEP [14]) were introduced for comparison. In those methods, we select the parameters as the authors advised or with experience.

In Section 2.1.2 (Equation (4)), the three brightness intervals (bright

S_{1}

, moderate

S_{2}

, and dark

S_{3}

) are set as follows,

S_{1} = [0.1, 0.7]

,

S_{2} = (0.7, 1.5]

, and

S_{3} = (1.5, 8]

.

In Section 2.2.1 (Equation (6)), we set the value of

ε

be related to the variance of the entire image, because the effect of

ε

is determining whether it is an edge should be preserved. In our experiment,

ε = σ_{i}^{3}

, where

σ_{i}

is the variance of the entire image.

In Section 2.2.2 (Equation (11)), the values of

α_{k}

determine the enhancement of the details. Throughout our experiment, we obtain two detail layers, and in Equation (13),

C = 7

and

z = 0.0001

.

In Section 2.3 (Equation (18)),

ρ

determines the contribution of each block to the fused block’s structure. Obviously, the contribution increases along with the strength of the block. Theoretically,

ρ > 0

is feasible. We set

ρ = 4

in our experiment. In Equation (21),

σ_{g}

and

σ_{l}

control the spread of the profile along

μ_{k}

and

l_{k}

. We set

σ_{g} = 0.2

and

σ_{l} = 0.5

, a smaller value of

σ_{g}

relative to

σ_{l}

is important to generate results with good visual impression. Additionally, we set the size of blocks and the moving window stride:

N = 11

and

D = 2

, as the author advised.

Throughout the paper, parameters mentioned above are adopted for typical infrared images with different characteristics. Results have demonstrated that the proposed method is capable of effectively enhancing IR image.

3.2. Visual Comparisons

To compare the effects of the methods intuitively, the enhanced results of the algorithms are given in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10. We discuss the results in detail in Section 4.

3.3. Quantitative Comparison

Generally, good display performance means high clarity and even gray level distribution. In order to do the quantitative comparison, the Tenengrad [33], Entropy, Naturalness Image Quality Evaluator (NIQE) [34], and Perception-based Image Quality Evaluator (PIQE) [35] are introduced. They are widely used in evaluating the quality of an image.

The Tenengrad is written as

G_{x} = [\begin{matrix} 1 & 0 & - 1 \\ 2 & 0 & - 2 \\ 1 & 0 & - 1 \end{matrix}]

(24)

G_{y} = [\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}]

(25)

S (x, y) = \sqrt{G_{x} * I (x, y) + G_{x} * I (x, y)}

(26)

T e n e n g r a d = \frac{1}{n} * \sum_{x} \sum_{y} S (x, y)

(27)

where

I (i, j)

denotes the gray value of the pixel

(x, y)

, * denotes convolution, and n is the number of pixels in image.

The Tenengrad is utilized to reflect the clarity of the whole image. Theoretically, the larger the Tenengrad value is, the higher the contrast, and the better the visibility of the details of the image. The calculation Tenengrad results are listed in Table 2.

The even distribution of the pixel values of the image is another goal of image enhancement. And the entropy of an image is a common approach to reflect the pixel value’s distribution. Specifically and theoretically, the larger the entropy value is, the more evenly the gray levels distributed. The entropy of an 8 bit image is written as

E n t r o p y (i m) = - \sum_{i = 0}^{255} p_{i} log p_{i}

(28)

where

p_{i}

is the probability of gray level i in the image. The calculation Entropy results are listed in Table 3.

NIQE measures the distance between the NSS-based features calculated from image to the features obtained from an image database used to train the model. The features are modeled as multidimensional Gaussian distributions. We calculate it by the Matlab function

n i q e ()

, which returns a non-negative scalar. Theoretically, the lower value of NIQE is, the better the perceptual quality of the image. The results are listed in Table 4.

PIQE calculates the no-reference quality score for an image through block-wise distortion estimation. We calculate it by the Matlab function

p i q e ()

, which returns a non-negative scalar in the range [0, 100]. The PIQE score is inversely correlated to the perceptual quality of an image. A low PIQE value indicates high perceptual quality and high PIQE value indicates low perceptual quality. The results are listed in Table 5.

3.4. Running Time Comparison

In order to do an efficiency comparison, the above-listed algorithms are tested, using MATLAB R2018b on a personal computer (Intel core i5-8250U; CPU:1.60 GHz; Memory: 8 GB). The size of the tested images are listed in Table 1. The calculation time results are listed in Table 6.

4. Discussion

Image group Figure 4 is an example of infrared images of rich scene information including human, bicycles, benches, ground, and so on. HE and CLAHE enhanced the contrast, while a large amount of local details lost. The AHPBC and MSR can enhance the details to some extent, while the dynamic range of the result image is so small that the visibility is poor. The result of Reinhard is visually comfortable but the some texture information is still ambiguous. LEP can enhance the image well in general, but generates a halo. Compared with other six approaches, our method meets the best performance.

Image group Figure 5 and Figure 6 are examples of low contrast image, which contains many details about the texture. The dynamic range of the original IR image is so narrow that HE and CLAHE fail in the enhancement of the details, while some regions in their results are over-enhanced, and some noises are generated. AHPBC, MSR, and Reinhard can preserve the global contrast but has a relatively weak compatibility in the enhancement of the local details. LEP can successfully enhance the edge of the humans in the image, but some tiny details like the texture of the road are still dim. Our method yields the best enhancement results, producing global detail enhancement without noise generation.

Image group Figure 7 and Figure 8 are examples of foggy images. Details like outlines of trees and human are unobservable in the results of HE and CLAHE, and the backgrounds are distorted. Compared with the original linear mapped image, the results of AHPBC and Reinhard are still blurred, even though there might be a great change in brightness. MSR can increase the contrast to some extent, but the effect on local detail enhancement is relatively weak. The noise in the result of LEP is obvious. It can be indicated by the comparison of the results in Figure 7 and Figure 8 that the proposed method creates the most visually comfortable results, which reveal the details most fully.

Image group Figure 9 and Figure 10 are examples of image with blurred details. Due to low contrast and weak details in the original image, HE and CLAHE not only fail to reproduce the details, but also generate noises. Objects such as trees, buildings, and pedestrians in AHPBC’s results are blurred. The results of MSR and Reinhard are too dark to observe the information. Relatively, the results of LEP and the proposed method are visually pleasing; comparing with the results of LEP, noise in the proposed results is weaker.

The results of the Tenengrad for the test images are shown in the Table 2. In theory, the higher value of Tenengrad, the clearer the entire image. In accordance with the result of visual comparisons, the proposed method and LEP achieve higher Tenengrads.

As being reported in Table 3, comparing about the entropy, the proposed method and LEP have the robust result, and our method obtains the slightly better value than LEP does. Practically, there are more details in the results of our proposed method.

As being reported in Table 4, comparing about the NIQE, lower value of NIQE reflect better perceptual quality of image. the overall difference of AHPBC, MSR, Reinhard, LEP, and our proposed method is not obvious.

As being reported in Table 5, lower value of NIQE reflect better perceptual quality of image. In general, the proposed method and LEP achieve better results, but the average result of our proposed method is the best.

As being reported in Table 6, as our approach introduce multiscale analysis and image fusion, the calculation time of the proposed algorithm is much more than the conventional and famous methods HE, CLAHE, MSR, and Reinhard. Our method runs relatively slower than LEP, but more quickly than AHPBC. How to accelerate our algorithm is one of the key points of our future work. Hardware acceleration is one of our choices. After optimization, our method is very likely to process image on real-time application.

All in all, performance of the proposed algorithm is verified by experiments with images with various characteristics. The above analysis of the results shows that the proposed method has strength in detail enhancement of the HDR infrared image. The dynamic range compression and detail enhancement results are visually comfortable without excessively obvious noise.

5. Conclusions

In this paper, a novel high dynamic range infrared image enhancement method is introduced. This method is capable of compressing the dynamic range, adjusting the gray levels, and enhancing the details effectively. The proposed approach is mainly based on adaptive Gamma correction, multiscale guided filter, and image fusion. First, in order to keep the weak global details in different area, we adopt an EOG-guided Gamma transformation, which is adaptive to adjust the original normalized image into multiple brightness levels. Second, the multiscale guided filter is utilized iteratively to decompose each brightness image into a base layer and several detail layers. Details in each image are enhanced separately and composed adaptively. Third, to obtain the image with global details of the input image, enhanced image in each brightness is fused together. Last, we filter out the bad pixels and adjust the dynamic range before outputting the image. Tested on HDR IR images of different scenes with sundry details and background, the experiment result indicates that the proposed method can compress the dynamic range while higher the contrast, enhance the details effectively, and generate a visually pleasing result. It should be pointed out that in the step of guided transformation, the EOG function is just chosen to guarantee the simplicity and correctness of the algorithm. That is to say, the function could be changed according to the case with flexibility in the future work. Meanwhile, the method of the enhancement of the decomposed layers could also be extended, which also provides new point for the research.

Author Contributions

F.C. proposed the original idea, performed the experiment and wrote the original manuscript; J.Z. contributed to the direction, content, and revised the manuscript and funding acquisition; project administration J.C.; T.X. revised the manuscript; G.L. contributed to the content; and X.P. contributed to the content, revised the manuscript and project administration. All authors have read and agreed to the published version of the manuscript.

Funding

We are grateful to the financial support of the National High Technology Research and Development Program of China (863 Program), grant number G158207.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sanchez-Reillo, R.; Tamer, S.; Lu, G.; Duta, N.; Keck, M. Histogram Equalization; Springer: New York, NY, USA, 2009. [Google Scholar]
Song, Y.F.; Shao, X.P.; Xu, J. New enhancement algorithm for infrared image based on double plateaus histogram. Infrared Laser Eng. 2008, 2, 308–311. [Google Scholar]
Vickers, V.E. Plateau equalization algorithm for real-time display of high-quality infrared imagery. Opt. Eng. 1996, 35, 1921. [Google Scholar] [CrossRef]
Wang, B.J. A real-time contrast enhancement algorithm for infrared images based on plateau histogram. Acta Photonica Sin. 2006, 48, 77–82. [Google Scholar] [CrossRef]
Zuiderveld, K. Contrast Limited Adaptive Histogram Equalization. In Graphics Gems; Academic Press Professional, Inc.: San Diego, CA, USA, 1994; pp. 474–485. [Google Scholar]
Wan, M.; Gu, G.; Qian, W.; Ren, K.; Chen, Q.; Maldague, X. Infrared Image Enhancement Using Adaptive Histogram Partition and Brightness Correction. Remote Sens. 2018, 10, 682. [Google Scholar] [CrossRef] [Green Version]
Liang, K.; Yong, M.; Yue, X.; Bo, Z.; Rui, W. A new adaptive contrast enhancement algorithm for infrared images based on double plateaus histogram equalization. Infrared Phys. Technol. 2012, 55, 309–315. [Google Scholar] [CrossRef]
Kim, J.Y.; Kim, L.S.; Hwang, S.H. An advanced contrast enhancement using partially overlapped sub-block histogram equalization. IEEE Trans. Circuits Syst. Video Technol. 2001, 11, 475–484. [Google Scholar]
Fattal, R.; Lischinski, D.; Werman, M. Gradient Domain High Dynamic Range Compression. ACM Trans. Graph. 2002, 21. [Google Scholar] [CrossRef] [Green Version]
Zhang, F.; Xie, W.; Ma, G.; Qin, Q. High dynamic range compression and detail enhancement of infrared images in the gradient domain. Infrared Phys. Technol. 2014, 67, 441–454. [Google Scholar] [CrossRef]
Durand, F.; Dorsey, J. Fast bilateral filtering for the display of high-dynamic-range images. ACM Trans. Graph. 2002, 21, 257–266. [Google Scholar] [CrossRef] [Green Version]
He, K.; Sun, J.; Tang, X. Guided Image Filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1397–1409. [Google Scholar] [CrossRef]
Li, S.; Kang, X.; Hu, J. Image Fusion With Guided Filtering. IEEE Trans. Image Process. 2013, 22, 2864–2875. [Google Scholar] [PubMed]
Gu, B.; Li, W.; Zhu, M.; Wang, M. Local Edge-Preserving Multiscale Decomposition for High Dynamic Range Image Tone Mapping. IEEE Trans. Image Process. 2013, 22, 70–79. [Google Scholar]
Bai, X.; Zhou, F.; Xue, B. Image enhancement using multi scale image features extracted by top-hat transform. Opt. Laser Technol. 2011, 44, 328–336. [Google Scholar] [CrossRef]
Bai, X.; Zhou, F.; Xue, B. Infrared image enhancement through contrast enhancement by using multiscale new top-hat transform. Infrared Phys. Technol. 2011, 54, 61–69. [Google Scholar] [CrossRef]
Zhan, B.; Wu, Y. Infrared Image Enhancement Based on Wavelet Transformation and Retinex. In Proceedings of the 2010 Second International Conference on Intelligent Human-machine Systems & Cybernetics, Nanjing, China, 26–28 August 2010. [Google Scholar]
Mccann, J.J. Lightness and retinex theory. J. Opt. Soc. Am. 1970, 61, 1–11. [Google Scholar]
Land, E.H. The Retinex Theory of Color Vision. Sci. Am. 1977, 237, 108–129. [Google Scholar] [CrossRef] [PubMed]
Rahman, Z.U.; Jobson, D.J.; Woodell, G.A. Retinex processing for automatic image enhancement. J. Electron. Imaging 2004, 13, 100–110. [Google Scholar]
Pu, Y.F.; Zhang, N.; Wang, Z.N.; Wang, J.; Yi, Z.; Wang, Y.; Zhou, J.L. Fractional-Order Retinex for Adaptive Contrast Enhancement of Under-Exposed Traffic Images. IEEE Intell. Transp. Syst. Mag. 2019. [Google Scholar] [CrossRef]
Jobson, D.; Rahman, Z. Properties and performance of a center/surround retinex. IEEE Trans. Image Process. 1997, 6, 451–462. [Google Scholar] [CrossRef]
Rahman, Z.; Jobson, D.J.; Woodell, G.A. Multi-scale retinex for color image enhancement. In Proceedings of the 3rd IEEE International Conference on Image Processing, Lausanne, Switzerland, 19 September 2002. [Google Scholar]
Reinhard, E.; Stark, M.; Shirley, P.; Ferwerda, J. Photographic tone reproduction for digital images. ACM Trans. Graph. 2002, 21, 267–276. [Google Scholar] [CrossRef] [Green Version]
Reinhard, E.; Pouli, T.; Kunkel, T.; Long, B.; Ballestad, A.; Damberg, G. Calibrated image appearance reproduction. ACM Trans. Graph. 2012, 31, 1–11. [Google Scholar] [CrossRef]
Abebe, M.A.; Pouli, T.; Larabi, M.C.; Reinhard, E. Perceptual Lightness Modeling for High Dynamic Range Imaging. ACM Trans. Appl. Percept. 2017, 15, 1. [Google Scholar] [CrossRef]
Ma, J.; Zhou, Z.; Wang, B.; Zong, H. Infrared and visible image fusion based on visual saliency map and weighted least square optimization. Infrared Phys. Technol. 2017, 82, 8–17. [Google Scholar] [CrossRef]
Saeedi, J.; Faez, K. Infrared and visible image fusion using fuzzy logic and population-based optimization. Appl. Soft Comput. 2012, 12, 1041–1054. [Google Scholar] [CrossRef]
Ma, K.; Zhou, W. Multi-exposure image fusion: A patch-wise approach. In Proceedings of the 2015 IEEE International Conference on Image Processing, Quebec City, QC, Canada, 27–30 September 2015. [Google Scholar]
Ma, K.; Li, H.; Yong, H.; Wang, Z.; Meng, D.; Zhang, L. Robust Multi-Exposure Image Fusion: A Structural Patch Decomposition Approach. IEEE Trans. Image Process. 2017, 26, 2519–2532. [Google Scholar] [CrossRef] [PubMed]
FLIR Thermal Starter Dataset Version 1.3. Available online: https://www.flir.com/oem/adas/adas-dataset-form/ (accessed on 16 August 2019).
Berg, A.; Ahlberg, J.; Felsberg, M. A Thermal Object Tracking Benchmark. In Proceedings of the 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Karlsruhe, Germany, 25–28 August 2015. [Google Scholar]
Tenenbaum, J.M. Accommodation in Computer Vision; Computer Science Department, Stanford University: Stanford, CA, USA, 1971. [Google Scholar]
Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “completely blind” image quality analyzer. IEEE Signal Process. Lett. 2012, 20, 209–212. [Google Scholar] [CrossRef]
Venkatanath, N.; Praneeth, D.; Bh, M.C.; Channappayya, S.S.; Medasani, S.S. Blind image quality evaluation using perception based features. In Proceedings of the 2015 Twenty First National Conference on Communications (NCC), Mumbai, India, 27 February–1 March 2015; pp. 1–6. [Google Scholar]

Figure 1. The proposed framework.

Figure 2. (a) Input Image. (b) Detail Layer 1. (c) Detail Layer 2. (d) Base Layer. (e) Enhanced Detail Layer 1. (f) Enhanced Detail Layer 1.

Figure 3. (a) Result 1 without multiscale guided filter enhancement. (b) Result 1 with multiscale guided filter enhancement. (c) Result 2 without multiscale guided filter enhancement. (d) Result 2 with multiscale guided filter enhancement.

Figure 4. Comparison results 1 (Image of Complex Scene). (a) Linear Mapping. (b) histogram equalization (HE). (c) contrast-limited adaptive histogram equalization (CLAHE). (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 5. Comparison Results 2 (Low Contrast Image). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 6. Comparison Results 3 (Low Contrast Image). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 7. Comparison Results 4 (Foggy Image). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 8. Comparison Results 5 (Foggy Image). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 9. Comparison Results 6 (Image with Blurred Details). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Figure 10. Comparison Results 7 (Image with Blurred Details). (a) Linear Mapping. (b) HE. (c) CLAHE. (d) AHPBC. (e) MSR. (f) Reinhard. (g) LEP. (h) Proposed result.

Table 1. Basic information of the test images.

Image	Size	Min (Gray Value)	Max (Gray Value)	% of the Total Gray Levels
IM1	480*640	0	16961	25.880
IM2	480*640	9893	16339	9.835
IM3	512*640	1411	3575	3.302
IM4	512*640	5683	8188	3.822
IM5	512*640	6433	8653	3.387
IM6	512*640	5984	11322	8.145
IM7	512*640	6010	10000	6.088

Table 2. The Tenengrad’s value of the images.

Image	Linear Mapping	HE	CLAHE	AHPBC	MSR	Reinhard	LEP	Proposed
IM1	4.2562	18.8087	15.7123	10.5995	4.2137	11.1091	21.5385	22.3971
IM2	4.9403	12.3800	7.4490	4.7770	4.8471	4.7601	11.1299	18.1054
IM3	4.4459	8.9382	4.2856	4.7390	4.1821	4.2448	11.0855	18.5358
IM4	3.1144	4.4115	2.0874	3.6231	3.1346	3.1205	5.9977	7.8913
IM5	4.6191	5.1433	3.9676	4.6468	4.4070	4.6040	9.5314	14.9446
IM6	1.4227	7.8863	2.9639	2.7526	1.4158	1.6008	8.4747	6.7918
IM7	2.0778	8.7989	2.9121	2.9168	2.0738	2.2411	8.9327	6.8261

Table 3. The entropy of the images.

Image	Linear Mapping	HE	CLAHE	AHPBC	MSR	Reinhard	LEP	Proposed
IM1	5.5735	3.4553	7.0238	5.3511	5.4562	7.3206	7.6118	7.5752
IM2	5.7655	2.5542	6.5089	5.6805	6.0352	5.6525	6.7636	7.1654
IM3	6.3733	2.2374	5.1676	6.1961	6.6940	6.2290	7.0587	7.4246
IM4	5.9289	1.7727	5.1401	5.7168	6.4312	5.9052	5.8555	6.6476
IM5	7.5017	2.8809	5.8623	7.4927	7.4540	7.5118	7.4614	7.8229
IM6	4.5174	1.5771	5.2951	4.3780	4.7277	4.6765	6.1891	6.3499
IM7	4.5504	1.1825	4.5859	4.5556	4.7223	4.6450	5.8029	5.8444

Table 4. The Naturalness Image Quality Evaluator (NIQE) values of the images.

Image	Linear Mapping	HE	CLAHE	AHPBC	MSR	Reinhard	LEP	Proposed
IM1	2.4030	7.6034	4.7557	2.3148	2.3713	2.1673	2.5625	2.8865
IM2	3.1617	8.7744	5.8055	3.3352	3.2184	3.2078	3.4816	4.0588
IM3	3.3359	11.5356	7.4972	3.8602	3.3382	3.2956	3.9761	4.3825
IM4	7.5140	11.9366	7.5140	4.2840	4.8551	4.7213	4.2182	4.8106
IM5	3.5719	15.0618	7.2863	3.4961	3.6931	3.6239	3.6369	3.4917
IM6	4.9581	13.3283	7.4248	4.9065	4.8822	4.5561	3.9335	4.0222
IM7	3.9773	11.7403	7.9685	3.7186	3.9794	3.8734	3.5809	3.5237

Table 5. The Perception-based Image Quality Evaluator (PIQE) values of the images.

Image	Linear Mapping	HE	CLAHE	AHPBC	MSR	Reinhard	LEP	Proposed
IM1	33.7017	65.3970	59.9335	33.43665	33.3354	21.2885	22.8901	24.9244
IM2	30.8086	76.7303	74.3567	32.4350	32.8172	34.5472	20.3241	22.4031
IM3	16.0431	82.7883	79.3746	16.5842	19.2051	18.4810	13.8669	30.9339
IM4	74.8258	81.6471	74.8258	35.7224	37.2663	39.0461	36.4915	16.0016
IM5	52.9848	82.5220	80.2671	52.7876	54.4075	49.0024	49.4377	43.2992
IM6	65.9284	81.1321	77.7412	62.9356	64.9387	58.8325	20.6870	16.2914
IM7	58.0052	79.8022	74.0387	56.8523	52.6654	54.8697	16.8590	16.2839

Table 6. Running time of the test images. Unit: second.

Image	HE	CLAHE	AHPBC	MSR	Reinhard	LEP	Proposed
IM1	0.1140	0.1999	27.7791	0.9225	0.0257	0.8482	2.5096
IM2	0.1195	0.1439	26.8293	0.1770	0.0284	1.0943	2.4500
IM3	0.1111	0.1455	30.8719	0.1070	0.2198	1.1543	2.6872
IM4	0.1158	0.1387	30.6425	0.1040	0.0202	1.1325	2.7946
IM5	0.1253	0.1422	40.4624	0.0990	0.0207	1.0314	2.6288
IM6	0.1112	0.1434	28.5309	0.0981	0.0208	1.2840	2.5608
IM7	0.1463	0.1869	27.4465	0.9937	0.0203	2.1250	2.7588

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, F.; Zhang, J.; Cai, J.; Xu, T.; Lu, G.; Peng, X. Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion. Appl. Sci. 2020, 10, 6262. https://doi.org/10.3390/app10186262

AMA Style

Chen F, Zhang J, Cai J, Xu T, Lu G, Peng X. Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion. Applied Sciences. 2020; 10(18):6262. https://doi.org/10.3390/app10186262

Chicago/Turabian Style

Chen, Feiran, Jianlin Zhang, Jingju Cai, Tao Xu, Gang Lu, and Xianrong Peng. 2020. "Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion" Applied Sciences 10, no. 18: 6262. https://doi.org/10.3390/app10186262

APA Style

Chen, F., Zhang, J., Cai, J., Xu, T., Lu, G., & Peng, X. (2020). Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion. Applied Sciences, 10(18), 6262. https://doi.org/10.3390/app10186262

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Infrared Image Adaptive Enhancement Guided by Energy of Gradient Transformation and Multiscale Image Fusion

Abstract

1. Introduction

2. Proposed Theory

2.1. EOG Guided Gray Distribution Adjustment

2.1.1. Energy of Gradient

2.1.2. EOG Guided Gray Distribution Adjustment

2.2. Multiscale Guided Filter Enhancement

2.2.1. Multiscale Guided Filter Decomposition

2.2.2. Adaptive Multiscale Guided Filter Composition

2.3. Image Fusion

2.4. Outliers Filtering

3. Experiment Results

3.1. Experimental Settings

3.2. Visual Comparisons

3.3. Quantitative Comparison

3.4. Running Time Comparison

4. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI