Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior

Gao, Xianjie; Zhang, Mingliang; Luo, Jinming

doi:10.3390/s22155593

Open AccessCommunication

Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior

by

Xianjie Gao

¹

,

Mingliang Zhang

² and

Jinming Luo

^3,*

¹

Department of Basic Sciences, Shanxi Agricultural University, Jinzhong 030801, China

²

School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China

³

School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(15), 5593; https://doi.org/10.3390/s22155593

Submission received: 8 June 2022 / Revised: 10 July 2022 / Accepted: 24 July 2022 / Published: 26 July 2022

(This article belongs to the Section Sensing and Imaging)

Download

Browse Figures

Versions Notes

Abstract

:

Low-light images are a common phenomenon when taking photos in low-light environments with inappropriate camera equipment, leading to shortcomings such as low contrast, color distortion, uneven brightness, and high loss of detail. These shortcomings are not only subjectively annoying but also affect the performance of many computer vision systems. Enhanced low-light images can be better applied to image recognition, object detection and image segmentation. This paper proposes a novel RetinexDIP method to enhance images. Noise is considered as a factor in image decomposition using deep learning generative strategies. The involvement of noise makes the image more real, weakens the coupling relationship between the three components, avoids overfitting, and improves generalization. Extensive experiments demonstrate that our method outperforms existing methods qualitatively and quantitatively.

Keywords:

low-light image enhancement; Retinex decomposition; Deep Image Prior

1. Introduction

With the great breakthrough of deep learning in the field of computer vision technology, image processing has been widely used in many fields, e.g., face recognition [1], defect detection [2], medical image retrieval [3], traffic information systems [4], and text recognition [5]. Image defects can be attributed to uncontrolled factors such as insufficient lighting conditions and non-uniform lighting during image capture. These unfavorable elements can be disturbed by backlighting, underexposure, and night-time conditions. Low-light images are usually noisy, low-contrast, color-distorted, and quality-impaired. These shortcomings not only result in an unpleasant visual experience but also affect the performance of many computer vision systems, e.g., for image recognition, object detection, and image segmentation.

Image enhancement has a wide range of applications in different fields, e.g., underwater images [6], high-speed railway images [7], and robot vision [8]. In general, there are two ways to improve the image quality. One is to improve the hardware performance of photographic equipment and the other is to process the obtained image. However, the former has disadvantages such as manufacturing difficulties, high cost, and complicated technology. Therefore, in practical applications, improving the quality of low-light images through enhancement algorithms is of great significance. Low-light image enhancement has two main purposes: improving contrast and suppressing noise. The enhanced image is more suitable for human observation and computer vision systems.

Related studies on low-light image enhancement are reviewed, including those using conventional methods and deep learning methods. Traditional low-light enhancement methods include methods based on histogram equalization (HE) and the Retinex model. Histogram equalization is a method of using an image histogram to adjust contrast in the field of image processing (BPDHE [9], DHE [10], histogram modification [11]). HE methods may increase the contrast of noise and reduce the contrast of useful signals. In view of the shortcomings of the HE method, many improved versions have been proposed, e.g., clipped AHE [12], CLAHE [13], CVC [14], and contrast enhancement algorithm [15].

Retinex model-based methods decompose low-light images into reflection and illumination components [16]. Given a low-light image S, it can be decomposed into

S = R ⊙ I

, where S represents the low-light image, R represents the reflectance, I represents the illumination map, and ⊙ represents the dot product operation. In addition, many improved versions of Retinex models have been derived from the Retinex theory, including the single-scale Retinex model [17], the multi-scale Retinex model [18], the naturalness preserved enhancement algorithm [19], the fusion-based enhancing method [20], and illumination map estimation [21]. There are also some algorithms based on the variational Retinex model, e.g., the variational Retinex model formulated as a quadratic optimization problem [22], the variational framework for Retinex introducing a bright channel [23], the variational Retinex model based on the

L_{2}

-norm [24], the hybrid

L_{2}

-

L_{p}

variational model with bright channel prior [25], and the maximum-entropy-based Retinex model [26]. Based on the computational complexity of variational methods, the disadvantage of this method is that processing images is time-consuming.

With the development of artificial intelligence, deep learning methods have also been widely used in the field of low-light image enhancement. Lore et al. [27] proposed a method of enhancing natural low-light images using a stacked sparse denoising autoencoder. Tao et al. [28] introduced a CNN method utilizing multi-scale feature maps to perform low-light image enhancement. Ignatov et al. [29] proposed a residual convolutional network that combines the composite perceptual error functions of content, color, and texture losses to improve the color and detail sharpness of the image. Shen et al. [30] put forward a convolutional neural network that directly learns the end-to-end mapping between dark and bright images for low-light image enhancement. Gharbi et al. [31] introduced a neural network architecture using input/output image pairs to perform image augmentation in real time and with full-resolution images. Wei et al. [32] designed a deep network called Retinex-Net based on the Retinex model, including Decom-Net for decomposition and Enhance-Net for lighting adjustment. Wang et al. [33] proposed a convolutional neural network based on the global prior information generated in the encoder–decoder network to enhance images. Chen et al. [34] presented a fully end-to-end convolutional network for processing low-light images using raw image data. Chen et al. [35] proposed an unpaired learning method for image enhancement based on a bidirectional generative adversarial network (GAN) framework. Zhang et al. [36] constructed an efficient network (KinD) trained on paired images shot under different exposure conditions. Wang et al. [37] proposed a neural network for enhancing underexposed photos by introducing intermediate lighting into the network to correlate the input with the expected enhancement result. Jiang et al. [38] proposed an unsupervised generative adversarial network trained with unpaired images. Yang et al. [39] suggested a semi-supervised learning method for low-light image enhancement based on a deep recursive band network (DRBN). Lv et al. [40] presented an end-to-end lightweight network for non-uniform illumination image enhancement that retains the advantages of the Retinex model and overcomes its limitations. Wang et al. [41] proposed the Deep Lightening Network (DLN) composed of several lightening back-projection (LBP) blocks to estimate residuals between low-light and normal-light images and the residual between low and normal light images. Zhu et al. [42] proposed the Edge-Enhanced Multi-Exposure Fusion Network (EEMEFN), which includes a multi-exposure fusion module and an edge enhancement module to enhance extremely low-light images. Liu et al. [43] obtained a Retinex-inspired Unrolling with Architecture Search (RUAS), where a cooperative architecture search was used to discover low-light prior architectures from a compact search space, and reference-free losses were used to train the network. Li et al. [44] presented a progressive–recursive image enhancement network (PRIEN) that uses a recursive unit to progressively enhance the input image. Zhang et al. [45] proposed dynamic fields to learn and make inferences from a single image, and then enforce temporal consistency. Fu et al. [46] suggested a novel unsupervised low-light image enhancement network (LE-GAN) based on generative adversarial networks using unpaired low-light/normal-light images for training. Zhao et al. [47] proposed a unified deep zero-reference framework termed RetinexDIP for enhancing low-light images; however, noise was not considered in the decomposition process. Liu et al. [48] proposed the Retinex-based fast algorithm (RBFA) to achieve low-light image enhancement. Liang et al. [49] proposed a low-light image enhancement model based on deep learning. Li et al. [50] presented a low-light image enhancement method based on a deep symmetric encoder–decoder convolutional network. Han et al. [51] proposed a DIP based on a noise-robust super resolution method. Ai and Kwon [52] used attention U-Net for extreme low-light image enhancement. Zhao et al. [53] proposed a multi-path interaction network to improve the quality of the image.

In this paper, we propose a novel RetinexDIP method to enhance images. Noise components are introduced into our network, and three components are generated by the DIP network. The involvement of noise makes the image more real, weakens the coupling relationship between the three components, avoids overfitting, and improves generalization. The illumination map is obtained by iterating and adjusting the input noise, and then the enhanced image is generated based on the Retinex model. Our training process is a zero-reference process and does not require any paired or even unpaired data, which is similar to existing methods (EnlightenGAN [38], CycleGAN [54], Zero-DCE [55]). The novel RetinexDIP method can be applied to various poorly lit environments and has good generalization. The loss function in this paper is composed of four parts: the spatial reconstruction loss, illumination-consistency loss, reflectance loss, and illumination smoothness loss. The experimental results show that the normal light images generated by our method are natural and clear and the method has excellent performance according to both visual observation and objective evaluation indicators. The main contributions of this paper are as follows:

We propose a novel noise-added RetinexDIP method to enhance images.
Three components are generated by the DIP network.
The zero-reference process avoids the risk of overfitting and improves generalization.
The experimental results show that our method significantly outperforms some current state-of-the-art methods.

The rest of the paper is organized as follows. Section 2 details our proposed approach. Section 3 presents the experimental results, and the last section concludes the paper.

2. Materials and Methods

Given a low-light image S and considering noise, the image can be decomposed into:

S = R ⊙ I + N

(1)

or

S = (R + N) ⊙ I,

(2)

where S represents the low-light image, R represents the reflectance, I represents the illumination map, N denotes the noise, and ⊙ represents the dot product operation. Adding hand-crafted priors to components makes the components more coupled. Deep Image Prior (DIP) means that complex prior knowledge does not need to be introduced, as it can be encoded in the structure of the neural network itself [56]. In practical problems, it is difficult to find pairs of low-light and normal images. Therefore, generative models are becoming more and more important.

In this paper, we implement image decomposition based on Retinex theory and generative strategies, taking into account the noise factor. The overall framework of this method is shown in Figure 1. As can be seen from Figure 1, there are three encoder–decoder networks (DIP1, DIP2, and DIP3) in the model. These DIP networks are all convolutional operations. DIP1 is used to generate noise N, and DIP2 and DIP3 are used to generate the reflectance R and the latent illumination I. All three DIP networks use white Gaussian noise as input and obey

z_{1, 2, 3} \sim N (0, σ^{2})

, where

σ^{2}

represents the variance of the Gaussian distribution. The noise is obtained via random sampling and has the same size as the image.

To evaluate the quality of the augmented images, the following four types of losses were employed to train our model.

Reconstruction Loss. The reconstruction loss is defined according to the following form:

l_{r e c} = {∥ g_{I} (z_{3}) ⊙ (g_{R} (z_{2}) + g_{N} (z_{1})) - S_{0} ∥}_{2}^{2},

(3)

where N is the noise generated by DIP1, denoted by

g_{N}

, R is the latent reflectance generated by DIP2, denoted by

g_{R}

, and I is the illumination generated by DIP3, denoted by

g_{I}

.

S_{0}

is the observed image.

Illumination-consistency Loss. As in [47], we also consider the illumination-consistency loss, which is defined as

l_{i - c} = {∥ g_{I} (z_{3}) - I_{0} ∥}_{1},

(4)

where

I_{0}

is the initial illumination obtained by

I_{0} (p) = max_{c \in {R, G, B}} S_{c} (p)

(5)

for every pixel p.

Reflectance Loss. In this paper, the reflectance R is considered, and the total variation (TV) constraint [57] is defined as

l_{r e f} = {∥ \nabla g_{R} (z_{2}) ∥}_{1},

(6)

where ∇ denotes the first-order operator containing a horizontal component

\nabla_{h}

and a vertical component

\nabla_{v}

.

Illumination Smoothness Loss. We also use the illumination reflection gradient-weighted TV constraint, defined as

l_{i - s} = {∥ W ⊙ \nabla g_{I} (z_{3}) ∥}_{1},

(7)

where W is the weight matrix. According to the weight strategy in [21], it is set via:

W_{h, v} \leftarrow \frac{1}{| \nabla_{h, v} I_{0} | + ϵ},

(8)

where

ϵ

is a small decimal to ensure that the denominator is not 0.

Combining the four losses, we minimize the objective function as follows:

arg min_{I, R, N} ∥ g_{I} (z_{3}) ⊙ (g_{R} (z_{2}) + g_{N} (z_{1})) - S_{0} ∥_{2}^{2} + λ_{1} ∥ g_{I} (z_{3}) - I_{0} ∥_{1} + λ_{2} ∥ \nabla g_{R} (z_{2}) ∥_{1} + λ_{3} {∥ W ⊙ \nabla g_{I} (z_{3}) ∥}_{1},

(9)

where

λ_{1}

,

λ_{2}

, and

λ_{3}

are the balance parameters.

The enhanced image S is composed of noise N, reflectance R, and the latent illumination I.

S = (R + N) ⊙ I,

(10)

or

S = g_{I} (z_{3}) ⊙ (g_{R} (z_{2}) + g_{N} (z_{1})) .

(11)

Next, enhancement using only estimated illumination is described. There are two commonly used composition strategies: one is to remove the illumination component, considering the reflectance as the enhancement result, i.e.

\hat{S} = S / I

, and the other is to adjust the illumination and reconstruct the result with the reflectance, i.e.

\hat{S} = \hat{I} ⊙ R

. In this paper, we use a variant of the former strategy, that is,

\hat{S} = S / \hat{I}

(refer to [21] for details).

We adjust the illumination distribution of decomposition using the gamma correction

\hat{I} = I^{γ}

, where

γ

is the correction factor. To sum up, the enhanced result is given by:

\hat{S} = S^{c} / \hat{I}, c \in {R, G, B} .

(12)

The whole operation process is shown in Algorithm 1.

Algorithm 1: our algorithm

3. Experiment

In this section, the experimental parameter settings, public low-light image datasets, and performance metrics are introduced. The results of our approach with different methods are also be discussed.

3.1. Settings

We implement our framework using PyTorch on an NVIDIA 2080Ti GPU. The model experimental parameters were set as follows:

λ_{1} = 1

,

λ_{2} = 0.0001

,

λ_{3} = 0.5

,

δ = 0.01

,

γ = 0.5

, and

K = 300

. We use six public datasets with low-light images for the experiments, including DICM [15], Fusion [58], LIME [21], MEF [59], NPE [19], and VV (https://sites.google.com/site/vonikakis/datasets (accessed on 1 June 2022)).

3.2. Performance Criteria

In this paper, we measure the experimental results from visual observations and objective evaluation indicators. The following evaluation indicators were used.

Natural Image Quality Evaluator (NIQE). The inspiration for NIQE is based on constructing a series of features used to measure image quality and using these features to fit a multivariate Gaussian model. In the evaluation process, the distance between the image feature model parameters (to be evaluated) and the pre-established model parameters is used to determine the image quality. A lower NIQE score indicates better preservation of naturalness. For details, refer to [60].

No-reference Image Quality Metric for Contrast Distortion (NIQMC). NIQMC is defined as a simple linear fusion of global and local quality measures [61]. A higher NIQMC score represents better image contrast.

Colorfulness-Based Patch-Based Contrast Quality Index (CPCQI). CPCQI is a color-based PCQI metric that evaluates the enhancement effect between input and enhanced output in terms of mean strength, signal strength, and signal structure components [62]. A larger CPCQI value indicates a higher contrast ratio.

3.3. Results

In this section, we show the effectiveness of the proposed method. We compare it with six other methods, i.e., LIME [21], NPE [19], SRIE [63], KinD [36], Zero-DCE [55], and RetinexDIP [47].

The specific process for our method is shown in detail step by step in Figure 2.

First, we evaluate the different methods qualitatively. As shown in Figure 3, Figure 4, Figure 5 and Figure 6, we select local regions and zoom in on them for intuitive comparison with other methods. The following conclusions can be drawn from the observation of Figure 3. The enhancement effect of the NPE, SRIE, and KinD methods is not obvious. The LIME and RetinexDIP methods produce over-enhancement effects in these regions. The processing result of Zero-DCE has unnatural color. Our method yields natural exposure and clear details. In Figure 4, it can be seen that our method enhances the image and the edges are clearly visible. The result of KinD has an unnatural color. By considering Figure 5, it can be seen that the method proposed in this paper does not have the problems of overexposure and artifacts when improving the contrast. From Figure 6, it can also be concluded that our method improves the contrast effectively and maintains the natural color at the same time.

In the following, we compare the proposed method with other methods quantitatively. The red, green, and blue scores represent the top three in the corresponding dataset, respectively. Table 1 presents the NIQE metrics of different methods on the six datasets. Notably, a lower NIQE score indicates better preservation of naturalness. Our method achieves the best results on the MEF and VV datasets and the second-best results on the average of the six datasets and LIME. Table 2 presents the NIQMC metrics of the different methods on the six datasets. A higher NIQMC score represents better image contrast. Our method is in the top three for DICM, LIME, MEF, NPE, VV, and the average of the six datasets. Table 3 presents the CPCQI of the different methods on the six datasets. A larger CPCQI value indicates a higher contrast ratio. Our method achieves the best results on DICM, Fusion, NPE, and the average of the six datasets, and also performs well on several other datasets.

As shown in Table 4, the runtimes of different methods were compared. In the experiment, we compare the runtimes of three traditional methods (LIME, NPE, SRIE) and three deep learning methods (KinD, Zero-DCE, RetinexDIP) to that of our method, with eight different input image sizes. Compared with NPE, SRIE, and RetinexDIP, we find that our method is more efficient on high-resolution images. Unlike traditional methods such as NPE and SRIE, the proposed method uses the DIP network to compute reflections and illumination. Benefiting from the convolutional structure, the runtime of the DIP model changes very little as the image resolution grows. Compared with RetinexDIP, the proposed method converges faster and requires less runtime due to the consideration of noise. Compared with Zero-DCE and KinD, our method can also save memory, since Zero-DCE and KinD are pixel-wise methods, while the proposed method is based on Retinex decomposition. The proposed method does not require the actual resolution of the image in the operation, and the memory will not increase significantly with an increase in the image resolution.

4. Conclusions

In this paper, we propose a novel low-light image enhancement method via Retinex decomposition of denoised Deep Image Prior. Noise is considered in the image decomposition using deep learning generative strategies. As a comparison, we also consider six other methods, i.e., LIME, NPE, SRIE, KinD, Zero-DCE, and RetinexDIP. Extensive experiments demonstrate that our method outperforms existing methods qualitatively and quantitatively. Unlike some other learning-based methods, the method proposed in this paper is a no-reference method, which means that only the input images are required without any extra data. Taking the reflection noise into consideration, our experiments show that the denoised Deep Image Prior can produce images with less noise.

In real scenes, noise always conforms to some scene-dependent distribution such as the Poisson distribution. In future work, other approaches such as normalizing flow will be considered to simulate a more realistic noise distribution than that of DIP.

Author Contributions

Conceptualization, X.G. and J.L.; methodology, X.G. and J.L.; validation, X.G., M.Z., and J.L.; resources, X.G.; writing—original draft preparation, X.G.; writing—review and editing, M.Z. and J.L.; supervision, M.Z. and J.L.; project administration, X.G.; funding acquisition, X.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Science and Technology Innovation Fund Project of Shanxi Agricultural University (2021BQ10); the National Natural Science Foundation of China (12101378); the Scientific Research for Excellent Doctors Project, Shanxi Province, China (SXBYKY2021046); and the Shanxi Provincial Research Foundation for Basic Research, China (20210302124548).

Data Availability Statement

Not applicable.

Acknowledgments

We are grateful to the anonymous reviewers and the editors for their valuable comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hu, G.; Yang, Y.; Yi, D.; Kittler, J.; Christmas, W.; Li, S.Z.; Hospedales, T. When face recognition meets with deep learning: An evaluation of convolutional neural networks for face recognition. In Proceedings of the IEEE International Conference on Computer Vision Workshops, Santiago, Chile, 7–13 December 2015; pp. 142–150. [Google Scholar]
Koch, C.; Georgieva, K.; Kasireddy, V.; Akinci, B.; Fieguth, P. A review on computer vision based defect detection and condition assessment of concrete and asphalt civil infrastructure. Adv. Eng. Inform. 2015, 29, 196–210. [Google Scholar] [CrossRef] [Green Version]
Qayyum, A.; Anwar, S.M.; Awais, M.; Majid, M. Medical image retrieval using deep convolutional neural network. Neurocomputing 2017, 266, 8–20. [Google Scholar] [CrossRef] [Green Version]
Buch, N.; Velastin, S.A.; Orwell, J. A review of computer vision techniques for the analysis of urban traffic. IEEE Trans. Intell. Transp. Syst. 2011, 12, 920–939. [Google Scholar] [CrossRef]
Cheng, Z.; Bai, F.; Xu, Y.; Zheng, G.; Pu, S.; Zhou, S. Focusing attention: Towards accurate text recognition in natural images. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 5076–5084. [Google Scholar]
Ancuti, C.; Ancuti, C.O.; Haber, T.; Bekaert, P. Enhancing underwater images and videos by fusion. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 81–88. [Google Scholar]
Lyu, G.; Huang, H.; Yin, H.; Luo, S.; Jiang, X. A novel visual perception enhancement algorithm for high-speed railway in the low light condition. In Proceedings of the 2014 12th International Conference on Signal Processing (ICSP), Hangzhou, China, 19–23 October 2014; pp. 1022–1025. [Google Scholar]
Cho, Y.; Jeong, J.; Kim, A. Model-assisted multiband fusion for single image enhancement and applications to robot vision. IEEE Robot. Autom. Lett. 2018, 3, 2822–2829. [Google Scholar]
Ibrahim, H.; Kong, N.S.P. Brightness preserving dynamic histogram equalization for image contrast enhancement. IEEE Trans. Consum. Electron. 2007, 53, 1752–1758. [Google Scholar] [CrossRef]
Abdullah-Al-Wadud, M.; Kabir, M.H.; Dewan, M.A.A.; Chae, O. A dynamic histogram equalization for image contrast enhancement. IEEE Trans. Consum. Electron. 2007, 53, 593–600. [Google Scholar] [CrossRef]
Arici, T.; Dikbas, S.; Altunbasak, Y. A histogram modification framework and its application for image contrast enhancement. IEEE Trans. Image Process. 2009, 18, 1921–1935. [Google Scholar] [CrossRef]
Pizer, S.M.; Amburn, E.P.; Austin, J.D.; Cromartie, R.; Geselowitz, A.; Greer, T.; ter Haar Romeny, B.; Zimmerman, J.B.; Zuiderveld, K. Adaptive histogram equalization and its variations. Comput. Vision, Graph. Image Process. 1987, 39, 355–368. [Google Scholar] [CrossRef]
Pisano, E.D.; Zong, S.; Hemminger, B.M.; DeLuca, M.; Johnston, R.E.; Muller, K.; Braeuning, M.P.; Pizer, S.M. Contrast limited adaptive histogram equalization image processing to improve the detection of simulated spiculations in dense mammograms. J. Digit. Imaging 1998, 11, 193–200. [Google Scholar] [CrossRef] [Green Version]
Celik, T.; Tjahjadi, T. Contextual and variational contrast enhancement. IEEE Trans. Image Process. 2011, 20, 3431–3441. [Google Scholar] [CrossRef] [Green Version]
Lee, C.; Lee, C.; Kim, C.S. Contrast enhancement based on layered difference representation of 2D histograms. IEEE Trans. Image Process. 2013, 22, 5372–5384. [Google Scholar] [CrossRef] [PubMed]
Land, E.H. The retinex theory of color vision. Sci. Am. 1977, 237, 108–129. [Google Scholar] [CrossRef] [PubMed]
Jobson, D.J.; Rahman, Z.U.; Woodell, G.A. Properties and performance of a center/surround retinex. IEEE Trans. Image Process. 1997, 6, 451–462. [Google Scholar] [CrossRef]
Jobson, D.J.; Rahman, Z.U.; Woodell, G.A. A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Trans. Image Process. 1997, 6, 965–976. [Google Scholar] [CrossRef] [Green Version]
Wang, S.; Zheng, J.; Hu, H.M.; Li, B. Naturalness preserved enhancement algorithm for non-uniform illumination images. IEEE Trans. Image Process. 2013, 22, 3538–3548. [Google Scholar] [CrossRef] [PubMed]
Fu, X.; Zeng, D.; Huang, Y.; Liao, Y.; Ding, X.; Paisley, J. A fusion-based enhancing method for weakly illuminated images. Signal Process. 2016, 129, 82–96. [Google Scholar] [CrossRef]
Guo, X.; Li, Y.; Ling, H. LIME: Low-light image enhancement via illumination map estimation. IEEE Trans. Image Process. 2016, 26, 982–993. [Google Scholar] [CrossRef] [PubMed]
Kimmel, R.; Elad, M.; Shaked, D.; Keshet, R.; Sobel, I. A variational framework for retinex. Int. J. Comput. Vis. 2003, 52, 7–23. [Google Scholar] [CrossRef]
Fu, X.; Zeng, D.; Huang, Y.; Ding, X.; Zhang, X.P. A variational framework for single low light image enhancement using bright channel prior. In Proceedings of the 2013 IEEE Global Conference on Signal and Information Processing, Austin, TX, USA, 3–5 December 2013; pp. 1085–1088. [Google Scholar]
Park, S.; Yu, S.; Moon, B.; Ko, S.; Paik, J. Low-light image enhancement using variational optimization-based retinex model. IEEE Trans. Consum. Electron. 2017, 63, 178–184. [Google Scholar] [CrossRef]
Fu, G.; Duan, L.; Xiao, C. A hybrid L₂-L_p variational model for single low-light image enhancement with bright channel prior. In Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, 22–25 September 2019; pp. 1925–1929. [Google Scholar]
Zhang, Y.; Di, X.; Zhang, B.; Wang, C. Self-supervised image enhancement network: Training with low light images only. arXiv 2020, arXiv:2002.11300. [Google Scholar]
Lore, K.G.; Akintayo, A.; Sarkar, S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognit. 2017, 61, 650–662. [Google Scholar] [CrossRef] [Green Version]
Tao, L.; Zhu, C.; Xiang, G.; Li, Y.; Jia, H.; Xie, X. LLCNN: A convolutional neural network for low-light image enhancement. In Proceedings of the 2017 IEEE Visual Communications and Image Processing (VCIP), St. Petersburg, FL, USA, 10–13 December 2017; pp. 1–4. [Google Scholar]
Ignatov, A.; Kobyshev, N.; Timofte, R.; Vanhoey, K.; Van Gool, L. Dslr-quality photos on mobile devices with deep convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 3277–3285. [Google Scholar]
Shen, L.; Yue, Z.; Feng, F.; Chen, Q.; Liu, S.; Ma, J. Msr-net: Low-light image enhancement using deep convolutional network. arXiv 2017, arXiv:1711.02488. [Google Scholar]
Gharbi, M.; Chen, J.; Barron, J.T.; Hasinoff, S.W.; Durand, F. Deep bilateral learning for real-time image enhancement. ACM Trans. Graph. 2017, 36, 1–12. [Google Scholar] [CrossRef]
Wei, C.; Wang, W.; Yang, W.; Liu, J. Deep retinex decomposition for low-light enhancement. arXiv 2018, arXiv:1808.04560. [Google Scholar]
Wang, W.; Wei, C.; Yang, W.; Liu, J. Gladnet: Low-light enhancement network with global awareness. In Proceedings of the 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi’an, China, 15–19 May 2018; pp. 751–755. [Google Scholar]
Chen, C.; Chen, Q.; Xu, J.; Koltun, V. Learning to see in the dark. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 3291–3300. [Google Scholar]
Chen, Y.S.; Wang, Y.C.; Kao, M.H.; Chuang, Y.Y. Deep photo enhancer: Unpaired learning for image enhancement from photographs with gans. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 6306–6314. [Google Scholar]
Zhang, Y.; Zhang, J.; Guo, X. Kindling the darkness: A practical low-light image enhancer. In Proceedings of the 27th ACM International Conference on Multimedia, Nice, France, 21–25 October 2019; pp. 1632–1640. [Google Scholar]
Wang, R.; Zhang, Q.; Fu, C.W.; Shen, X.; Zheng, W.S.; Jia, J. Underexposed photo enhancement using deep illumination estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 6849–6857. [Google Scholar]
Jiang, Y.; Gong, X.; Liu, D.; Cheng, Y.; Fang, C.; Shen, X.; Yang, J.; Zhou, P.; Wang, Z. Enlightengan: Deep light enhancement without paired supervision. IEEE Trans. Image Process. 2021, 30, 2340–2349. [Google Scholar] [CrossRef]
Yang, W.; Wang, S.; Fang, Y.; Wang, Y.; Liu, J. From fidelity to perceptual quality: A semi-supervised approach for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 3063–3072. [Google Scholar]
Lv, F.; Liu, B.; Lu, F. Fast enhancement for non-uniform illumination images using light-weight CNNs. In Proceedings of the 28th ACM International Conference on Multimedia, Seattle, WA, USA, 12–16 October 2020; pp. 1450–1458. [Google Scholar]
Wang, L.W.; Liu, Z.S.; Siu, W.C.; Lun, D.P. Lightening network for low-light image enhancement. IEEE Trans. Image Process. 2020, 29, 7984–7996. [Google Scholar] [CrossRef]
Zhu, M.; Pan, P.; Chen, W.; Yang, Y. Eemefn: Low-light image enhancement via edge-enhanced multi-exposure fusion network. Proc. Aaai Conf. Artif. Intell. 2020, 34, 13106–13113. [Google Scholar] [CrossRef]
Liu, R.; Ma, L.; Zhang, J.; Fan, X.; Luo, Z. Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 10561–10570. [Google Scholar]
Li, J.; Feng, X.; Hua, Z. Low-light image enhancement via progressive-recursive network. IEEE Trans. Circ. Syst. Video Technol. 2021, 31, 4227–4240. [Google Scholar] [CrossRef]
Zhang, F.; Li, Y.; You, S.; Fu, Y. Learning temporal consistency for low light video enhancement from single images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 4967–4976. [Google Scholar]
Fu, Y.; Hong, Y.; Chen, L.; You, S. LE-GAN: Unsupervised low-light image enhancement network using attention module and identity invariant loss. Knowl. Based Syst. 2022, 240, 108010. [Google Scholar] [CrossRef]
Zhao, Z.; Xiong, B.; Wang, L.; Ou, Q.; Yu, L.; Kuang, F. Retinexdip: A unified deep framework for low-light image enhancement. IEEE Trans. Circ. Syst. Video Technol. 2021, 32, 1076–1088. [Google Scholar] [CrossRef]
Liu, S.; Long, W.; He, L.; Li, Y.; Ding, W. Retinex-based fast algorithm for low-light image enhancement. Entropy 2021, 23, 746. [Google Scholar] [CrossRef] [PubMed]
Liang, H.; Yu, A.; Shao, M.; Tian, Y. Multi-Feature Guided Low-Light Image Enhancement. Appl. Sci. 2021, 11, 5055. [Google Scholar] [CrossRef]
Li, Q.; Wu, H.; Xu, L.; Wang, L.; Lv, Y.; Kang, X. Low-light image enhancement based on deep symmetric encoder—decoder convolutional networks. Symmetry 2020, 12, 446. [Google Scholar] [CrossRef] [Green Version]
Han, S.; Lee, T.B.; Heo, Y.S. Deep Image Prior for Super Resolution of Noisy Image. Electronics 2021, 10, 2014. [Google Scholar] [CrossRef]
Ai, S.; Kwon, J. Extreme low-light image enhancement for surveillance cameras using attention U-Net. Sensors 2020, 20, 495. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhao, B.; Gong, X.; Wang, J.; Zhao, L. Low-Light Image Enhancement Based on Multi-Path Interaction. Sensors 2021, 21, 4986. [Google Scholar] [CrossRef] [PubMed]
Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2223–2232. [Google Scholar]
Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1780–1789. [Google Scholar]
Ulyanov, D.; Vedaldi, A.; Lempitsky, V. Deep image prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 9446–9454. [Google Scholar]
Ng, M.K.; Wang, W. A total variation model for Retinex. SIAM J. Imaging Sci. 2011, 4, 345–365. [Google Scholar] [CrossRef]
Yue, H.; Yang, J.; Sun, X.; Wu, F.; Hou, C. Contrast enhancement based on intrinsic image decomposition. IEEE Trans. Image Process. 2017, 26, 3981–3994. [Google Scholar] [CrossRef]
Lee, C.; Lee, C.; Lee, Y.Y.; Kim, C.S. Power-constrained contrast enhancement for emissive displays based on histogram equalization. IEEE Trans. Image Process. 2011, 21, 80–93. [Google Scholar]
Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “completely blind" image quality analyzer. IEEE Signal Process. Lett. 2012, 20, 209–212. [Google Scholar] [CrossRef]
Gu, K.; Lin, W.; Zhai, G.; Yang, X.; Zhang, W.; Chen, C.W. No-reference quality metric of contrast-distorted images based on information maximization. IEEE T. Cybern. 2017, 47, 4559–4565. [Google Scholar] [CrossRef] [PubMed]
Gu, K.; Tao, D.; Qiao, J.F.; Lin, W. Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 1301–1313. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fu, X.; Zeng, D.; Huang, Y.; Zhang, X.P.; Ding, X. A weighted variational model for simultaneous reflectance and illumination estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2782–2790. [Google Scholar]

Figure 1. Overall framework of our method, where

z_{1, 2, 3} \sim N (0, σ^{2})

signify Gaussian noise. DIP1 is used to generate noise N, and DIP2 and DIP3 are used to generate the reflectance R and the latent illumination I.

l_{r e c}

,

l_{i - c}

,

l_{r e f}

, and

l_{i - s}

represent reconstruction loss, illumination-consistency loss, reflectance loss, and illumination smoothness loss, respectively.

I_{0}

is the initial illumination,

S_{0}

is the input image, and S is the enhanced image.

Figure 1. Overall framework of our method, where

z_{1, 2, 3} \sim N (0, σ^{2})

signify Gaussian noise. DIP1 is used to generate noise N, and DIP2 and DIP3 are used to generate the reflectance R and the latent illumination I.

l_{r e c}

,

l_{i - c}

,

l_{r e f}

, and

l_{i - s}

represent reconstruction loss, illumination-consistency loss, reflectance loss, and illumination smoothness loss, respectively.

I_{0}

is the initial illumination,

S_{0}

is the input image, and S is the enhanced image.

Figure 2. Results of each step of our method: (a,f) are the input image and enhanced result; (b–e) represent illumination I, reflectance R, noise N, and

R + N

.

Figure 2. Results of each step of our method: (a,f) are the input image and enhanced result; (b–e) represent illumination I, reflectance R, noise N, and

R + N

.

Figure 3. Comparisons of enhanced images. Red boxes indicate the obvious differences. Compared with other methods, our method yields natural exposure and clear details.

Figure 4. Comparisons of enhanced images. Red boxes indicate the obvious differences. Compared with other methods, our method enhances the image and the edges are clearly visible.

Figure 5. Comparisons of enhanced images. Red boxes indicate the obvious differences. Compared with other methods, our method does not have the problems of overexposure and artifacts.

Figure 6. Comparisons of enhanced images. Red boxes indicate the obvious differences. Compared with other methods, our method improves the contrast effectively and maintains the natural color.

Table 1. Comparison of average NIQE on six datasets. (The red, green, and blue scores represent the top three in the corresponding dataset, respectively).

Method	DICM	Fusion	LIME	MEF	NPE	VV	Average
LIME	3.5360	3.9183	4.1423	3.7022	4.2625	2.7475	3.5442
NPE	3.4530	3.8883	3.9031	3.5155	3.9501	3.0290	3.4928
SRIE	3.5768	3.9741	3.7868	3.4742	3.9883	3.1357	3.5668
KinD	4.2691	4.1027	4.3525	4.1318	3.9589	3.4255	4.0752
Zero-DCE	3.6091	4.2421	3.9354	3.4044	4.0944	3.2245	3.6332
RetinexDIP	3.7612	4.2308	3.6355	3.2721	4.1012	2.4890	3.5363
Ours	3.7911	4.0628	3.7615	3.2363	4.0426	2.4604	3.5294

Table 2. Comparison of average NIQMC on six datasets. (The red, green, and blue scores represent the top three in the corresponding dataset, respectively).

Method	DICM	Fusion	LIME	MEF	NPE	VV	Average
LIME	5.3397	5.3686	5.4956	5.4168	5.4480	5.5805	5.4121
NPE	5.0895	4.5802	4.6168	4.8610	5.1738	5.2655	5.0104
SRIE	4.9990	4.3568	4.5032	4.7045	5.1848	5.3021	4.9246
KinD	4.6155	4.5248	4.6841	4.6725	4.5766	4.8159	4.6511
Zero-DCE	4.8984	4.7346	5.0678	5.0504	5.1068	5.3614	5.0062
RetinexDIP	4.9912	4.4449	4.7830	5.0151	5.3222	5.3915	5.0126
Ours	5.0093	4.5210	4.7996	5.0761	5.2931	5.4138	5.0398

Table 3. Comparison of average CPCQI on six datasets. (The red, green, and blue scores represent the top three in the corresponding dataset, respectively).

Method	DICM	Fusion	LIME	MEF	NPE	VV	Average
LIME	0.8986	0.9642	1.0882	1.0385	0.9844	0.9555	0.9515
NPE	0.9139	0.9705	1.0812	1.0372	1.0228	0.9557	0.9609
SRIE	0.9056	1.0094	1.1121	1.0967	1.0258	0.9629	0.9721
KinD	0.7459	0.8148	0.8336	0.7877	0.8007	0.7418	0.7670
Zero-DCE	0.7818	0.8820	0.9803	0.9461	0.8578	0.8396	0.8415
RetinexDIP	0.9999	1.0680	1.1595	1.1088	1.0411	1.0525	1.0436
Ours	1.0038	1.0787	1.1585	1.0926	1.0524	1.0445	1.0437

Table 4. Runtime (RT) comparison (in seconds).

Method	( $640 \times 480$ )	( $1280 \times 960$ )	( $1920 \times 1440$ )	( $2560 \times 1920$ )	( $3200 \times 2400$ )	( $3840 \times 2880$ )	( $4480 \times 3360$ )	( $5120 \times 3840$ )
LIME	0.1133	0.4196	1.0148	1.5713	2.3901	3.3302	4.4058	5.7054
NPE	5.8861	26.6340	58.5019	104.8345	163.9938	235.7513	326.0996	427.1531
SRIE	4.7643	33.6684	121.5802	343.9839	726.5981	386.7066	544.0660	865.0404
KinD	0.1554	0.0464	-	-	-	-	-	-
Zero-DCE	0.12559	0.1390	0.2539	0.4051	0.83371	-	-	-
RetinexDIP	15.2482	15.5945	31.4575	52.1564	102.5126	139.4568	182.4861	212.1594
Ours	15.3655	15.4910	30.8131	48.9527	94.5498	122.6732	154.7097	189.1483

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, X.; Zhang, M.; Luo, J. Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior. Sensors 2022, 22, 5593. https://doi.org/10.3390/s22155593

AMA Style

Gao X, Zhang M, Luo J. Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior. Sensors. 2022; 22(15):5593. https://doi.org/10.3390/s22155593

Chicago/Turabian Style

Gao, Xianjie, Mingliang Zhang, and Jinming Luo. 2022. "Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior" Sensors 22, no. 15: 5593. https://doi.org/10.3390/s22155593

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Low-Light Image Enhancement via Retinex-Style Decomposition of Denoised Deep Image Prior

Abstract

1. Introduction

2. Materials and Methods

3. Experiment

3.1. Settings

3.2. Performance Criteria

3.3. Results

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI