Stable Sparse Model with Non-Tight Frame

Zhang, Min; Shi, Yunhui; Qi, Na; Yin, Baocai

doi:10.3390/app10051771

Open AccessArticle

Stable Sparse Model with Non-Tight Frame

¹

Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China

²

Computer Science and Technology, Dalian University of Technology, Dalian 116023, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(5), 1771; https://doi.org/10.3390/app10051771

Submission received: 17 January 2020 / Revised: 24 February 2020 / Accepted: 25 February 2020 / Published: 4 March 2020

(This article belongs to the Special Issue Advances in Image Processing, Analysis and Recognition Technology)

Download

Browse Figures

Versions Notes

Abstract

:

Overcomplete representation is attracting interest in image restoration due to its potential to generate sparse representations of signals. However, the problem of seeking sparse representation must be unstable in the presence of noise. Restricted Isometry Property (RIP), playing a crucial role in providing stable sparse representation, has been ignored in the existing sparse models as it is hard to integrate into the conventional sparse models as a regularizer. In this paper, we propose a stable sparse model with non-tight frame (SSM-NTF) via applying the corresponding frame condition to approximate RIP. Our SSM-NTF model takes into account the advantage of the traditional sparse model, and meanwhile contains RIP and closed-form expression of sparse coefficients which ensure stable recovery. Moreover, benefitting from the pair-wise of the non-tight frame (the original frame and its dual frame), our SSM-NTF model combines a synthesis sparse system and an analysis sparse system. By enforcing the frame bounds and applying a second-order truncated series to approximate the inverse frame operator, we formulate a dictionary pair (frame pair) learning model along with a two-phase iterative algorithm. Extensive experimental results on image restoration tasks such as denoising, super resolution and inpainting show that our proposed SSM-NTF achieves superior recovery performance in terms of both subjective and objective quality.

Keywords:

sparse dictionary; stable recovery; frame; RIP

1. Introduction

Sparse representation of signals in dictionary domains has been widely studied and has provided promising performance in numerous signal processing tasks such as image denoising [1,2,3,4,5], super resolution [6,7,8], inpainting [9,10] and compression [11,12]. It is well known that images are represented by a linear combination of certain atoms of a dictionary. Overcomplete sparse representation is the overcomplete system with a sparse constraint. Common overcomplete systems differ from the traditional bases, such as DCT, DFT and Wavelet, because they offer a wider range of generating elements; potentially, this wider range allows more flexibility and effectiveness in signal sparse representation. However, it is a severely under-constrained illposed problem to find the underlying overcomplete representation due to the redundancy of the systems. When the underlying representation is sparse and the overcomplete systems have stable properties, the ill-posedness will disappear [13]. Sparse models are generally classified into two categories: Synthesis sparse models and analysis sparse model [14]. The commonly referred to sparse models are synthesis sparse models. The analysis ones characterize the signal by multiplying it with an analysis overcomplete dictionary, leading to a sparse outcome. A variety of effective sparse models have been investigated and established such as the classical synthesis sparse model [9,15], the classical analysis sparse model [14], the nonlocal sparse model [16,17] and the 2D sparse model [18]. Unfortunately, these models ignore the stability recovery property which claims that once a sufficient sparse solution is found, all alternative solutions necessarily reside very close to it [9]. Recently, the stable recovery of sparse representation has drawn attention in signal processing theory. Generally speaking, stable recovery can be guaranteed by two properties: Sufficient sparsity and a favorable structure of the dictionary [19]. Donoho defines the concept of mutual incoherence of the dictionary and applies it to prove some possibility of stable recovery [19]. The authors of [20] proposea sparsity-based orthogonal dictionary learning method to minimize the mutual incoherence. The authors of [21] propose an incoherent dictionary learning scheme by integrating a low rank gram matrix of the dictionary into the dictionary learning model.

A more powerful stable recovery guarantee developed by Candes and Tao, termed Restricted Isometry Property (RIP), makes consequent analysis easy [22]. A matrix

Φ

is said to satisfy the RIP of order k if there exists a constant

δ_{k} \in (0, 1)

such that

(1 - δ_{k}) {∥ y ∥}_{2}^{2} \leq {∥ Φ y ∥}_{2}^{2} \leq (1 + δ_{k}) {∥ y ∥}_{2}^{2}

(1)

holds for all k-sparse vectors

y

.

δ_{k}

is defined as the smallest constant which satisfies the above inequalities and is called the restricted isometry constant of

Φ

.

Most RIP research substantially investigates applying RIP as a stablility analysis instrument [17,23,24] or finding optimal RIP constant [25,26] which are all theoretical analyses rather than practical applications. According to the research of [21], the intrinsic property of a dictionary has a direct influence on its performance. All familiar algorithms are staggeringly unstable with a coherent or degenerate dictionary [19]. Recognizing the gap between theoretical analyses and practical applications of RIP, this paper aims to build a stable sparse model satisfying the RIP.

Recently, the frame as a stable overcomplete system has drawn some attention in signal processing as the given signal can be represented by its canonical expansion in a manner similar to conventional bases under the frame. Some data-driven approaches are proposed in [1,27,28,29,30]. The authors of [27,29,30] utilize redundant tight frame in compressed sensing and [28] applies tight frame to few-view image reconstruction. Study [1] presents a data-driven method that the dictionary atoms associated with the tight frame are generated by filters. These approaches achieve much better image processing performance than previous methods, and meanwhile the tight frame condition which requires the frame almost-orthogonality will limit the flexibility in sparse representation. Study [31] derives stable recovery result for

l_{1}

-analysis minimization in redundant, possibly non-tight frames. Inspired by this result and the relationship between RIP and frame, we aim to establish a stable sparse model with RIP based on non-tight frame.

We call a sequence

{ϕ_{i}}_{i = 1}^{M} \in H

a frame if and only if there exist two positive numbers A and B such that

{A ∥ x ∥}_{2}^{2} \leq \sum_{i = 1}^{M} | < x, ϕ_{i} {> |}^{2} \leq B {∥ x ∥}_{2}^{2} \forall x \in H^{N}

(2)

Here, A and B are called the bounds of the frame. We find that every submatrix

Φ_{k}

satisfied RIP is a non-tight frame with

(1 - δ_{k})

and

(1 + δ_{k})

as its frame bounds with a given k. Obviously, there is an essential connection between the non-tight frame and the RIP.

In this paper we focus on a stable sparse model and more specifically on the development of an algorithm that would learn a pair of non-tight frame based dictionaries from a set of signal examples. We propose a stable sparse model via applying the non-tight frame condition to approximate the RIP. This model shares the favorite overcomplete structures with the common sparse models, and meanwhile it contains RIP and closed-form sparse coefficient expression which ensure stable recovery. Recognizing that the optimal framebounds are essentially the maximum and minimum singular values of the frame, RIP is actually enforced on the dictionary pair (the frame and its dual frame) by constraining the singular values of them. We also formulate a dictionary pair learning model via applying the second-order truncated Taylor series to approximate the inverse frame operator. Then we present an efficient algorithm to learn the dictionary pair via a two-phase iterative approach. To summarize, this paper makes the following contributions:

We propose a stable sparse model along with a dictionary pair learning model. Non-tight frame condition is utilized to develop a relaxation of RIP to guarantee stable recovery of sparse representation. Moreover, the sparse coefficients are also modeled, which leads to a more stable recovery especially for seriously noisy image.
It is nearly impossible to solve the dictionary pair learning model in a straightforward way since the inverse frame operator is involved. We provide an effective way to modify the model via applying a second-order truncated Taylor series to approximate the inverse frame operator, and provide an efficient algorithm for the modified one.
We present the stability analysis of the proposed model and demonstrate it on natural and synthetic image denoising, super resolution and image inpainting. The denoising results show that the proposed approach outperforms synthesis models such as the KSVD and the data-driven tight frame based methods for natural image case in terms of average PSNR. Moreover, it also gains comparable performance to the Analysis KSVD for a piecewise-constant (PWC) image in terms of average PSNR. The meaningful structures in the trained dictionary pairs for natural images and a PWC image are observed. The super resolution results show that the SSM-NTF produces better performance than the Bicubic interpolation method and the method in [32]. The inpainting results show that our model is able to eliminate text of fonts completely.

This paper is organized as follows: Section 2 reviews the related work on frame, synthesis sparse model and analysis sparse model. Section 3 presents our stable sparse model with non-tight frame SSM-NTF along with a dictionary pair learning model. Section 4 proposes the corresponding dictionary pair learning algorithm. Section 5 proposes the image restoration method of our proposed SSM-NTF model. In Section 6 we analyze the computational complexity of our proposed algorithm. In Section 7, we demonstrate the the effectiveness of our SSM-NTF model by analyzing the convergence of the corresponding algorithm, denoising natural and piecewise constant images, super resolution and image inpainting. Finally, Section 8 concludes this paper.

2. Related Work

In this section, we briefly review the related work on frame, synthesis sparse model and analysis sparse model.

Frame: A frame

Φ

is called a tight frame if the frame bounds are equal in the Equation (2) [32]. There are two associated operators can be defined between the Hilbert space

H^{N}

and Square Integrable Space

l_{2}^{M} (\cdot)

once a frame is defined. One is the analysis operator

T

defined by

{(T x)}_{i} = < x, ϕ_{i} >, \forall x \in H^{N}

(3)

and the other is its adjoint operator

T^{*}

which is called the synthesis operator

\begin{matrix} T^{*} c = \sum_{i = 1}^{M} c ϕ_{i} \forall c = {(c_{i})}_{i \in J} \in l_{2}^{M} (T) \end{matrix}

(4)

then, the frame operator can be defined as the following canonical expansion

\begin{matrix} Fx = T^{*} T x = \sum_{i = 1}^{M} < x, ϕ_{i} > ϕ_{i} \end{matrix}

(5)

In Euclidean space, a given frame

Φ

can be represent in manner of matrix with its columns of it as the frame elements. Then one of its adjoint operator can be representated as

ψ_{i} = F^{- 1} ϕ_{i}

[32]. Let

x \in R^{N}

be an arbitrary vector, a reconstruction function can be expressed as the following form

\begin{matrix} x = \sum_{i = 1}^{M} < x, ψ_{i} > ϕ_{i} \end{matrix}

(6)

Synthesis sparse model: The conventional synthesis sparse model represents a vector

x

by the linear combination of a few atoms from a large dictionary

Φ

, denoted as

x = {Φ y, ∥ y ∥}_{0} \leq L

, where L is the sparsity of

y

. The computational techniques for approximating sparse coefficient

y

under a given dictionary

Φ

and

x

includes greedy pursuit (e.g., OMP [9]) and convex relaxation optimization, such as Lasso [33] and FISTA [8]. In order to improve the performance of sparse representation, some modified models such as the nonlocal sparse model [16], the frame based sparse model [21], and the MD sparse model [18] are also investigated.

Analysis sparse model: The analysis sparse model is defined as:

y = {Ω x, ∥ y ∥}_{0} = p - l

where

Ω \in R^{p \times d}

is a linear operator (also called as a dictionary), and l denotes the co-sparsity of the signal

x

. The analysis representative vector

y

is sparse with l zeros. The zeros in

y

denote the low-dimensional subspace to which the signal

x

belongs. The analysis sparse coding [14] and dictionary learning [34] approach are also been proposed.

However, all these models ignore the stability recovery property which provides stable reconstruction of the signals in presence of noise.

Dictionary learning methods: The dictionaries include analytical dictionaries, such as DCT, DWT, curvelets and contourlets and learned dictionaries. Some dictionary learning method are proposed, such as the classical KSVD [9] algorithm, the efficient sparse coding which convert the original dictionary learning problem to two least squares problem by applying the Lagrange dual [3], the non-local sparse model [16] which learns a set of PCA sub-dictionaries by cluster the samples into K clusters using image nonlocal self-similarity prior and its improved version which using the

l_{q}

-norm to instead the

l_{2}

-norm in order to handle different image contents. With the realization of stability, some mutual-coherence based methods are proposed. In [20] a sparsity-based orthogonal dictionary learning method is proposed to minimize the mutual-coherence of the dictionary. The authors of [21] propose an in coherent dictionary learning scheme by integrating a low rank gram matrix of the dictionary into the dictionary learning model. However, these methods only concern the capability of the dictionary without modeling the sparse coefficients which still has some probability of instability.

3. The Proposed SSM-NTF

In this section, we present the stable sparse model with non-tight frame, (Section 3.1), the stability analysis of the proposed model, (Section 3.2) and the dictionary pair (the frame pair) learning model, (Section 3.3).

3.1. Stable Sparse Model with Non-Tight Frame

In this section, we derive our stable sparse model with non-tight frame where the non-tight frame condition serves as an approximation to the RIP.

According to [35], a k-th RIP constant can be express as

δ_{k} (Φ) = \frac{Γ_{k} (Φ) - 1}{Γ_{k} (Φ) + 1}

(7)

where

Γ_{k} (Φ) = \frac{θ_{m a x}^{k}}{θ_{m i n}^{k}}

(8)

\begin{matrix} θ_{m a x}^{k} = max_{{∥ y ∥}_{0} = k} \frac{{∥ Φ y ∥}_{2}^{2}}{{∥ y ∥}_{2}^{2}}, θ_{m i n}^{k} = min_{{∥ y ∥}_{0} = k} \frac{{∥ Φ y ∥}_{2}^{2}}{{∥ y ∥}_{2}^{2}} \end{matrix}

(9)

The Equation (7) provides a new perspective in integrating the RIP to sparse model via applying

θ_{m a x}^{k}

and

θ_{m i n}^{k}

instead of the RIP constant

δ_{k} (Φ)

. The difficulty in building a stable sparse model decreases. However, the sparsity k varies with the noise level, and also, in a feasible numerical calculation method, it is impossible to sweep through all the samples satisfying

{∥ x ∥}_{0} = k

to pursue an unknown dictionary

Φ

.

Let

x

be a signal vector, the frame reconstruction function can be formulated as

x = Φ Ψ^{T} x

where

Ψ

is a dual frame of

Φ

. Adding a reasonable sparsity prior to the signal

x

over

Ψ

domain, we can derive

\frac{∥ Φ (Ψ^{T} x) ∥_{2}^{2}}{∥ Ψ^{T} {x ∥}_{2}^{2}} = \frac{{∥ x ∥}_{2}^{2}}{∥ Ψ^{T} {x ∥}_{2}^{2}} = \frac{1}{\frac{∥ Ψ^{T} {x ∥}_{2}^{2}}{{∥ x ∥}_{2}^{2}}}

(10)

Denoting the optimal frame bounds of

Φ

as A and B, the frame condition of

Ψ

can be formulated as

\frac{1}{B} \leq \frac{∥ Ψ^{T} {x ∥}_{2}^{2}}{{∥ x ∥}_{2}^{2}} \leq \frac{1}{A}

. Then a pair of bounds for Equation (10) can be obtained as

A \leq \frac{∥ Φ (Ψ^{T} x) ∥_{2}^{2}}{∥ Ψ^{T} {x ∥}_{2}^{2}} \leq B

. A formula similar to Equation (9) is derived as

B = max_{x \in J} \frac{∥ Φ (Ψ^{T} x) ∥_{2}^{2}}{∥ Ψ^{T} {x ∥}_{2}^{2}}

,

A = min_{x \in J} \frac{∥ Φ (Ψ^{T} x) ∥_{2}^{2}}{∥ Ψ^{T} {x ∥}_{2}^{2}}

where

J

is the data set. Imitating Equation (7), we can obtain a RIP-like constant expression

\hat{δ} (Φ) = \frac{\hat{Γ} (Φ) - 1}{\hat{Γ} (Φ) + 1}

(11)

where

\hat{Γ} (Φ) = \frac{B}{A}

. Obviously,

\hat{δ} (Φ)

can be regarded as an approximation of the RIP constant which benefits the computation due to the ignorance on sparsity degree. In a word, the RIP constraint can be satisfied by constraining the frame bounds. Thus, a stable overcomplete system with a sparsity prior can be established.

Now we discuss the characteristic of the frame bounds A and B. The Frame Condition (2) has a more compact form

\sqrt{A} \leq η_{Φ} \leq \sqrt{B}

where

η_{Φ}

denotes any singular value of

Φ

. More specifically,

\sqrt{A} = η_{m i n}

,

\sqrt{B} = η_{m a x}

where

η_{m a x}

and

η_{m i n}

denote the maximum and minimum singular values of

Φ

, respectively. Then, we can obtain

η_{m a x} \geq θ_{m a x}^{k}, η_{m i n} \leq θ_{m i n}^{k}

. It is easy to know that

\hat{δ} (Φ) \geq δ_{k} (Φ)

. Obviously,

\hat{δ} (Φ)

is a reasonable relaxation of

δ_{k} (Φ)

as

\hat{δ} (Φ)

is slightly exceed

δ_{k} (Φ)

but resides very close to it as long as the data is not seriously degraded. Therefore, the RIP constraint can be enforced on the frames by limiting the maximum and minimum singular values.

In this paper, we integrate non-frame to traditional sparse model to establish a stable sparse model with RIP. Let

x

be a signal vector. Under the assumption of the sparsity prior of

Ψ^{T} x

, we apply a soft thresholding operator

S_{λ} (\cdot)

(which shall be defined in the next subsection) on it such that

x = Φ S_{λ} (Ψ^{T} x)

(12)

where

λ

is a vector with elements

λ_{i}

corresponding to

ψ_{i}

,

i = 1, 2, \dots, M

. Therefore, we propose the stable sparse model with non-tight frame (SSM-NTF) as follows

\begin{matrix} y = S_{λ} (Ψ^{T} x), x = Φ y, \\ {s . t . ∥ y ∥}_{0} \leq s . \end{matrix}

(13)

Here, the correlation between the frame

Φ

and its dual frame

Ψ

is formulated as

Ψ = F^{- 1} Φ

. The frame operator

F

is formulated as

Φ Φ^{T}

which is indeed a gram matrix of

Φ

. The singular values of

Φ

are constrained by

\sqrt{A} \leq η_{Φ} \leq \sqrt{B}

to satisfy the RIP. Actually, by constraining the singular values of

Φ

, the elements of the gram matrix are also bounded which meets the theory of mutual coherence.

In order to be consistent with the traditional sparse models, we refer to the frame

Φ

and its dual frame

Ψ

as dictionary and its dual dictionary.

3.2. The Stability Analysis of the Proposed Model

In sparse representation problem, a given noiseless signal

x

, can be formulated formulated as

(P_{0}) : min_{x} {∥ y ∥}_{0} s . t . x = Φ y

(14)

where

Φ

is the sparse representation dictionary and

y

is the sparse coefficients. While

x = Φ y

is an underdetermined linear system, the problem

(P_{0})

has the unique solution

y_{0}

as soon as it satisfies the uniqueness property which is formulated as

∥ \hat{y} ∥_{0} < \frac{1}{2} (1 + 1 / μ)

(15)

where

μ

is the mutual-coherence of

Φ

[9]. However, the signals are usually acquired with noise, then the problem

(P_{0})

should be relaxed to the problem

(P_{ϵ})

which is expressed as

(P_{ϵ}) : min_{x} {∥ y ∥}_{0} s . t . {∥ x - Φ y ∥}_{F}^{2} \leq ϵ

(16)

where

ϵ

is an error-tolerant which exists due to the noise. The problem

(P_{ϵ})

will no longer maintain the uniqueness of solution as

x = Φ y + ϵ

is an inequality system. Thus, the notion of Uniqueness Property (15) is replaced by the notion of stability which claims that all the alternative solutions reside very close to the ideal solution. Under the stable guarantee, we can yet ensure that the recovery results of our methods produce meaningful solutions. Assume that

y_{0}

is the ideal solution to the problem

(P_{ϵ})

and

\hat{y}

is the candidate one, the traditional sparse model has a stability claim of the form [9]

∥ \hat{y} - y_{0} ∥_{2}^{2} \leq \frac{4 ϵ^{2}}{1 - (2 s_{0} - 1) μ},

(17)

where

μ

is the mutual coherence which is formulated as

μ = {max}_{i \neq j} | < ϕ_{i}, ϕ_{j} > |, i, j = 1, 2, \dots, M

. Apparently, the error bound of Equation (17) can only be determined with given sparsity

s_{0}

and the mutual coherence

μ

. However, the mutual coherence of an unknown dictionary is very difficult to calculate which lead to a result that we can not ensure the stability in the dictionary learning case. In contrast, we derive a similar stability claim of our proposed SSM-NTF model.

Defining

d = \hat{y} - y_{0}

with

y_{0}

as the ideal solution to the model, we have that

∥ Φ \hat{y} - Φ y_{0} ∥_{2} = {∥ Φ d ∥}_{2} \leq 2 ϵ

. From the previous subsection, we have know that the frame

Φ

satisfies the RIP with the corresponding parameter

\hat{δ} (Φ)

. Thus, using this property and exploiting the lower-bound part in Equation (1), we get

(1 - \hat{δ} (Φ)) {∥ d ∥}_{2}^{2} \leq {∥ Φ d ∥}_{2}^{2} \leq 4 ϵ^{2}

(18)

where

\hat{δ} (Φ) = \frac{\hat{Γ} (Φ) - 1}{\hat{Γ} (Φ) + 1} = \frac{\frac{B}{A} - 1}{\frac{B}{A} + 1}

. Thus, we get a stability claim of the form

{∥ d ∥}_{2}^{2} = {∥ \hat{y} - y_{0} ∥}_{2}^{2} \leq \frac{4 ϵ^{2}}{1 - \frac{\frac{B}{A} - 1}{\frac{B}{A} + 1}}

(19)

Obviously, the error bound of the SSM-NTF is determined by

\frac{B}{A}

, the ratio of the upper bound to the lower bound of the frame, rather than the specific values of A and B. Thus, for the convenience of numerical experiments, we usually set A to a fixed value. A main advantage of standard orthogonal transformations is that they maintain the energy of the signals in the transform domain as its frame bounds A and B are equal to 1. However, the standard orthogonal basis is non-redundant that limits its performance in sparse representation. In order to make a trade off between the represent accuracy and the degree of redundant, we usually set the lower frame bound A to a value a little smaller than 1 but not over-small as A is the minimum singular value of

Φ

which determines the condition number of

Φ

. Thus, once the tolerance error is given, the value of B can be easily calculated. Further, a pair of dictionaries conform to the given error can be obtained using the proposed SSM-NTF model. On the other hand, if the value of B is given by experience, the error bound of our model can be measured.

3.3. Learning Model of Dictionary Pair

Assuming

X \in R^{N \times L}

is the training data with signal vectors

x_{i} \in R^{N}, i = 1, 2, \dots, L

, as its columns. The dictionary pair learning model can be written as

\begin{matrix} min_{Φ, Ψ, λ, Y} {∥ X - Φ Y ∥}_{F}^{2} + γ_{1} ∥ Y - S_{λ} (Ψ^{T} X) ∥_{F}^{2} + γ_{2} {∥ Y ∥}_{0} + γ_{3} {∥ Ψ - F^{- 1} Φ ∥}_{F}^{2} \\ s . t . \sqrt{A} \leq η_{Φ} \leq \sqrt{B} \end{matrix}

(20)

However, the Problem (20) is difficult to solve. First, the inverse of the frame operator

F

has no closed-form explicit expression. Secondly, the thresholding operator is a highly nonlinear operator which makes the optimization with respect to

λ

hard to optimize.

Apparently, the Problem (20) is difficult to solve as the existence of the inverse of

F

. Fortunately, the matrix

F^{- 1}

can be expressed as a convergent series [36] which is formulated as

F^{- 1} = \frac{2}{A + B} \sum_{k = 0}^{\infty} {(I - \frac{2 F}{A + B})}^{k}

(21)

Here, we truncated the series at

k = 1

to make a tradeoff between computational complexity and approximation accuracy. It is formulated as

F^{- 1} \approx \frac{2}{A + B} + \frac{2}{A + B} (I - \frac{2 F}{A + B}) = \frac{2}{A + B} (2 I - \frac{2 F}{A + B})

(22)

In this way, once the frame bounds are given, the inverse of

F

can be calculated easily. Then the optimization problem for training RIP-dictionary pair is formulated as

\begin{matrix} min_{Φ, Ψ, λ, Y} {∥ X - Φ Y ∥}_{F}^{2} + γ_{1} ∥ Y - S_{λ} (Ψ^{T} X) ∥_{F}^{2} + γ_{2} {∥ Y ∥}_{0} + γ_{3} {∥ Ψ - \frac{2}{A + B} (2 I - \frac{2 F}{A + B}) Φ ∥}_{F}^{2} \\ s . t . \sqrt{A} \leq σ_{Φ} \leq \sqrt{B} \end{matrix}

(23)

where

S_{λ} (\cdot)

is the elementwise thresholding operator. There are two basic thresholding methods: The hard thresholding method whose thresholding operator defines as

S_{λ} (\cdot) \to m a x (| \cdot | - λ, 0)

and the soft thresholding whose operator is defined as

S_{λ} (\cdot) \to s g n (\cdot) m a x (| \cdot | - λ, 0)

. Both of the two operator are are non-convex and highly discontinuous which lead to big challenges to solve Problem (23). The mean reason is the fact that the update of the thresholding values

λ

causing non-smooth changes to the cost function. To solve this difficulty, we design an alternative direction method via global search and least square that will be introduce in Section 4.1.

4. Dictionary Pair Learning Algorithm

In this subsection, we propose the two-phase iterative algorithm for dictionary pair learning by dividing Problem (23) into two subproblems: The sparse coding phase which updates the sparse coefficients

Y

and thresholding values

λ

, and the dictionary pair update phase which computes

Φ

and

Ψ

.

4.1. Sparse coding phase

In this subsection, we discuss how to calculate the sparse coefficients

Y

and the threshold values

λ

with given

Φ

and

Ψ

under our SSM-NTF model.

Given a pair of dictionaries

Φ

and

Ψ

, calculating

Y

and

λ

from

X

is formulated as:

{\hat{Y}, \hat{λ}} = min_{Y, λ} {∥ X - Φ Y ∥}_{F}^{2} + γ_{1} ∥ Y - S_{λ} (Ψ^{T} X) ∥_{F}^{2} + γ_{2} {∥ Y ∥}_{0}

(24)

We pursue the two variables alternatively. Firstly, with fixed

λ

, we obtain the sparse coefficients

Y

by solving Problem (24) through OMP [9] as it can be easily convert to the classical synthesis sparse expression

{min ∥ Z - DY ∥}_{F}^{2}, s . t . {∥ Y ∥}_{0}

where

Z = [X {\sqrt{γ}}_{1} S_{λ} (Ψ^{T} X)]

and

D = [Φ {\sqrt{γ}}_{1} I]

.

Secondly, the pursue of

λ

is equivalent to solving the following problem

\hat{λ} = \underset{λ}{arg min} {∥ Y - S_{λ} (Ψ^{T} X) ∥}_{F}^{2}

(25)

which can be decomposed into M individual optimization problems

{\hat{λ}}_{i} = \underset{λ_{i}}{arg min} {∥ {\bar{y}}_{i} - S_{λ_{i}} (ψ_{i}^{T} X) ∥}_{2}^{2}, i = 1, \dots, M .

(26)

where

ψ_{i}

is the column of

Ψ

. From the definition of soft thresholding operator, we can know that the function of Problem (26) is discrete. By denoting the data indices set that remains intact after the thresholding as

J_{i}

, we split the data

X

into two parts:

X^{J_{i}}

and

X^{{\hat{J}}_{i}}

such that

\begin{matrix} S_{λ_{i}} (ψ_{i}^{T} X^{J_{i}}) & = ψ_{i}^{T} X^{J_{i}} - s g n (ψ_{i}^{T} X^{J_{i}}) λ_{i} \end{matrix}

(27)

\begin{matrix} S_{λ_{i}} (ψ_{i}^{T} X^{{\hat{J}}_{i}}) & = 0 . \end{matrix}

(28)

where

{\hat{J}}_{i}

is a supplementary to the intact indices

J_{i}

which turn the all elements to zero. It is clear to know that the variables

J_{i}

and

{\hat{J}}_{i}

are both functions of

λ_{i}

without explicit expressions which leads to a large challenge in optimization.

In order to solve Problem (26), an intermediate variable

μ_{i}

is necessarily to introduced to separate the whole problem into two parts: The update of the indices

J_{i}

and

{\hat{J}}_{i}

(determined by

μ_{i}

) and the update of the explicit thresholding value

λ_{i}

. Then Problem (26) can be transformed to another optimization problem:

\begin{matrix} {{\hat{λ}}_{i}, {\hat{μ}}_{i}} = \underset{λ_{i}}{arg min} ∥ {\bar{y}}_{i}^{{\hat{J}}_{i}} ∥_{2}^{2} + ∥ {\bar{y}}^{J_{i}} - [ψ_{i}^{T} X^{J_{i}} - s g n (ψ_{i}^{T} X^{J_{i}}) λ_{i}] ∥_{2}^{2} + τ / 2 {∥ λ_{i} - μ_{i} ∥}_{2}^{2} \end{matrix}

(29)

where

J_{i}

and

{\hat{J}}_{i}

are two functions of the intermediate variable

μ_{i}

.

At the k-th step, to obtain

μ_{i}

, we solve Problem (29) with

λ_{i}

fixed and denote the functions as

f (μ_{i}) + g (μ_{i}) + l (μ_{i})

where

f (μ_{i}) = {∥ {\bar{y}}_{i}^{{\hat{J}}_{i}} ∥}_{2}^{2}

,

g (μ_{i}) = {∥ {\bar{y}}^{J_{i}} - [ψ_{i}^{T} X^{J_{i}} - s g n (ψ_{i}^{T} X^{J_{i}}) λ_{i}] ∥}_{2}^{2}

,

l (μ_{i}) = τ / 2 {∥ λ_{i} - μ_{i} ∥}_{2}^{2}

. Optimizing this expression is obviously non-trivial as the target function is non-convex and highly discontinuous. Actually, with

λ_{i}

fixed, the minimization of

f (μ_{i}) + g (μ_{i})

can be globally solved due to its discrete finite nature. In another word, if a series of candidate terms of

μ_{i}

are given, the global search is guaranteed to succeed.

Once a

λ_{i}

is given, the

f (μ_{i}) + g (μ_{i})

will be a piecewise constant function. It means that the function values remain unchanged within a series of intervals which are determined by

| ψ_{i}^{T} x_{i} |, i = i_{1}, i_{2}, \dots, i_{l}

. Therefore,

| ψ_{i}^{T} x_{i} |, i = 1, 2, \dots, L

can be taken as a portion of candidate terms of

μ_{i}

. For the function

ł (μ_{i})

, it is clear that it minimizes at

μ_{i} = λ_{i}

and monotonically increases with the increasing distance between

ł (μ_{i})

and the given

ł (λ_{i})

. So, to minimize

ł (μ_{i})

, we only need to choose the closest point in the feasible region.

Without loss of generality, we assume that all the

| ψ_{i}^{T} x_{i} |, i = 1, 2, \dots, L

are ascending ordered and the corresponding signals are in the same order. We compute all the possible values of

f (μ_{i}) + g (μ_{i})

by

\begin{matrix} f (ψ_{i}^{T} X_{j + 1}) = f (ψ_{i}^{T} X_{j}) + y_{i (j + 1)}^{2} \\ g (ψ_{i}^{T} X_{j + 1}) = {∥ {\bar{y}}_{i} - ψ_{i}^{T} X + s g n (ψ_{i}^{T} X) λ_{i} ∥}_{2}^{2} - β_{k + 1} \end{matrix}

(30)

where

β_{k + 1} = β_{k} + {[y_{i (k + 1)} - ψ_{i}^{T} x_{k + 1} + s g n (ψ_{i}^{T} x_{k + 1}) λ_{i}]}^{2}, β_{1} = {[y_{i 1} - ψ_{i}^{T} x_{1} + s g n (ψ_{i}^{T} x_{1}) λ_{i}]}^{2}

. Sort

| ψ_{i}^{T} x_{i} |, i = 1, 2, \dots, L

in descending order of

f (μ_{i}) + g (μ_{i})

, and every two adjacent values form an interval on which the function value remains unchanged. In another word, the objective function

f (μ_{i}) + g (μ_{i}) + l (μ_{i})

is minimized at the point closest to

λ_{i}

in the interval. Thus, compute all the minimizer values on every interval and the minimum must be the optimal result.

With

μ_{i}

fixed, we solve the following problem in order to pursue

λ_{i}

:

\begin{matrix} {{\hat{λ}}_{i}} = \underset{λ_{i}}{arg min} ∥ {\bar{y}}^{J_{i}} - [ψ_{i}^{T} X^{J_{i}} - s g n (ψ_{i}^{T} X^{J_{i}}) λ_{i}] ∥_{2}^{2} + τ / 2 {∥ λ_{i} - μ_{i} ∥}_{2}^{2} \end{matrix}

(31)

This is a standard continuous convex function that can be easily sovled by least square.

We summarize our sparse coding method in Algorithm 1.

Algorithm 1 Sparse coding algorithm

Input and Initialization:
Training data

X \in R^{N \times L}

, iteration number r, initial value

λ_{i} = 0

.
Output:
Sparse coefficients

y

, and thresholding values

λ_{i}

1:: Compute the sparse coefficients $y$ via Problem (24) according to the OMP algorithm.
2:: Sort the columns of $X$ and $y$ in increasing order of $| {ψ_{i}}^{T} X |$ .
3:: For p=1:r
For j=1:L
Compute all the possible values for $f (μ_{i}) + g (μ_{i})$ by

$\begin{matrix} f (ψ_{i}^{T} X_{j + 1}) = f (ψ_{i}^{T} X_{j}) + y_{i (j + 1)}^{2} \\ g (ψ_{i}^{T} X_{j + 1}) = {∥ {\bar{y}}_{i} - ψ_{i}^{T} X + s g n (ψ_{i}^{T} X) λ_{i} ∥}_{2}^{2} - β_{k + 1} \end{matrix}$

where $β_{k + 1} = β_{k} + {[y_{i (k + 1)} - ψ_{i}^{T} x_{k + 1} + s g n (ψ_{i}^{T} x_{k + 1}) λ_{i}]}^{2}, β_{1} = {[y_{i 1} - ψ_{i}^{T} x_{1} + s g n (ψ_{i}^{T} x_{1}) λ_{i}]}^{2}$ .
Denote them as a vector $ν$ .
End for
4:: Sort the elements of $| {ψ_{i}}^{T} X |$ in descending order of $ν$ . Denote the intervals bounded as $ξ_{q}, q = 1, 2, L - 1$ .
5:: compute every $ν_{q} + l ({\hat{μ}}_{i})$ where ${\hat{μ}}_{i}$ is the point closest to $λ_{i}$ in $ξ_{j - 1}$ .
6:: ${\hat{μ}}_{i} = arg {min}_{μ_{i}} ν_{q} + l (μ_{i})$ .
7:: Compute $λ_{i}$ via Problem (31).
End for

4.2. Dictionary Pair Update Phase

To obtain

Ψ

, we solve the following problem with all other variables fixed:

\hat{Ψ} = \underset{Ψ}{arg min} ∥ Y - S_{λ} (Ψ^{T} X) ∥_{F}^{2} + \frac{γ_{3}}{γ_{1}} {∥ Ψ - F^{- 1} Φ ∥}_{F}^{2}

(32)

Such problem is a highly nonlinear optimization due to the definition of

S_{λ}

. Here we solve

Ψ

columnwisely by updating each column of

Ψ

.

For each

ψ_{i}

, we solve the following subproblem:

{\hat{ψ}}_{i} = \underset{ψ_{i}}{arg min} ∥ {\bar{y}}_{i} - S_{λ_{i}} (ψ_{i}^{T} X) ∥_{2}^{2} + \frac{γ_{3}}{γ_{1}} {∥ ψ_{i} - F^{- 1} ϕ_{i} ∥}_{2}^{2}

(33)

We denote

J_{i}

and

{\hat{J}}_{i}

as the indices set as before. Set the elements of

{\bar{y}}_{i}

corresponding to the indices

{\hat{J}}_{i}

to be zeros and denote the new vector as

{\bar{z}}_{i}

. This operation leads to a consequence that

ψ_{i}^{T} X^{{\hat{J}}_{i}}

≈0. Then we solve the following quadratic optimization problem that is easy to solve with least squares.

\begin{matrix} \hat{ψ_{i}} = \underset{ψ_{i}}{arg min} ∥ {\bar{z}}_{i} - ψ_{i}^{T} {X ∥}_{2}^{2} + \frac{γ_{3}}{γ_{1}} {∥ ψ_{i} - F^{- 1} ϕ_{i} ∥}_{2}^{2} \end{matrix}

(34)

The optimization problem to pursue

Φ

is formulated as

\begin{matrix} \hat{Φ} = \underset{Φ}{arg min} & {∥ X - Φ Y ∥}_{F}^{2} + γ_{3} {∥ Ψ - F^{- 1} Φ ∥}_{F}^{2} \end{matrix}

(35)

\begin{matrix} s . t . & \sqrt{A} \leq δ_{Φ} \leq \sqrt{B} \end{matrix}

(36)

where the frame operator

F

is given by

Φ Φ^{T}

and

F^{- 1}

is defined as Equation (22). The target function then becomes

{∥ X - Φ Y ∥}_{F}^{2} + η_{3} {∥ Ψ - \frac{2}{A + B} (2 I - \frac{2 Φ Φ^{T}}{A + B}) Φ ∥}_{F}^{2}

(37)

which is denoted by

h (Φ)

. We apply the gradient descent method to unconstraint version of Problem (35) and then project the solution to the feasible space. The gradient is given by a very complicated form as follows

\begin{matrix} \nabla h (Φ) = (X - Φ Y) Y^{T} - γ_{3} {\frac{4}{α} h (Φ) + \frac{4}{α^{2}} [Φ Φ^{T} h (Φ) + Φ h {(Φ)}^{T} Φ + h (Φ) Φ Φ^{T}]} . \end{matrix}

(38)

In order to reduce the complexity, the gradient can also be computed with the fixed

F

calculated in the previous step of the ADM. Then at the k-th iteration, the gradient can be written as

\nabla h (Φ^{k}) = (X - Φ^{k - 1} Y) Y^{T} - γ_{3} {F^{(- 1)}}^{T} (Ψ^{k} - F^{T} Φ^{k - 1})

(39)

where

F = Φ^{k - 1} {Φ^{(k - 1)}}^{T}

. The descent step length can be obtained by optimizing the problem

min_{θ} h (Φ + θ \nabla h (Φ))

with fixed

F

, which is given by

\begin{matrix} \hat{θ} = \frac{< a, b > + γ_{3} < c, d >}{{∥ a ∥}_{F}^{2} + γ_{3} {∥ c ∥}_{F}^{2}} \end{matrix}

(40)

where

a = \nabla h (Φ) Y, b = X - Φ Y, c = F^{- 1} \nabla h (Φ), d = Ψ - F^{- 1} Φ

. For the frame condition

\sqrt{A} \leq δ_{Φ} \leq \sqrt{B}

, we apply a SVD decomposition

Φ = U Σ V^{T}

and map the singular values to the interval

[\sqrt{A}, \sqrt{B}]

linearly. We denote the mapped singular matrix as

\hat{Σ}

and reconstruct

Φ

by

Φ = U \hat{Σ} V^{T}

.

We summarize our algorithm in Algorithm 2.

Algorithm 2 Dictionary pair learning algorithm

Input and Initialization:
Training data

X

, frame bound A,B, iteration

n u m

, gradient descent iterations r.
Build frame

Φ \in R^{N \times M}

and

Ψ \in R^{N \times M}

, either by using random entries, or using M randomly chosen data.
Output:
Frame

Φ, Ψ

, Sparse coefficients

Y

, and thresholding values

λ

For l=1:

n u m

Sparse Coding Step:

1:: Compute the sparse coefficients $Y$ and the thresholding values $λ$ via Algorithm 1.
Frame Update Step:
2:: Update $Ψ$ in column-wise. Compute $W = S (Ψ^{T} X) .$
For $i = 1 : M$
Denote ${\hat{J}}_{i}$ as the indices of zeros in the i-th column of W. Set $ψ_{i}^{T} X^{{\hat{J}}_{i}} = 0$ . Compute $ψ_{i}$ via Equation (34).
End For
3:: Update $Φ$ via Gradient Descent and singular value map.
For k=1:r
Calculate Gradient via Equation (39) and calculate the descent step via Equation (40)
End for
4:: Apply SVD decomposition $Φ = U Σ V^{T}$ , map $Σ$ to obtain $\hat{Σ}$ and reconstruct $Φ = U \hat{Σ} V^{T}$ .
End for

5. Restoration

The image restoration aims to reconstruct a high-quality image

I

from its degraded (e.g., noisy, blurred and/or downsampled) version

L

, denoted by

L = S H I + n

, where H represents a blurring filter, S the downsampling operator, and

n

is a noisy signal. For the signal satisfies the SSM-NTF, the restoration model based on SSM-NTF is formulated as

\begin{matrix} {\hat{I}, \hat{Y}, \hat{λ}} = min_{I, {y_{i}}_{i = 1}^{N}, \hat{λ}} ∥ L - S H I ∥_{F}^{2} + γ \sum_{i} ∥ R_{i} I - Φ y_{i} ∥_{F}^{2} + γ_{1} \sum_{i} ∥ y_{i} - S_{λ} (Ψ^{T} R_{i} I) ∥_{F}^{2} + γ_{2} \sum_{i} ∥ y_{i} ∥_{0} \end{matrix}

(41)

where

R_{i}

is an operator that extracts the i-th patch of the image

I

and

y_{i}

is the i-th column of

Y

.

λ

denotes a vector

[λ_{1}, λ_{2}, \dots, λ_{M}]

with

λ_{j}

operating on the j-th element of

Ψ^{T} R_{i} I

. On the right side of Equation (41), the first term is the global force that demands the proximity between the degraded image

L

, and its high-quality version

I

. The rest terms are the local constraints to make sure every patch at location i satisfies the SSM-NTF.

To solve Problem (41), we apply Algorithm 1 to obtain the sparse coefficients

Y

and the threshold values

λ

. We mainly state the iterative method to obtain

I

. Assume the sign of

Ψ^{T} R_{i} I^{k}

will not change much between two steps, we set it in the k-th step by

c^{k} = s i g n (Ψ^{T} R_{i} I^{k - 1})

. where

s i g n

is the sign function. Denote

d^{k} = Ψ^{T} R_{i} I^{k - 1}

. We set

O^{k}

as an index set that satisfies

| d_{l} | \leq λ_{l}, l \in O^{k}

. Set

u^{k} \in R^{M}

as a vector with elements

u_{l} = \{\begin{matrix} λ_{l} & l \in O^{k}, \\ 0 & o t h e r w i s e . \end{matrix}

Then the non-convex and non-smooth threshold can be removed with the substitution that

y_{i} - S_{λ} (Ψ^{T} R_{i} I^{k}) \approx y_{i} + c^{k} ⊙ u - Ψ^{T} R_{i} I^{k}

Thus, in the k-th step, the problem needs to be solved is expressed as

\begin{matrix} {\hat{I^{k}}} = min_{I^{k - 1}} ∥ L - S H I^{k - 1} ∥_{F}^{2} + γ \sum_{i} ∥ R_{i} I^{k - 1} - Φ y_{i} ∥_{F}^{2} + γ_{1} \sum_{i} {∥ y_{i} + c^{k} ⊙ u^{k} - Ψ^{T} R_{i} I^{k - 1} ∥}_{F}^{2} \end{matrix}

(42)

where ⊙ is point multiplication. This convex problem can be easily solved by gradient descent algorithm.

We summarized the restoration algorithm in Algorithm 3.

Algorithm 3 Restoration algorithm

Input
Training dictionaries

Φ

,

Ψ

, iteration number r, a degraded image

L

, set

I_{0} = L

.
Output:
The high quality image

\hat{I}

1:: Compute $Y$ and $λ$ via the method in Algorithm 1.
For k=1:r
2:: compute $d^{k} = Ψ^{T} R_{i} I^{k - 1}$ . Set $c^{k} = s i g n (d^{k})$ . set $O^{k}$ as an index set that satisfies $| d_{l}^{k} | \leq λ_{l}, l \in O^{k}$ .
Set $u_{l}^{k} = \{\begin{matrix} λ_{l} & l \in O^{k}, \\ 0 & o t h e r w i s e . \end{matrix}$ .
3:: Solving the Problem (42) via gradient descent algorithm.
End for

6. Complexity Analysis

In this section, we discuss the computational complexity of our sparse coding and dictionary pair learning algorithms with regard to those of conventional sparse model counterparts.

We first analyze complexities of the main components of the sparse coding (SC) and dictionary updating (DU) algorithms. In terms of SC, given a set of training samples,

X \in R^{N \times L}

, the complexity of BtOMP of calculating

\hat{Y} = min_{Y} {∥ X - Φ Y ∥}_{F}^{2} + γ_{1} ∥ Y - S_{λ} (Ψ^{T} X) ∥_{F}^{2} + γ_{2} {∥ Y ∥}_{0}

is

O (K^{2} M L)

where K is the target sparsity and the complexity of threshold of calculating

\hat{λ} = min_{λ} {∥ Y - S_{λ} (Ψ^{T} X) ∥}_{F}^{2}

is

O (N M L)

, which cost most of time in SC step at each iteration. The sparse coefficients

Y \in N \times L

and the the threshold values

λ

are computed with fixed dictionaries

Φ \in R^{N \times M}

and

Ψ \in R^{N \times M}

. Correspondingly, the traditional sparse coefficients

B \in N \times L

is sparse approximated by dictionary

D \in R^{N \times M}

and the computational complexity is

O (K^{2} M L)

.

In terms of DU, with the given training samples

X \in R^{N \times L}

, we learn a pair of dictionary

Φ \in R^{N \times M}

and

Ψ \in R^{N \times M}

. We update

Ψ

via Problem (34) with a computational complexity of

O (N^{2} L)

. In order to update

Φ

. we need to calculate the gradient via Equation (39) with a computational complexity of

O (N M L)

and the step size via Equation (40) with a computational complexity of

O (r N M L)

where r is the iteration number of the gradient descent. For the traditional dictionary learning, the corresponding training set is

X \in R^{N \times L}

and the dictionary

D \in R^{N \times M}

is updated by SVD decomposition of rank-1 with a computational complexity of

O (K M L)

.

7. Experimental Results

We demonstrate the effectiveness of our SSM-NTF model by first discussing the convergence of our dictionary pair learning algorithm and then evaluating the performance on natural and piecewise constant image denoising, super resolution and image inpainting.

7.1. Convergence Analysis

The convergence of the presented dictionary pair learning algorithm is evaluated in Figure 1. Here, we train a pair of dictionaries

Φ

and

Ψ

of size

100 \times 200

from 65,000 patches, which are of size 10 × 10 randomly sampled from six natural images. We apply the frame reconstruction function

Φ S_{λ} (Ψ^{T} x)

to reconstruct the patches. The convergence of the presented dictionary pair learning algorithm is evaluated in Figure 1. The dictionary pair is illustrated in Figure 2. They exhibit that our dictionary pair learning method is able to capture the feature of the image along with the convergence property.

7.2. Image Denoising

In this subsection, we evaluate the performance of our proposed SSM-NTF model on image denoising. Benefitting from the concept of non-tight frame, the proposed SSM-NTF model contains a pair of dictionaries: The frame and its dual frame. As a result, our proposed SSM-NTF model contains an analysis system and a synthesis system. The analysis-like system is denoted as

y = S_{λ} (Ψ^{T} x), ∥ y ∥ \leq s

(43)

which analyzes the signals in

Ψ

domain. The synthesis system is denoted as

x = Φ y, ∥ y ∥ \leq s

(44)

which reconstructs the analyzed signals. The two systems share the same sparse coefficients

Y

.

Therefore, we compare our proposed SSM-NTF with synthesis and analysis models, respectively. It is well known that the synthesis sparse model has advantage in dealing with the natural image while the analysis sparse model is mostly used to address the piecewise constant image. Therefore, we respectively, perform the denoising experiments on natural images and piecewise images comparing with the most related approaches.

7.2.1. Natural Image Denoising

We now turn to present experimental results on six classical natural images named ‘Barbara’, ‘Boat’, ‘Couple’, ‘Hill’, ‘Lena’ and ‘Man’ which are shown in [1], to evaluate the performance of the training algorithm. The denoising problem which has been widely studied in sparse representation is used as the target application. We add Gaussian white noise to these images at different noise levels

σ = 20, 30, 40, 50, 60, 70, 80, 90, 100

. Then we use the learned dictionary pair to denoise the natural images, with overlap of 1 pixel between adjacent patches of size

10 \times 10

. The patch denoising stage is followed by weighted averaging the overlapping patch recoveries to obtain the final clean image. The parameters in our scheme are

γ_{1} = 1.1

and

γ_{3} = 1.2 {(L / M)}^{2}

where L and M are the sample and dictionary size, respectively. We have stated in Section 3.2 that we usually set A to a positive number around but smaller than 1. In fact, we set A from 0.6 to 1 by a step of 0.03 to test the denoising performance to determine the specific value of it. Then, with fixed A, we set B from 1 to 4 with a step length of 0.3 to run experiments on every noise level to determine the values of B. The values of frame bounds A and B are shown in Table 1. For example, when the noise level

σ = 40

, A and B are set to be 0.8 and 1.8, respectively.

Table 2 shows the comparison results in terms of PSNR. There are three related image denoising methods involved, including the classical dictionary learning algorithm KSVD [9], the data-driven tight frame based denoising method [1] and the incoherent dictionary learning based method [21]. The patch size of KSVD [9] and the method in [21] are

8 \times 8

with stripe 1 and the dictionaries are of size

64 \times 256

at their optimal state according to the previous work. We point out that [1] works on filters of size

16 \times 16

instead of image patches and initialized by 64 3-level Harr wavelet filters in size

16 \times 16

. All the three compared methods can achieve their best performance with 50 iterations.

Table 2 shows that the incoherent dictionary learning method [21] outperforms the KSVD [9] in average as the mutual incoherent of dictionary can provide stable recovery. That [1] outperforms [21] implies that the tight frame is a more stable system. Then our stable sparse model based method outperforms [1] in average suggests that applying non-tight frame to approximate RIP can provide even better and more stable reconstruction results. Figure 3 shows two exemplified visual results on images ‘Man’ and ‘Couple’ at noise levels

σ = 50

and

σ = 40

, respectively. The proposed method shows much clearer and better visual results than the other competing methods.

7.2.2. Piecewise Constant Image Denoising

In this subsection, we demonstrate the analytical property of our SSM-NTF model using a synthetic image. The denoising problem which has been widely studied in sparse representation is used as the target application. We start with a piecewise constant of size

256 \times 256

contaminated by Gaussian white noise with noise level

σ = 5

and extract all possible

5 \times 5

image pathes. For the denoising we apply the dictionary pair learning algorithm with the parameters

γ_{1} = 1.5, γ_{3} = 1.2

L / M

,

A = 0.8

and

B = 1.8

in parallel with patch denosing with the synthesis KSVD [9] and the analysis KSVD [14]. We apply 100 iterations of the our dictionary learning method on this training set, and learning dictionary pair of size

25 \times 50

. The experimental set of the synthesis KSVD [9] and the analysis KSVD [14] are at their optimal state according to the previous work.

The learned dictionary pair

Φ

which exhibits much like the synthesis dictionary and

Ψ

which exhibits a high resemblance to the analysis dictionary are illustrated in Figure 4. The resulting PSNRs of the denoised images are 45.32 dB for Analysis KSVD, 43.60 dB for Synthesis KSVD, and 45.17 dB for our proposed algorithm. The figure shows that our dictionary pair learning method is able to capture the features of the piecewise constant image. Figure 5 shows the absolute difference images for each of the three methods. Note that these images are displayed in the dynamic range

[0, 20]

to make the differences more pronounced. Our proposed approach leads to a much better denoising result than the synthesis KSVD and is comparable with the analysis KSVD.

7.3. Super Resolution

We evaluate our SSM-NTF in comparison with two examplar-based scheme [37] for image Super Resolution (SR) Problem (41) with a bicubic filter. Figure 6 shows the 15 test natural images [18] with both rich texture and structure. All the schemes are applied to the illumination channel, where the scale factor is 3, we always use

3 \times 3

low-resolution patches with overlap of 1 pixel between adjacent patches, corresponding to

9 \times 9

patches with overlap of 3 pixels for the high-resolution patches. In these experiments, we have used the following parameters:

A = 0.8, B = 1.8

,

γ_{1} = 1.1

and

γ_{3} = 1.2

L / M

where L and M are the sample and dictionary size, respectively. In our scheme, dictionary learning is performed between HR and middle-level (MR) images which are the first-, and second-order derivatives of the upsampled version of one LR image by a factor of 2. The four

1 D

filters used to extract the derivatives are:

\begin{matrix} f_{1} = [- 1, 0, 1], f_{2} = f_{1}^{T} \\ f_{3} = [1, 0, - 2, 0, 1], f_{4} = f_{3}^{T} \end{matrix}

(45)

We train two pairs of HR/LR dictionaries

{Φ_{h}, Ψ_{h}}

and

{Φ_{l}, Ψ_{l}}

from 100,000 HR/LR patch pairs

[X_{h}, X_{l}]

randomly sampled from the collected natural images which are also used in [37] where

X_{h}

is sampled from the HR images and

X_{l}

is sampled from the four feature images. The feature images are obtained by applying the four filters to the upsampled LR image. Given

Φ

and

Ψ

and the four MR feature images, the sparse coefficients

Y

and threshold value

λ

can be calculated by Algorithm 1. With the theory in [37], the HR image can be recovered via Algorithm 3. In the experiment, our HR dictionary pair are of size

81 \times 450

and MR ones are of size

144 \times 450

. The dictionary size of [37] is 81 × 1024 (HR) and 144 × 1024 (MR) at its best performance as stated in the paper. Thus, the dictionary size of [37] is larger than the sum of our dictionaries. Table 3 shows the objective evaluation results of our proposed SSM-NTF compared with bicubic interpolation and [37]. On average, our SSM-NTF presents best in PSNR. Figure 7 presents the corresponding visual comparison of the illumination SR results of Image 12. We can observe that the result of bicubic interpolation is too smooth and the result of [37] suffers from obvious ringing artifact and noises. The HR reconstruction of our SSM-NTF method provides more clear details.

7.4. Image Inpainting

To illustrate the potential applicability of our proposed SSM-NTF model on image inpainting, we apply it to the applications of text removal. In these experiments, we have used the following parameters:

A = 0.8, B = 1.8

,

γ_{1} = 1.1

and

γ_{3} = 1.2

L / M

where L and M are the sample and dictionary size, respectively. We operate on the image ‘Adar’, ‘Lena’, ‘Couple’, ‘Hill’ with super-imposed text of various fonts.

In this experiment, we applied our SSM-NTF model to image inpainting in a way similar to the non-blind KSVD inpainting algorithm [9], which requires the knowledge of which pixels are corrupted and required inpainting. Actually, only the non-corrupted pixels are used to training the dictionary pair and inpainting the images. We operate our method on pathes of size

10 \times 10

that extract from the images with overlap of 1 pixel between adjacent. The trained dictionary pair are of size

100 \times 200

. The KSVD algorithm in this experiment is dealing with patches of size

8 \times 8

that extract from the images with overlap of 1 pixel between adjacent. The dictionary size is

64 \times 256

at its best performance according to [9]. The patch inpainting stage is followed solving Problem (41). Table 4 shows the objective evaluation results of our proposed SSM-NTF compared with DCT and KSVD [9]. The visual comparisons are shown in Figure 8 and Figure 9. We find that the proposed SSM-NTF method is able to eliminate text of fonts completely while the KSVD is dull. Our SSM-NTF method achieves better performance in terms of both subjective and objective quality.

8. Conclusions

In this paper, we propose a stable sparse model with non-tight frame (SSM-NTF) and further formulate a dictionary pair learning model to stably recover the signals. We theoretically analyze the rationality of the approximation for RIP with the non-tight frame condition. The proposed SSM-NTF has RIP and the closed-form expression of the sparse coefficients that ensure the stable recovery especially for seriously noise images. The proposed SSM-NTF contains both a synthesis sparse and an analysis system which share the common sparse coefficients without taking into account the thresholding. We also propose an efficient dictionary pair learning algorithm via developing an explicit analytical expression of the inherent relation between the dictionary pair. The proposed algorithm is capable of approximating structures of signals via a pair of adaptive dictionaries. The effectiveness of our proposed SSM-NTF and its corresponding algorithms are demonstrated in image denoising, image super-resolution and image inpainting. The results of numerical experiments show that the proposed SSM-NTF achieves superior to the compared methods in objective and subjective quality on most of the cases.

On the other hand, our proposed SSM-NTF is actually a 1D sparse model. The 1D sparse model suffers from high memory as well as high computational costs especially when handling high dimensional data. MD frame can be expressed as the kronecker product of a series of 1D frames. Benefitting from this good characteristic, in future work, we will extend our stable sparse model to propose an MD stable sparse model. Moreover, the proposed SSM-NTF is not effective enough to remove other kinds of noise (e.g., salt and pepper noise) as the loss function of SSM-NTF is gaussian. We would like to improve the performance of our model by changing the loss function.

Author Contributions

M.Z. derived the theory, analyzed the data, performed the performance and wrote the original draft; Y.S. and N.Q. researched the relevant theory, participated in discussions of the work and revised the manuscript; B.Y. supervised the project. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by NSFC (No.61672066,61976011, U1811463, U19B2039, 61906008, 61906009), Beijing municipal science and technology commission (No.Z171100004417023) and the Scientific Research Common Program of Beijing Municipal Commission of Education (KM202010005018).

Acknowledgments

This work was supported by Beijing Advanced Innovation Center for Future Internet Technology, Beijing Key Laboratory of Multimedia and Intelligent Software Technology. We deeply appreciate the organizations mentioned above.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cai, J.F.; Ji, H.; Shen, Z.; Ye, G.B. Data-driven tight frame construction and image denoising. Appl. Comput. Harmon. Anal. 2014, 37, 89–105. [Google Scholar] [CrossRef]
Xie, J.; Feris, R.S.; Yu, S.S.; Sun, M.T. Joint super resolution and denoising from a single depth image. IEEE Trans. Multimed. 2015, 17, 1525–1537. [Google Scholar] [CrossRef]
Rubinstein, R.; Zibulevsky, M.; Elad, M. Efficient Implementation of the KSVD Algorithm and the Batch-OMP Method; Tech. Rep.; Department of Computer Science, Technion: Haifa, Israel, 2008. [Google Scholar]
Fadili, M.J.; Starck, J.-L.; Murtagh, F. Inpainting and zooming using sparse representations. Comput. J. 2009, 52, 64–79. [Google Scholar]
Liu, Y.; Zhai, G.; Gu, K.; Liu, X.; Zhao, D.; Gao, G. Reduced-reference image quality assessment in free-energy principle and sparse representation. IEEE Trans. Multimed. 2018, 20, 379–391. [Google Scholar] [CrossRef]
Zhao, J.; Hu, H.; Cao, F. Image super-resolution via adaptive sparse representation. Knowl. Based Syst. 2017, 124, 23–33. [Google Scholar] [CrossRef]
Zhu, Z.; Guo, F.; Yu, H.; Chen, C. Fast single image super-resolution via self-example learning and sparse representation. IEEE Trans. Multimed. 2014, 16, 2178–2190. [Google Scholar] [CrossRef]
Beck, A.; Teboulle, M. A fast iterative shrinkage thresholding algorithm for linear inverse problems. SIAM J. Imag. Sci. 2009, 2, 183–202. [Google Scholar] [CrossRef] [Green Version]
Elad, M. Sparse and Redundant Representations. In From Theory to Applications in Signal and Image Processing; Springer: Berlin, Germany, 2010. [Google Scholar]
Zhang, L.; Bioucas-Dias, J.M. Fast hyperspectral image denoising and inpainting based on low-rank and sparse representations. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2018, 11, 730–742. [Google Scholar] [CrossRef]
Kalluri, M.; Jiang, M.; Ling, N.; Zheng, J.; Zhang, P. Adaptive RD optimal sparse coding with quantization for image compression. IEEE Trans. Multimed. 2019, 21, 39–50. [Google Scholar] [CrossRef]
Liu, Y.; Dimitris, A. Pados, Compressed-sensed-domain l₁-pca video surveillance. IEEE Trans. Multimed. 2016, 18, 351–363. [Google Scholar] [CrossRef]
Babaie-Zadeh, M.; Jutten, C. Corrections to on the stable recovery of the sparsest overcomplete representations in presence of noise [oct 10 5396–5400]. IEEE Trans. Signal Process. 2011, 59, 1913. [Google Scholar] [CrossRef]
Rubinstein, R.; Peleg, T.; Elad, M. Analysis KSVD: A dictionary-learning algorithm for the analysis sparse model. IEEE Trans. Signal Process. 2012, 61, 661–677. [Google Scholar] [CrossRef] [Green Version]
Elad, M.; Aharon, M. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Trans. Image Process 2006, 15, 3736–3745. [Google Scholar] [CrossRef] [PubMed]
Dong, W.; Zhang, L.; Shi, G.; Li, X. Nonlocally centralized sparse representation for image restoration. IEEE Trans. Image Process. 2013, 22, 1620–1630. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Song, X.; Peng, X.; Xu, J.; Shi, G.; Wu, F. Distributed compressive sensing for cloud-based wireless image Transmission. IEEE Trans. Multimed. 2017, 19, 1351–1364. [Google Scholar] [CrossRef]
Qi, N.; Shi, Y.; Sun, X.; Yin, B. Tensr: Multi-dimensional tensor sparse representation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, 27–30 June 2016; pp. 5916–5925. [Google Scholar]
Donoho, D.L.; Elad, M.; Vladimir, N. Temlyakov, Stable recovery of sparse overcomplete representations in the presence of noise. IEEE Trans. Inf. Theory 2006, 52, 6–18. [Google Scholar] [CrossRef]
Bao, C.; Cai, J.-F.; Ji, H. Fast sparsity-based orthogonal dictionary learning for image restoration. In Proceedings of the IEEE International Conference on Computer Vision, ICCV 2013, Sydney, Australia, 1–8 December 2013; pp. 3384–3391. [Google Scholar]
Wang, J.; Cai, J.-F.; Shi, Y.; Yin, B. Incoherent dictionary learning for sparse representation based image denoising. In Proceedings of the IEEE International Conference on Image Processing, ICIP 2014, Paris, France, 27–30 October 2014; pp. 4582–4586. [Google Scholar]
Candès, E.J.; Tao, T. Decoding by linear programming. IEEE Trans. Inf. Theory 2005, 51, 4203–4215. [Google Scholar] [CrossRef] [Green Version]
Wang, J.; Li, G.; Rencker, L.; Wang, W.; Gu, Y. An rip-based performance guarantee of covariance-assisted matching pursuit. IEEE Signal Process. Lett. 2018, 25, 828–832. [Google Scholar] [CrossRef]
Akbari, A.; Trocan, M.; Granado, B. Sparse recovery-based error concealment. IEEE Trans. Multimed. 2017, 19, 1339–1350. [Google Scholar] [CrossRef]
Zhang, R.; Li, S. A proof of conjecture on restricted isometry property constants δ_t (0<t<4/3). IEEE Trans. Inf. Theory 2018, 64, 1699–1705. [Google Scholar]
Lin, J.; Li, S.; Shen, Y. New bounds for restricted isometry constants with coherent tight frames. IEEE Trans. Signal Process. 2013, 61, 611–621. [Google Scholar] [CrossRef]
Cao, C.; Gao, X. Compressed sensing image restoration based on data-driven multi-scale tight frame. J. Comput. Appl. Math. 2017, 309, 622–629. [Google Scholar] [CrossRef]
Li, J.; Zhang, W.; Zhang, H.; Li, L.; Yan, B. Data driven tight frames regularization for few-view image reconstruction. In Proceedings of the 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, ICNC-FSKD 2017, Guilin, China, 29–31 July 2017; pp. 815–820. [Google Scholar]
Liu, Y.; Zhan, Z.; Cai, J.-F.; Guo, D.; Chen, Z.; Qu, X. Projected iterative softthresholding algorithm for tight frames in compressed sensing magnetic resonance imaging. IEEE Trans. Med. Imaging 2016, 35, 2130–2140. [Google Scholar] [CrossRef] [Green Version]
Bai, H.; Li, S.; He, X. Sensing matrix optimization based on equiangular tight frames with consideration of sparse representation error. IEEE Trans. Multimed. 2016, 18, 2040–2053. [Google Scholar] [CrossRef]
Haltmeier, M. Stable signal reconstruction via l₁-minimization in redundant, non-tight frames. IEEE Trans. Signal Process. 2013, 61, 420–426. [Google Scholar] [CrossRef]
Li, D.F.; Sun, W.C. Expansion of frames to tight frames. Acta Mathematica Sinica Eng. Ser. 2009, 25, 287–292. [Google Scholar] [CrossRef]
Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Statist. Soc. Ser. B 1996, 58, 267–288. [Google Scholar] [CrossRef]
Nam, S.; Davies, M.E.; Elad, M.; Gribonval, R. The cosparse analysis model and algorithms. Appl. Comput. Harmon. Anal. 2013, 34, 205–229. [Google Scholar] [CrossRef]
Zhang, Y. Theory of compressive sensing via l₁-minimization: A non-rip analysis and extensions. J. Opt. Res. Soc. China 2013, 1, 79–105. [Google Scholar] [CrossRef] [Green Version]
Christensen, O. Finite dimensional approximation of the inverse frame operator and applications to wavelet frames and Gabor frames. J. Fourier Anal. Appl. 2000, 1, 79–90. [Google Scholar] [CrossRef] [Green Version]
Yang, J.; Member, J.W.; Huang, T.S.; Ma, Y. Image Super-Resolution Via Sparse Representation. IEEE Trans. Image Process. 2010, 19, 2861–2873. [Google Scholar] [CrossRef]

Figure 1. Convergence analysis. The X-labels are the iteration number. The Y-labels are the is the objective function of System (20) (left) and the restoration result (measured by ‘PSNR’) (right). It is shown that our dictionary pair learning algorithm is a convergence one.

Figure 2. The exemplified dictionary pair (

Φ

,

Ψ

) in our stable sparse model with non-tight frame (SSM-NTF) model training by natural images.

Figure 2. The exemplified dictionary pair (

Φ

,

Ψ

) in our stable sparse model with non-tight frame (SSM-NTF) model training by natural images.

Figure 3. Visual comparison of reconstruction results by different methods on ‘Man’ (

σ = 50

) and ‘Couple’ (

σ = 40

). From left to right: original image, noise image, KSVD [9], method of [1], method of [21] and our proposed method.

Figure 3. Visual comparison of reconstruction results by different methods on ‘Man’ (

σ = 50

) and ‘Couple’ (

σ = 40

). From left to right: original image, noise image, KSVD [9], method of [1], method of [21] and our proposed method.

Figure 4. The exemplified dictionaries training by the piecewise constant. From left to right: Synthesis KSVD (

25 \times 100

), our proposed dictionary pair (

25 \times 50

) and analysis KSVD (

25 \times 50

).

Figure 4. The exemplified dictionaries training by the piecewise constant. From left to right: Synthesis KSVD (

25 \times 100

), our proposed dictionary pair (

25 \times 50

) and analysis KSVD (

25 \times 50

).

Figure 5. Visual quality comparison of denoising results for piecewise constant image. Images of the absolute errors are displayed in the dynamic range [0,20] (from left to right): Original image, noise image, analysis KSVD [14], synthesis KSVD [9], our proposed method.

Figure 6. Test images for image super-resolution performance evaluation [18].

Figure 7. Visual quality comparison of SR results for Image 12 corresponding to Table 3. From left to right: Original image, result of bicubic interpolation (

P S N R = 30.28

), [32] (

P S N R = 30.62

) and our SSM-NTF method (

P S N R = 30.99

), respectively.

Figure 7. Visual quality comparison of SR results for Image 12 corresponding to Table 3. From left to right: Original image, result of bicubic interpolation (

P S N R = 30.28

), [32] (

P S N R = 30.62

) and our SSM-NTF method (

P S N R = 30.99

), respectively.

Figure 8. Visual quality comparison of text image inpainting results. From left to right: Original image, text image, result of DCT (

P S N R = 27.11

), KSVD (

P S N R = 28.01

) and our SSM-NTF method (

P S N R = 28.95

), respectively.

Figure 8. Visual quality comparison of text image inpainting results. From left to right: Original image, text image, result of DCT (

P S N R = 27.11

), KSVD (

P S N R = 28.01

) and our SSM-NTF method (

P S N R = 28.95

), respectively.

Figure 9. Visual quality comparison of scratch image inpainting results. From left to right: Original image, scratch image, result of DCT (

P S N R = 30.81

), KSVD (

P S N R = 31.69

), and our SSM-NTF method (

P S N R = 32.02

), respectively.

Figure 9. Visual quality comparison of scratch image inpainting results. From left to right: Original image, scratch image, result of DCT (

P S N R = 30.81

), KSVD (

P S N R = 31.69

), and our SSM-NTF method (

P S N R = 32.02

), respectively.

Table 1. The valuesof A and B.

$σ$	20	30	40	50	60	70	80	90	100
A	0.8	0.8	0.8	0.8	0.8	0.8	0.8	0.8	0.8
B	1.8	1.8	1.8	2.4	3.0	3.3	3.3	3.6	3.6

Table 2. PSNR (dB) for nature image denoising results.

$σ$	Image	Barbara	Boat	Couple	Hill	Lena	Man	Average
20	KSVD [9]	31.01	30.50	30.15	30.27	32.51	30.26	30.78
	[21]	31.03	30.45	30.16	30.30	32.52	30.30	30.79
	[1]	31.07	30.35	30.20	30.31	32.56	30.07	30.76
	SSM-NTF	31.06	30.56	30.32	30.42	32.60	30.33	30.88
30	KSVD [9]	28.75	28.60	28.07	28.51	30.59	28.43	28.83
	[21]	28.78	28.63	28.06	28.53	30.60	28.47	28.85
	[1]	29.07	28.48	28.22	28.64	30.60	28.26	28.88
	SSM-NTF	29.03	28.71	28.32	28.74	30.82	28.59	29.04
40	KSVD [9]	27.03	27.23	26.54	27.23	29.13	27.17	27.39
	[21]	27.05	27.18	26.59	27.21	29.10	27.12	27.38
	[1]	27.58	27.20	26.87	27.49	29.25	26.99	27.56
	SSM-NTF	27.52	27.49	27.07	27.65	29.43	27.41	27.76
50	KSVD [9]	25.71	26.05	25.42	26.29	27.92	26.18	26.26
	[21]	25.77	26.08	25.40	26.31	27.87	26.23	26.28
	[1]	26.45	26.15	25.84	26.63	28.15	26.09	26.55
	SSM-NTF	26.40	26.41	26.04	26.88	28.49	26.49	26.79
60	KSVD [9]	24.45	25.18	24.57	25.69	27.01	25.40	25.38
	[21]	24.45	25.20	24.50	25.65	27.03	25.42	25.38
	[1]	25.64	25.33	25.04	25.91	27.22	25.38	25.75
	SSM-NTF	25.52	25.55	25.22	26.11	27.41	25.74	25.93
70	KSVD [9]	23.40	24.46	23.90	25.10	26.18	24.77	24.63
	[21]	23.46	24.46	23.93	25.15	26.19	24.79	24.66
	[1]	24.88	24.67	24.36	25.30	26.44	24.78	25.07
	SSM-NTF	24.74	25.05	24.60	25.50	26.61	25.06	25.26
80	KSVD [9]	22.80	23.85	23.44	24.69	25.34	24.19	24.05
	[21]	22.87	23.93	23.47	24.76	25.40	24.27	24.09
	[1]	24.18	24.09	23.79	24.77	25.74	24.25	24.47
	SSM-NTF	24.11	24.42	24.02	24.91	26.02	24.58	24.68
90	KSVD [9]	22.31	23.34	22.98	24.31	24.96	23.78	23.61
	[21]	22.31	23.38	23.01	24.36	24.97	23.84	23.64
	[1]	23.50	23.56	23.27	24.30	25.14	23.79	23.93
	SSM-NTF	23.41	23.84	23.47	24.58	25.43	24.13	24.14
100	KSVD [9]	21.88	22.92	22.64	23.96	24.44	23.39	23.21
	[21]	21.89	22.95	22.67	23.99	24.47	23.43	23.23
	[1]	23.01	23.11	22.83	23.86	24.59	23.38	23.46
	SSM-NTF	22.90	23.46	23.05	24.16	24.90	23.68	23.69

Table 3. PSNR (dB) for

3 \times

SR reconstructions results.

Table 3. PSNR (dB) for

3 \times

SR reconstructions results.

$I m a g e$	1	2	3	4	5	6	7	8
$B i c u b i c$ [37]	32.69	26.32	26.04	28.39	28.70	25.81	37.21	24.40
$S C$ [37]	32.89	26.53	26.16	28.51	29.09	26.25	37.59	24.61
$S S M - N T F$	33.10	26.75	26.41	28.82	29.31	26.40	37.49	24.77
$I m a g e$	9	10	11	12	13	14	15	$A v e r a g e$
$B i c u b i c$ [37]	31.40	28.72	33.24	30.28	26.49	28.11	31.90	29.31
$S C$ [37]	31.73	29.35	33.50	30.62	26.75	28.74	32.13	29.63
$S S M - N T F$	31.90	29.49	33.66	30.99	26.95	29.02	32.19	29.82

Table 4. PSNR (dB) for image inpainting results.

$Image$	Adar	Lena	Hill	Couple	Average
$D C T$ [9]	27.11	31.30	28.53	29.21	29.04
$K S V D$ [9]	28.01	31.69	28.90	29.50	29.53
$S S M - N T F$	28.95	32.02	29.31	29.87	30.04

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, M.; Shi, Y.; Qi, N.; Yin, B. Stable Sparse Model with Non-Tight Frame. Appl. Sci. 2020, 10, 1771. https://doi.org/10.3390/app10051771

AMA Style

Zhang M, Shi Y, Qi N, Yin B. Stable Sparse Model with Non-Tight Frame. Applied Sciences. 2020; 10(5):1771. https://doi.org/10.3390/app10051771

Chicago/Turabian Style

Zhang, Min, Yunhui Shi, Na Qi, and Baocai Yin. 2020. "Stable Sparse Model with Non-Tight Frame" Applied Sciences 10, no. 5: 1771. https://doi.org/10.3390/app10051771

APA Style

Zhang, M., Shi, Y., Qi, N., & Yin, B. (2020). Stable Sparse Model with Non-Tight Frame. Applied Sciences, 10(5), 1771. https://doi.org/10.3390/app10051771

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stable Sparse Model with Non-Tight Frame

Abstract

1. Introduction

2. Related Work

3. The Proposed SSM-NTF

3.1. Stable Sparse Model with Non-Tight Frame

3.2. The Stability Analysis of the Proposed Model

3.3. Learning Model of Dictionary Pair

4. Dictionary Pair Learning Algorithm

4.1. Sparse coding phase

4.2. Dictionary Pair Update Phase

5. Restoration

6. Complexity Analysis

7. Experimental Results

7.1. Convergence Analysis

7.2. Image Denoising

7.2.1. Natural Image Denoising

7.2.2. Piecewise Constant Image Denoising

7.3. Super Resolution

7.4. Image Inpainting

8. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI