Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces

Gu, Ruixue; Fu, Hongsun; Wang, Zhuoyue

doi:10.3390/math11071706

Open AccessArticle

Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces

by

Ruixue Gu

,

Hongsun Fu

^* and

Zhuoyue Wang

School of Science, Dalian Maritime University, Dalian 116026, China

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(7), 1706; https://doi.org/10.3390/math11071706

Submission received: 10 February 2023 / Revised: 29 March 2023 / Accepted: 29 March 2023 / Published: 3 April 2023

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we consider a generalized inexact Newton-Landweber iteration to solve nonlinear ill-posed inverse problems in Banach spaces, where the forward operator might not be Gâteaux differentiable. The method is designed with non-smooth convex penalty terms, including

L^{1}

-like and total variation-like penalty functionals, to capture special features of solutions such as sparsity and piecewise constancy. Furthermore, the inaccurate inner solver is incorporated into the minimization problem in each iteration step. Under some assumptions, based on

ε

-subdifferential, we establish the convergence analysis of the proposed method. Finally, some numerical simulations are provided to illustrate the effectiveness of the method for solving both smooth and non-smooth nonlinear inverse problems.

Keywords:

nonlinear inverse problems; non-smooth forward mappings; inexact Newton regularization; non-smooth convex penalty; Landweber iteration

MSC:

65J15; 65J20; 47H17

1. Introduction

We are interested in solving the following ill-posed inverse problem

F (x) = y,

(1)

where

F : D (F) \subset X \to Y

is a possibly non-smooth nonlinear operator from Banach space

X

to Banach space

Y

. In general, (1) may have many solutions. To pick the one with desired feature, we choose a convex function

Θ : X \to (- \infty, \infty]

and determine a solution

x^{†}

such that

D_{ξ_{0}} Θ (x^{†}, x_{0}) : = min_{x \in D (Θ) \cap D (F)} \{D_{ξ_{0}} Θ (x, x_{0}) : F (x) = y\},

(2)

where

x_{0} \in D (\partial Θ)

and

ξ_{0} \in \partial Θ (x_{0})

denote the initial guesses, and

D_{ξ_{0}} Θ (x, x_{0})

denotes the Bregman distance induced by

Θ

. Due to the error of measurement, instead of y, only perturbed data

y^{δ}

is available, satisfying

∥ y^{δ} {- y ∥}_{Y} \leq δ

(3)

with known noise level

δ > 0

. A typical property of such equations is their ill-posedness, i.e., small perturbations of data may lead to huge deviations in solutions of (1). Therefore, regularization techniques are needed to obtain a stable solution

x^{†}

from

y^{δ}

; see [1,2,3] and references therein.

In [4,5,6,7,8], inexact Newton regularization methods were developed for solving nonlinear inverse problems in Banach spaces, where the forward mapping F is assumed to be smooth, i.e., continuously Fréchet differentiable. This type of method updates the current iteration

x_{n}^{δ}

by applying an iterative regularization scheme to solve approximately the local linearization of (1) at

x_{n}^{δ}

, i.e.,

F^{'} (x_{n}^{δ}) (x - x_{n}^{δ}) = y^{δ} - F (x_{n}^{δ}),

(4)

where

F^{'} (x_{n}^{δ})

is the Fréchet derivative of F at

x_{n}^{δ}

. Assuming the nth iterates

(ξ_{n}^{δ}, x_{n}^{δ})

are available, by employing the Landweber iteration in [9] to (4), the inexact Newton-Landweber iteration in [6] generates the inner iterates

{(ξ_{n, k}^{δ}, x_{n, k}^{δ})}

by

\begin{matrix} ξ_{n, k + 1}^{δ} = ξ_{n, k}^{δ} + μ_{n, k}^{δ} F^{'} {(x_{n}^{δ})}^{*} J_{r}^{Y} (y^{δ} - F (x_{n}^{δ}) - F^{'} (x_{n}^{δ}) (x_{n, k}^{δ} - x_{n}^{δ})), \\ x_{n, k + 1}^{δ} = arg min_{x \in X} \{{Θ (x) - 〈ξ_{n, k + 1}^{δ}, x〉}_{X^{*}, X}\} \end{matrix}

(5)

with

ξ_{n, 0}^{δ} = ξ_{n}^{δ}

and

x_{n, 0}^{δ} = x_{n}^{δ}

and suitable step length

μ_{n, k}^{δ}

, and

J_{r}^{Y} (y) : = \partial (\frac{1}{r} {∥y∥}_{Y}^{r}) (1 < r < \infty)

denotes the duality mapping from

Y

to its dual

Y^{*}

. Let

k_{n}^{δ}

be the smallest integer such that

{∥y^{δ} - F (x_{n}^{δ}) - F^{'} (x_{n}^{δ}) (x_{n, k_{n}^{δ}}^{δ} - x_{n}^{δ})∥}_{Y} < γ {∥y^{δ} - F (x_{n}^{δ})∥}_{Y}

(6)

for some

0 < γ < 1

, then the next outer iterates are defined as

ξ_{n + 1}^{δ} = ξ_{n, k_{n}^{δ}}^{δ} and x_{n + 1}^{δ} = x_{n, k_{n}^{δ}}^{δ} .

(7)

It has been shown in [6] that this renders a regularization method if the outer iteration (7) is terminated by the discrepancy principle. Recently, in [7,8] the authors considered an inexact Newton regularization method employing a so-called two-point gradient method [10] as inner scheme and derived the convergence result under the discrepancy principle. When

X

and

Y

are both Hilbert spaces and

Θ (x) = {∥x∥}^{2} / 2

, one may refer to [11,12,13,14,15,16] for some convergence and convergence rates.

Except for the tangential cone condition on the forward operator, the convergence analysis for inexact Newton regularization methods in [4,5,6,7,8] requires the continuity of the derivative

F^{'} (x)

and the exact resolution of the minimization problem in (5). However, there are some cases where the forward operator F is not Gâteaux differentiable [17], which leads to the existing inexact Newton method impractical. Moreover, the exact solution of (5) can only be found for some special

Θ

; this minimization problem in general can only be solved numerically inaccurately. Therefore, it is necessary to generalize the method (5)–(7) to cover the non-smooth forward operator case and to incorporate an inner inexact solver for the minimization problem in (5).

It has been shown in [18] that the Fréchet derivative

F^{'} (x)

can be substituted by another bounded linear operator sufficiently close to

F^{'} (x)

. In [17,19], by introducing the Bouligand subderivative, the authors considered the Bouligand–Landweber iteration and the Bouligand–Levenberg–Marquardt method in Hilbert spaces. By employing a bounded operator

A_{F}

(depending only on a certain point) as the replacement of

F^{'} (x)

, an extension of the Gauss–Newton method was proposed in [20,21,22]. Inspired by the previous work, in this work, we propose a generalized inexact Newton-Landweber iteration for solving inverse problems with possibly non-smooth nonlinear operators, given by

\begin{matrix} ξ_{n, k + 1}^{δ} = ξ_{n, k}^{δ} + μ_{n, k}^{δ} A_{F}^{*} J_{r}^{Y} (y^{δ} - F (x_{n}^{δ}) - A_{F} (x_{n, k}^{δ} - x_{n}^{δ})), \\ Θ (x_{n, k + 1}^{δ}) - {〈ξ_{n, k + 1}^{δ}, x_{n, k + 1}^{δ}〉}_{X^{*}, X} \leq min_{x \in X} \{Θ (x) - {〈ξ_{n, k + 1}^{δ}, x〉}_{X^{*}, X}\} + ε_{n, k + 1}, \end{matrix}

(8)

where

A_{F}

is a bounded operator satisfying certain conditions (see (17)),

ε_{n, k + 1} > 0

. The inner stopping index is chosen as

k_{n}^{δ} : = min \{{\tilde{k}}_{n}^{δ}, k_{max}\}

, where

{\tilde{k}}_{n}^{δ}

is the first integer such that

{∥y^{δ} - F (x_{n}^{δ}) - A_{F} (x_{n, k}^{δ} - x_{n}^{δ})∥}_{Y}^{p} + σ ε_{n, k} < {(γ {∥y^{δ} - F (x_{n}^{δ})∥}_{Y})}^{p}

for

0 < γ < 1

,

σ > 0

and given integer

k_{max} \geq 1

. The next iterates are then constructed by

ξ_{n + 1}^{δ} = ξ_{n, k_{n}^{δ}}^{δ} and x_{n + 1}^{δ} = x_{n, k_{n}^{δ}}^{δ}

. Please note that when

ε_{n, k + 1} = 0

,

k_{max} = \infty

, F is continuously Fréchet differentiable and

A_{F} : = F^{'} (x_{n}^{δ})

, our method (8) reduces to the method (5) in [6]. In contrast to existing methods [4,5,6,7,8], our proposed method (8) does not require the Fréchet differentiablility of F and the exact solver of the minimization problem; therefore, our method is effective for solving not only smooth but also non-smooth and nonlinear inverse problems. Under certain conditions on F and

A_{F}

, based on

ε

-subdifferential, we develop a detailed convergence analysis of the method in Section 3. The numerical results in Section 4 demonstrate the effectiveness of our method.

The rest of this paper is built up as follows. In Section 2, we give some preliminaries on Banach spaces and convex analysis. In Section 3, we formulate our proposed method, and further elaborate well-posedness and the regularization property of the proposed method. Finally, in Section 4, we provide some numerical results to indicate the effectiveness of the method in dealing with smooth as well as non-smooth inverse problems.

2. Preliminaries

In this section, we review some basic concepts on convex analysis and Banach spaces. More details are available in [23,24,25].

Let

X

be a Banach space with norm

{∥ \cdot ∥}_{X}

, and

X^{*}

is named its dual space. If

x \in X

and

x^{*} \in X^{*}

, we denote by

{〈 x^{*}, x 〉}_{X^{*}, X} = x^{*} (x)

the duality pair. Given another Banach space

Y

and a bounded linear operator A from

X

to

Y

, we write

A^{*} : Y^{*} \to X^{*}

as its adjoint, i.e.,

{〈 A^{*} y^{*}, x 〉}_{X^{*}, X} = {〈 y^{*}, A x 〉}_{Y^{*}, Y}

for any

x \in X

and

y^{*} \in Y^{*}

. Set

N (A) = {x \in X : A x = 0}

as the null space of A and

N {(A)}^{⊥} : = {ξ \in X^{*} : 〈 ξ, x 〉 = 0 for all x \in N (A)}

as the annihilator of

N (A)

. When

X

is reflexive, there holds

N {(A)}^{⊥} : = \bar{R (A^{*})}

, where

\bar{R (A^{*})}

is the closure of

R (A^{*})

. On a Banach space

X

, for any

r \in (1, \infty)

, the subdifferential of convex function

x \to {∥ x ∥}_{X}^{r} / r

at x is given by

J_{r}^{X} (x) : = {ξ \in X^{*} {: ∥ ξ ∥}_{X^{*}} = {∥ x ∥}_{X}^{r - 1} and {〈 ξ, x 〉}_{X^{*}, X} = {∥ x ∥}_{X}^{r}},

which is called the duality mapping

J_{r}^{X} : X \to 2^{X^{*}}

of

X

with gauge function

t \to t^{r - 1}

.

In addition, the duality mapping

J_{r}^{X}

, for each

1 < r < \infty

, is single-valued and uniformly continuous on bounded sets when

X

is uniformly smooth, in the sense that its modulus of smoothness

ρ_{X} (s) : = \frac{1}{2} sup {∥ \bar{x} {+ x ∥}_{X} + ∥ \bar{x} {- x ∥}_{X} - 2 : ∥ \bar{x} ∥_{X} = {1, ∥ x ∥}_{X} \leq s}

satisfies

{lim}_{s \to 0} \frac{ρ_{X} (s)}{s} = 0

.

For a given convex function

Θ : X \to (- \infty, \infty]

with effective domain

D (Θ) : = {x \in X : Θ (x) < \infty}

, we call

Θ

proper if

D (Θ) \neq \emptyset

. For

ε \geq 0

, we define the

ε

-subdifferential of the function

Θ

at x by

\partial_{ε} Θ (x) : = {ξ \in X^{*} : Θ (\bar{x}) \geq Θ (x) + {〈 ξ, \bar{x} - x 〉}_{X^{*}, X} - ε for all \bar{x} \in X} .

Any element

ξ \in \partial_{ε} Θ (x)

is a

ε

-subgradient of

Θ

at x [24]. When

ε = 0

, then the

ε

-subdifferential of

Θ

corresponds to the subdifferential

\partial Θ

. One can see that

\partial_{ε} Θ (x) \neq \emptyset

implies

x \in D (Θ)

. If

Θ

is lower semicontinuous, then for any

x \in D (Θ)

,

ε

-subdifferential

\partial_{ε} Θ (x)

is always non-empty for any

ε > 0

, see [23] (Theorem 2.4.4).

For

ξ \in \partial_{ε} Θ (x)

with

ε \geq 0

,

D_{ξ} Θ (\bar{x}, x) = Θ (\bar{x}) - Θ (x) - {〈 ξ, \bar{x} - x 〉}_{X^{*}, X} + ε, \forall \bar{x} \in X,

(9)

which is called the

ε

-Bregman distance, induced by

Θ

at x in the direction

ξ

. Obviously,

D_{ξ} Θ (\bar{x}, x) \geq 0

. When

ε = 0

, the

ε

-Bregman distance becomes the Bregman distance [24]. A proper convex function

Θ : X \to (- \infty, \infty]

is uniformly convex if there exists a strictly increasing continuous function

φ : [0, \infty) \to [0, \infty)

with

φ (0) = 0

such that

Θ (λ \bar{x} + (1 - λ) x) + c_{0} λ (1 - λ) φ (∥ \bar{x} - x ∥_{X}) \leq λ Θ (\bar{x}) + (1 - λ) Θ (x)

for any

\bar{x}, x \in X

and

λ \in [0, 1]

.

Θ

is p-convex if

φ (t) = c_{0} t^{p}

for some

c_{0} > 0

and

p \geq 2

; see [23] (Theorem 3.5.10).

Given a proper, lower semicontinuous, convex function

Θ : X \to (- \infty, \infty]

, its Legendre-Fenchel conjugate is defined by

Θ^{*} (ξ) : = sup_{x \in X} {{〈 ξ, x 〉}_{X^{*}, X} - Θ (x)}, ξ \in X^{*} .

It is easily seen that

Θ^{*}

is also proper, lower semicontinuous, and convex. If, in addition,

X

is reflexive, then [23] (Theorem 2.4.2)

ξ \in \partial_{ε} Θ (x) ⟺ x \in \partial_{ε} Θ^{*} (ξ) ⟺ Θ (x) + Θ^{*} (ξ) \leq {〈 ξ, x 〉}_{X^{*}, X} + ε .

(10)

The following lemma gives further results of p-convex functionals; refer to [23] (Corollary 3.5.11) and [26] (Lemma 2.1 and Lemma 2.3) for more details.

Lemma 1.

Let

X

be a reflexive Banach space and

Θ : X \to (- \infty, \infty]

be proper, lower semicontinuous and p-convex with

φ (t) = c_{0} t^{p}

for some

c_{0} > 0

,

p \geq 2

,

\frac{1}{p} + \frac{1}{p^{*}} = 1

. Then,

(i): If $x \in D (Θ)$ and $ξ \in \partial_{ε} Θ (x)$ for some $ε \geq 0$ , we have

$c_{0} {∥ \bar{x} - x ∥}_{X}^{p} \leq 2 D_{ξ} Θ (\bar{x}, x) + 2 ε, \forall \bar{x} \in X .$

(11)
(ii): If $x \in D (Θ)$ and $ξ \in \partial_{ε} Θ (x)$ for some $ε \geq 0$ , for $η \in X^{*}$ , there holds

${〈 η, x - \nabla Θ^{*} (ξ) 〉}_{X, X^{*}} \leq \frac{1}{p^{*} {(2 c_{0})}^{p^{*} - 1}} {∥ η ∥}_{X^{*}}^{p^{*}} + ε$

(12)

and therefore

$∥ x - \nabla Θ^{*} {(ξ) ∥}_{X}^{p} \leq \frac{p}{2 c_{0}} ε .$

(13)
(iii): For $x \in D (Θ)$ , $ξ \in \partial_{ε} Θ (x)$ and $η \in X^{*}$ , there holds

$Θ^{*} (η) - Θ^{*} (ξ) - {〈 x, η - ξ 〉}_{X, X^{*}} \leq \frac{1}{p^{*} {(2 c_{0})}^{p^{*} - 1}} {∥ η - ξ ∥}_{X^{*}}^{p^{*}} .$

(14)
(iv): $D (Θ^{*}) = X^{*}$ . $Θ^{*}$ is Fréchet differentiable on $X^{*}$ and its gradient $\nabla Θ^{*} : X^{*} \to X$ satisfies

$∥ \nabla Θ^{*} (η) - \nabla Θ^{*} {(ξ) ∥}_{X} \leq {(\frac{{∥ η - ξ ∥}_{X^{*}}}{2 c_{0}})}^{p^{*} - 1}$

(15)

for all $η, ξ \in X^{*}$ .

3. The Method

We consider (1), where

F : D (F) \subset X \to Y

is a possibly non-smooth nonlinear operator between Banach spaces

X

and

Y

. For carrying out convergence analysis, we pose the following assumptions on the convex function

Θ

and the operator F and

A_{F}

.

Assumption 1.

Θ is a proper, weakly lower semicontinuous and uniformly convex function with

p \geq 2

satisfying (11) for some

c_{0} > 0

.

Assumption 2.

(a)

There is

ρ > 0

such that

B_{2 ρ} (x_{0}) \subset D (F)

, where

B_{ρ} (x_{0}) : = {x \in X : ∥ x - x_{0} ∥_{X} \leq ρ}

. (1) has a solution

x^{*}

in

D (Θ)

satisfying

D_{ξ_{0}} Θ (x^{*}, x_{0}) \leq \frac{1}{4} c_{0} ρ^{p} .

(16)

(b)

F is weakly closed on

D (F)

, i.e., for any sequence

{x_{n}} \subset D (F)

with

x_{n} \to x \in X

and

F (x_{n}) ⇀ v \in Y

, there hold

x \in D (F)

and

F (x) = v

.

(c)

There is a constant

0 \leq η < 1

such that

∥ F (\bar{x}) - F (x) - A_{F} (\bar{x} - x) ∥_{Y} \leq η {∥ F (\bar{x}) - F (x) ∥}_{Y}

(17)

for all

\bar{x}, x \in B_{2 ρ} (x_{0}) \cap D (F)

. Moreover, there is a constant

\hat{B} > 0

such that

∥ A_{F} ∥ \leq \hat{B}

.

In Assumption 2, the condition (17) can be viewed as a transformation of the tangential cone condition widely used in nonlinear regularization methods [1,25,27]. When

X

is reflexive, by the weak closedness of F and the lower semi-continuity and p-convexity of

Θ

, one can show that

x^{†}

in (2) exists. Moreover, it has been shown in [28] (Lemma 3.2) that

x^{†}

is uniquely defined. We note that our generalized inexact Newton-Landweber iteration method (8) in each iteration step involves an inaccurate inner solver of the minimization problem

x : = arg min_{z \in X} \{Θ (z) - {〈 ξ, z 〉}_{X^{*}, X}\}

(18)

for any

ξ \in X^{*}

. Concerning the inexact resolution of (8), we make the following assumption.

Assumption 3.

For any given

ε \geq 0

, there is a procedure

S_{ε} : X^{*} \to X

for solving (18) such that for any

ξ \in X^{*}

, the element

x : = S_{ε} (ξ)

satisfies

Θ (x) - {〈 ξ, x 〉}_{X^{*}, X} \leq min_{z \in X} \{Θ (z) - {〈 ξ, z 〉}_{X^{*}, X}\} + ε .

(19)

Moreover, for each

ε \geq 0

, the mapping

S_{ε} : X^{*} \to X

is continuous.

We summarize generalized the inexact Newton-Landweber iteration in Algorithm 1.

Algorithm 1 Generalized inexact Newton-Landweber iteration for noisy data

Input: Parameters

η < γ < 1

,

μ_{0}, μ_{1} > 0

,

k_{max} \geq 1

,

σ > 0

and

τ > 1

; a sequence of positive numbers

{ε_{n, k}}_{k \geq 0, n \geq 0}

satisfying

\sum_{n = 0}^{\infty} \sum_{k = 0}^{\infty} ε_{n, k} < \infty

; the operator

A_{F}

.

Initial guess:

x_{0} \in X

and

ξ_{0} \in \partial Θ (x_{0})

;

Set

n = 0

.

while

{∥F (x_{n}^{δ}) - y^{δ}∥}_{Y} > τ δ

do

(i): Set $ξ_{n, 0}^{δ} : = ξ_{n}^{δ}$ , $x_{n, 0}^{δ} : = x_{n}^{δ}$ ; the inner iterates ${(ξ_{n, k}^{δ}, x_{n, k}^{δ})}$ are constructed by

$ξ_{n, k + 1}^{δ} = ξ_{n, k}^{δ} + μ_{n, k}^{δ} A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}), x_{n, k + 1}^{δ} = S_{ε_{n, k + 1}} (ξ_{n, k + 1}^{δ}),$

(20)

where $s_{n, k}^{δ} = y^{δ} - F (x_{n}^{δ}) - A_{F} (x_{n, k}^{δ} - x_{n}^{δ})$ and

$μ_{n, k}^{δ} = {\tilde{μ}}_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}$

(21)

with

${\tilde{μ}}_{n, k}^{δ} = min \{\frac{μ_{0} {∥ s_{n, k}^{δ} ∥}_{Y}^{p (r - 1)}}{∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}) ∥_{X^{*}}^{p}}, μ_{1}\} .$
(ii): Take $k_{n}^{δ} = min {{\tilde{k}}_{n}^{δ}, k_{max}}$ , where $k_{max} \geq 1$ is a given integer and ${\tilde{k}}_{n}^{δ} \geq 1$ is the first integer satisfying

$∥ s_{n, k}^{δ} ∥_{Y}^{p} + σ ε_{n, k} < (γ ∥ y^{δ} - F (x_{n}^{δ}) {∥_{Y})}^{p} .$

(22)

Update the outer iterates by

$ξ_{n + 1}^{δ} = ξ_{n, k_{n}^{δ}}^{δ} and x_{n + 1}^{δ} = x_{n, k_{n}^{δ}}^{δ} .$

Set $n : = n + 1$ .

end while

We denote by

n_{δ} = n (δ, y^{δ})

the outer stopping index such that the discrepancy principle is fulfilled, i.e.,

∥ F (x_{n_{δ}}^{δ}) - y^{δ} ∥_{Y} \leq τ δ < {∥ F (x_{n}^{δ}) - y^{δ} ∥}_{Y}, 0 \leq n < n_{δ} .

(23)

Output: An approximate solution

x_{n_{δ}}^{δ}

of (1).

By definition of

x_{n, k}^{δ}

in (20), we have

Θ (x_{n, k}^{δ}) - {〈ξ_{n, k}^{δ}, x_{n, k}^{δ}〉}_{X^{*}, X} \leq Θ (x) - {〈ξ_{n, k}^{δ}, x〉}_{X^{*}, X} + ε_{n, k}, \forall x \in X,

which implies that

ξ_{n, k}^{δ} \in \partial_{ε_{n, k}} Θ (x_{n, k}^{δ})

. This fact will be used in the forthcoming theoretical analysis. The following lemma shows that Algorithm 1 is well-defined.

Lemma 2.

Let

X

be reflexive and

Y

be uniformly smooth. Assume that Assumption 1, 2 and 3 hold. Let

{ε_{n, k}}_{n \geq 0, k \geq 0}

be a sequence of positive numbers satisfying

\sum_{n = 0}^{\infty} \sum_{k = 0}^{\infty} ε_{n, k} < \frac{1}{16} c_{0} ρ^{p} .

(24)

Let

β > 1

,

η < γ < 1

,

μ_{0} > 0

, and

τ > 1

be chosen such that

c_{1} : = \frac{1}{β} - \frac{η}{γ} - \frac{1 + η}{τ γ} - \frac{p - 1}{p} {(\frac{μ_{0}}{2 c_{0}})}^{\frac{1}{p - 1}} > 0,

(25)

then, there holds:

(i): for each $0 \leq n < n_{δ}$ , ${\tilde{k}}_{n}^{δ} < \infty$ and $x_{n, k}^{δ} \in B_{2 ρ} (x_{0})$ for all $0 \leq k \leq k_{n}^{δ}$ ;
(ii): Algorithm 1 terminates after $n_{δ} < \infty$ iteration steps;
(iii): for any solution $\hat{x}$ of (1) in $B_{2 ρ} (x_{0}) \cap D (Θ)$ , we have

$D_{ξ_{n + 1}^{δ}} Θ (\hat{x}, x_{n + 1}^{δ}) - D_{ξ_{n}^{δ}} Θ (\hat{x}, x_{n}^{δ}) \leq 3 \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k}$

for $0 \leq n < n_{δ}$ . Here we may take $ε_{0} = 0$ since $ξ_{0} \in \partial Θ (x_{0})$ and $ε_{n} = ε_{n, 0}$ .

Proof.

We first show that if

x_{n}^{δ} \in B_{2 ρ} (x_{0})

for some

0 \leq n < n_{δ}

, then

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) & - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) \\ \leq - c_{1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} + 2 ε_{n, k} + ε_{n, k + 1} \end{matrix}

(26)

for

0 \leq k < k_{n}^{δ}

. Using the definition of

ε

-Bregman distance (9), we can arrive at, for

0 \leq k < k_{n}^{δ}

,

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) & = [Θ (x_{n, k}^{δ}) - {〈 ξ_{n, k}^{δ}, x_{n, k}^{δ} 〉}_{X^{*}, X} - ε_{n, k}] \\ + [{〈 ξ_{n, k + 1}^{δ}, x_{n, k + 1}^{δ} 〉}_{X^{*}, X} - Θ (x_{n, k + 1}^{δ})] \\ - {〈 ξ_{n, k + 1}^{δ} - ξ_{n, k}^{δ}, \hat{x} 〉}_{X^{*}, X} + ε_{n, k + 1} . \end{matrix}

Using (10) and the definition of

Θ^{*}

, we further have

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) \\ \leq Θ^{*} (ξ_{n, k + 1}^{δ}) - Θ^{*} (ξ_{n, k}^{δ}) - {〈 ξ_{n, k + 1}^{δ} - ξ_{n, k}^{δ}, \hat{x} 〉}_{X^{*}, X} + ε_{n, k + 1} \\ = Θ^{*} (ξ_{n, k + 1}^{δ}) - Θ^{*} (ξ_{n, k}^{δ}) - {〈 ξ_{n, k + 1}^{δ} - ξ_{n, k}^{δ}, \nabla Θ^{*} (ξ_{n, k}^{δ}) 〉}_{X^{*}, X} \\ + {〈 ξ_{n, k + 1}^{δ} - ξ_{n, k}^{δ}, \nabla Θ^{*} (ξ_{n, k}^{δ}) - x_{n, k}^{δ} 〉}_{X^{*}, X} \\ + {〈 ξ_{n, k + 1}^{δ} - ξ_{n, k}^{δ}, x_{n, k}^{δ} - \hat{x} 〉}_{X^{*}, X} + ε_{n, k + 1} . \end{matrix}

Since

Θ

is p-convex, we may use (12), (14) and the definition of

ξ_{n, k + 1}^{δ}

to derive that

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) \leq \frac{2}{p^{*} {(2 c_{0})}^{p^{*} - 1}} {(μ_{n, k}^{δ})}^{p^{*}} {∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}) ∥}_{X^{*}}^{p^{*}} \\ + μ_{n, k}^{δ} {〈 J_{r}^{Y} (s_{n, k}^{δ}), A_{F} (x_{n, k}^{δ} - \hat{x}) 〉}_{Y^{*}, Y} + ε_{n, k} + ε_{n, k + 1} . \end{matrix}

Please note that

A_{F} (x_{n, k}^{δ} - \hat{x}) = - s_{n, k}^{δ} + [y^{δ} - F (x_{n}^{δ}) - A_{F} (\hat{x} - x_{n}^{δ})],

in view of (3), Assumption 2(c) and the property of

J_{r}^{Y}

, we further have

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) \\ \leq \frac{2}{p^{*} {(2 c_{0})}^{p^{*} - 1}} {(μ_{n, k}^{δ})}^{p^{*}} ∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}) ∥_{X^{*}}^{p^{*}} - μ_{n, k}^{δ} {∥ s_{n, k}^{δ} ∥}_{Y}^{r} \\ + μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} ((1 + η) δ + η ∥ y^{δ} - F (x_{n}^{δ}) ∥_{Y}) + ε_{n, k} + ε_{n, k + 1} . \end{matrix}

(27)

By the definition of

μ_{n, k}^{δ}

, it follows that

\begin{matrix} {(μ_{n, k}^{δ})}^{p^{*}} ∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}) ∥_{X^{*}}^{p^{*}} = μ_{n, k}^{δ} {(μ_{n, k}^{δ})}^{p^{*} - 1} {∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ}) ∥}_{X^{*}}^{p^{*}} \\ \leq {(μ_{0})}^{p^{*} - 1} μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{p^{*} (r - 1)} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{(p - r) (p^{*} - 1)}{p}} \\ \leq {(μ_{0})}^{p^{*} - 1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} . \end{matrix}

(28)

Using again the definition of

μ_{n, k}^{δ}

, together with (22) and (23), we derive that

\begin{matrix} μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} ((1 + η) δ + η ∥ y^{δ} - F (x_{n}^{δ}) ∥_{Y}) \\ \leq \frac{1 + η}{τ} μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} ∥ y^{δ} - F (x_{n}^{δ}) ∥_{Y} + η μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} {∥ y^{δ} - F (x_{n}^{δ}) ∥}_{Y} \\ \leq \frac{1 + η}{γ τ} μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} (∥ s_{n, k}^{δ} ∥_{Y}^{p} + σ ε_{n, k})^{\frac{1}{p}} + \frac{η}{γ} μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r - 1} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{1}{p}} \\ \leq (\frac{1 + η}{γ τ} + \frac{η}{γ}) μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} . \end{matrix}

(29)

Analogous to the proof of [26] (Lemma 3.1), we can show that

μ_{n, k}^{δ} ∥ s_{n, k}^{δ} ∥_{Y}^{r} \geq \frac{1}{β} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} - κ {\tilde{μ}}_{n, k}^{δ} σ ε_{n, k},

where

σ > 0

and

κ μ_{1} σ \leq 1

with

κ = \{\begin{matrix} 1, i f p \geq r, \\ {(β^{\frac{p}{r - p}} - 1)}^{\frac{p - r}{p}}, i f p < r . \end{matrix}

By inserting the above inequality, (28), (29) into (27) and using

{\tilde{μ}}_{n, k}^{δ} \leq μ_{1}

, we obtain the estimate

\begin{matrix} D_{ξ_{n, k + 1}^{δ}} Θ (\hat{x}, x_{n, k + 1}^{δ}) - D_{ξ_{n, k}^{δ}} Θ (\hat{x}, x_{n, k}^{δ}) \\ \leq - c_{1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} + κ {\tilde{μ}}_{n, k}^{δ} σ ε_{n, k} + ε_{n, k} + ε_{n, k + 1} \\ \leq - c_{1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} + 2 ε_{n, k} + ε_{n, k + 1}, \end{matrix}

which yields the assertion (26). By summing (26) over k from

k = 0

to

k = l

for any

l < {\tilde{k}}_{n}^{δ}

, we have

c_{1} \sum_{k = 0}^{l} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \leq D_{ξ_{n}^{δ}} Θ (\hat{x}, x_{n}^{δ}) - D_{ξ_{n, l + 1}^{δ}} Θ (\hat{x}, x_{n, l + 1}^{δ}) + 3 \sum_{k = 0}^{l} ε_{n, k} .

(30)

In view of Assumption 2

(c)

and the definition of

μ_{n, k}^{δ}

, we have

μ_{n, k}^{δ} \geq c_{2} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}} with c_{2} : = min {μ_{0} {\hat{B}}^{- p}, μ_{1}},

which together with (22) and (23) gives

μ_{n, k}^{δ} (∥ s_{n, k}^{δ} ∥_{Y}^{p} + σ ε_{n, k})^{\frac{r}{p}} \geq c_{2} (∥ s_{n, k}^{δ} ∥_{Y}^{p} + σ ε_{n, k}) \geq c_{2} γ^{p} {∥ y^{δ} - F (x_{n}^{δ}) ∥}_{Y}^{p} \geq c_{2} γ^{p} τ^{p} δ^{p} .

(31)

By inserting (31) into (30), there holds

c_{1} c_{2} γ^{p} τ^{p} δ^{p} (l + 1) \leq D_{ξ_{n}^{δ}} Θ (\hat{x}, x_{n}^{δ}) + 3 \sum_{k = 0}^{l} ε_{n, k} < \infty, \forall 0 \leq l < {\tilde{k}}_{n}^{δ},

which implies that

{\tilde{k}}_{n}^{δ} < \infty

. By taking

l = k_{n}^{δ} - 1 (k_{n}^{δ} = min {{\tilde{k}}_{n}^{δ}, k_{max}})

in (30), we can obtain

c_{1} \sum_{k = 0}^{k_{n}^{δ} - 1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \leq D_{ξ_{n}^{δ}} Θ (\hat{x}, x_{n}^{δ}) - D_{ξ_{n + 1}^{δ}} Θ (\hat{x}, x_{n + 1}^{δ}) + 3 \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k},

(32)

which yields assertion (iii).

In view of (16) and (11), we have

∥ x_{0} - x^{†} ∥_{X} \leq ρ

, i.e.,

x^{†}

is a solution of (1) in

B_{ρ} (x_{0})

. Then, by taking (26) with

\hat{x} = x^{†}

, we can inductively deduce that

D_{ξ_{n, k}^{δ}} Θ (x^{†}, x_{n, k}^{δ}) \leq D_{ξ_{0}} Θ (x^{†}, x_{0}) + 3 \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} .

Using (11) and (24), together with (16), we can obtain

\begin{matrix} c_{0} {∥ x_{n, k}^{δ} - x^{†} ∥}_{X}^{p} & \leq 2 D_{ξ_{n, k}^{δ}} Θ (x^{†}, x_{n, k}^{δ}) + 2 ε_{n, k} \leq 2 D_{ξ_{0}} Θ (x^{†}, x_{0}) + 8 \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} \\ \leq \frac{1}{2} c_{0} ρ^{p} + \frac{1}{2} c_{0} ρ^{p} = c_{0} ρ^{p}, \end{matrix}

which gives

∥ x_{n, k}^{δ} - x^{†} ∥_{X} \leq ρ

, and thus

∥ x_{n, k}^{δ} - x_{0} ∥_{X} \leq 2 ρ

, i.e.,

x_{n, k}^{δ} \in B_{2 ρ} (x_{0})

for all

0 \leq n < n_{δ}

and

0 \leq k \leq k_{n}^{δ}

.

Finally, we prove that

n_{δ} < \infty

. By summing (32) over n from

n = 0

to

n = m

for any

m < n_{δ}

, and using (31), we can further obtain

\begin{matrix} c_{1} c_{2} γ^{p} τ^{p} δ^{p} (m + 1) & \leq c_{1} c_{2} \sum_{n = 0}^{m} \sum_{k = 0}^{k_{n}^{δ} - 1} μ_{n, k}^{δ} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \\ \leq D_{ξ_{0}} Θ (\hat{x}, x_{0}) + 3 \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} < \infty, \end{matrix}

which yields

n_{δ} < \infty

. □

3.1. Convergence Analysis

To carry out the convergence analysis of Algorithm 1, it is necessary to consider its counterpart with exact data. The algorithm for the noise-free case is reformulated as follows. Noting that the inner iteration number

k_{n}

might not be unique (see Lemma 3 below), using different integer

k_{n}

to update the outer iteration may lead to different iterative sequences

\{(ξ_{n}, x_{n})\}

. Next, let

Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

denote the set of all possible sequences

\{(ξ_{n}, x_{n})\}

generated by Algorithm 2 from

(ξ_{0}, x_{0})

with

k_{n}

chosen as in (35).

Algorithm 2 Generalized inexact Newton-Landweber iteration for exact data

Input: Parameters

η < γ < 1

,

μ_{0}, μ_{1} > 0

,

k_{max} \geq 1

and

σ > 0

; a sequence of positive numbers

{ε_{n, k}}_{k \geq 0, n \geq 0}

satisfying

\sum_{n = 0}^{\infty} \sum_{k = 0}^{\infty} ε_{n, k} < \infty

; the operator

A_{F}

.

Initial guess:

x_{0} \in X

and

ξ_{0} \in \partial Θ (x_{0})

;

Set

n = 0 .

Repeat:

(i): Assuming that $(ξ_{n}, x_{n})$ is constructed, we define the inner iterates ${(ξ_{n, k}, x_{n, k})}$ by setting $ξ_{n, 0} = ξ_{n}$ , $x_{n, 0} = x_{n}$ and

$ξ_{n, k + 1} = ξ_{n, k} + μ_{n, k} A_{F}^{*} J_{r}^{Y} (s_{n, k}), x_{n, k + 1} = S_{ε_{n, k + 1}} (ξ_{n, k + 1}),$

(33)

where $s_{n, k} = y - F (x_{n}) - A_{F} (x_{n, k} - x_{n})$ and

$μ_{n, k} = \{\begin{matrix} {\tilde{μ}}_{n, k} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}, i f F (x_{n}) \neq y, 0, o t h e r w i s e \end{matrix}$

(34)

with

${\tilde{μ}}_{n, k} = min \{\frac{μ_{0} {∥ s_{n, k} ∥}_{Y}^{p (r - 1)}}{∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}) ∥_{X^{*}}^{p}}, μ_{1}\} .$
(ii): Determine an integer $1 \leq k_{n} \leq k_{max}$ satisfying

$∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k} \geq (γ ∥ y - F (x_{n}) {∥_{Y})}^{p}, \forall 0 \leq k < k_{n}$

(35)

with given $k_{max} \geq 1$ and define

$ξ_{n + 1} = ξ_{n, k_{n}} and x_{n + 1} = x_{n, k_{n}} .$

Set $n : = n + 1 .$

Until stopping criterion is satisfied.

The following lemma shows the well-definedness of

\{(ξ_{n}, x_{n})\} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

.

Lemma 3.

Let all the conditions in Lemma 2 hold. Then, for any sequence

\{(ξ_{n}, x_{n})\} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

,

x_{n, k} \in B_{2 ρ} (x_{0})

for all

n \geq 0

and

k_{n}

is well-defined. Moreover, for any solution

\hat{x}

of (1) in

B_{2 ρ} (x_{0}) \cap D (Θ)

, there hold

c_{3} \sum_{k = 0}^{k_{n} - 1} μ_{n, k} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \leq D_{ξ_{n}} Θ (\hat{x}, x_{n}) - D_{ξ_{n + 1}} Θ (\hat{x}, x_{n + 1}) + 3 \sum_{k = 0}^{k_{n} - 1} ε_{n, k}

(36)

and

\sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n} - 1} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k}) < \infty

(37)

with

c_{3} : = \frac{1}{β} - \frac{η}{γ} - \frac{p - 1}{p} {(\frac{μ_{0}}{2 c_{0}})}^{\frac{1}{p - 1}} > 0

for all

n \geq 0

.

Proof.

Proceeding as in the proof of Lemma 3.1 in [6], we can immediately obtain (36). Summing (36) over n from

n = 0

to

n = \infty

, we establish that

\begin{matrix} c_{3} \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n} - 1} μ_{n, k} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \leq D_{ξ_{0}} Θ (\hat{x}, x_{0}) + 3 \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n} - 1} ε_{n, k} < \infty . \end{matrix}

(38)

By the definition of

μ_{n, k}

, we have

μ_{n, k} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k})^{\frac{r}{p}} \geq c_{2} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k}),

from which we can obtain (37).

Next, we show that there exists

1 \leq k_{n} \leq k_{max}

such that (35) holds. If

F (x_{n}) = y

for some n, then (34) yields

μ_{n, k} = 0

for

k \geq 0

. Then it follows from the definition of

ξ_{n, k + 1}

in (33) that

ξ_{n, k + 1} = ξ_{n, k} = ξ_{n}

for all

k \geq 0

. Since

S_{ε}

is continuous, we have

x_{n, k + 1} = S_{ε_{n, k + 1}} (ξ_{n, k + 1}) = S_{ε_{n, k}} (ξ_{n, k}) = x_{n, k} = x_{n}, k \geq 0 .

(39)

Therefore, (35) holds for any

1 \leq k_{n} \leq k_{max}

. Now, suppose that

F (x_{n}) \neq y

, in view of the definition of

μ_{n, k}

and (35), there holds, for

0 \leq k < k_{n}

,

μ_{n, k} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \geq c_{2} γ^{p} {∥y - F (x_{n})∥}_{Y}^{p} .

(40)

Combining with (36), we can obtain

\begin{matrix} c_{3} c_{2} γ^{p} k_{n} {∥y - F (x_{n})∥}_{Y}^{p} \leq c_{3} \sum_{k = 0}^{k_{n} - 1} μ_{n, k} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{r}{p}} \\ \leq D_{ξ_{n}} Θ (\hat{x}, x_{n}) - D_{ξ_{n + 1}} Θ (\hat{x}, x_{n + 1}) + 3 \sum_{k = 0}^{k_{n} - 1} ε_{n, k} \\ \leq D_{ξ_{n}} Θ (\hat{x}, x_{n}) + 3 \sum_{k = 0}^{k_{n} - 1} ε_{n, k} < \infty, \end{matrix}

which suggests that

k_{n}

has an upper bound, denoted by

{\tilde{k}}_{n}

. Therefore, (35) holds for any

k_{n}

satisfying

1 \leq k_{n} \leq min \{{\tilde{k}}_{n}, k_{max}\}

. □

We next show the convergence of the sequence

{(ξ_{n}, x_{n})}

in

Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

. The next proposition will be useful for the forthcoming convergence analysis.

Proposition 1.

Let all the conditions in Lemma 2 hold. Then, for any solution

\hat{x}

of (1) in

B_{2 ρ} (x_{0}) \cap D (θ)

, for any

{(ξ_{n}, x_{n})} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

, the sequence

{D_{ξ_{n}} Θ (\hat{x}, x_{n})}

is convergent.

Proof.

Please refer to [26] (Lemma 3.3) for detailed proof. □

Theorem 1.

Let

X

be reflexive and

Y

be uniformly smooth. Assume that Assumption 1, 2 and 3 hold. Then, for any sequence

{(ξ_{n}, x_{n})} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

, there is a solution

x^{*}

of (1.1) in

B_{2 ρ} (x_{0}) \cap D (Θ)

such that

lim_{n \to \infty} {∥ x_{n} - x^{*} ∥}_{X} = 0 a n d lim_{n \to \infty} D_{ξ_{n}} Θ (x^{*}, x_{n}) = 0 .

Moreover, there holds

x^{*} = x^{†}

.

Proof.

We first prove that

{x_{n}}

has a convergent subsequence. By using (38), (40) and the fact

k_{n} \geq 1

, we have

c_{3} c_{2} γ^{p} \sum_{n = 0}^{\infty} {∥y - F (x_{n})∥}_{Y}^{p} \leq D_{ξ_{0}} Θ (\hat{x}, x_{0}) + 3 \sum_{n = 0}^{\infty} \sum_{k = 0}^{k_{n} - 1} ε_{n, k} < \infty .

Consequently,

{lim}_{n \to \infty} {∥ F (x_{n}) - y ∥}_{Y} = 0

. If

F (x_{n}) = y

for some n, by using the argument in deriving (39) repeatedly, we can show that

F (x_{m}) = y

for all

m \geq n

. Therefore, we can find a strictly increasing subsequence

{n_{l}}

such that

n_{0} = 0

, and

n_{l}

with

l \geq 0

being the first integer satisfying

n_{l} \geq n_{l - 1} + 1 and ∥ F (x_{n_{l}}) {- y ∥}_{Y} \leq {∥ F (x_{n_{l - 1}}) - y ∥}_{Y} .

For such a sequence

{n_{l}}

, it is easy to see that

∥ F (x_{n_{l}}) {- y ∥}_{Y} \leq {∥ F (x_{n}) - y ∥}_{Y}, 0 \leq n \leq n_{l} .

(41)

Next, we show that

{x_{n_{l}}}

is a Cauchy sequence. For any solution

\hat{x}

of (1) in

B_{2 ρ} (x_{0}) \cap D (θ)

, by the definition of

ε

-Bregman distance, we have, for

0 \leq j < l < \infty

D_{ξ_{n_{j}}} Θ (x_{n_{l}}, x_{n_{j}}) = D_{ξ_{n_{j}}} Θ (\hat{x}, x_{n_{j}}) - D_{ξ_{n_{l}}} Θ (\hat{x}, x_{n_{l}}) + {〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} + ε_{n_{l}} .

(42)

From Proposition 1, one can see that

D_{ξ_{n_{j}}} Θ (\hat{x}, x_{n_{j}}) - D_{ξ_{n_{l}}} Θ (\hat{x}, x_{n_{l}})

tends to zero as

l, j \to \infty

. We next estimate the term

{〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X}

for

0 \leq j < l < \infty

. By using the fact that

ξ_{n} = ξ_{n, 0}

and

ξ_{n + 1} = ξ_{n, k_{n}}

, we have

\begin{matrix} {〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} & = \sum_{n = n_{j}}^{n_{l} - 1} {〈 ξ_{n + 1} - ξ_{n}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} \\ = \sum_{n = n_{j}}^{n_{l} - 1} {〈 ξ_{n, k_{n}} - ξ_{n, 0}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} . \end{matrix}

(43)

By the definition of

ξ_{n, k}

,

μ_{n, k}

and the property of

J_{r}^{Y}

, we further have

\begin{matrix} |{〈 ξ_{n, k_{n}} - ξ_{n, 0}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X}| & = |\sum_{k = 0}^{k_{n} - 1} {〈 ξ_{n, k + 1} - ξ_{n, k}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X}| \\ = |\sum_{k = 0}^{k_{n} - 1} μ_{n, k} {〈 J_{r}^{Y} (s_{n, k}), A_{F} (x_{n_{l}} - \hat{x}) 〉}_{Y^{*}, Y}| \\ \leq \sum_{k = 0}^{k_{n} - 1} μ_{n, k} ∥ s_{n, k} ∥_{Y}^{r - 1} {∥ A_{F} (x_{n_{l}} - \hat{x}) ∥}_{Y} \\ \leq \sum_{k = 0}^{k_{n} - 1} μ_{1} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k})^{1 - \frac{1}{p}} {∥ A_{F} (x_{n_{l}} - \hat{x}) ∥}_{Y} . \end{matrix}

(44)

Using (41), (35) and Assumption 2(c), we have

\begin{matrix} ∥ A_{F} (x_{n_{l}} - \hat{x}) ∥_{Y} & \leq ∥ A_{F} (x_{n_{l}} - x_{n}) ∥_{Y} + {∥ A_{F} (x_{n} - \hat{x}) ∥}_{Y} \\ \leq (1 + η) (∥ F (x_{n_{l}}) - F (x_{n}) ∥_{Y} + ∥ F (x_{n}) - y ∥_{Y}) \\ \leq (1 + η) (∥ F (x_{n_{l}}) - y ∥_{Y} + 2 ∥ F (x_{n}) - y ∥_{Y}) \\ \leq 3 (1 + η) ∥ F (x_{n}) {- y ∥}_{Y} \\ \leq \frac{3 (1 + η)}{γ} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{\frac{1}{p}} \end{matrix}

for

0 \leq n \leq n_{l}

and

0 \leq k < k_{n}

. By inserting the above inequality into (44), there holds

\begin{matrix} |{〈 ξ_{n, k_{n}} - ξ_{n, 0}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X}| & \leq \frac{3 (1 + η) μ_{1}}{γ} \sum_{k = 0}^{k_{n} - 1} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k}) . \end{matrix}

Combining with (43), we can deduce that

| {〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} | \leq \frac{3 (1 + η) μ_{1}}{γ} \sum_{n = n_{j}}^{n_{l} - 1} \sum_{k = 0}^{k_{n} - 1} (∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k}) .

Furthermore, together with (37), it follows that

lim_{j \to \infty} sup_{l \geq j} | {〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} | = 0 .

(45)

Since

{lim}_{n \to \infty} ε_{n} = 0

, we can conclude from (42) that

D_{ξ_{n_{j}}} Θ (x_{n_{l}}, x_{n_{j}}) \to 0 as j, l \to \infty .

By the p-convexity of

Θ

, there also holds

∥ x_{n_{l}} - x_{n_{j}} ∥_{X} \to 0 as j, l \to \infty .

Therefore,

{x_{n_{l}}}

is a Cauchy sequence in

X

. There exists some

x^{*} \in X

such that

x_{n_{l}} \to x^{*}

as

l \to \infty

. Please note that

{lim}_{n \to \infty} {∥ F (x_{n}) - y ∥}_{Y} = 0

, it follows from the continuity of F that

F (x^{*}) = y

.

We next prove that

x^{*} \in B_{2 ρ} (x_{0}) \cap D (Θ)

. Due to

{x_{n_{l}}} \subset B_{2 ρ} (x_{0})

, we must have

x^{*} \in B_{2 ρ} (x_{0})

. By

ξ_{n_{l}} \in \partial_{ε_{n_{l}}} Θ (x_{n_{l}})

, there holds

Θ (x_{n_{l}}) \leq Θ (\hat{x}) + {〈 ξ_{n_{l}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} + ε_{n_{l}} .

(46)

Using (45) and

x_{n_{l}} \to x^{*}

, we can find a constant

C_{0}

such that

| {〈 ξ_{n_{l}} - ξ_{n_{0}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} | \leq C_{0} and | {〈 ξ_{n_{0}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} | \leq C_{0}, \forall l .

Therefore,

| {〈 ξ_{n_{l}}, x_{n_{l}} - \hat{x} 〉}_{X^{*}, X} | \leq 2 C_{0}

for all l. Since

Θ

is lower semicontinuous and

{lim}_{n \to \infty} ε_{n}

= 0, we can derive from (46) that

Θ (x^{*}) \leq \underset{l \to \infty}{lim inf} Θ (x_{n_{l}}) \leq Θ (\hat{x}) + 2 C_{0} < \infty,

which implies that

x^{*} \in D (Θ)

. Thus,

x^{*} \in B_{2 ρ} (x_{0}) \cap D (Θ)

is a solution of (1).

Finally, we prove the convergence of the whole sequence

{x_{n}}

to

x^{*}

. Let

η_{0} : = lim_{n \to \infty} D_{ξ_{n}} Θ (x^{*}, x_{n}),

whose existence is guaranteed by Lemma 1. By the non-negativity of

ε

-Bregman distance, we have

η_{0} \geq 0

. Using (42) with

\hat{x} = x^{*}

and taking

l \to \infty

, we have

D_{ξ_{n_{j}}} Θ (x^{*}, x_{n_{j}}) \leq D_{ξ_{n_{j}}} Θ (x^{*}, x_{n_{j}}) - η_{0} + sup_{l \geq j} {| 〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - x^{*} 〉 |}_{X^{*}, X},

which suggests that

η_{0} \leq sup_{l \geq j} {| 〈 ξ_{n_{l}} - ξ_{n_{j}}, x_{n_{l}} - x^{*} 〉 |}_{X^{*}, X}

for all j. By virtue of (45) and taking

j \to \infty

, we obtain

η_{0} \leq 0

. Therefore,

η_{0} = 0

, i.e.,

{lim}_{n \to \infty} D_{ξ_{n}} Θ (x^{*}, x_{n}) = 0

. Using the p-convexity of

Θ

and

{lim}_{n \to \infty} ε_{n} = 0

, we can further conclude that

{lim}_{n \to \infty} {∥ x_{n} - x^{*} ∥}_{X} = 0

.

It remains to be shown

\hat{x} = x^{†}

. From the definition of

ξ_{n}

, we observe that

ξ_{n + 1} - ξ_{n} \in R ({A_{F}}^{*}) \subset N {(A_{F})}^{⊥} = \bar{R (A_{F})} .

Therefore, we can make use of the second part of [28] (Proposition 3.6) to complete the proof. □

3.2. Regularization Property

In this subsection, we will prove the regularization property of Algorithm 1. Before proceeding further, the stability results of Algorithm 1 are presented in the following two lemmas: Lemma 4 is the stability property of the inner scheme, and Lemma 5 concerns the stability of the whole algorithm.

Lemma 4.

Let all the conditions in Lemma 2 hold. The sequence of noisy data

{y^{δ_{l}}}

satisfies

∥ y^{δ_{l}} {- y ∥}_{Y} \leq δ_{l} \to 0

as

l \to \infty

. The sequence

{(ξ_{n}^{δ_{l}}, x_{n}^{δ_{l}})}

is produced by Algorithm 1. For any integer

n \geq 0

, if

ξ_{n}^{δ_{l}} \to ξ_{n} a n d x_{n}^{δ_{l}} \to x_{n} a s l \to \infty

for some

(ξ_{n}, x_{n}) \in X^{*} \times X

, then there holds for each

k = 0, 1, \dots,

ξ_{n, k}^{δ_{l}} \to ξ_{n, k} a n d x_{n, k}^{δ_{l}} \to x_{n, k} a s l \to \infty .

with

{(ξ_{n, k}, x_{n, k})}

defined by (33).

Proof.

We use an induction argument on k. When

k = 0

, the result automatically holds since

ξ_{n, 0}^{δ_{l}} = ξ_{n, 0} = ξ_{n}

and

x_{n, 0}^{δ_{l}} = x_{n, 0} = x_{n}

. Assume that the result holds for some

k \geq 1

, we will show that it also holds for

k + 1

. The following two cases will be considered:

(i)

s_{n, k} = 0

. By the definition of

ξ_{n, k + 1}

, we have

ξ_{n, k + 1} = ξ_{n, k}

. Consequently,

ξ_{n, k + 1}^{δ_{l}} - ξ_{n, k + 1} = ξ_{n, k}^{δ_{l}} - ξ_{n, k} + μ_{n, k}^{δ_{l}} A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ_{l}}) .

Together with

μ_{n, k}^{δ} \leq μ_{1} (∥ s_{n, k}^{δ} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}

and the property of

J_{r}^{Y}

, we can deduce that

∥ ξ_{n, k + 1}^{δ_{l}} - ξ_{n, k + 1} ∥_{X^{*}} \leq ∥ ξ_{n, k}^{δ_{l}} - ξ_{n, k} ∥_{X^{*}} + μ_{1} \hat{B} (∥ s_{n, k}^{δ_{l}} ∥_{Y}^{p} + σ ε_{n, k})^{1 - \frac{r}{p}} {∥ s_{n, k}^{δ_{l}} ∥}_{Y}^{r - 1} .

By the induction hypothesis and the continuity of F, there holds

ξ_{n, k + 1}^{δ_{l}} \to ξ_{n, k + 1}

as

l \to \infty

. The definition of

x_{n, k + 1}^{δ_{l}}

(

x_{n, k + 1}^{δ_{l}} = S_{ε_{n, k + 1}} (ξ_{n, k + 1}^{δ_{l}})

) and the continuity of

S_{ε}

yield that

x_{n, k + 1}^{δ_{l}} \to x_{n, k + 1}

as

l \to \infty

.

(ii)

s_{n, k} \neq 0

. We first show that

μ_{n, k}^{δ_{l}} \to μ_{n, k}

as

l \to \infty

. Recall that

μ_{n, k} = min \{\frac{μ_{0} {∥ s_{n, k} ∥}_{Y}^{p (r - 1)}}{∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}) ∥_{X^{*}}^{p}}, μ_{1}\} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}

and

μ_{n, k}^{δ_{l}} = min \{\frac{μ_{0} {∥ s_{n, k}^{δ_{l}} ∥}_{Y}^{p (r - 1)}}{∥ A_{F}^{*} J_{r}^{Y} (s_{n, k}^{δ_{l}}) ∥_{X^{*}}^{p}}, μ_{1}\} (∥ s_{n, k}^{δ_{l}} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}} .

If

A_{F}^{*} J_{r}^{Y} (s_{n, k}) = 0

, we have

μ_{n, k} = μ_{1} (∥ s_{n, k} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}

and

μ_{n, k}^{δ_{l}} = μ_{1} (∥ s_{n, k}^{δ_{l}} {∥_{Y}^{p} + σ ε_{n, k})}^{1 - \frac{r}{p}}

for sufficiently large l, which gives

μ_{n, k}^{δ_{l}} \to μ_{n, k}

as

l \to \infty

. If

A_{F}^{*} J_{r}^{Y} (s_{n, k}) \neq 0

, it follows from the induction hypothesis that

μ_{n, k}^{δ_{l}} \to μ_{n, k}

as

l \to \infty

. Then, using the induction hypotheses and the continuity of F,

J_{r}^{Y}

and

S_{ε}

, we can conclude that

ξ_{n, k + 1}^{δ_{l}} \to ξ_{n, k + 1}

and

x_{n, k + 1}^{δ_{l}} \to x_{n, k + 1}

as

l \to \infty

. □

Lemma 5.

Let all the conditions in Lemma 2 hold. The sequence of noisy data

{y^{δ_{l}}}

satisfies

∥ y^{δ_{l}} {- y ∥}_{Y} \leq δ_{l} \to 0

as

l \to \infty

. The sequence

{(ξ_{n}^{δ_{l}}, x_{n}^{δ_{l}})}

is generated by Algorithm 1. Then, for any integer

n \geq 0

, by taking a subsequence of

{y^{δ_{l}}}

if necessary, there is a sequence

{(ξ_{n}, x_{n})} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

such that

ξ_{m}^{δ_{l}} \to ξ_{m} a n d x_{m}^{δ_{l}} \to x_{m} a s l \to \infty

for all

0 \leq m \leq n

.

Proof.

We use an induction to complete the proof. For

n = 0

, the result is trivial again. Assume, for some

n > 0

, the result is true for some sequence

{(ξ_{n}, x_{n})} \in Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

. We next show that it is also valid for

n + 1

. To this end, we can obtain a sequence from

Γ_{γ, μ_{0}, μ_{1}} (ξ_{0}, x_{0})

by redefining

ξ_{n + 1}

and

x_{n + 1}

in the sequence

{(ξ_{n}, x_{n})}

and applying Algorithm 2 to generate the remaining terms. We may follow Lemma 4 to derive that

ξ_{n, k}^{δ_{l}} \to ξ_{n, k}, x_{n, k}^{δ_{l}} \to x_{n, k} as l \to \infty

(47)

for

k = 0, 1, \dots

, where

{(ξ_{n, k}, x_{n, k})}

are defined by (33).

Let

k_{n}^{δ_{l}}

be the integer used to define

ξ_{n + 1}^{δ_{l}}

and

x_{n + 1}^{δ_{l}}

. By the definition of

k_{n}^{δ_{l}}

(

k_{n}^{δ_{l}} = min {{\tilde{k}}_{n}^{δ_{l}}, k_{max}}

), we know that

1 \leq k_{n}^{δ_{l}} \leq k_{max}

. By taking a subsequence of

{y^{δ_{l}}}

if necessary, we may assume that

k_{n}^{δ_{l}}

takes the same integer value

k_{n}

. Then,

1 \leq k_{n} \leq k_{max}

and

∥ s_{n, k}^{δ_{l}} ∥_{Y}^{p} + σ ε_{n, k} \geq (γ ∥ y^{δ_{l}} - F (x_{n}^{δ_{l}}) {∥_{Y})}^{p}, 0 \leq k < k_{n} .

Then, by taking

l \to \infty

and using Lemma 4, we have

∥ s_{n, k} ∥_{Y}^{p} + σ ε_{n, k} \geq (γ ∥ y - F (x_{n}) {∥_{Y})}^{p}, 0 \leq k < k_{n} .

With this choice of

k_{n}

, we can redefine

ξ_{n + 1}

and

x_{n + 1}

in the sequence

{(ξ_{n}, x_{n})}

by

ξ_{n + 1} : = ξ_{n, k_{n}}

and

x_{n + 1} : = x_{n, k_{n}}

. The application of Lemma 4 yields

ξ_{n + 1}^{δ_{l}} \to ξ_{n + 1}

and

x_{n + 1}^{δ_{l}} \to x_{n + 1}

as

l \to \infty

. The proof is thus complete. □

We are now in a position to show the regularization property of Algorithm 1.

Theorem 2.

Let

X

be reflexive and let

Y

be uniformly smooth. Let Assumption 1, 2 and 3 hold. Let

β > 1

,

η < γ < 1

,

μ_{0} > 0

, and

τ > 1

be chosen such that (25) holds. Assume further that

\{y^{δ}\}

is a family of noisy data satisfying

{∥y^{δ} - y∥}_{Y} \leq δ \to 0

and let

n_{δ}

be determined by (23) for each

y^{δ}

. Then,

{∥x_{n_{δ}}^{δ} - x^{†}∥}_{X} \to 0 and D_{ξ_{n_{δ}}^{δ}} Θ (x^{†}, x_{n_{δ}}^{δ}) \to 0

as

δ \to 0

.

Proof.

Due to the p-convexity of

Θ

, it is sufficient to show

{lim}_{δ \to 0} D_{ξ_{n_{δ}}^{δ}} Θ (x^{†}, x_{n_{δ}}^{δ}) = 0

. Since

ε_{n, k} > 0

for all

n, k

, from (22) and (23), we must have

n_{δ} \to \infty

as

δ \to 0

. Then, for any arbitrary but fixed integer

\hat{n} > 0

, we have

n_{δ} > \hat{n}

for small

δ

. From Lemma 2, we have

D_{ξ_{n_{δ}}^{δ}} Θ (x^{†}, x_{n_{δ}}^{δ}) \leq D_{ξ_{\hat{n}}^{δ}} Θ (x^{†}, x_{\hat{n}}^{δ}) + 3 \sum_{n = \hat{n}}^{n_{δ} - 1} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} .

Together with the lower semi-continuity of

Θ

, there holds

\begin{matrix} \underset{δ \to 0}{lim sup} D_{ξ_{n_{δ}}^{δ}} Θ (x^{†}, x_{n_{δ}}^{δ}) \\ \leq Θ (x^{†}) - \underset{δ \to 0}{lim inf} Θ (x_{\hat{n}}^{δ}) - lim_{δ \to 0} {〈 ξ_{\hat{n}}^{δ}, x^{†} - x_{\hat{n}}^{δ} 〉}_{X^{*}, X} + 3 \sum_{n = \hat{n}}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} \\ = Θ (x^{†}) - Θ (x_{\hat{n}}) - {〈 ξ_{\hat{n}}, x^{†} - x_{\hat{n}} 〉}_{X^{*}, X} + 3 \sum_{n = \hat{n}}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} \\ = D_{ξ_{\hat{n}}} Θ (x^{†}, x_{\hat{n}}) + 3 \sum_{n = \hat{n}}^{\infty} \sum_{k = 0}^{k_{n}^{δ} - 1} ε_{n, k} . \end{matrix}

Since

\hat{n} > 0

is arbitrary, by taking

\hat{n} \to \infty

, we may use (24) and Theorem 1 to deduce that

{lim}_{δ \to 0} D_{ξ_{n_{δ}}^{δ}} Θ (x^{†}, x_{n_{δ}}^{δ}) = 0

. By the p-convexity of

Θ

, we can conclude that

∥ x_{n_{δ}}^{δ} - x^{†} ∥_{X} \to 0

as

δ \to 0

. □

4. Numerical Experiments

In this section, we provide two numerical experiments. The aim of the first one is to test the effectiveness of our Algorithm 1 in identifying non-smooth solutions of parametric identification problems. The second one is to validate the efficiency of Algorithm 1 for solving non-smooth source-term problems.

4.1. Elliptic Parameter Identification

We first consider the reconstruction of parameter c in the boundary value problem [3]

\{\begin{matrix} - u^{″} + c u = f & in Ω, \\ u = g & on \partial Ω . \end{matrix}

(48)

from

L^{2} (Ω)

—measurements of the state u, where

f \in H^{- 1} (Ω)

and

g \in H^{\frac{1}{2}} (Ω) .

Let

D : = {c \in L^{2} (Ω) : ∥ c - \hat{c} ∥_{L^{2} (Ω)} \leq γ_{0} for all \hat{c} \geq 0 a . e .}

for some

γ_{0} > 0 .

If

c \in D

is given, then (48) has a unique solution

u : = u (c)

. Therefore, (48) reduces to solving

F (c) = u

if we define the nonlinear operator

F : L^{2} (Ω) \to L^{2} (Ω)

by

F (c) : = u (c) .

From [3], it is known that F is Fréchet differentiable; the Fréchet derivative of F and its adjoint are given by

F^{'} (c) h = - A {(c)}^{- 1} (h F (c)) and F^{'} {(c)}^{*} ω = - u (c) A {(c)}^{- 1} ω

(49)

for

h, ω \in L^{2} (Ω)

and

A (c) : H^{2} (Ω) ⋂ H_{0}^{1} (Ω) \to L^{2} (Ω)

is defined by

A (c) u = - u^{″} + c u .

According to [21], if we choose

A_{F} = F^{'} (c_{f})

with given

c_{f} \in D

, condition (c) in Assumption 2 holds.

In this experiment, we pick

Ω = [0, 1]

,

u (0) = 1

and

u (1) = 6

. The sought parameter

c^{†} (t)

is taken to be

c^{†} (t) = \{\begin{matrix} 0, & 0 \leq t \leq 0.1563, \\ 1.5, & 0.1563 < t \leq 0.3125, \\ 2.5, & 0.3125 < t \leq 0.5469, \\ 1.3, & 0.5469 < t \leq 0.7813, \\ 0.5, & 0.7813 < t \leq 1 . \end{matrix}

In addition, we take

u (c^{†}) = 1 + 5 t

and the source term

f (t) = (1 + 5 t) c^{†} (t)

. Our aim is to reconstruct

c^{†}

from noisy data

u^{δ}

with noise level

δ = {∥u^{δ} - u (c^{†})∥}_{L^{2}} = 0.0001

. When implementing Algorithm 1, we use the initial guess

c_{0} = ξ_{0} = 0

and

Θ (x) = \frac{1}{2 β} \int_{Ω} {|x (w)|}^{2} d w + T V (x)

with

β = 1

to identify the non-smooth feature of

c^{†}

. We fix

η = 0.01

,

σ = 0.001

,

k_{max} = 500

,

γ = 0.98

in (22), and

τ = 1.5

in (23). To meet the condition (25), we require

μ_{0} < 2 (1 - \frac{η}{γ} - \frac{1 + η}{γ τ}) / β

. Thus, we take

μ_{0} = (1 - 1 / τ) / β

and

μ_{1} = 10, 000

. By dividing the interval

[0, 1]

into 128 subintervals of equal length, the involved differential equations are solved by the finite difference method. The minimization problems concerning

Θ

are solved by the primal dual hybrid gradient (PDHG) method [29] which is terminated as long as the relative duality gap is

\leq {(n + 1)}^{- 1.5} {(k + 1)}^{- 1.5}

.

Figure 1a,b show the reconstruction results of Algorithm 1 with

A_{F} = F^{'} (c_{f})

at two different choices of

c_{f}

:

c_{f} = 0

and

c_{f} = 1

. As comparisons, we also consider the inexact Newton-Landweber iteration in [6], i.e., Algorithm 1 with

A_{F}

being the Fréchet derivative of F at each iteration, i.e.,

A_{F} = F^{'} (c_{n}^{δ})

; the corresponding reconstructed solution is plotted in Figure 1c. As can be seen, the reconstructions of our Algorithm 1 with different values of

c_{f}

have quality comparable to the ones reconstructed by [6], while Algorithm 1 does not require the information of the Fréchet derivative at each iteration. In Figure 1d, we present the evolution of the relative errors

{∥c_{n}^{δ} - c^{†}∥}_{L^{2}} / {∥c^{†}∥}_{L^{2}}

with respect to inner iteration number n. Since our Algorithm 1 avoids the calculation of the Fréchet derivative at each iteration, the computational work is considerably reduced; inevitably, Algorithm 1 may require more iterations to execute, which, however, can be offset by the numerous advantages.

4.2. A Non-Smooth Ill-Posed Problem

Let

Ω \subset R^{2}

be a bounded domain with a Lipschitz boundary

\partial Ω .

We consider the inverse problem of the estimation of the source term f in the non-smooth semi-linear elliptic equation

- Δ u + u^{+} = f in Ω, u = 0 on \partial Ω

(50)

from an

L^{2} (Ω)

measurement

\tilde{u}

of the state

u,

where

u^{+} = max {u (x), 0}

for all

x \in Ω .

It is easy to see that, for each

f \in L^{2} (Ω),

(50) has a unique solution

u : = u (f) \in H_{0}^{1} (Ω) ⋂ C (\bar{Ω}) \subset L^{2} (Ω);

see [30] (Theorem 4.7). If we know the sought solution

f^{†}

and define

F (f) = u (f),

then the problem (50) reduces to the inverse problem of (1). It has been shown that F is weakly closed and is not Gâteaux differentiable at f if the measurement of the set

{u (f) = 0}

is positive; see [17] and [30] (Proposition 3.4). In this case, the Bouligand subderivative of F at f exists, which is defined by a limit of the Fréchet derivatives of F in differentiable points. Christof introduced in [31] (Proposition) a specific Bouligand subderivative of F which states that, for

f \in L^{2} (Ω),

the bounded linear operator

G (f) : L^{2} (Ω) \to L^{2} (Ω)

maps

h \in H_{0}^{1} (Ω)

to the unique solution

v : = G (f) h \in H_{0}^{1} (Ω) ↪ L^{2} (Ω)

of

- Δ v + χ_{{u (f) > 0}} v = h in Ω, v = 0 on \partial Ω .

(51)

It has been shown in [21] that if we take

A_{F} = G (M_{f})

with given

M_{f} \in L^{2} (Ω)

, the condition (17) holds for sufficiently small

ρ > 0 .

In the computation, we pick

Ω : = (0, 1) \times (0, 1) \subset R^{2}

and assume that the sought solution is

\begin{matrix} f^{†} (x_{1}, x_{2}) : = & max (u^{†} (x_{1}, x_{2}), 0) + [4 π^{2} u^{†} (x_{1}, x_{2}) \\ - 2 {(2 x_{1} - 1)}^{2} + 2 (x_{1} - 1 + β) (x_{1} - β) sin (2 π x_{2})] χ_{[β, 1 - β]} (x_{1}) \end{matrix}

with

β = 0.1,

where

u^{†} (x_{1}, x_{2}) : = [{(x_{1} - β)}^{2} {(x_{1} - 1 + β)}^{2} sin (2 π x_{2})] χ_{[β, 1 - β]} (x_{1})

is the corresponding exact state. Obviously,

u^{†} \in H_{0}^{1} (Ω) ⋂ H^{2} (Ω)

, together the right-hand side

f^{†}

satisfies (50). Since

u^{†}

vanishes on a set of measures

2 β,

the forward operator F is not Gâteaux differential at

f^{†} .

To carry out the computation, we employ a uniform triangular Friedrichs–Keller triangulation with

128 \times 128 .

The differential Equations (50) and (51) will be discretized using a finite-element method; see [17]. The corresponding discrete system is then solved by a semi-smooth Newton iteration [32]. We generate noisy data

u^{δ}

by adding Gaussian noise to u with the noise level

δ = {∥u^{δ} - u^{†}∥}_{L^{2}}

. In the following, we pick

δ = 0.001

and the initial guess

f_{0} = f^{†} - 20 \times sin (π x_{1}) sin (2 π x_{2}) .

When executing Algorithm 1, we use

Θ (x) = \frac{1}{2} \int_{Ω} {|x (w)|}^{2} d w

and

γ = 0.9, μ_{0} = 0.8, μ_{1} = 1000, k_{max} = 300

,

τ = 1.4

.

In Figure 2, we report the reconstruction of Algorithm 1 corresponding to

A_{F} = G (M_{f})

with two different choices of

M_{f}

:

M_{f} = f_{0}

and

M_{f} = 0

. Figure 2a displays the exact solution

f^{†}

. The reconstruction results of Algorithm 1 are presented in Figure 2c,d. Observe that our algorithm can provide satisfactory results for non-smooth inverse problems.

In summary, we can see from the above experiments that our Algorithm 1 could efficiently deal with smooth as well as non-smooth inverse problems by choosing the operator

A_{F}

appropriately; when the forward mapping is not Gâteaux differentiable, the operator

A_{F}

, as a replacement of the nonexisting Gâteaux derivative, can be taken as the Bouligand subderivative, which extends the feasibility of inexact Newton methods [4,5,6,7] for non-smooth inverse problems.

5. Conclusions

In this paper, by employing a bounded linear operator

A_{F}

and inexact inner solvers, we propose a generalized inexact Newton-Landweber iteration method; this method does not require the Fréchet derivative of the forward mapping, which makes the method feasible for not only smooth but also non-smooth nonlinear inverse problems in Banach spaces. Under certain conditions on

A_{F}

, the convergence analysis is carefully established. The numerical simulations on smooth parameter identification problems and non-smooth inverse source-term problems indicate that our method could effectively solve inverse problems with smooth as well as non-smooth forward operators by choosing the appropriate operator

A_{F}

.

There are several possible lines of future research. First, the bounded linear operator

A_{F}

may be replaced by a family of bounded linear operators

A_{F} (x)

and the corresponding convergence theory can be developed without relying on the continuity of

x \to A_{F} (x)

[17]. Second, application of the method to other non-smooth inverse problems is another interesting research direction.

Author Contributions

Conceptualization, R.G., H.F. and Z.W.; methodology, R.G., H.F. and Z.W.; formal analysis, R.G., H.F. and Z.W.; Figure creation and editing, R.G., H.F. and Z.W.; writing—original draft preparation, R.G., H.F. and Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China under grant number 42274166 and the Fundamental Research Funds for the Central Universities.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kaltenbacher, B.; Neubauer, A.; Scherzer, O. Iterative Regularization Methods for Nonlinear Ill-Posed Problems; Walter de Gruyter GmbH & Co. KG: Berlin, Germany, 2008. [Google Scholar]
Scherzer, O. (Ed.) Handbook of Mathematical Methods in Imaging; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Engl, H.W.; Hanke, M.; Neubauer, A. Regularization of Inverse Problems; Kluwer Academic Publishers: Boston, MA, USA; London, UK, 1996. [Google Scholar]
Jin, Q. Inexact Newton-Landweber iteration for solving nonlinear inverse problems in Banach spaces. Inverse Probl. 2012, 28, 065002. [Google Scholar] [CrossRef]
Margotti, F.; Rieder, A.; Leitão, A. A Kaczmarz version of the REGINN-Landweber iteration for ill-posed problems in Banach spaces. SIAM J. Numer. Anal. 2014, 52, 1439–1465. [Google Scholar] [CrossRef]
Jin, Q. Inexact Newton-Landweber iteration in Banach spaces with nonsmooth convex penalty terms. SIAM J. Numer. Anal. 2015, 53, 2389–2413. [Google Scholar] [CrossRef] [Green Version]
Gu, R.; Han, B. Inexact Newton regularization in Banach spaces based on two-point gradient method with uniformly convex penalty terms. Appl. Numer. Math. 2021, 160, 122–145. [Google Scholar] [CrossRef]
Gu, R.; Fu, H.; Han, B. Generalized inexact Newton regularization for nonlinear ill-posed problems in Banach spaces. Inverse Probl. 2022, 38, 065001. [Google Scholar] [CrossRef]
Boţ, R.I.; Hein, T. Iterative regularization with a general penalty term-theory and application to L¹ and TV regularization. Inverse Probl. 2011, 28, 104010. [Google Scholar] [CrossRef] [Green Version]
Zhong, M.; Wang, W.; Jin, Q. Regularization of inverse problems by two-point gradient methods in Banach spaces. Numer. Math. 2019, 143, 713–747. [Google Scholar] [CrossRef]
Rieder, A. On the regularization of nonlinear ill-posed problems via inexact Newton iterations. Inverse Probl. 1999, 15, 309. [Google Scholar] [CrossRef] [Green Version]
Neubauer, A. Optimal convergence rates for inexact Newton regularization with CG as inner iteration. J. Inverse Ill-Posed Probl. 2020, 28, 145–153. [Google Scholar] [CrossRef]
Lechleiter, A.; Rieder, A. Towards a general convergence theory for inexact newton regularizations. Numer. Math. 2010, 114, 521–548. [Google Scholar] [CrossRef] [Green Version]
Hanke, M. Regularizing properties of a truncated Newton-CG algorithm for nonlinear inverse problems. Numer. Funct. Anal. Optim. 1997, 18, 971–993. [Google Scholar] [CrossRef] [Green Version]
Rieder, A. On convergence rates of inexact Newton regularizations. Numer. Math. 2001, 88, 347–365. [Google Scholar]
Jin, Q. On the order optimality of the regularization via inexact Newton iterations. Numer. Math. 2012, 121, 237–260. [Google Scholar]
Clason, C.; Nhu, V.H. Bouligand-Landweber iteration for a non-smooth ill-posed problem. Numer. Math. 2019, 142, 789–832. [Google Scholar] [CrossRef] [Green Version]
Scherzer, O. Convergence criteria of iterative methods based on Landweber iteration for solving nonlinear problems. J. Math. Anal. Appl. 1995, 194, 911–933. [Google Scholar] [CrossRef]
Clason, C.; Nhu, V.H. Bouligand-Levenberg-Marquardt iteration for a non-smooth ill-posed problem. Electron. Trans. Numer. Anal. 2019, 51, 274–314. [Google Scholar] [CrossRef] [Green Version]
Mahale, P.; Dixit, S.K. Convergence analysis of simplified iteratively regularized Gauss-Newton method in a Banach space setting. Appl. Anal. 2018, 97, 2686–2719. [Google Scholar] [CrossRef]
Fu, Z.; Chen, Y.; Li, L.; Han, B. Analysis of a generalized regularized Gauss-Newton method under heuristic rule in Banach spaces. Inverse Probl. 2021, 37, 125003. [Google Scholar] [CrossRef]
Jin, Q. On a class of frozen regularized Gauss-Newton methods for nonlinear inverse problems. Math. Comput. 2010, 79, 2191–2211. [Google Scholar] [CrossRef] [Green Version]
Zalinescu, C. Convex Analysis in General Vector Spaces; World Scientific: Singapore, 2002. [Google Scholar]
Bregman, L.M. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Comput. Math. Math. Phys. 1967, 7, 200–217. [Google Scholar] [CrossRef]
Schuster, T.; Kaltenbacher, B.; Hofmann, B.; Kazimierski, K.S. Regularization Methods in Banach Spaces; De Gruyter: Vienna, Austria, 2012. [Google Scholar]
Jin, Q. Landweber-Kaczmarz method in Banach spaces with inexact inner solvers. Inverse Probl. 2016, 32, 104005. [Google Scholar] [CrossRef] [Green Version]
Schöpfer, F.; Louis, A.K.; Schuster, T. Nonlinear iterative methods for linear ill-posed problems in Banach spaces. Inverse Probl. 2006, 22, 311. [Google Scholar]
Jin, Q.; Wang, W. Landweber iteration of Kaczmarz type with general non-smooth convex penalty functionals. Inverse Probl. 2013, 29, 085011. [Google Scholar] [CrossRef]
Zhu, M.; Chan, T.F. An efficient primal-dual hybrid gradient algorithm for total variation image restoration. UCLA CAM Rep. 2008, 34, 8–34. [Google Scholar]
Tröltzsch, F. Optimal Control of Partial Differential Equations. In Theory, Methods and Applications; American Mathematical Society: Providence, RI, USA, 2010. [Google Scholar]
Christof, C.; Clason, C.; Meyer, C.; Walter, S. Optimal control of a non-smooth semilinear elliptic equation. Math. Contr. Relat. Field 2018, 8, 247–276. [Google Scholar] [CrossRef] [Green Version]
Ulbrich, M. Semismooth Newton Methods for Variational Inequalities and Constrained Optimization Problems in Function Spaces, (MOS-SIAM Series on Optimization); SIAM: Philadelphia, PA, USA, 2011. [Google Scholar]

Figure 1. Elliptic parameter identification (

δ = 0.0001

). (a) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{f})

and

c_{f} = 0

; (b) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{f})

and

c_{f} = 1

; (c) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{n}^{δ})

; (d) Evolution of the relative error.

Figure 1. Elliptic parameter identification (

δ = 0.0001

). (a) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{f})

and

c_{f} = 0

; (b) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{f})

and

c_{f} = 1

; (c) Reconstruction by Algorithm 1 with

A_{F} = F^{'} (c_{n}^{δ})

; (d) Evolution of the relative error.

Figure 2. The non-smooth ill-posed problem (

δ = 0.001

). (a) Exact solution

f^{†}

; (b) Noisy data

u^{δ}

; (c) Reconstruction by Algorithm 1 with

A_{F} = G (M_{f})

and

M_{f} = 0

; (d) Reconstruction by Algorithm 1 with

A_{F} = G (M_{f})

and

M_{f} = f_{0}

.

Figure 2. The non-smooth ill-posed problem (

δ = 0.001

). (a) Exact solution

f^{†}

; (b) Noisy data

u^{δ}

; (c) Reconstruction by Algorithm 1 with

A_{F} = G (M_{f})

and

M_{f} = 0

; (d) Reconstruction by Algorithm 1 with

A_{F} = G (M_{f})

and

M_{f} = f_{0}

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gu, R.; Fu, H.; Wang, Z. Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces. Mathematics 2023, 11, 1706. https://doi.org/10.3390/math11071706

AMA Style

Gu R, Fu H, Wang Z. Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces. Mathematics. 2023; 11(7):1706. https://doi.org/10.3390/math11071706

Chicago/Turabian Style

Gu, Ruixue, Hongsun Fu, and Zhuoyue Wang. 2023. "Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces" Mathematics 11, no. 7: 1706. https://doi.org/10.3390/math11071706

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalized Inexact Newton-Landweber Iteration for Possibly Non-Smooth Inverse Problems in Banach Spaces

Abstract

1. Introduction

2. Preliminaries

3. The Method

3.1. Convergence Analysis

3.2. Regularization Property

4. Numerical Experiments

4.1. Elliptic Parameter Identification

4.2. A Non-Smooth Ill-Posed Problem

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI