Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces

Argyros, Michael I.; Argyros, Ioannis K.; Regmi, Samundra; George, Santhosh

doi:10.3390/math10152621

Open AccessArticle

Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces

¹

Department of Computer Science, University of Oklahoma, Norman, OK 73019, USA

²

Department of Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

³

Department of Mathematics, University of Houston, Houston, TX 77204, USA

⁴

Department of Mathematical and Computational Sciences, National Institute of Technology Karnataka, Mangaluru 575 025, India

^*

Authors to whom correspondence should be addressed.

Mathematics 2022, 10(15), 2621; https://doi.org/10.3390/math10152621

Submission received: 8 July 2022 / Revised: 24 July 2022 / Accepted: 26 July 2022 / Published: 27 July 2022

(This article belongs to the Special Issue Numerical Methods for Solving Nonlinear Equations)

Download Versions Notes

Abstract

:

In this article, we propose a new methodology to construct and study generalized three-step numerical methods for solving nonlinear equations in Banach spaces. These methods are very general and include other methods already in the literature as special cases. The convergence analysis of the specialized methods is been given by assuming the existence of high-order derivatives which are not shown in these methods. Therefore, these constraints limit the applicability of the methods to equations involving operators that are sufficiently many times differentiable although the methods may converge. Moreover, the convergence is shown under a different set of conditions. Motivated by the optimization considerations and the above concerns, we present a unified convergence analysis for the generalized numerical methods relying on conditions involving only the operators appearing in the method. This is the novelty of the article. Special cases and examples are presented to conclude this article.

Keywords:

generalized three-step numerical method; convergence; Banach space

MSC:

49M15; 47H17; 65J15; 65G99; 47H17; 41A25; 49M15

1. Introduction

A plethora of applications from diverse disciplines of computational sciences are converted to nonlinear equations such as

F (x) = 0

(1)

using modeling (mathematical) [1,2,3,4]. The nonlinear operator F is defined on an open and convex subset

Ω

of a Banach space X with values in X. The solution of the equation is denoted by

x_{*} .

Numerical methods are mainly used to find

x_{*} .

This is the case since the analytic form of the solution

x_{*}

is obtained in special cases.

Researchers, as well as practitioners, have proposed numerous numerical methods under a different set of convergence conditions using high-order derivatives, which are not present in the methods.

Let us consider an example.

Example 1.

Define the function F on

X = [- 0.5, 1.5]

by

F (t) = \{\begin{matrix} t^{3} ln t^{2} + t^{5} - t^{4}, t \neq 0 \\ 0, t = 0 \end{matrix}

Clearly, the point

t_{*} = 1

solves the equation

F (t) = 0 .

It follows that

\begin{matrix} F^{‴} (t) & = & 6 ln t^{2} + 60 t^{2} - 24 t + 22 . \end{matrix}

Then, the function F does not have a bounded third derivative in

X .

Hence, many high convergence methods (although they may converge) cannot apply to show convergence. In order to address these concerns, we propose a unified approach for dealing with the convergence of these numerical methods that take into account only the operators appearing on them. Hence, the usage of these methods becomes possible and under weaker conditions.

Let

x_{0} \in Ω

be a starting point. Define the generalized numerical method

\forall n = 0, 1, 2, \dots

by

\begin{matrix} y_{n} & = & a_{n} = a (x_{n}) \\ z_{n} & = & b_{n} = b (x_{n}, y_{n}) \\ x_{n + 1} & = & c_{n} = c (x_{n}, y_{n}, z_{n}), \end{matrix}

(2)

where

a : Ω ⟶ X, b : Ω \times Ω ⟶ X

and

c : Ω \times Ω \times Ω ⟶ X

are given operators chosen so that

{lim}_{n ⟶ \infty} x_{n} = x_{*} .

The specialization of (2) is

\begin{matrix} y_{n} & = & x_{n} + α_{n} F (x_{n}) \\ z_{n} & = & u_{n} + β_{n} F (x_{n}) + γ_{n} F (y_{n}) \\ x_{n + 1} & = & v_{n} + δ_{n} F (x_{n}) + ϵ_{n} F (y_{n}) + θ_{n} F (z_{n}), \end{matrix}

(3)

where

u_{n} = x_{n}

or

u_{n} = y_{n}, v_{n} = x_{n}

or

v_{n} = y_{n}

or

v_{n} = z_{n},

and

α_{n}, β_{n}, γ_{n}, δ_{n}, ϵ_{n}, θ_{n}

are linear operators on

Ω, Ω \times Ω

and

Ω \times Ω \times Ω,

with values in

X,

respectively. By choosing some of the linear operators equal to the O linear operators in (3), we obtain the methods studied in [5]. Moreover, if

X = R^{k},

then we obtain the methods studied in [6,7]. In particular, the methods in [5] are of the special form

\begin{matrix} y_{n} & = & x_{n} - O_{1, n}^{- 1} F (x_{n}) \\ z_{n} & = & y_{n} - O_{2, n}^{- 1} F (y_{n}) \\ x_{n + 1} & = & z_{n} - O_{3, n}^{- 1} F (z_{n}), \end{matrix}

(4)

\begin{matrix} y_{n} & = & x_{n} - s F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & x_{n} - O_{4, n} F (x_{n}) \\ x_{n + 1} & = & z_{n} - O_{5, n} F (z_{n}), \end{matrix}

(5)

where they, as the methods in [7,8], are of the form

\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & y_{n} - O_{6, n} F^{'} {(x_{n})}^{- 1} F (y_{n}) \\ x_{n + 1} & = & z_{n} - O_{7, n} F^{'} {(x_{n})}^{- 1} F (z_{n}), \end{matrix}

(6)

where

s \in R

is a given parameter, and

O_{k, n}, k = 1, 2, \dots, 7

are linear operators acting between

Ω

and

X .

In particular, operators must have a special form to obtain the fourth, seventh or eighth order of convergence.

Further specifications of operators “

O

” lead to well-studied methods, a few of which are listed below (other choices can be found in [6,7,9,10]):

Newton method (second order) [1,4,11,12]:

$y_{n} = x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) .$

(7)
Jarrat method (second order) [13]:

$y_{n} = x_{n} - \frac{2}{3} F^{'} {(x_{n})}^{- 1} F (x_{n}) .$

(8)
Traub-type method (fifth order) [14]:

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (y_{n}) \\ x_{n + 1} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (z_{n}) . \end{matrix}$

(9)
Homeir method (third order) [15]:

$\begin{matrix} y_{n} & = & x_{n} - \frac{1}{2} F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & y_{n} - F^{'} {(x_{n})}^{- 1} F (y_{n}) . \end{matrix}$

(10)
Cordero–Torregrosa (third Order) [2]:

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & x_{n} - 6 (F^{'} (x_{n}) + 4 F^{'} (\frac{x_{n} + y_{n}}{2})) F^{'} {(y_{n})}^{- 1} F (x_{n}) . \end{matrix}$

(11)

or

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & x_{n} - 2 {[2 F^{'} (\frac{3 x_{n} + y_{n}}{4}) - F^{'} (\frac{x_{n} + y_{n}}{2}) + 2 F^{'} (\frac{x_{n} + 3 y_{n}}{4})]}^{- 1} F (x_{n}) . \end{matrix}$

(12)
Noor–Wasseem method (third order) [3]:

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & x_{n} - 4 {[3 F^{'} (\frac{2 x_{n} + y_{n}}{3}) + F^{'} (y_{n})]}^{- 1} F (x_{n}) . \end{matrix}$

(13)
Xiao–Yin method (third order) [16]:

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & x_{n} - \frac{2}{3} [{(3 F^{'} (y_{n}) - F^{'} (x_{n}))}^{- 1} + F^{'} {(x_{n})}^{- 1}] F (x_{n}) . \end{matrix}$

(14)
Corder–Torregrosa method (fifth order) [2]:

$\begin{matrix} y_{n} & = & x_{n} - \frac{2}{3} F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & x_{n} - \frac{1}{2} {(3 F^{'} (y_{n}) - F^{'} (x_{n}))}^{- 1} (3 F^{'} (y_{n}) + F^{'} (x_{n})) F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & z_{n} - {(\frac{1}{2} F^{'} (y_{n}) + \frac{1}{2} F^{'} (x_{n}))}^{- 1} F (z_{n}) . \end{matrix}$

(15)

or

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & x_{n} - 2 {(F^{'} (y_{n}) + F^{'} (x_{n}))}^{- 1} F (x_{n}) \\ x_{n + 1} & = & z_{n} - F^{'} {(y_{n})}^{- 1} F (z_{n}) . \end{matrix}$

(16)
Sharma–Arora method (fifth order) [17,18]:

$\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ x_{n + 1} & = & x_{n} - (2 F^{'} {(y_{n})}^{- 1} - F^{'} {(x_{n})}^{- 1}) F (x_{n}) . \end{matrix}$

(17)
Xiao–Yin method (fifth order) [16]:

$\begin{matrix} y_{n} & = & x_{n} - \frac{2}{3} F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & x_{n} - \frac{1}{4} (3 F^{'} {(y_{n})}^{- 1} + F^{'} {(x_{n})}^{- 1}) F (x_{n}) \\ x_{n + 1} & = & x_{n} - \frac{1}{3} [{(3 F^{'} (y_{n}) - F^{'} (x_{n}))}^{- 1}] F (x_{n}) . \end{matrix}$

(18)
Traub-type method (second order) [14]:

$\begin{matrix} y_{n} & = & x_{n} - {[w_{n}, x_{n}; F]}^{- 1} F (x_{n}) \\ w_{n} & = & x_{n} + d F (x_{n}), \end{matrix}$

(19)

where $[., .; F] : Ω \times Ω ⟶ L (X, X)$ is a divided difference of order one.
Moccari–Lofti method (fourth order) [19]:

$\begin{matrix} y_{n} & = & x_{n} - {[x_{n}, w_{n}; F]}^{- 1} F (x_{n}) \\ x_{n + 1} & = & y_{n} - {([y_{n}, w_{n}; F] + [y_{n}, x_{n}; F] - [x_{n}, w_{n}; F])}^{- 1} F (y_{n}) . \end{matrix}$

(20)
Wang–Zang method (seventh order) [8,16,20]:

$\begin{matrix} y_{n} & = & x_{n} - {[w_{n}, x_{n}; F]}^{- 1} F (x_{n}) \\ z_{n} & = & M_{8} (x_{n}, y_{n}) \\ x_{n + 1} & = & z_{n} - {([z_{n}, x_{n}; F] + [z_{n}, y_{n}; F] - [y_{n}, x_{n}; F])}^{- 1} F (z_{n}), \end{matrix}$

(21)

where $M_{8}$ is any fourth-order Steffensen-type iteration method.
Sharma–Arora method (seventh order) [17]:

$\begin{matrix} y_{n} & = & x_{n} - {[w_{n}, x_{n}; F]}^{- 1} F (x_{n}) \\ z_{n} & = & y_{n} - (3 I - [w_{n}, x_{n}; F] ([y_{n}, x_{n}; F] + [y_{n}, w_{n}; F]) \\ {[w_{n}, x_{n}; F]}^{- 1}) F (y_{n}) \\ x_{n + 1} & = & z_{n} - {[z_{n}, y_{n}; F]}^{- 1} ([w_{n}, x_{n}; F] \\ + [y_{n}, x_{n}; F] - [z_{n}, x_{n}; F]) {[w_{n}, x_{n}; F]}^{- 1} F (z_{n}) . \end{matrix}$

(22)

The local, as well as the semi-local, convergence for methods (4) and (5), were presented in [17], respectively, using hypotheses relating only to the operators on these methods. However, the local convergence analysis of method (6) requires the usage of derivatives or divided differences of higher than two orders, which do not appear in method (6). These high-order derivatives restrict the applicability of method (6) to equations whose operator F has high-order derivatives, although method (6) may converge (see Example 1).

Similar restrictions exist for the convergence of the aforementioned methods of order three or above.

It is also worth noticing that the fifth convergence order method by Sharma [18]

\begin{matrix} y_{n} & = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) \\ z_{n} & = & y_{n} - 5 F^{'} {(x_{n})}^{- 1} F (y_{n}) \\ x_{n + 1} & = & y_{n} - \frac{1}{5} [9 F^{'} {(x_{n})}^{- 1} F (y_{n}) + F^{'} {(x_{n})}^{- 1} F (z_{n})] \end{matrix}

(23)

cannot be handled with the analyses given previously [5,6,7] for method (4), method (5), or method (6).

Based on all of the above, clearly, it is important to study the convergence of method (2) and its specialization method (3) with the approach employed for method (4) or (5). This way, the resulting unified convergence criteria can apply to their specialized methods listed or not listed previously. Hence, this is the motivation as well as the novelty of the article.

There are two important types of convergence: the semi-local and the local. The semi-local uses information involving the initial point to provide criteria, assuring the convergence of the numerical method, while the local one is based on the information about the solution to find the radii of the convergence balls.

The local convergence results are vital, although the solution is unknown in general since the convergence order of the numerical method can be found. This kind of result also demonstrates the degree of difficulty in selecting starting points. There are cases when the radius of convergence of the numerical method can be determined without the knowledge of the solution.

As an example, let

X = R .

Suppose function F satisfies an autonomous differential [5,21] equation of the form

H (F (t)) = F^{'} (t),

where H is a continuous function. Notice that

H (F (t_{*})) = F^{'} (t_{*})

or

F^{'} (t_{*}) = H (0) .

In the case of

F (t) = e^{t} - 1

, we can choose

H (t) = t + 1

(see also the numerical section).

Moreover, the local results can apply to projection numerical methods, such as Arnoldi’s, the generalized minimum residual numerical method (GMRES), the generalized conjugate numerical method (GCS) for combined Newton/finite projection numerical methods, and in relation to the mesh independence principle to develop the cheapest and most efficient mesh refinement techniques [1,5,11,21].

In this article, we introduce a majorant sequence and use our idea of recurrent functions to extend the applicability of the numerical method (2). Our analysis includes error bounds and results on the uniqueness of

x_{*}

based on computable Lipschitz constants not given before in [5,13,21,22,23,24] and in other similar studies using the Taylor series. This idea is very general. Hence, it applies also to other numerical methods [10,14,22,25].

The convergence analysis of method (2) and method (3) is given in Section 2. Moreover, the special choices of operators appear in the method in Section 3 and Section 4. Concluding remarks, open problems, and future work complete this article.

2. Convergence Analysis of Method

The local is followed by the semi-local convergence analysis. Let

S = [0, \infty)

and

S_{0} = [0, ρ_{0})

for some

ρ_{0} > 0 .

Consider functions

h_{1} : S_{0} ⟶ R, h_{2} : S_{0} \times S_{0} ⟶ R

and

h_{3} : S_{0} \times S_{0} \times S_{0} ⟶ R

be continuous and nondecreasing in each variable.

Suppose that equations

h_{i} (t) - 1 = 0, i = 1, 2, 3

(24)

have the smallest solutions,

ρ_{i} \in S - {0} .

The parameter

ρ

defined by

ρ = min {ρ_{i}}

(25)

shall be shown to be a radius of convergence for method (2). Let

S_{1} = [0, ρ) .

It follows by the definition of radius

ρ

that for all

t \in S_{1}

0 \leq h_{i} (t) < 1 .

(26)

The notation

U (x, ς)

denotes an open ball with center

x \in X

and of radius

ς > 0 .

By

U [x, ς]

, we denote the closure of

U (x, ς) .

The following conditions are used in the local convergence analysis of the method (2).

Suppose the following:

(H1): Equation $F (x) = 0$ has a solution $x_{*} \in Ω .$
(H2): $∥ a (x) - x_{*} ∥ \leq h_{1} (∥ x - x_{*} ∥) ∥ x - x_{*} ∥,$

$∥ b (x, y) - x_{*} ∥ \leq h_{2} (∥ x - x_{*} ∥, ∥ y - x_{*} ∥) ∥ x - x_{*} ∥$

and

$∥ c (x, y, z) - x_{*} ∥ \leq h_{3} (∥ x - x_{*} ∥, ∥ y - x_{*} ∥, ∥ z - x_{*} ∥) ∥ x - x_{*} ∥$

for all $x, y, z \in Ω_{0} = Ω \cap U (x_{*}, ρ_{0}) .$
(H3): Equations (24) have smallest solutions $ρ_{i} \in S_{0} - {0}$ ;
(H4): $U [x_{*}, ρ] \subset Ω,$ where the radius $ρ$ is given by Formula (25).

Next, the main local convergence analysis is presented for method (2).

Theorem 1.

Suppose that the conditions (H1)–(H4) hold and

x_{0} \in U (x_{*}, r) - {x_{*}} .

Then, the sequence

{x_{n}}

generated by method (2) is well defined and converges to

x_{*} .

Moreover, the following estimates hold

\forall n = 0, 1, 2, \dots

∥ y_{n} - x_{*} ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥ < ρ

(27)

∥ z_{n} - x_{*} ∥ \leq h_{2} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥

(28)

and

∥ x_{n + 1} - x_{*} ∥ \leq h_{3} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥, ∥ z_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥ .

(29)

Proof.

Let

x_{0} \in U (x_{*}, ρ_{0}) .

Then, it follows from the first condition in (H1) the definition of

ρ,

(26) (for

i = 1

) and the first substep of method (2) for

n = 0

that

∥ y_{0} - x_{*} ∥ \leq h_{1} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ < ρ,

(30)

showing estimate (27) for

n = 0

and the iterate

y_{0} \in U (x_{*}, ρ) .

Similarly,

\begin{matrix} ∥ z_{0} - x_{*} ∥ & \leq & h_{2} (∥ x_{0} - x_{*} ∥, ∥ y_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \\ \leq & h_{2} (∥ x_{0} - x_{*} ∥, ∥ y_{0} - x_{*} ∥) \\ \leq & h_{2} (∥ x_{0} - x_{*} ∥, ∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ \end{matrix}

(31)

and

\begin{matrix} ∥ x_{1} - x_{*} ∥ & \leq & h_{3} (∥ x_{0} - x_{*} ∥, ∥ y_{0} - x_{*} ∥, ∥ z_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \\ \leq & h_{3} (∥ x_{0} - x_{*} ∥, ∥ x_{0} - x_{*} ∥, ∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \\ \leq & ∥ x_{0} - x_{*} ∥, \end{matrix}

showing estimates (28), (29), respectively and the iterates

z_{0}, x_{1} \in U (x_{*}, ρ) .

By simply replacing

x_{0}, y_{0}, z_{0}, x_{1}

by

x_{k}, y_{k}, z_{k}, x_{k + 1}

in the preceding calculations, the induction for estimates (27)–(29) is terminated. Then, from the estimate

∥ x_{k + 1} - x_{*} ∥ \leq d ∥ x_{k} - x_{*} ∥ < ρ,

where

d = h_{3} (∥ x_{0} - x_{*} ∥, ∥ x_{0} - x_{*} ∥, ∥ x_{0} - x_{*} ∥) \in [0, 1)

(32)

we conclude

x_{k + 1} \in U [x_{*}, ρ]

and

{lim}_{k ⟶ \infty} x_{k} = x_{*} .

□

Remark 1.

It follows from the proof of Theorem 1 that

y, z

can be chosen in particular as

y_{n} = a (x_{n})

and

z_{n} = b (x_{n}, y_{n}) .

Thus, the condition (H2) should hold for all

x, a (x), b (x, y) \in Ω_{0}

and not

x, y, z \in Ω_{0} .

Clearly, in this case, the resulting functions

h_{i}

are at least as tight as the functions

h_{i}

, leading to an at least as large radius of convergence

\bar{ρ}

as ρ (see the numerical section).

Concerning the semi-local convergence of method (2), let us introduce scalar sequences

{t_{n}}, {s_{n}}

and

{u_{n}}

defined for

t_{0} = 0, s_{0} = η \geq 0

and the rest of the iterates, depending on operators

a, b, c

and F (see how in the next section). These sequences shall be shown to be majorizing for method (2). However, first, a convergence result for these sequence is needed.

Lemma 1.

Suppose that

\forall n = 0, 1, 2, \dots

t_{n} \leq s_{n} \leq u_{n} \leq t_{n + 1}

(33)

and

t_{n} \leq λ

(34)

for some

λ \geq 0 .

Then, the sequence

{t_{n}}

is convergent to its unique least upper bound

t_{*} \in [0, λ] .

Proof.

It follows from conditions (33) and (34) that sequence

{t_{n}}

is nondecreasing and bounded from above by

λ

, and as such, it converges to

t_{*} .

□

Theorem 2.

Suppose the following:

(H5) Iterates

{x_{n}}, {y_{n}}, {z_{n}}

generated by method (2) exist, belong in

U (x_{0}, t_{*})

and satisfy the conditions of Lemma 1 for all

n = 0, 1, 2, \dots

(H6)

∥ a (x_{n}) - x_{n} ∥ \leq s_{n} - t_{n},

∥ b (x_{n}, y_{n}) - y_{n} ∥ \leq u_{n} - s_{n}

and

∥ c (x_{n}, y_{n}, z_{n}) - z_{n} ∥ \leq t_{n + 1} - u_{n}

for all

n = 0, 1, 2, \dots

and

(H7)

U [x_{0}, t_{*}] \subset Ω .

Then, there exists

x_{*} \in U [x_{0}, t_{*}]

such that

{lim}_{n ⟶ \infty} x_{n} = x_{*} .

Proof.

It follows by condition (H5) that sequence

{t_{n}}

is complete as convergent. Thus, by condition (H6), sequence

{x_{n}}

is also complete in a Banach space X, and as such, it converges to some

x_{*} \in U [x_{0}, t_{*}]

(since

U [x_{0}, t_{*}]

is a closed set). □

Remark 2.

(i) Additional conditions are needed to show

F (x_{*}) = 0 .

The same is true for the results on the uniqueness of the solution.

(ii) The limit point

t_{*}

is not given in the closed form. So, it can be replaced by λ in Theorem 2.

3. Special Cases I

The iterates of method (3) are assumed to exist, and operator F has a divided difference of order one.

Local Convergence

Three possibilities are presented for the local cases based on different estimates for the determination of the functions

h_{i} .

It follows by method (3) that

(P1): $y_{n} - x_{*} = x_{n} - x_{*} + α_{n} F (x_{n}) = (I + α_{n} [x_{n}, x_{*}; F]) (x_{n} - x_{*}),$

$\begin{matrix} z_{n} - x_{*} & = & (I + γ_{n} [y_{n}, x_{*}; F]) (y_{n} - x_{*}) + β_{n} [x_{n}, x_{*}; F] (x_{n} - x_{*}) \\ = & [(I + γ_{n} [y_{n}, x_{*}; F]) (I + α_{n} [x_{n}, x_{*}; F]) + β_{n} [x_{n}, x_{*}; F]] (x_{n} - x_{*}) \end{matrix}$

and

$\begin{matrix} x_{n + 1} - x_{*} & = & (I + θ_{n} [z_{n}, x_{*}; F]) (z_{n} - x_{*}) + δ_{n} [x_{n}, x_{*}; F] (x_{n} - x_{*}) \\ + ϵ_{n} [y_{n}, x_{*}; F] (y_{n} - x_{*}) \\ = & [(I + θ_{n} [z_{n}, x_{*}; F]) (I + γ_{n} [y_{n}, x_{*}; F]) (I + β_{n} [x_{n}, x_{*}; F]) \\ + δ_{n} [x_{n}, x_{*}; F] + ϵ_{n} [y_{n}, x_{*}; F] (I + α_{n} [x_{n}, x_{*}; F])] (x_{n} - x_{*}) \end{matrix}$

Hence, the functions $h_{i}$ are selected to satisfy $\forall x_{n}, y_{n}, z_{n} \in Ω$

$∥ I + α_{n} [x_{n}, x_{*}; F] ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥),$

$∥ (I + γ_{n} [y_{n}, x_{*}; F]) (I + α_{n} [x_{n}, x_{*}; F]) + β_{n} [x_{n}, x_{*}; F] ∥ \leq h_{2} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥)$

$\begin{matrix} ∥ (I + θ_{n} [z_{n}, x_{*}; F]) (I + γ_{n} [y_{n}, x_{*}; F]) (I + β_{n} [x_{n}, x_{*}; F]) \\ + δ_{n} [x_{n}, x_{*}; F] + ϵ_{n} [y_{n}, x_{*}; F] (I + α_{n} [x_{n}, x_{*}; F]) ∥ \\ \leq & h_{3} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥, ∥ z_{n} - x_{*} ∥) . \end{matrix}$

A practical non-discrete choice for the function $h_{1}$ is given by

$∥ I + α (x) [x, x_{*}; F] ∥ \leq h_{1} (∥ x - x_{*} ∥) \forall x \in Ω .$

Another choice is given by

$h_{1} (t) = sup_{x \in Ω, ∥ x - x_{*} ∥ \leq t} ∥ I + α (x) [x, x_{*}; F] ∥ .$

The choices of functions $h_{2}$ and $h_{3}$ can follow similarly.
(P2): Let $M^{i} : Ω ⟶ X$ be a linear operator. By $M_{n}^{i}$ we denote $M^{i} (x_{n}) \forall n = 0, 1, 2, \dots .$
Then, it follows from method (3)

$\begin{matrix} y_{n} - x_{*} & = & x_{n} - x_{*} - M_{n}^{1} F (x_{n}) + (α_{n} + M_{n}) F (x_{n}) \\ = & (I - M_{n}^{2} [x_{n}, x_{*}; F]) + (α_{n} + M_{n}^{2}) [x_{n}, x_{*}; F]) (x_{n} - x_{*}), \\ z_{n} - x_{*} & = & ((I - M_{n}^{2} [y_{n}, x_{*}; F]) + (γ_{n} + M_{n}^{2}) [y_{n}, x_{*}; F]) (y_{n} - x_{*}) \end{matrix}$

and

$x_{n + 1} - x_{*} = ((I - M_{n}^{3} [z_{n}, x_{*}; F]) + (θ_{n} + M_{n}^{3}) [z_{n}, x_{*}; F]) (z_{n} - x_{*}) .$

Thus, the functions $h_{i}$ must satisfy

$∥ I + α_{n} ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥),$

$(I + γ_{n}) (I + α_{n}) ∥ \leq h_{2} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥)$

and

$∥ x_{n + 1} - x_{*} ∥ \leq ∥ (I + θ_{n}) (I + γ_{n}) (I + α_{n}) ∥ \leq h_{3} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥, ∥ z_{n} - x_{*} ∥) .$

Clearly, the function $h_{1}$ can be chosen again as in case (P1). The functions $h_{2}$ and $h_{3}$ can be defined similarly.
(P3): Assume ∃ function $φ_{0} : [0, \infty) ⟶ R$ continuous and non-decreasing such that

$∥ F^{'} {(x_{*})}^{- 1} (F^{'} (x) - F^{'} (x_{*})) ∥ \leq φ_{0} (∥ x - x_{*} ∥) \forall x \in Ω .$

Then, we can write

$F (x_{n}) = F (x_{n}) - F (x_{*}) = \int_{0}^{1} F^{'} (x_{*} + θ (x_{n} - x_{*})) d θ (x_{n} - x_{*})$

leading to

$∥ F^{'} {(x_{*})}^{- 1} F (x_{n}) ∥ \leq \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ ∥ x_{n} - x_{*} ∥ .$

Then, by method (3) we obtain, in turn, that

$\begin{matrix} y_{n} - x_{*} & = & [I + α_{n} F^{'} (x_{*}) F^{'} {(x_{*})}^{- 1} \\ \times (\int_{0}^{1} F^{'} (x_{*} + θ (x_{n} - x_{*})) d θ - F^{'} (x_{*}) + F^{'} (x_{*}))] (x_{n} - x_{*}), \end{matrix}$

so, the function $h_{1}$ must satisfy

$∥ I + α_{n} \int_{0}^{1} F^{'} (x_{*} + θ (x_{n} - x_{*})) d θ ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥)$

or

$∥ h_{1} (t) ∥ = sup_{∥ x - x_{*} ∥ \leq t, x \in Ω} ∥ I + α (x) \int_{0}^{1} F^{'} (x_{*} + θ (x_{n} - x_{*})) d θ ∥$

or

$∥ I + α_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) \leq h_{1} (∥ x_{n} - x_{*} ∥)$

or

$h_{1} (t) = sup_{∥ x - x_{*} ∥ \leq t, x \in Ω} ∥ I + α (x) F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) .$

Similarly, for the other two steps, we obtain in the last choice

\begin{matrix} ∥ z_{n} - x_{*} ∥ & \leq & ∥ I + γ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) ∥ y_{n} - x_{*} ∥ \\ + ∥ β_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) ∥ x_{n} - x_{*} ∥ \end{matrix}

and

\begin{matrix} ∥ x_{n + 1} - x_{*} ∥ & \leq & ∥ I + θ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ z_{n} - x_{*} ∥) d θ) ∥ z_{n} - x_{*} ∥ \\ + ∥ δ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) ∥ x_{n} - x_{*} ∥ \\ + ∥ ϵ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) ∥ y_{n} - x_{*} ∥ . \end{matrix}

Thus, the function

h_{2}

satisfies

\begin{matrix} ∥ I + γ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) ∥ y_{n} - x_{*} ∥ \\ + ∥ β_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) \\ \leq & h_{2} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥) \end{matrix}

or

\begin{matrix} h_{2} (s, t) & = & sup_{∥ x - x_{*} ∥ \leq s, ∥ y - x_{*} ∥ \leq t} [∥ I + γ (x) F^{'} (x_{*}) ∥ \\ \times (1 + \int_{0}^{1} φ_{0} (θ t) d θ) t) \\ + ∥ β (x) F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ s) d θ)] . \end{matrix}

Finally, concerning the choice of the function

h_{3},

by the third substep of method (3)

\begin{matrix} ∥ x_{n + 1} - x_{*} ∥ & \leq & ∥ I + θ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ z_{n} - x_{*} ∥) d θ) ∥ z_{n} - x_{*} ∥ \\ + ∥ δ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) ∥ x_{n} - x_{*} ∥ \\ + ∥ ϵ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) ∥ y_{n} - x_{*} ∥, \end{matrix}

so the function

h_{3}

must satisfy

\begin{matrix} ∥ I + θ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) h_{2} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥) \\ + ∥ δ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ x_{n} - x_{*} ∥) d θ) \\ + ∥ ϵ_{n} F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ ∥ y_{n} - x_{*} ∥) d θ) h_{1} (∥ x_{n} - x_{*} ∥) \\ \leq & h_{3} (∥ x_{n} - x_{*} ∥, ∥ y_{n} - x_{*} ∥, ∥ z_{n} - x_{*} ∥) \end{matrix}

or

\begin{matrix} h (x, s, t, u) & = & sup_{∥ x - x_{*} ∥ \leq s, ∥ y - x_{*} ∥ \leq t, ∥ z - x_{*} ∥ \leq u} μ (x, s, t, u), \end{matrix}

where

\begin{matrix} μ (x, s, t, u) & = & ∥ I + θ (x) F^{'} (x_{*}) ∥ \\ \times (1 + \int_{0}^{1} φ_{0} (θ u) d θ) h_{2} (t, s) \\ + ∥ δ (x) F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} (θ s) d θ) \\ + ∥ ϵ (x) F^{'} (x_{*}) ∥ (1 + \int_{0}^{1} φ_{0} ((θ t) d θ) h_{1} (s)] . \end{matrix}

The functions

h_{2}

and

h_{3}

can also be defined with the other two choices as those of function

h_{1}

given previously.

Semi-local Convergence

Concerning this case, we can have instead of the conditions of Theorem 2 (see (H6)) but for method (3)

∥ α_{n} F (x_{n}) ∥ \leq s_{n} - t_{n},

∥ β_{n} F (x_{n}) + γ_{n} F (y_{n}) ∥ \leq u_{n} - s_{n}

and

∥ δ_{n} F (x_{n}) + ϵ_{n} F (y_{n}) + θ_{n} F (z_{n}) ∥ \leq t_{n + 1} - u_{n} \forall n = 0, 1, 2, \dots .

Notice that under these choices,

∥ y_{n} - x_{n} ∥ \leq s_{n} - t_{n}

∥ z_{n} - y_{n} ∥ \leq u_{n} - s_{n}

and

∥ x_{n + 1} - z_{n} ∥ \leq t_{n + 1} - u_{n} .

Then, the conclusions of Theorem 2 hold for method (3). Even more specialized choices of linear operators appearing on these methods as well as function

h_{i}

can be found in the Introduction, the next section, or in [1,2,11,21] and the references therein.

4. Special Cases II

The section contains even more specialized cases of method (2) and method (3). In particular, we study the local and semi-local convergence first of method (22) and second of method (20). Notice that to obtain method (22), we set in method (3)

\begin{matrix} α_{n} & = & - F^{'} {(x_{n})}^{- 1}, u_{n} = y_{n}, β_{n} = O, γ_{n} = - 5 F^{'} {(x_{n})}^{- 1}, \\ v_{n} & = & y_{n}, δ_{n} = O, ϵ_{n} = - \frac{9}{5} F^{'} {(x_{n})}^{- 1} a n d θ_{n} = - \frac{1}{5} F^{'} (x_{n}) . \end{matrix}

(35)

Moreover, for method (20), we let

\begin{matrix} α_{n} & = & - {[x_{n}, w_{n}; F]}^{- 1}, u_{n} = y_{n}, β_{n} = O, z_{n} = x_{n + 1}, \\ γ_{n} & = & {([y_{n}, w_{n}; F] + [y_{n}, x_{n}; F] - [x_{n}, w_{n}; F])}^{- 1}, δ_{n} = ϵ_{n} = θ_{n} = O \end{matrix}

(36)

and

v_{n} = z_{n} .

5. Local Convergence of Method

The local convergence analysis of method (23) utilizes some functions parameters. Let

S = [0, \infty) .

Suppose the following:

(i): ∃ function $w_{0} : S ⟶ R$ continuous and non-decreasing such that equation

$w_{0} (t) - 1 = 0$

has a smallest solution $ρ_{0} \in S - {0} .$ Let $S_{0} = [0, ρ_{0}) .$
(ii): ∃ function $w : S_{0} ⟶ R$ continuous and non-decreasing such that equation

$h_{1} (t) - 1 = 0$

has a smallest solution $ρ_{1} \in S_{0} - {0},$ where the function $h_{1} : S_{0} ⟶ R$ defined by

$h_{1} (t) = \frac{\int_{0}^{1} w ((1 - θ) t) d θ}{1 - w_{0} (t)} .$
(iii): Equation

$w_{0} (h_{1} (t) t) - 1 = 0$

has a smallest solution ${\bar{ρ}}_{1} \in S_{0} - {0} .$ Let

${\bar{\bar{ρ}}}_{0} = min {ρ_{0}, {\bar{ρ}}_{1}}$

and ${\tilde{S}}_{1} = [0, {\bar{\bar{ρ}}}_{0}) .$
(iv): Equation

$h_{2} (t) - 1 = 0$

has a smallest solution $ρ_{2} \in {\tilde{S}}_{1} - {0},$ where the function $h_{2} : {\tilde{S}}_{1} ⟶ R$ is defined as

$\begin{matrix} h_{2} (t) & = & [\frac{\int_{0}^{1} w ((1 - θ) h_{1} (t) t) d θ}{1 - w_{0} (h_{1} (t) t)} \\ + \frac{w ((1 + h_{1} (t)) t) (1 + \int_{0}^{1} w_{0} (θ h_{1} (t) t) d θ)}{(1 - w_{0} (t)) (1 - w_{0} (h_{1} (t) t))} \\ + \frac{4 (1 + \int_{0}^{1} w_{0} (θ h_{1} (t) t) d θ}{1 - w_{0} (t)}] h_{1} (t) . \end{matrix}$
(v): Equation

$h_{3} (t) - 1 = 0$

has a smallest solution $ρ_{3} \in {\tilde{S}}_{1} - {0},$ where the function $h_{3} : {\tilde{S}}_{1} ⟶ R$ is defined by

$\begin{matrix} h_{3} (t) & = & h_{1} (t) + \frac{1}{5} [\frac{9 (1 + \int_{0}^{1} w_{0} (θ h_{1} (t) t) d θ) h_{1} (t)}{1 - w_{0} (t)} \\ (1 + \int_{0}^{1} w_{0} (θ h_{2} (t) t) d θ) h_{2} (t)] . \end{matrix}$

The parameter

ρ

defined by

ρ = min {ρ_{j}} j = 1, 2, 3

(37)

is proven to be a radius of convergence for method (2) in Theorem 3. Let

S_{1} = [0, ρ) .

Then, it follows by these definitions that

\forall t \in S_{2}

0 \leq w_{0} (t) < 1

(38)

0 \leq w_{0} (h_{1} (t) t) < 1

(39)

and

0 \leq h_{i} (t) < 1 .

(40)

The conditions required are as follows:

(C1) Equation

F (x) = 0

has a simple solution

x_{*} \in Ω .

(C2)

∥ F^{'} {(x_{*})}^{- 1} (F^{'} (x) - F^{'} (x_{*})) ∥ \leq w_{0} (∥ x - x_{*} ∥)

\forall x \in Ω .

Set

Ω_{1} = U (x_{*}, ρ_{0}) \cap Ω .

(C3)

∥ F^{'} {(x_{*})}^{- 1} (F^{'} (y) - F^{'} (x)) ∥ \leq w (∥ y - x ∥)

\forall x, y \in Ω_{1}

and

(C4)

U [x_{0}, ρ] \subset Ω .

Next, the main local convergence result follows for method (23).

Theorem 3.

Suppose that conditions (C1)–(C4) hold and

x_{0} \in U (x_{*}, ρ) - {x_{*}} .

Then, the sequence

{x_{n}}

generated by method (23) is well defined in

U (x_{*}, ρ),

remains in

U (x_{*}, ρ) \forall n = 0, 1, 2, \dots

and is convergent to

x_{*} .

Moreover, the following assertions hold:

∥ y_{n} - x_{*} ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥ < ρ,

(41)

∥ z_{n} - x_{*} ∥ \leq h_{2} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥,

(42)

and

∥ x_{n + 1} - x_{*} ∥ \leq h_{3} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥,

(43)

where functions

h_{i}

are defined previously and the radius ρ is given by Formula (37).

Proof.

Let

u \in U (x_{*}, ρ) - {x_{*}} .

By using conditions (C1), (C2) and (37), we have that

\begin{matrix} ∥ F^{'} {(x_{*})}^{- 1} (F^{'} (u) - F^{'} (x_{*})) ∥ & \leq & w_{0} (∥ x_{0} - x_{*} ∥) \leq w_{0} (r) < 1 . \end{matrix}

(44)

It follows by (44) and the Banach lemma on invertible operators [11,15] that

F^{'} {(u)}^{- 1} \in L (X, X)

and

∥ F^{'} {(u)}^{- 1} F^{'} (x_{*}) ∥ \leq \frac{1}{1 - w_{0} (∥ x_{0} - x_{*} ∥)} .

(45)

If

u = x_{0},

then the iterate

y_{0}

is well defined by the first substep of method (23) and we can write

\begin{matrix} y_{0} - x_{*} & = & x_{0} - x_{*} - F^{'} {(x_{0})}^{- 1} F (x_{0}) \\ = & F^{'} {(x_{0})}^{- 1} \int_{0}^{1} (F^{'} (x_{*} + θ (x_{0} - x_{*})) d θ - F^{'} (x_{0})) (x_{0} - x_{*}) . \end{matrix}

(46)

In view of (C1)–(C3), (45) (for

u = x_{0}

), (40) (for

i = 1

) and (46), we obtain in turn that

\begin{matrix} ∥ y_{0} - x_{*} ∥ & \leq & \frac{\int_{0}^{1} w ((1 - θ) ∥ x_{0} - x_{*} ∥) d θ ∥ x_{0} - x_{*} ∥}{1 - w_{0} (∥ x_{0} - x_{*} ∥)} \\ \leq & h_{1} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ < ∥ x_{0} - x_{*} ∥ < ρ . \end{matrix}

(47)

Thus, the iterate

y_{0} \in U (x_{*}, r)

and (41) holds for

n = 0 .

The iterate

z_{0}

is well defined by the second substep of method (23), so we can write

\begin{matrix} z_{0} - x_{*} & = & y_{0} - x_{0} - 5 F^{'} {(x_{0})}^{- 1} F (y_{0}) \\ = & y_{0} - x_{*} - F^{'} {(y_{0})}^{- 1} F (y_{0}) \\ + F^{'} {(y_{0})}^{- 1} (F (x_{0}) - F^{'} (y_{0})) F^{'} {(x_{0})}^{- 1} F (y_{0}) \\ - 4 F^{'} {(x_{0})}^{- 1} F (y_{0}) . \end{matrix}

(48)

Notice that linear operator

F^{'} {(y_{0})}^{- 1}

exists by (45) (for

u = y_{0}

). It follows by (37), (40) (for

j = 1

), (C3), (45) (for

u = x_{0}, y_{0}

), in turn that

\begin{matrix} ∥ z_{0} - x_{*} ∥ & \leq & [\frac{\int_{0}^{1} w ((1 - θ) ∥ y_{0} - x_{*} ∥) d θ}{1 - w_{0} (∥ y_{0} - x_{*} ∥)} \\ + \frac{w (∥ y_{0} - x_{0} ∥) (1 + \int_{0}^{1} w_{0} (θ ∥ y_{0} - x_{*} ∥) d θ)}{(1 - w_{0} (∥ x_{0} - x_{*} ∥)) (1 - w_{0} (∥ y_{0} - x_{*} ∥))} \\ + \frac{4 (1 + \int_{0}^{1} w_{0} (θ ∥ y_{0} - x_{*} ∥) d θ}{1 - w_{0} (∥ x_{0} - x_{*} ∥)}] ∥ y_{0} - x_{*} ∥ \\ \leq & h_{2} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ . \end{matrix}

(49)

Thus, the iterate

z_{0} \in U (x_{*}, ρ)

and (42) holds for

n = 0,

where we also used (C1) and (C2) to obtain the estimate

\begin{matrix} ∥ F^{'} {(x_{*})}^{- 1} F (y_{0}) ∥ & = & ∥ F^{'} {(x_{*})}^{- 1} [\int_{0}^{1} F^{'} (x_{*} + θ (y_{0} - x_{*})) d θ - F^{'} (x_{*}) \\ + F^{'} (x_{*})] (y_{0} - x_{*}) ∥ \\ \leq & (1 + \int_{0}^{1} w_{0} (θ ∥ y_{0} - x_{*} ∥) d θ) ∥ y_{0} - x_{*} ∥ . \end{matrix}

Moreover, the iterate

x_{1}

is well defined by the third substep of method (23), so we can have

x_{1} - x_{*} = y_{0} - x_{*} - \frac{1}{5} F^{'} {(x_{0})}^{- 1} (9 F (y_{0}) + F (z_{0})),

leading to

\begin{matrix} ∥ x_{1} - x_{*} ∥ & \leq & ∥ y_{0} - x_{*} ∥ + \frac{1}{5} (\frac{9 (1 + \int_{0}^{1} w_{0} (θ ∥ y_{0} - x_{*} ∥) d θ) ∥ y_{0} - x_{*} ∥}{1 - w_{0} (∥ y_{0} - x_{*} ∥)} \\ + (1 + \int_{0}^{1} w_{0} (θ ∥ z_{0} - x_{*} ∥) d θ) ∥ z_{0} - x_{*} ∥) \\ \leq & h_{3} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ < ρ . \end{matrix}

(50)

Therefore, the iterate

x_{1} \in U (x_{*}, ρ)

and (43) holds for

n = 0 .

Switch

x_{0}, y_{0}, z_{0}, x_{1}

by

x_{m}, y_{m}, z_{m}, x_{m + 1} \forall m = 0, 1, 2 \dots

in the preceding calculations to complete the induction for the estimates (41)–(43). Then, by the estimate

∥ x_{m + 1} - x_{*} ∥ \leq d ∥ x_{m} - x_{*} ∥ < ρ,

(51)

where

d = h_{3} (∥ x_{0} - x_{*} ∥) \in [0, 1)

, we obtain that

x_{m + 1} \in U (x_{*}, ρ)

and

l i m_{m ⟶ \infty} x_{m} = x_{*} .

□

The uniqueness of the solution result for method (23) follows.

Proposition 1.

Suppose the following:

(i): Equation $F (x) = 0$ has a simple solution $x_{*} \in U (x_{*}, r) \subset Ω$ for some $r > 0 .$
(ii): Condition (C2) holds.
(iii): There exists $r_{1} \geq r$ such that

$\int_{0}^{1} w_{0} (θ r_{1}) d θ < 1 .$

(52)

Set

Ω_{2} = U [x_{*}, r_{1}] \cap Ω .

Then, the only solution of equation

F (x) = 0

in the set

Ω_{2}

is

x_{*} .

Proof.

Let

y_{*} \in D_{2}

be such that

F (y_{*}) = 0 .

Define the linear operator

J = \int_{0}^{1} h (x_{*} + θ (y_{*} - x_{*})) d θ .

It then follows by (ii) and (52) that

\begin{matrix} ∥ h {(x_{*})}^{- 1} (J - F^{'} (x_{*})) ∥ & \leq & \int_{0}^{1} w_{0} (θ ∥ y_{*} - x_{*} ∥) d θ \\ \leq & \int_{0}^{1} w_{0} (θ r_{1}) d θ < 1 . \end{matrix}

Hence, we deduce

x_{*} = y_{*}

by the invertibility of J and the estimate

J (x_{*} - y_{*}) = F (x_{*}) - F (y_{*}) = 0 .

□

Remark 3.

Under all conditions of Theorem 3, we can set

ρ = r .

Example 2.

Consider the motion system

F_{1}^{'} (v_{1}) = e^{v_{1}}, F_{2}^{'} (v_{2}) = (e - 1) v_{2} + 1, F_{3}^{'} (v_{3}) = 1

with

F_{1} (0) = F_{2} (0) = F_{3} (0) = 0 .

Let

F = {(F_{1}, F_{2}, F_{3})}^{t r} .

Let

X = R^{3}, Ω = U [0, 1], x_{*} = {(0, 0, 0)}^{t r} .

Let function F on Ω for

v = {(v_{1}, v_{2}, v_{3})}^{t r}

given as

F (v) = {(e^{v_{1}} - 1, \frac{e - 1}{2} v_{2}^{2} + v_{2}, v_{3})}^{t r} .

Using this definition, we obtain the derivative as

F^{'} (v) = [\begin{matrix} e^{v_{1}} & 0 & 0 \\ 0 & (e - 1) v_{2} + 1 & 0 \\ 0 & 0 & 1 \end{matrix}] .

Hence,

F^{'} (x_{*}) = I .

Let

v \in R^{3}

with

v = {(v_{1}, v_{2}, v_{3})}^{t r} .

Moreover, the nor for

N \in R^{3} \times R^{3}

is

∥ N ∥ = max_{1 \leq j \leq 3} \sum_{i = 1}^{3} ∥ n_{j, i} ∥ .

Conditions (C1)–(C3) are verified for

w_{0} (t) = (e - 1) t

and

w (t) = 2 (1 + \frac{1}{e - 1}) t .

Then, the radii are

ρ_{1} = 0.3030, ρ_{2} = 0.1033 = ρ a n d ρ_{3} = 0.1461 .

Example 3.

If

X = C [0, 1]

is equipped with the max-norm,

Ω = U [0, 1],

consider

G : Ω ⟶ E_{1}

given as

G (λ) (x) = φ (x) - 6 \int_{0}^{1} x τ λ {(τ)}^{3} d τ .

(53)

We obtain

G^{'} (λ (ξ)) (x) = ξ (x) - 18 \int_{0}^{1} x τ λ {(τ)}^{2} ξ (τ) d τ, f o r e a c h ξ \in D .

Clearly,

x_{*} = 0

and the conditions (C1)–(C3) hold for

w_{0} (t) = 9 t

and

w (t) = 18 t .

Then, the radii are

ρ_{1} = 0.0556, ρ_{2} = 0.0089 = ρ a n d ρ_{3} = 0.0206 .

6. Semi-Local Convergence of Method

As in the local case, we use some functions and parameters for the method (23).

Suppose:

There exists function

v_{0} : S ⟶ R

that is continuous and non-decreasing such that equation

v_{0} (t) - 1 = 0

has a smallest solution

τ_{0} \in S - {0} .

Consider function

v : S_{0} ⟶ R

to be continuous and non-decreasing. Define the scalar sequences for

η \geq 0

and

\forall n = 0, 1, 2, \dots

by

\begin{matrix} t_{0} & = & 0, s_{0} = η \\ u_{n} & = & s_{n} + \frac{5 \int_{0}^{1} v (θ (s_{n} - t_{n})) d θ (s_{n} - t_{n})}{1 - v_{0} (t_{n})}, \\ t_{n + 1} & = & u_{n} + \frac{1}{1 - v_{0} (t_{n})} [(1 + \int_{0}^{1} v_{0} (u_{n} + θ (u_{n} - s_{n})) d θ (u_{n} - s_{n}) \\ + 3 \int_{0}^{1} v (θ (s_{n} - t_{n})) d θ (s_{n} - t_{n})] \\ s_{n + 1} & = & t_{n + 1} + \frac{1}{1 - v_{0} (t_{n + 1})} [\int_{0}^{1} v (θ (t_{n + 1} - t_{n})) d θ (t_{n + 1} - t_{n}) \\ + (1 + \int_{0}^{1} v_{0} (θ t_{n}) d θ (t_{n + 1} - s_{n})] . \end{matrix}

(54)

This sequence is proven to be majorizing for method (23) in Theorem 4. However, first, we provide a general convergence result for sequence (54).

Lemma 2.

Suppose that

\forall n = 0, 1, 2, \dots

v_{0} (t_{n}) < 1

(55)

and there exists

τ \in [0, τ_{0})

such that

t_{n} \leq τ .

(56)

Then, sequence

{t_{n}}

converges to some

t_{*} \in [0, τ] .

Proof.

It follows by (54)–(56) that sequence

{t_{n}}

is non-decreasing and bounded from above by

τ .

Hence, it converges to its unique least upper bound

t_{*} .

□

Next, the operator F is related to the scalar functions.

Suppose the following:

(h1): There exists $x_{0} \in Ω, η \geq 0$ such that $F^{'} {(x_{0})}^{- 1} L (B_{2}, B_{1})$ and $∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq η .$
(h2): $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x) - F^{'} (x_{0})) ∥ \leq v_{0} (∥ x - x_{0} ∥)$ for all $x \in Ω .$
Set $Ω_{3} = Ω \cap U (x_{0}, τ_{0}) .$
(h3): $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (y) - F^{'} (x)) ∥ \leq v (∥ y - x ∥)$ for all $x, y \in Ω_{3} .$
(h4): Conditions of Lemma 2 hold.
and
(h5): $U [x_{0}, t_{*}] \subset Ω .$

We present the semi-local convergence result for the method (23).

Theorem 4.

Suppose that conditions (h1)–(h5) hold. Then, sequence

{x_{n}}

given by method (23) is well defined, remains in

U [x_{0}, t_{*}]

and converges to a solution

x_{*} \in U [x_{0}, t_{*}]

of equation

F (x) = 0 .

Moreover, the following assertions hold:

∥ y_{n} - x_{n} ∥ \leq s_{n} - t_{n},

(57)

∥ z_{n} - y_{n} ∥ \leq u_{n} - s_{n}

(58)

and

∥ x_{n + 1} - z_{n} ∥ \leq t_{n + 1} - u_{n} .

(59)

Proof.

Mathematical induction is utilized to show estimates (57)–(59). Using (h1) and method (23) for

n = 0

∥ y_{0} - x_{0} ∥ = ∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq η = s_{0} - t_{0} \leq t_{*} .

Thus, the iterate

y_{0} \in U [x_{0}, t_{*}]

and (57) holds for

n = 0 .

Let

u \in U [x_{0}, t_{*}] .

Then, as in Theorem 3, we get

∥ F^{'} {(u)}^{- 1} F^{'} (x_{0}) ∥ \leq \frac{1}{1 - v_{0} (∥ u - x_{0} ∥)} .

(60)

Hence, if we set

u = x_{0}

, iterates

y_{0}, z_{0}

and

x_{1}

are well defined by method (23) for

n = 0 .

Suppose iterates

x_{k}, y_{k}, z_{k}, x_{k + 1}

also exist for all integer values k smaller than

n .

Then, we have the estimates

\begin{matrix} ∥ z_{n} - y_{n} ∥ & = & 5 ∥ F^{'} {(x_{n})}^{- 1} F (y_{n}) ∥ \\ \leq & \frac{5 \int_{0}^{1} v (θ ∥ y_{n} - x_{n} ∥) d θ ∥ y_{n} - x_{n} ∥}{1 - v_{0} (∥ x_{n} - x_{0} ∥)} \\ \leq & \frac{5 \int_{0}^{1} v (θ ∥ s_{n} - t_{n})) d θ (s_{n} - t_{n})}{1 - v_{0} (t_{n})} = u_{n} - s_{n}, \end{matrix}

\begin{matrix} ∥ x_{n + 1} - z_{n} ∥ & = & ∥ \frac{1}{5} F^{'} {(x_{n})}^{- 1} (F (y_{n}) - F (z_{n})) + 3 F^{'} {(x_{n})}^{- 1} F (y_{n}) ∥ \\ \leq & \frac{1}{1 - v_{0} (∥ x_{n} - x_{0} ∥)} [(1 + \frac{1}{5} \int_{0}^{1} v_{0} (∥ z_{n} - x_{0} ∥ + θ ∥ z_{n} - y_{n} ∥) d θ) ∥ y_{n} - x_{n} ∥ \\ + 3 \int_{0}^{1} v (θ ∥ y_{n} - x_{n} ∥ d θ ∥ y_{n} - x_{n} ∥] \\ \leq & t_{n + 1} - u_{n} \end{matrix}

and

\begin{matrix} ∥ y_{n + 1} - x_{n + 1} ∥ & = & ∥ F^{'} {(x_{n + 1})}^{- 1} F (x_{n + 1}) ∥ \\ \leq & ∥ F^{'} {(x_{n + 1})}^{- 1} F^{'} (x_{0}) ∥ ∥ F^{'} {(x_{0})}^{- 1} F (x_{n + 1}) ∥ \\ \leq & \frac{1}{1 - v_{0} (∥ x_{n + 1} - x_{0} ∥)} [\int_{0}^{1} v (θ ∥ x_{n + 1} - x_{n} ∥) d θ ∥ x_{n + 1} - x_{n} ∥ \\ + (1 + \int_{0}^{1} v_{0} (θ ∥ x_{n} - x_{0} ∥) d θ) ∥ x_{n + 1} - y_{n} ∥] \\ \leq & s_{n + 1} - t_{n + 1}, \end{matrix}

where we also used

\begin{matrix} F (y_{n}) & = & F (y_{n}) - F (x_{n}) - F^{'} (x_{n}) (y_{n} - x_{n}) \\ = & \int_{0}^{1} [F^{'} (x_{n} + θ (y_{n} - x_{n})) d θ - F^{'} (x_{n})] (y_{n} - x_{n}), \end{matrix}

so

∥ F^{'} {(x_{0})}^{- 1} F (y_{n}) ∥ \leq \int_{0}^{1} v (θ ∥ y_{n} - x_{n} ∥) d θ ∥ y_{n} - x_{n} ∥

and

\begin{matrix} F (x_{n + 1}) & = & F (x_{n + 1}) - F (x_{n}) - F^{'} (x_{n}) (y_{n} - x_{n}) \\ - F^{'} (x_{n}) (x_{n + 1} - x_{n}) + F^{'} (x_{n}) (x_{n + 1} - x_{n}) \\ = & F (x_{n + 1}) - F (x_{n}) - F^{'} (x_{n}) (x_{n + 1} - x_{n}) + F^{'} (x_{n}) (x_{n + 1} - y_{n}), \end{matrix}

so

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} F (x_{n + 1}) ∥ & \leq & \int_{0}^{1} v (θ ∥ x_{n + 1} - x_{n} ∥) d θ ∥ x_{n + 1} - x_{n} ∥ \\ + (1 + v_{0} (∥ x_{n} - x_{0} ∥)) ∥ x_{n + 1} - y_{n} ∥ \\ \leq & \int_{0}^{1} v (θ (t_{n + 1} - t_{n})) d θ (t_{n + 1} - t_{n}) \\ + (1 + v_{0} (t_{n})) (t_{n + 1} - s_{n}), \\ ∥ z_{n} - x_{0} ∥ & \leq & ∥ z_{n} - y_{n} ∥ + ∥ y_{n} - x_{0} ∥ \\ \leq & u_{n} - s_{n} + s_{n} - t_{0} \leq t_{*} \end{matrix}

(61)

and

\begin{matrix} ∥ x_{n + 1} - x_{0} ∥ & \leq & ∥ x_{n + 1} - z_{n} ∥ + ∥ z_{n} - x_{0} ∥ \\ \leq & t_{n + 1} - u_{n} + u_{n} - t_{0} \leq t_{*} . \end{matrix}

Hence, sequence

{t_{n}}

is majorizing for method (2) and iterates

{x_{n}}, {y_{n}}, {z_{n}}

belong in

U [x_{0}, t_{*}] .

The sequence

{x_{n}}

is complete in Banach space X and as such, it converges to some

x_{*} \in U [x_{0}, t_{*}] .

By using the continuity of F and letting

n ⟶ \infty

in (61), we deduce

F (x_{*}) = 0 .

□

Proposition 2.

Suppose:

(i): There exists a solution $x_{*} \in U (x_{0}, ρ_{2})$ of equation $F (x) = 0$ for some $ρ_{2} > 0 .$
(ii): Condition (h2) holds.
(iii): There exists $ρ_{3} \geq ρ_{2}$ such that

$\int_{0}^{1} v_{0} ((1 - θ) ρ_{2} + θ ρ_{3}) d θ < 1 .$

(62)

Set

Ω_{4} = Ω \cap U [x_{0}, ρ_{3}] .

Then,

x_{*}

is the only solution of equation

F (x) = 0

in the region

Ω_{4} .

Proof.

Let

y_{*} \in Ω_{4}

with

F (y_{*}) = 0 .

Define the linear operator

Q = \int_{0}^{1} F^{'} (x_{*} + θ (y_{*} - x_{*})) d θ .

Then, by (h2) and (62), we obtain in turn that

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} (Q - F^{'} (x_{0})) ∥ & \leq & \int_{0}^{1} v_{0} ((1 - θ) ∥ x_{0} - y_{*} ∥ + θ ∥ x_{0} - x_{*} ∥) d θ \\ \leq & \int_{0}^{1} v_{0} ((1 - θ) ρ_{2} + θ ρ_{3}) d ρ < 1 . \end{matrix}

Thus,

x_{*} = y_{*} .

□

The next two examples show how to choose the functions

v_{0}, v

, and the parameter

η .

Example 4.

Set

X = R .

Let us consider a scalar function F defined on the set

Ω = U [x_{0}, 1 - μ]

for

μ \in (0, 1)

by

F (x) = x^{3} - μ .

Choose

x_{0} = 1 .

Then, the conditions (h1)–(h3) are verified for

η = \frac{1 - μ}{3}, v_{0} (t) = (3 - μ) t

and

v (t) = 2 (1 + \frac{1}{3 - μ}) t .

Example 5.

Consider

X = C [0, 1]

and

Ω = U [0, 1] .

Then the problem [5]

Ξ (0) = 0, Ξ (1) = 1,

Ξ^{″} = - Ξ - ι Ξ^{2}

is also given as integral equation of the form

Ξ (q_{2}) = q_{2} + \int_{0}^{1} Θ (q_{2}, q_{1}) (Ξ^{3} (q_{1}) + ι Ξ^{2} (q_{1})) d q_{1}

where ι is a constant and

Θ (q_{2}, q_{1})

is the Green’s function

Θ (q_{2}, q_{1}) = \{\begin{matrix} q_{1} (1 - q_{2}), & q_{1} \leq q_{2} \\ q_{2} (1 - q_{1}), & q_{2} < q_{1} . \end{matrix}

Consider

F : Ω ⟶ X

as

[F (x)] (q_{2}) = x (q_{2}) - q_{2} - \int_{0}^{1} Θ (q_{2}, q_{1}) (x^{3} (q_{1}) + ι x^{2} (q_{1})) d q_{1} .

Choose

Ξ_{0} (q_{2}) = q_{2}

and

Ω = U (Ξ_{0}, ϵ_{0}) .

Then, clearly

U (Ξ_{0}, ϵ_{0}) \subset U (0, ϵ_{0} + 1),

since

∥ Ξ_{0} ∥ = 1 .

If

2 ι < 5 .

Then, conditions (C1)–(C3) are satisfied for

w_{0} (t) = \frac{2 ι + 3 ρ_{0} + 6}{8} t, w (t) = \frac{ι + 6 ρ_{0} + 3}{4} t .

Hence,

w_{0} (t) \leq w (t) .

7. Local Convergence of Method

The local analysis is using on certain parameters and real functions. Let

L_{0}, L

and

α

be positive parameters. Set

T_{1} = [0, \frac{1}{(2 + α) L_{0}}]

provided that

(2 + α) L_{0} < 1 .

Define the function

h_{1} : T_{1} ⟶ R

by

h_{1} (t) = \frac{(1 + α) L t}{1 - (2 + α) L_{0} t} .

Notice that parameter

ρ

ρ = \frac{1}{(1 + α) L + (2 + α) L_{0}}

is the only solution of equation

h_{1} (t) - 1 = 0

in the set

T_{1} .

Define the parameter

ρ_{0}

by

ρ_{0} = \frac{1}{(2 + α) (L_{0} + L)} .

Notice that

ρ_{0} < ρ .

Set

T_{0} = [0, ρ_{0}] .

Define the function

h_{2} : T_{0} ⟶ R

by

h_{2} (t) = \frac{(2 + 2 α + h_{1} (t)) L h_{1} (t) t}{1 - (2 + α) (L_{0} + L) t} .

The equation

h_{2} (t) - 1 = 0

has a smallest solution

ρ \in T_{0} - {0}

by the intermediate value theorem, since

h_{2} (0) - 1 = - 1

and

h_{2} (t) ⟶ \infty

as

y ⟶ ρ_{0}^{-} .

It shall be shown that R is a radius of convergence for method (20). It follows by these definitions that

\forall t \in T_{0}

0 \leq (L_{0} + L) (2 + α) t < 1

(63)

0 \leq h_{1} (t) < 1

(64)

and

0 \leq h_{2} (t) < 1 .

(65)

The following conditions are used:

(C1): There exists a solution $x_{*} \in Ω$ of equation $F (x) = 0$ such that $F^{'} {(x_{*})}^{- 1} \in L (X, X) .$
(C2): There exist positive parameters $L_{0}$ and $α$ such that $\forall v, z \in Ω$

$∥ F^{'} {(x_{*})}^{- 1} ([v, z; F] - F^{'} (x_{*})) ∥ \leq L_{0} (∥ v - x_{*} ∥ + ∥ z - x_{*} ∥)$

and

$∥ F (x) ∥ \leq α ∥ x - x_{*} ∥ .$

Set $Ω_{1} = U (x_{*}, ρ) \cap Ω .$
(C3): There exists a positive constant $L > 0$ such that $\forall x, y, v, z \in Ω_{1}$

$∥ F^{'} {(x_{*})}^{- 1} ([x, y; F] - [v, z; F]) ∥ \leq L (∥ x - v ∥ + ∥ y - z ∥)$

and
(C4): $U [x_{0}, ρ] \subset Ω .$

Next, the local convergence of method (20) is presented using the preceding terminology and conditions.

Theorem 5.

Under conditions (C1)–(C4), further suppose that

x_{0} \in U (x_{*}, ρ) .

Then, the sequence

{x_{n}}

generated by method (20) is well defined in

U (x_{*}, ρ),

stays in

U (x_{*}, ρ) \forall n = 0, 1, 2, \dots

and is convergent to

x_{*}

so that

∥ y_{n} - x_{*} ∥ \leq h_{1} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥ < Ω

(66)

and

∥ x_{n + 1} - x_{*} ∥ \leq h_{2} (∥ x_{n} - x_{*} ∥) ∥ x_{n} - x_{*} ∥ \leq ∥ x_{n} - x_{*} ∥,

(67)

where the functions

h_{1}, h_{2}

and the radius ρ are defined previously.

Proof.

It follows by method (20), (C1), (C2) and

x_{0} \in U (x_{*}, ρ)

in turn that

\begin{matrix} ∥ F^{'} {(x_{*})}^{- 1} (A_{0} - F^{'} (x_{*})) ∥ & = & ∥ F^{'} {(x_{*})}^{- 1} ([x_{0}, x_{0} + F (x_{0}); F] - F^{'} (x_{*})) ∥ \\ \leq & L_{0} (2 ∥ x_{0} - x_{*} ∥ + ∥ F (x_{0}) - F (x_{*}) ∥) \\ \leq & L_{0} (2 + α) ∥ x_{0} - x_{*} ∥ \\ < & L_{0} (2 + α) ρ . \end{matrix}

(68)

It follows by (68) and the Banach lemma on invertible operators [24] that

A_{0}^{- 1} \in L (X, X)

and

∥ A_{0}^{- 1} F^{'} (x_{*}) ∥ \leq \frac{1}{1 - (2 + α) L_{0} ∥ x_{0} - x_{*} ∥} .

(69)

Hence, the iterate

y_{0}

exists by the first substep of method (20) for

n = 0 .

It follows from the first substep of method (20), (C2) and (C3), that

\begin{matrix} ∥ y_{0} - x_{*} ∥ & \leq & ∥ x_{0} - x_{*} - A_{0}^{- 1} F (x_{0}) \\ ∥ A_{0}^{- 1} F^{'} (x_{*}) F^{'} {(x_{*})}^{- 1} (A_{0} - (F (x_{0}) - F (x_{*}))) (x - 0 - x_{*}) ∥ \\ \leq & ∥ A_{0}^{- 1} F^{'} (x_{*}) ∥ ∥ F^{'} {(x_{*})}^{- 1} (A_{0} - (F (x_{0}) - F (x_{*}))) ∥ ∥ x_{0} - x_{*} ∥ \\ \leq & \frac{L (∥ x_{0} - x_{*} ∥ + ∥ F (x_{0}) - F (x_{*}))}{1 - L_{0} (2 + α) ∥ x_{0} - x_{*} ∥} \\ \leq & h_{1} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ < ρ . \end{matrix}

(70)

Thus, the iterate

y_{0} \in U (x_{*}, ρ)

and (66) holds for

n = 0 .

Similarly, by the second substep of method (20), we have

\begin{matrix} ∥ F^{'} {(x_{*})}^{- 1} (B_{0} - F^{'} (x_{*})) ∥ & = & ∥ F^{'} {(x_{*})}^{- 1} ([y_{0}, w_{0}; F] \\ - [y_{0}, x_{0}; F] - [x_{0}, w_{0}; F] - [x_{*}, x_{*}; F]) ∥ \\ \leq & L ∥ y_{0} - w_{0} ∥ + L_{0} (∥ y_{0} - x_{*} ∥ + ∥ w_{0} - x_{*} ∥) \\ \leq & L (∥ y_{0} - x_{*} ∥ + ∥ w_{0} - x_{*} ∥) + L_{0} (∥ y_{0} - x_{*} ∥ + ∥ w_{0} - x_{*} ∥) \\ \leq & (L + L_{0}) (2 + α) ρ \leq \frac{L + L_{0}}{L + L_{0}} = 1 . \end{matrix}

(71)

Hence,

B_{0}^{- 1} \in L (X, X)

and

∥ B_{0}^{- 1} F^{'} (x_{*}) ∥ \leq \frac{1}{1 - (L + L_{0}) (2 + α) ∥ x_{0} - x_{*} ∥} .

(72)

Thus, the iterate

x_{1}

exists by the second sub-step of method (20). Then, as in (70) we obtain in turn that

\begin{matrix} ∥ x_{1} - x_{*} ∥ & \leq & ∥ y_{0} - x_{*} - B_{0}^{- 1} F (y_{0}) ∥ \\ \leq & ∥ B_{0}^{- 1} F^{'} (x_{*}) ∥ ∥ F^{'} {(x_{*})}^{- 1} (B_{0} - (F (y_{0}) - F (x_{*}))) ∥ ∥ y_{0} - x_{*} ∥ \\ \leq & \frac{∥ F^{'} {(x_{*})}^{- 1} ([y_{0}, w_{0}; F] + [y_{0}, x_{0}; F] - [x_{0}, w_{0}; F] - [y_{0}, x_{*} : F]) ∥}{1 - (L + L_{0}) (2 + α) ∥ x_{0} - x_{*} ∥} \\ ∥ y_{0} - x_{*} ∥ \\ \leq & \frac{L (2 + 2 α + h_{2} (∥ x_{0} - x_{*} ∥)) ∥ x_{0} - x_{*} ∥}{1 - (L + L_{0}) (2 + α) ∥ x_{0} - x_{*} ∥} h_{1} (∥ x_{0} - x_{*} ∥) \\ ∥ x_{0} - x_{*} ∥ \\ \leq & h_{2} (∥ x_{0} - x_{*} ∥) ∥ x_{0} - x_{*} ∥ \leq ∥ x_{0} - x_{*} ∥ < ρ . \end{matrix}

(73)

Therefore, the iterate

x_{1} \in U (x_{*}, ρ)

and (67) holds for

n = 0 .

Simply replace

x_{0}, y_{0}, x_{1}

by

x_{m}, y_{m}, x_{m + 1} \forall m = 0, 1, 2 \dots

in the preceding calculations to complete the induction for (66) and (67). It then follows from the estimate

∥ x_{m + 1} - x_{*} ∥ \leq μ ∥ x_{m} - x_{*} ∥ < ρ,

(74)

where,

μ = h_{2} (∥ x_{0} - x_{*} ∥) \in [0, 1)

leading to

x_{m + 1} \in U (x_{*}, ρ)

and

l i m_{m ⟶ \infty} x_{m} = x_{*} .

□

Concerning the uniqueness of the solution

x_{*}

(not given in [9]), we provide the result.

Proposition 3.

Suppose:

(i): The point $x_{*}$ is a simple solution $x_{*} \in U (x_{*}, r) \subset Ω$ for some $r > 0$ of equation $F (x) = 0 .$
(ii): There exists positive parameter $L_{1}$ such that $\forall y \in Ω$

$∥ F^{'} {(x_{*})}^{- 1} ([x_{*}, y; F] - F^{'} (x_{*})) ∥ \leq L_{1} ∥ y - x_{*} ∥$

(75)
(iii): There exists $r_{1} \geq r$ such that

$L_{1} r_{1} < 1 .$

(76)

Set

Ω_{2} = U [x_{*}, r_{1}] \cap Ω .

Then,

x_{*}

is the only solution of equation

F (x) = 0

in the set

Ω_{2} .

Proof.

Set

P = [x_{*}, y_{*}; F]

for some

y_{*} \in D_{2}

with

F (y_{*}) = 0 .

It follows by (i), (75) and (76) that

∥ F^{'} {(x_{*})}^{- 1} (P - F^{'} (x_{*})) ∥ \leq L_{1} ∥ y_{*} - x_{*} ∥) < 1 .

Thus, we conclude

x_{*} = y_{*}

by the invertability of P and identity

P (x_{*} - y_{*}) = F (x_{*}) - F (y_{*}) = 0 .

□

Remark 4.

(i) Notice that not all conditions of Theorem 5 are used in Proposition 3. If they were, then we can set

r_{1} = ρ .

(ii) By the definition of set

Ω_{1}

we have

Ω_{1} \subset Ω .

(77)

Therefore, the parameter

L \leq L_{2},

(78)

where

L_{2}

is the corresponding Lipschitz constant in [1,3,9,19] appearing in the condition

\forall x, y, z \in Ω

∥ F^{'} {(x_{*})}^{- 1} ([x, y; F] - [v, z; F]) ∥ \leq L_{2} (∥ x - v ∥ + ∥ y - z ∥) .

(79)

Thus, the radius of convergence

R_{0}

in [1,7,8,20] uses

L_{2}

instead of

L .

That is by (78)

R_{0} \leq ρ .

(80)

Examples where (77), (78) and (80) are strict can be found in [2,5,11,12,13,15,21,22,23,24].

8. Majorizing Sequences for Method

Let

K_{0}, K,

be given positive parameters and

δ \in [0, 1),

K_{0} \leq K, η \geq 0,

and

T = [0, 1) .

Consider recurrent polynomials defined on the interval T for

n = 1, 2, \dots

by

\begin{matrix} f_{n}^{(1)} (t) & = & K t^{2 n} η + K t^{2 n - 1} η + 2 K_{0} (1 + t + \dots + t^{2 n + 1}) η \\ + K_{0} (t^{2 n + 1} + 2 t^{2 n}) t^{2 n + 1} η + δ - 1, \\ f_{n}^{(2)} (t) & = & K t^{2 n + 1} η + K (t^{2 n + 1} + 2 t^{2 n}) t^{2 n} η \\ + 2 K_{0} (1 + t + \dots + t^{2 n + 2}) η + δ - 1, \\ g_{n}^{(1)} (t) & = & K t^{3} + K t^{2} - K t - K + 2 K_{0} (t^{3} + t^{4}) \\ + K_{0} (t^{2 n + 3} + 2 t^{n + 2}) t^{4} η - K_{0} (t^{2 n + 1} + 2 t^{2 n}) t^{2} η, \\ g_{n}^{(2)} (t) & = & K t^{3} + K (t^{3} + 2 t^{2}) t^{2 n + 2} η \\ + 2 K_{0} (t^{3} + t^{4}) - K t - K (t + 2) t^{2 n} η, \\ h_{n + 1}^{(1)} (t) & = & g_{n + 1}^{(1)} (t) - g_{n}^{(1)} (t), \\ h_{n + 1}^{(2)} (t) & = & g_{n + 1}^{(2)} (t) - g_{n}^{(2)} (t), \end{matrix}

and polynomials

g_{\infty}^{(1)} (t) = g_{1} (t) = K t^{3} + K t^{2} - K t - K + 2 K_{0} (t^{3} + t^{4}),

g_{\infty}^{(2)} (t) = g_{2} (t) = K t^{3} + 2 K_{0} (t^{3} + t^{4}) - K t = g_{3} (t) t

and

g (t) = {(t - 1)}^{2} (t^{5} + 4 t^{4} + 6 t^{3} + 6 t^{2} + 5 t + 2) .

Then, the following auxiliary result connecting these polynomials can be shown.

Lemma 3.

The following assertions hold:

f_{n + 1}^{(1)} (t) = f_{n}^{(1)} (t) + g_{n}^{(1)} (t) t^{2 n - 1} η,

(81)

f_{n + 1}^{(2)} (t) = f_{n}^{(2)} (t) + g_{n}^{(2)} (t) t^{2 n} η,

(82)

h_{n + 1}^{(1)} (t) = g (t) K_{0} t^{2 n + 2} η,

(83)

h_{n + 1}^{(2)} (t) = g (t) K t^{2 n} η,

(84)

polynomials

g_{1}

and

g_{2}

have smallest zeros in the interval

T - {0}

denoted by

ξ_{1}

and

α_{2},

respectively,

h_{n + 1}^{(1)} (t) \geq 0 \forall t \in [0, ξ_{1})

(85)

and

h_{n + 1}^{(2)} (t) \geq 0 \forall t \in [0, ξ_{2}) .

(86)

Moreover, define functions on the interval T by

g_{\infty}^{(1)} (t) = lim_{n ⟶ \infty} g_{n}^{(1)} (t)

(87)

and

g_{\infty}^{(2)} (t) = lim_{n ⟶ \infty} g_{n}^{(2)} (t) .

(88)

Then,

g_{\infty}^{(1)} (t) = g_{1} (t) \forall t \in [0, α_{1}),

(89)

g_{\infty}^{(2)} (t) = g_{2} (t) \forall t \in [0, α_{2}),

(90)

f_{n + 1}^{(1)} (t) \leq f_{n}^{(1)} (t) + g_{1} (t) t^{2 n - 1} η \forall t \in [0, ξ_{1}),

(91)

f_{n + 1}^{(2)} (t) \leq f_{n}^{(2)} (t) + g_{2} (t) t^{2 n} η \forall t \in [0, ξ_{2}),

(92)

f_{n + 1}^{(1)} (ξ_{1}) \leq f_{n}^{(1)} (ξ_{1}),

(93)

and

f_{n + 1}^{(2)} (ξ_{2}) \leq f_{n}^{(2)} (ξ_{2}) .

(94)

Proof.

Assertions (81)–(84) hold by the definition of these functions and basic algebra. By the intermediate value theorem polynomials

g_{1}

and

g_{3}

have zeros in the interval

T - {0},

since

g_{1} (0) = - K, g_{1} (1) = 4 K_{0}, g_{2} (0) = - K

and

g_{2} (1) = 4 K_{0} .

Then, assertions (85) and (86) follow by the definition of these polynomials and zeros

ξ_{1}

and

ξ_{2} .

Next, assertions (91) and (94) also follow from (87), (88) and the definition of these polynomials. □

The preceding result is connected to the scalar sequence defined

\forall n = 0, 1, 2, \dots

by

t_{0} = 0, s_{0} = η,

\begin{matrix} t_{1} & = & s_{0} + \frac{K (η + δ) η}{1 - K_{0} (2 η + δ)}, \\ s_{n + 1} & = & t_{n + 1} + \frac{K (t_{n + 1} - t_{n} + s_{n} - t_{n}) (t_{n + 1} - s_{n})}{1 - K_{0} (2 t_{n + 1} + γ_{n} + δ)} \\ t_{n + 2} & = & s_{n + 1} + \frac{K (s_{n + 1} - t_{n + 1} + γ_{n}) (s_{n + 1} - t_{n + 1})}{1 - K_{0} (2 s_{n + 1} + δ)}, \end{matrix}

(95)

where

γ_{n} = K (t_{n + 1} - t_{n} + s_{n} - t_{n}) (t_{n + 1} - s_{n}), δ \geq γ_{0} .

Moreover, define parameters

ξ_{1} = \frac{K (s_{1} - t_{1} + γ_{0})}{1 - K_{0} (2 s_{1} + δ)}, ξ_{2} = \frac{K (t_{1} + s_{0})}{1 - K_{0} (2 t_{1} + γ_{0} + δ)}

and

a = max {ξ_{1}, ξ_{2}},

Then, the first convergence result for sequence

{t_{n}}

follows.

Lemma 4.

Suppose

K η \leq 1, 0 < ξ_{1}, 0 < ξ_{2}, a < ξ < 1,

(96)

f_{1}^{(1)} (ξ_{1}) \leq 0

(97)

and

f_{2}^{(1)} (ξ_{2}) \leq 0 .

(98)

Then, scalar sequence

{t_{n}}

is non-decreasing, bounded from above by

t_{* *} = \frac{η}{1 - ξ},

and converges to its unique least upper bound

t_{*} \in [0, t_{* *}] .

Moreover, the following error bounds hold

0 < t_{n + 1} - s_{n} \leq ξ (s_{n} - t_{n}) \leq ξ^{2 n + 1} η,

(99)

0 < s_{n} - t_{n} \leq ξ (t_{n} - s_{n - 1}) \leq ξ^{2 n} η

(100)

and

γ_{n + 1} \leq γ_{n} \leq γ_{0} .

(101)

Proof.

Assertions (99)–(101) hold if we show using induction that

0 < \frac{K (t_{n + 1} - t_{n} + s_{n} - t_{n})}{1 - K_{0} (2 t_{n + 1} + γ_{n} + δ)} \leq ξ_{1},

(102)

0 < \frac{K (s_{n + 1} - t_{n + 1} + γ_{n})}{1 - K_{0} (2 s_{n + 1} + δ)} \leq ξ_{2},

(103)

and

t_{n} \leq s_{n} \leq t_{n + 1} .

(104)

By the definition of

t_{1},

we obtain

\frac{t_{1}}{s_{0}} = \frac{1 - K η}{1 - K_{0} (2 η + δ)} > 1,

so

s_{0} < t_{1},

and (103) holds for

n = 0 .

Suppose assertions (101)–(103) hold for each

m = 0, 1, 2, 3, \dots, n .

By (99) and (100) we have

\begin{matrix} s_{m} & \leq & t_{m} + ξ^{2 m} η \leq s_{m - 1} + ξ^{2 m - 1} η + ξ^{2 m} η \\ \leq & η + ξ η + \dots + ξ^{2 m} η \\ = & \frac{1 - ξ^{2 m + 1}}{1 - ξ} η \leq t_{* *} \end{matrix}

(105)

and

\begin{matrix} t_{m + 1} & \leq & s_{m} + ξ^{2 m + 1} η \leq t_{m} + ξ^{2 m + 1} η + ξ^{2 m} η \\ \leq & η + ξ η + \dots + ξ^{2 m + 1} η \\ = & \frac{1 - ξ^{2 m + 2}}{1 - ξ} η \leq t_{* *} . \end{matrix}

(106)

By the induction hypotheses sequences

{t_{m}}, {s_{m}}

are increasing. Evidently, estimate (101) holds if

\begin{matrix} K ξ^{2 m + 1} η + K ξ^{2 m} η + 2 K_{0} ξ \frac{1 - ξ^{2 m + 2}}{1 - ξ} η \\ + K_{0} ξ δ + ξ γ_{m} K_{0} - ξ \leq 0 \end{matrix}

or

f_{m}^{(1)} (t) \leq 0 a t t = ξ_{1},

(107)

where

γ_{m} \leq K (ξ^{2 m + 1} + 2 ξ^{2 m}) ξ^{2 m + 1} η^{2} .

By (91), (93), and (98) estimate (107) holds.

Similarly, assertion (103) holds if

\begin{matrix} K ξ^{2 m + 2} η + K^{2} (ξ^{2 m + 1} η + 2 ξ^{2 m} η) ξ^{2 m + 1} η \\ + 2 ξ K_{0} (1 + ξ + \dots + ξ^{2 m + 2}) η + δ ξ - ξ \leq 0 \end{matrix}

or

f_{m}^{(2)} (t) \leq 0 a t t = ξ_{2} .

(108)

By (92) and (94), assertion (108) holds. Hence, (100) and (103) also hold. Notice that

γ_{n}

can be written as

γ_{n} = K (E_{n} + E_{n}^{1}) E_{n}^{2},

where

E_{n} = t_{n + 1} - t_{n} > 0, E_{n}^{1} = s_{n} - t_{n},

and

E_{n}^{2} = t_{n + 1} - s_{n} > 0 .

Hence, we get

E_{n + 1} - E_{n} = t_{n + 2} - 2 t_{n + 1} + t_{n} \leq ξ^{2 n} (ξ^{2} - 1) (ξ + 1) η < 0,

E_{n + 1}^{1} - E_{n}^{1} = s_{n + 1} - t_{n + 1} - (s_{n} - t_{n}) \leq ξ^{2 n} (ξ^{2} - 1) η < 0,

and

E_{n + 1}^{2} - E_{n}^{2} = t_{n + 2} - s_{n + 1} - (t_{n + 1} - s_{n}) \leq ξ^{2 n + 1} (ξ^{2} - 1) η < 0,

so

γ_{n + 1} \leq γ_{n} \leq γ_{0} .

It follows that sequence

{t_{n}}

is non-decreasing, bounded from above by

t_{* *} .

Thus, it converges to

t_{*} .

□

Next, a second convergence result for sequence (95) is presented but the sufficient criteria are weaker but more difficult to verify than those of Lemma 4.

Lemma 5.

Suppose

K_{0} δ < 1,

(109)

K_{0} (2 t_{n + 1} + γ_{n} + δ) < 1,

(110)

and

K_{0} (2 s_{n + 1} + δ) < 1

(111)

hold. Then, sequence

{t_{n}}

is increasing and bounded from above by

t_{1}^{* *} = \frac{1 - K_{0} δ}{2 K_{0}},

so it converges to its unique least upper bound

t_{1}^{*} \in [0, t_{1}^{* *}] .

Proof.

It follows from the definition of sequence (95), and conditions (109)–(111). □

9. Semi-Local Convergence of Method

The conditions (C) shall be used in the semi-local convergence analysis of method (20).

Suppose

(C1): There exist $x_{0} \in Ω, η \geq 0, δ \in [0, 1)$ such that $A_{0}^{- 1} \in L (X, X), ∥ A_{0}^{- 1} F (x_{0}) ∥ \leq η,$ and $∥ F (x_{0}) ∥ \leq δ .$
(C2): There exists $K_{0} > 0$ such that for all $u, v \in Ω$

$∥ A_{0}^{- 1} ([u, v; F] - A_{0}) ∥ \leq K_{0} (∥ u - x_{0} ∥ + ∥ v - w_{0} ∥) .$

Set $Ω_{0} = U (x_{0}, \frac{1 - K_{0} δ}{2 K_{0}}) \cap Ω$ for $K_{0} δ < 1 .$
(C3): There exists $K > 0$ such that for all $u, v, \bar{u}, \bar{v} \in Ω_{0}$

$∥ A_{0}^{- 1} ([u, v; F] - [\bar{u}, \bar{v}; F]) ∥ \leq K (∥ u - \bar{u} ∥ + ∥ v - \bar{v} ∥) .$
(C4): $U [x_{0}, ρ + δ] \subset Ω,$ where $ρ = \{\begin{matrix} t_{*} + γ_{0} o r t_{* *}, & i f c o n d i t i o n s o f L e m m a 4 h o l d \\ t_{1}^{*} + γ_{0} o r t_{1}^{* *}, & i f c o n d i t i o n s o f L e m m a 5 h o l d . \end{matrix}$

Remark 5.

The results in [19] are given in the non-affine form. The benefits of using affine invariant results over non-affine are well-known [1,5,11,21]. In particular, they assumed

∥ A_{0}^{- 1} ∥ \leq β

and

(C3)′

∥ [x, y; F] - [\bar{x}, \bar{y}; F] ∥ \leq \bar{K} (∥ x - \bar{x} ∥ + ∥ y - \bar{y} ∥)

holds for all

x, y, \bar{x} \bar{y} \in Ω .

By the definition of the set

Ω_{0},

we get

Ω_{0} \subset Ω,

(112)

so

K_{0} \leq β \bar{K}

(113)

and

K \leq β \bar{K} .

(114)

Hence, K can replace

β \bar{K}

in the results in [19]. Notice also that using (C3)′ they estimated

∥ B_{n + 1}^{- 1} A_{0} ∥ \leq \frac{1}{1 - β \bar{K} (2 {\bar{s}}_{n + 1} + δ)}

(115)

and

∥ A_{0}^{- 1} (A_{n + 1} - A_{0}) ∥ \leq \frac{1}{1 - β \bar{K} ({\bar{t}}_{n + 1} - {\bar{t}}_{0}) + {\bar{γ}}_{n} + δ)},

(116)

where

{{\bar{t}}_{n}}, {{\bar{s}}_{n}}

are defined for

n = 0, 1, 2, \dots

by

{\bar{t}}_{0} = 0, {\bar{s}}_{0} = η,

\begin{matrix} {\bar{t}}_{1} & = & {\bar{s}}_{0} + \frac{β \bar{K} (η + δ) η}{1 - β \bar{K} (2 {\bar{s}}_{0} + δ)}, \\ {\bar{s}}_{n + 1} & = & {\bar{t}}_{n + 1} + \frac{β \bar{γ}}{1 - β \bar{K} (2 {\bar{t}}_{n + 1} + {\bar{γ}}_{n} + δ)} \\ {\bar{t}}_{n + 2} & = & {\bar{s}}_{n + 1} + \frac{β \bar{K} ({\bar{s}}_{n + 1} - {\bar{t}}_{n + 1} + {\bar{γ}}_{n}) ({\bar{s}}_{n + 1} - {\bar{t}}_{n + 1})}{1 - β \bar{K} (2 {\bar{s}}_{n + 1} + δ)}, \end{matrix}

(117)

where

{\bar{γ}}_{n} = \bar{K} ({\bar{t}}_{n + 1} - {\bar{t}}_{n} + {\bar{s}}_{n} - {\bar{t}}_{n}) ({\bar{t}}_{n + 1} - {\bar{s}}_{n}), δ \geq {\bar{γ}}_{0} .

But using the weaker condition (C2) we obtain respectively,

∥ B_{n + 1}^{- 1} A_{0} ∥ \leq \frac{1}{1 - K_{0} (2 s_{n + 1} + δ)}

(118)

and

∥ A_{0}^{- 1} (A_{n + 1} - A_{0}) ∥ \leq \frac{1}{1 - K_{0} (t_{n + 1} - t_{0} + γ_{n} + δ)}

(119)

which are tighter estimates than (115) and (116), respectively. Hence,

K_{0}, K

can replace

β \bar{K},

β, \bar{K}

and (118), (119) can replace (115), (116), respectively, in the proof of Theorem 3 in [19]. Examples where (112)–(114) are strict can be found in [1,5,11,21]. Simple induction shows that

0 < s_{n} - t_{n} \leq {\bar{s}}_{n} - {\bar{t}}_{n}

(120)

0 < t_{n + 1} - s_{n} \leq {\bar{t}}_{n + 1} - {\bar{s}}_{n}

(121)

and

t_{*} \leq {\bar{t}}^{*} = lim_{n ⟶ \infty} {\bar{t}}_{n} .

(122)

These estimates justify the claims made at the introduction of this work along the same lines. The local results in [19] can also be extended using our technique.

Next, we present the semi-local convergence result for the method (20).

Theorem 6.

Suppose that conditions (C) hold. Then, iteration

{x_{n}}

generated by method (20) exists in

U [x_{0}, t_{*}],

remains in

U [x_{0}, t_{*}]

and

{lim}_{n ⟶ \infty} x_{n} = x_{*} \in U [x_{0}, t_{*}]

with

F (x_{*}) = 0,

so that

∥ x_{n} - x_{*} ∥ \leq t_{*} - t_{n} .

Proof.

It follows from the comment above Theorem 6. □

Next, we present the uniqueness of the solution result, where conditions (C) are not necessarily utilized.

Proposition 4.

Suppose the following:

(i): There exists a simple solution $x_{*} \in U (x_{0}, r) \subset Ω$ for some $r > 0 .$
(ii): Condition (C2) holds
and
(iii): There exists $r^{*} \geq r$ such that $K_{0} (r + r^{*} + δ) < 1 .$

Set

Ω_{1} = U (x_{0}, \frac{1 - K_{0} (δ + r)}{K_{0}}) \cap Ω .

Then, the element

x_{*}

is the only solution of equation

F (x) = 0

in the region

Ω_{1} .

Proof.

Let

z^{*} \in Ω_{1}

with

F (z^{*}) = 0 .

Define

Q = [x_{*}, z^{*}; F] .

Then, in view of (ii) and (iii),

∥ A_{0}^{- 1} (Q - A_{0}) ∥ \leq K_{0} (∥ x_{*} - x_{0} ∥ + ∥ z^{*} - w_{0} ∥ \leq K_{0} (r + r^{*} + δ) < 1 .

Therefore, we conclude

z^{*} = x_{*}

is a consequence of the invertibility of Q and the identity

Q (x_{*} - z^{*}) = F (x_{*}) - F (z^{*}) = 0 .

□

Remark 6.

(i) Notice that r can be chosen to be

t_{*} .

(ii) The results can be extended further as follows. Replace

(C3)″

∥ A_{0}^{- 1} ([u, v; F] - [\bar{u}, \bar{v}; F]) ∥ \leq \tilde{K} (∥ u - \bar{u} ∥ + ∥ v - \bar{v} ∥),

\forall u, \bar{u} \in Ω_{0}, v = u - A {(u)}^{- 1} F (u)

and

\bar{v} = A {(\bar{u})}^{- 1} F (\bar{u}) .

Then, we have

(iii)

\tilde{K} \leq K .

Another way is if we define the set

Ω_{2} = U (x_{1}, \frac{1 - K_{0} (δ + γ_{0})}{2 K_{0}} - η)

provided that

K_{0} (δ + γ_{0}) < 1 .

Moreover, suppose

Ω_{2} \subset Ω .

Then, we have

Ω_{2} \subset Ω_{0}

if condition (C3)″ on

Ω_{2}

, say, with constant

{\tilde{K}}_{0}

. Then, we have that

{\tilde{K}}_{0} \leq K

also holds. Hence, tighter

\tilde{K}

or

{\tilde{K}}_{0}

can replace K in Theorem 6.

10. Conclusions

The convergence analysis is developed for generalized three-step numerical methods. The advantages of the new approach include weaker convergence criteria and a uniform set of conditions utilizing information on these methods in contrast to earlier works on special cases of these methods, where the existence of high-order derivatives is assumed to prove convergence. The methodology is very general and does not depend on the methods. That is why it can be applied to multi-step and other numerical methods that shall be the topic of future work.

The weak point of this methodology is the observation that the computation of the majorant functions “h” at this generality is hard in general. Notice that this is not the case for the special cases of method (2) or method (3) given below them (see, for example, Examples 4 and 5). As far as we know, there is no other methodology that can be compared to the one introduced in this article to handle the semi-local or the local convergence of method (2) or method (3) at this generality.

Author Contributions

Conceptualization, M.I.A., I.K.A., S.R. and S.G.; methodology, M.I.A., I.K.A., S.R. and S.G.; software, M.I.A., I.K.A., S.R. and S.G.; validation, M.I.A., I.K.A., S.R. and S.G.; formal analysis, M.I.A., I.K.A., S.R. and S.G.; investigation, M.I.A., I.K.A., S.R. and S.G.; resources, M.I.A., I.K.A., S.R. and S.G.; data curation, M.I.A., I.K.A., S.R. and S.G.; writing—original draft preparation, M.I.A., I.K.A., S.R. and S.G.; writing—review and editing, M.I.A., I.K.A., S.R. and S.G.; visualization, M.I.A., I.K.A., S.R. and S.G.; supervision, M.I.A., I.K.A., S.R. and S.G.; project administration, M.I.A., I.K.A., S.R. and S.G.; funding acquisition, M.I.A., I.K.A., S.R. and S.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Appell, J.; DePascale, E.; Lysenko, J.V.; Zabrejko, P.P. New results on Newton-Kantorovich approximations with applications to nonlinear integral equations. Numer. Funct. Anal. Optim. 1997, 18, 1–17. [Google Scholar] [CrossRef]
Ezquerro, J.A.; Hernandez, M.A. Newton’s Method: An Updated Approach of Kantorovich’s Theory; Birkhäuser: Cham Switzerland, 2018. [Google Scholar]
Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton-Kantorovich type theorems. J. Complex. 2010, 26, 3–42. [Google Scholar] [CrossRef] [Green Version]
Regmi, S.; Argyros, I.K.; George, S.; Argyros, C. Numerical Processes for Approximating Solutions of Nonlinear Equations. Axioms 2022, 11, 307. [Google Scholar] [CrossRef]
Argyros, I.K. The Theory and Applications of Iteration Methods, 2nd ed.; Engineering Series; CRC Press: Boca Raton, FL, USA; Taylor and Francis Group: Abingdon, UK, 2022. [Google Scholar]
Zhanlav, K.H.; Otgondorj, K.H.; Sauul, L. A unified approach to the construction of higher-order derivative-free iterative methods for solving systems of nonlinear equations. Int. J. Comput. Math. 2021. [Google Scholar]
Zhanlav, T.; Chun, C.; Otgondorj, K.H.; Ulziibayar, V. High order iterations for systems of nonlinear equations. Int. J. Comput. Math. 2020, 97, 1704–1724. [Google Scholar] [CrossRef]
Wang, X. An Ostrowski-type method with memory using a novel self-accelerating parameters. J. Comput. Appl. Math. 2018, 330, 710–720. [Google Scholar] [CrossRef]
Moccari, M.; Lofti, T. On a two-step optimal Steffensen-type method: Relaxed local and semi-local convergence analysis and dynamical stability. J. Math. Anal. Appl. 2018, 468, 240–269. [Google Scholar] [CrossRef]
Shakhno, S.M.; Gnatyshyn, O.P. On an iterative Method of order 1.839… for solving nonlinear least squares problems. Appl. Math. Comput. 2005, 161, 253–264. [Google Scholar]
Argyros, I.K. Unified Convergence Criteria for Iterative Banach Space Valued Methods with Applications. Mathematics 2021, 9, 1942. [Google Scholar] [CrossRef]
Potra, F.-A.; Pták, V. Nondiscrete Induction and Iterative Processes; Pitman Publishing: Boston, MA, USA, 1984. [Google Scholar]
Cordero, A.; Torregrosa, J.R. Variants of Newton’s method using fifth-order quadrature formulas. Appl. Math. Comput. 2007, 190, 686–698. [Google Scholar] [CrossRef]
Traub, J.F. Iterative Methods for the Solution of Equations; Prentice Hall: Hoboken, NJ, USA, 1964. [Google Scholar]
Kantorovich, L.V.; Akilov, G.P. Functional Analysis; Pergamon Press: Oxford, UK, 1982. [Google Scholar]
Xiao, X.; Yin, H. Achieving higher order of convergence for solving systems of nonlinear equations. Appl. Math. Comput. 2017, 311, 251–261. [Google Scholar] [CrossRef]
Sharma, J.R.; Arora, H. Efficient derivative-free numerical methods for solving systems of nonlinear equations. Comput. Appl. Math. 2016, 35, 269–284. [Google Scholar] [CrossRef]
Sharma, J.R.; Guha, R.K. Simple yet efficient Newton-like method for systems of nonlinear equations. Calcolo 2016, 53, 451–473. [Google Scholar] [CrossRef]
Noor, M.A.; Waseem, M. Some iterative methods for solving a system of nonlinear equations. Comput. Math. Appl. 2009, 57, 101–106. [Google Scholar] [CrossRef] [Green Version]
Wang, X.; Zhang, T. A family of Steffensen type methods with seventh-order convergence. Numer. Algor. 2013, 62, 429–444. [Google Scholar] [CrossRef]
Argyros, I.K.; Magréñan, A.A. A Contemporary Study of Iterative Methods; Elsevier: Amsterdam, The Netherlands; Academic Press: New York, NY, USA, 2018. [Google Scholar]
Grau-Sanchez, M.; Grau, A.; Noguera, M. Ostrowski type methods for solving system of nonlinear equations. Appl. Math. Comput. 2011, 218, 2377–2385. [Google Scholar] [CrossRef]
Homeier, H.H.H. A modified Newton method with cubic convergence: The multivariate case. J. Comput. Appl. Math. 2004, 169, 161–169. [Google Scholar] [CrossRef] [Green Version]
Kou, J.; Wang, X.; Li, Y. Some eight order root finding three-step methods. Commun. Nonlinear Sci. Numer. Simul. 2010, 15, 536–544. [Google Scholar] [CrossRef]
Verma, R. New Trends in Fractional Programming; Nova Science Publisher: New York, NY, USA, 2019. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Argyros, M.I.; Argyros, I.K.; Regmi, S.; George, S. Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces. Mathematics 2022, 10, 2621. https://doi.org/10.3390/math10152621

AMA Style

Argyros MI, Argyros IK, Regmi S, George S. Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces. Mathematics. 2022; 10(15):2621. https://doi.org/10.3390/math10152621

Chicago/Turabian Style

Argyros, Michael I., Ioannis K. Argyros, Samundra Regmi, and Santhosh George. 2022. "Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces" Mathematics 10, no. 15: 2621. https://doi.org/10.3390/math10152621

APA Style

Argyros, M. I., Argyros, I. K., Regmi, S., & George, S. (2022). Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces. Mathematics, 10(15), 2621. https://doi.org/10.3390/math10152621

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalized Three-Step Numerical Methods for Solving Equations in Banach Spaces

Abstract

1. Introduction

2. Convergence Analysis of Method

3. Special Cases I

4. Special Cases II

5. Local Convergence of Method

6. Semi-Local Convergence of Method

7. Local Convergence of Method

8. Majorizing Sequences for Method

9. Semi-Local Convergence of Method

10. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI