1. Introduction
A plethora of applications from diverse disciplines of computational sciences are converted to nonlinear equations such as
using modeling (mathematical) [
1,
2,
3,
4]. The nonlinear operator
F is defined on an open and convex subset
of a Banach space
X with values in
X. The solution of the equation is denoted by
Numerical methods are mainly used to find
This is the case since the analytic form of the solution
is obtained in special cases.
Researchers, as well as practitioners, have proposed numerous numerical methods under a different set of convergence conditions using high-order derivatives, which are not present in the methods.
Let us consider an example.
Example 1. Define the function F on byClearly, the point solves the equation It follows that Then, the function F does not have a bounded third derivative in
Hence, many high convergence methods (although they may converge) cannot apply to show convergence. In order to address these concerns, we propose a unified approach for dealing with the convergence of these numerical methods that take into account only the operators appearing on them. Hence, the usage of these methods becomes possible and under weaker conditions.
Let
be a starting point. Define the generalized numerical method
by
where
and
are given operators chosen so that
The specialization of (
2) is
where
or
or
or
and
are linear operators on
and
with values in
respectively. By choosing some of the linear operators equal to the
O linear operators in (
3), we obtain the methods studied in [
5]. Moreover, if
then we obtain the methods studied in [
6,
7]. In particular, the methods in [
5] are of the special form
where they, as the methods in [
7,
8], are of the form
where
is a given parameter, and
are linear operators acting between
and
In particular, operators must have a special form to obtain the fourth, seventh or eighth order of convergence.
Further specifications of operators “
” lead to well-studied methods, a few of which are listed below (other choices can be found in [
6,
7,
9,
10]):
Newton method (second order) [1,4,11,12]: Jarrat method (second order) [13]: Traub-type method (fifth order) [14]: Homeir method (third order) [15]: Cordero–Torregrosa (third Order) [2]: or
Noor–Wasseem method (third order) [3]: Xiao–Yin method (third order) [16]: Corder–Torregrosa method (fifth order) [2]: or
Sharma–Arora method (fifth order) [17,18]: Xiao–Yin method (fifth order) [16]: Traub-type method (second order) [14]:
where
is a divided difference of order one.
Moccari–Lofti method (fourth order) [19]: Wang–Zang method (seventh order) [8,16,20]:
where
is any fourth-order Steffensen-type iteration method.
Sharma–Arora method (seventh order) [17]:
The local, as well as the semi-local, convergence for methods (
4) and (
5), were presented in [
17], respectively, using hypotheses relating only to the operators on these methods. However, the local convergence analysis of method (
6) requires the usage of derivatives or divided differences of higher than two orders, which do not appear in method (
6). These high-order derivatives restrict the applicability of method (
6) to equations whose operator
F has high-order derivatives, although method (
6) may converge (see Example 1).
Similar restrictions exist for the convergence of the aforementioned methods of order three or above.
It is also worth noticing that the fifth convergence order method by Sharma [
18]
cannot be handled with the analyses given previously [
5,
6,
7] for method (
4), method (
5), or method (
6).
Based on all of the above, clearly, it is important to study the convergence of method (
2) and its specialization method (
3) with the approach employed for method (
4) or (
5). This way, the resulting unified convergence criteria can apply to their specialized methods listed or not listed previously. Hence, this is the motivation as well as the novelty of the article.
There are two important types of convergence: the semi-local and the local. The semi-local uses information involving the initial point to provide criteria, assuring the convergence of the numerical method, while the local one is based on the information about the solution to find the radii of the convergence balls.
The local convergence results are vital, although the solution is unknown in general since the convergence order of the numerical method can be found. This kind of result also demonstrates the degree of difficulty in selecting starting points. There are cases when the radius of convergence of the numerical method can be determined without the knowledge of the solution.
As an example, let
Suppose function
F satisfies an autonomous differential [
5,
21] equation of the form
where
H is a continuous function. Notice that
or
In the case of
, we can choose
(see also the numerical section).
Moreover, the local results can apply to projection numerical methods, such as Arnoldi’s, the generalized minimum residual numerical method (GMRES), the generalized conjugate numerical method (GCS) for combined Newton/finite projection numerical methods, and in relation to the mesh independence principle to develop the cheapest and most efficient mesh refinement techniques [
1,
5,
11,
21].
In this article, we introduce a majorant sequence and use our idea of recurrent functions to extend the applicability of the numerical method (
2). Our analysis includes error bounds and results on the uniqueness of
based on computable Lipschitz constants not given before in [
5,
13,
21,
22,
23,
24] and in other similar studies using the Taylor series. This idea is very general. Hence, it applies also to other numerical methods [
10,
14,
22,
25].
The convergence analysis of method (
2) and method (
3) is given in
Section 2. Moreover, the special choices of operators appear in the method in
Section 3 and
Section 4. Concluding remarks, open problems, and future work complete this article.
2. Convergence Analysis of Method
The local is followed by the semi-local convergence analysis. Let and for some Consider functions and be continuous and nondecreasing in each variable.
Suppose that equations
have the smallest solutions,
The parameter
defined by
shall be shown to be a radius of convergence for method (
2). Let
It follows by the definition of radius
that for all
The notation denotes an open ball with center and of radius By , we denote the closure of
The following conditions are used in the local convergence analysis of the method (
2).
Suppose the following:
- (H1)
Equation has a solution
- (H2)
and
for all
- (H3)
Equations (
24) have smallest solutions
;
- (H4)
where the radius
is given by Formula (
25).
Next, the main local convergence analysis is presented for method (
2).
Theorem 1. Suppose that the conditions (H1)–(H4) hold and Then, the sequence generated by method (2) is well defined and converges to Moreover, the following estimates hold and Proof. Let
Then, it follows from the first condition in (H1) the definition of
(
26) (for
) and the first substep of method (
2) for
that
showing estimate (
27) for
and the iterate
Similarly,
and
showing estimates (
28), (
29), respectively and the iterates
By simply replacing
by
in the preceding calculations, the induction for estimates (
27)–(
29) is terminated. Then, from the estimate
where
we conclude
and
□
Remark 1. It follows from the proof of Theorem 1 that can be chosen in particular as and Thus, the condition (H2) should hold for all and not Clearly, in this case, the resulting functions are at least as tight as the functions , leading to an at least as large radius of convergence as ρ (see the numerical section).
Concerning the semi-local convergence of method (
2), let us introduce scalar sequences
and
defined for
and the rest of the iterates, depending on operators
and
F (see how in the next section). These sequences shall be shown to be majorizing for method (
2). However, first, a convergence result for these sequence is needed.
Lemma 1. Suppose that andfor some Then, the sequence is convergent to its unique least upper bound Proof. It follows from conditions (
33) and (
34) that sequence
is nondecreasing and bounded from above by
, and as such, it converges to
□
Theorem 2. Suppose the following:
(H5) Iterates generated by method (2) exist, belong in and satisfy the conditions of Lemma 1 for all (H6) andfor all and (H7)
Then, there exists such that
Proof. It follows by condition (H5) that sequence is complete as convergent. Thus, by condition (H6), sequence is also complete in a Banach space X, and as such, it converges to some (since is a closed set). □
Remark 2. (i) Additional conditions are needed to show The same is true for the results on the uniqueness of the solution.
(ii) The limit point is not given in the closed form. So, it can be replaced by λ in Theorem 2.
3. Special Cases I
The iterates of method (
3) are assumed to exist, and operator
F has a divided difference of order one.
Local Convergence
Three possibilities are presented for the local cases based on different estimates for the determination of the functions
It follows by method (
3) that
- (P1)
and
Hence, the functions
are selected to satisfy
A practical non-discrete choice for the function
is given by
Another choice is given by
The choices of functions and can follow similarly.
- (P2)
Let be a linear operator. By we denote
Then, it follows from method (
3)
and
Thus, the functions
must satisfy
and
Clearly, the function can be chosen again as in case (P1). The functions and can be defined similarly.
- (P3)
Assume ∃ function
continuous and non-decreasing such that
Then, we can write
leading to
Then, by method (
3) we obtain, in turn, that
so, the function
must satisfy
or
or
or
Similarly, for the other two steps, we obtain in the last choice
and
Thus, the function
satisfies
or
Finally, concerning the choice of the function
by the third substep of method (
3)
so the function
must satisfy
or
where
The functions and can also be defined with the other two choices as those of function given previously.
Semi-local Convergence
Concerning this case, we can have instead of the conditions of Theorem 2 (see (H6)) but for method (
3)
and
Notice that under these choices,
and
Then, the conclusions of Theorem 2 hold for method (
3). Even more specialized choices of linear operators appearing on these methods as well as function
can be found in the Introduction, the next section, or in [
1,
2,
11,
21] and the references therein.
5. Local Convergence of Method
The local convergence analysis of method (
23) utilizes some functions parameters. Let
Suppose the following:
- (i)
∃ function
continuous and non-decreasing such that equation
has a smallest solution
Let
- (ii)
∃ function
continuous and non-decreasing such that equation
has a smallest solution
where the function
defined by
- (iii)
Equation
has a smallest solution
Let
and
- (iv)
Equation
has a smallest solution
where the function
is defined as
- (v)
Equation
has a smallest solution
where the function
is defined by
The parameter
defined by
is proven to be a radius of convergence for method (
2) in Theorem 3. Let
Then, it follows by these definitions that
and
The conditions required are as follows:
(C1) Equation has a simple solution
(C2)
Set
(C3)
and
(C4)
Next, the main local convergence result follows for method (
23).
Theorem 3. Suppose that conditions (C1)–(C4) hold and Then, the sequence generated by method (23) is well defined in remains in and is convergent to Moreover, the following assertions hold: where functions are defined previously and the radius ρ is given by Formula (37). Proof. Let
By using conditions (C1), (C2) and (
37), we have that
It follows by (
44) and the Banach lemma on invertible operators [
11,
15] that
and
If
then the iterate
is well defined by the first substep of method (
23) and we can write
In view of (C1)–(C3), (
45) (for
), (
40) (for
) and (
46), we obtain in turn that
Thus, the iterate
and (
41) holds for
The iterate
is well defined by the second substep of method (
23), so we can write
Notice that linear operator
exists by (
45) (for
). It follows by (
37), (
40) (for
), (C3), (
45) (for
), in turn that
Thus, the iterate
and (
42) holds for
where we also used (C1) and (C2) to obtain the estimate
Moreover, the iterate
is well defined by the third substep of method (
23), so we can have
leading to
Therefore, the iterate
and (
43) holds for
Switch
by
in the preceding calculations to complete the induction for the estimates (
41)–(
43). Then, by the estimate
where
, we obtain that
and
□
The uniqueness of the solution result for method (
23) follows.
Proposition 1. Suppose the following:
- (i)
Equation has a simple solution for some
- (ii)
Condition (C2) holds.
- (iii)
There exists such that
Set Then, the only solution of equation in the set is
Proof. Let
be such that
Define the linear operator
It then follows by (ii) and (
52) that
Hence, we deduce by the invertibility of J and the estimate □
Remark 3. Under all conditions of Theorem 3, we can set
Example 2. Consider the motion systemwith Let Let Let function F on Ω
for given as Using this definition, we obtain the derivative as Hence, Let with Moreover, the nor for is Conditions (C1)–(C3) are verified for and Then, the radii are Example 3. If is equipped with the max-norm, consider given as Clearly, and the conditions (C1)–(C3) hold for and Then, the radii are 6. Semi-Local Convergence of Method
As in the local case, we use some functions and parameters for the method (
23).
Suppose:
There exists function
that is continuous and non-decreasing such that equation
has a smallest solution
Consider function
to be continuous and non-decreasing. Define the scalar sequences for
and
by
This sequence is proven to be majorizing for method (
23) in Theorem 4. However, first, we provide a general convergence result for sequence (
54).
Lemma 2. Suppose that and there exists such that Then, sequence converges to some
Proof. It follows by (
54)–(
56) that sequence
is non-decreasing and bounded from above by
Hence, it converges to its unique least upper bound
□
Next, the operator F is related to the scalar functions.
Suppose the following:
- (h1)
There exists such that and
- (h2)
for all
Set
- (h3)
for all
- (h4)
Conditions of Lemma 2 hold.
and
- (h5)
We present the semi-local convergence result for the method (
23).
Theorem 4. Suppose that conditions (h1)–(h5) hold. Then, sequence given by method (23) is well defined, remains in and converges to a solution of equation Moreover, the following assertions hold: Proof. Mathematical induction is utilized to show estimates (
57)–(
59). Using (h1) and method (
23) for
Thus, the iterate
and (
57) holds for
Let
Then, as in Theorem 3, we get
Hence, if we set
, iterates
and
are well defined by method (
23) for
Suppose iterates
also exist for all integer values
k smaller than
Then, we have the estimates
and
where we also used
so
and
so
and
Hence, sequence
is majorizing for method (
2) and iterates
belong in
The sequence
is complete in Banach space
X and as such, it converges to some
By using the continuity of
F and letting
in (
61), we deduce
□
Proposition 2. Suppose:
- (i)
There exists a solution of equation for some
- (ii)
Condition (h2) holds.
- (iii)
There exists such that
Set Then, is the only solution of equation in the region
Proof. Let
with
Define the linear operator
Then, by (h2) and (
62), we obtain in turn that
Thus, □
The next two examples show how to choose the functions , and the parameter
Example 4. Set Let us consider a scalar function F defined on the set for by Choose Then, the conditions (h1)–(h3) are verified for and
Example 5. Consider and Then the problem [5]is also given as integral equation of the formwhere ι is a constant and is the Green’s function Choose and Then, clearly since If Then, conditions (C1)–(C3) are satisfied for Hence,
7. Local Convergence of Method
The local analysis is using on certain parameters and real functions. Let and be positive parameters. Set provided that
Define the function
by
Notice that parameter
is the only solution of equation
in the set
Define the parameter
by
Notice that Set
Define the function
by
The equation
has a smallest solution
by the intermediate value theorem, since
and
as
It shall be shown that
R is a radius of convergence for method (
20). It follows by these definitions that
and
The following conditions are used:
- (C1)
There exists a solution of equation such that
- (C2)
There exist positive parameters
and
such that
and
Set
- (C3)
There exists a positive constant
such that
and
- (C4)
Next, the local convergence of method (
20) is presented using the preceding terminology and conditions.
Theorem 5. Under conditions (C1)–(C4), further suppose that Then, the sequence generated by method (20) is well defined in stays in and is convergent to so that andwhere the functions and the radius ρ are defined previously. Proof. It follows by method (
20), (C1), (C2) and
in turn that
It follows by (
68) and the Banach lemma on invertible operators [
24] that
and
Hence, the iterate
exists by the first substep of method (
20) for
It follows from the first substep of method (
20), (C2) and (C3), that
Thus, the iterate
and (
66) holds for
Similarly, by the second substep of method (
20), we have
Hence,
and
Thus, the iterate
exists by the second sub-step of method (
20). Then, as in (
70) we obtain in turn that
Therefore, the iterate
and (
67) holds for
Simply replace
by
in the preceding calculations to complete the induction for (
66) and (
67). It then follows from the estimate
where,
leading to
and
□
Concerning the uniqueness of the solution
(not given in [
9]), we provide the result.
Proposition 3. Suppose:
- (i)
The point is a simple solution for some of equation
- (ii)
There exists positive parameter such that - (iii)
There exists such that
Set Then, is the only solution of equation in the set
Proof. Set
for some
with
It follows by (i), (
75) and (
76) that
Thus, we conclude by the invertability of P and identity □
Remark 4. (i) Notice that not all conditions of Theorem 5 are used in Proposition 3. If they were, then we can set
(ii) By the definition of set we have Therefore, the parameterwhere is the corresponding Lipschitz constant in [1,3,9,19] appearing in the condition Thus, the radius of convergence in [1,7,8,20] uses instead of That is by (78) Examples where (77), (78) and (80) are strict can be found in [2,5,11,12,13,15,21,22,23,24]. 8. Majorizing Sequences for Method
Let
be given positive parameters and
and
Consider recurrent polynomials defined on the interval
T for
by
and polynomials
and
Then, the following auxiliary result connecting these polynomials can be shown.
Lemma 3. The following assertions hold:polynomials and have smallest zeros in the interval denoted by and respectively,and Moreover, define functions on the interval T byand Proof. Assertions (
81)–(
84) hold by the definition of these functions and basic algebra. By the intermediate value theorem polynomials
and
have zeros in the interval
since
and
Then, assertions (
85) and (
86) follow by the definition of these polynomials and zeros
and
Next, assertions (
91) and (
94) also follow from (
87), (
88) and the definition of these polynomials. □
The preceding result is connected to the scalar sequence defined
by
where
Moreover, define parameters and
Then, the first convergence result for sequence follows.
Lemma 4. Then, scalar sequence is non-decreasing, bounded from above by and converges to its unique least upper bound Moreover, the following error bounds holdand Proof. Assertions (
99)–(
101) hold if we show using induction that
and
By the definition of
we obtain
so
and (
103) holds for
Suppose assertions (
101)–(
103) hold for each
By (
99) and (
100) we have
and
By the induction hypotheses sequences
are increasing. Evidently, estimate (
101) holds if
or
where
By (
91), (
93), and (
98) estimate (
107) holds.
Similarly, assertion (
103) holds if
or
By (
92) and (
94), assertion (
108) holds. Hence, (
100) and (
103) also hold. Notice that
can be written as
where
and
Hence, we get
and
so
It follows that sequence is non-decreasing, bounded from above by Thus, it converges to □
Next, a second convergence result for sequence (
95) is presented but the sufficient criteria are weaker but more difficult to verify than those of Lemma 4.
Lemma 5. Supposeandhold. Then, sequence is increasing and bounded from above by so it converges to its unique least upper bound Proof. It follows from the definition of sequence (
95), and conditions (
109)–(
111). □