Next Article in Journal
Towards A Global Cosmic Ray Sensor Network: CREDO Detector as the First Open-Source Mobile Application Enabling Detection of Penetrating Radiation
Previous Article in Journal
DSP: Schema Design for Non-Relational Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Local and Semilocal Convergence of Nourein’s Iterative Method for Finding All Zeros of a Polynomial Simultaneously

by
Petko D. Proinov
and
Maria T. Vasileva
*
Faculty of Mathematics and Informatics, University of Plovdiv Paisii Hilendarski, 24 Tzar Asen, 4000 Plovdiv, Bulgaria
*
Author to whom correspondence should be addressed.
Symmetry 2020, 12(11), 1801; https://doi.org/10.3390/sym12111801
Submission received: 8 September 2020 / Revised: 11 October 2020 / Accepted: 26 October 2020 / Published: 30 October 2020
(This article belongs to the Section Mathematics)

Abstract

:
In 1977, Nourein (Intern. J. Comput. Math. 6:3, 1977) constructed a fourth-order iterative method for finding all zeros of a polynomial simultaneously. This method is also known as Ehrlich’s method with Newton’s correction because it is obtained by combining Ehrlich’s method (Commun. ACM 10:2, 1967) and the classical Newton’s method. The paper provides a detailed local convergence analysis of a well-known but not well-studied generalization of Nourein’s method for simultaneous finding of multiple polynomial zeros. As a consequence, we obtain two types of local convergence theorems as well as semilocal convergence theorems (with verifiable initial condition and a posteriori error bound) for the classical Nourein’s method. Each of the new semilocal convergence results improves the result of Petković, Petković and Rančić (J. Comput. Appl. Math. 205:1, 2007) in several directions. The paper ends with several examples that show the applicability of our semilocal convergence theorems.

1. Introduction

This paper deals with the convergence of two iterative methods for finding all zeros of a polynomial simultaneously. The first one is due to Nourein [1] and it has quartic convergence when all zeros of the polynomial are simple and has linear convergence otherwise. The second one is a generalization of Nourein’s method for simultaneously finding all zeros of an arbitrary polynomial that has at least one multiple zero. To our knowledge, Nourein’s method for multiple zeros appears for the first time in the book of Sendov, Andreev and Kjurkchiev [2].
Throughout this paper, ( K , | · | ) stands for a valued field with absolute value | · | and K [ z ] denotes the ring of polynomials over K .
Weierstrass’ method and elementary symmetric functions. Historically, the first iterative method for simultaneous finding all zeros of a polynomial was constructed by Weierstrass [3] in 1891. Let
f ( z ) = c 0 z n + c 1 z n 1 + + c n
be a polynomial in K [ z ] of degree n 2 . Furthermore, let us define the elementary symmetric functions φ ν : K n K , ν = 1 , 2 , , n as follows:
φ ν ( x 1 , , x n ) = ( 1 ) ν 1 j 1 < < j ν n x j 1 x j ν .
It is well-known that a vector ξ = ( ξ 1 , , ξ n ) is a solution of the symmetric system
φ ν ( x 1 , , x n ) = c ν c 0 , ν = 1 , , n ,
if and only if ξ 1 , , ξ n are all zeros of the polynomial f. Using this fact, Weierstrass [3] derived an iterative algorithm for solving the symmetric system (2). Weierstrass’s method is defined by the following iteration:
x ( k + 1 ) = x ( k ) W f ( x ( k ) ) , k = 0 , 1 , 2 , ,
where the so-called Weierstrass correction W f : D K n K n is defined by
W f ( x ) = ( W 1 ( x ) , , W n ( x ) ) with W i ( x ) = f ( x i ) a 0 j i ( x i x j ) ( i = 1 , , n )
and D is the set of all vectors in K n with pairwise distinct coordinates. Weierstrass’s method (3) has quadratic convergence when all zeros of f are simple and has linear convergence otherwise. For historical notes and advanced results for Weierstrass’s method, we refer to [4,5].
Ehrlich’s method. In 1967, Ehrlich [6] introduced a third-order simultaneous method defined by the following iteration:
x ( k + 1 ) = G ( x ( k ) ) , k = 0 , 1 , 2 , ,
where the iteration function G : D G K n K n is defined by G ( x ) = ( G 1 ( x ) , , G n ( x ) ) with
G i ( x ) = x i f ( x i ) f ( x i ) f ( x i ) j = 1 j i n 1 x i x j ( i = 1 , , n ) .
In 1975, Börsch-Supan [7] presented another third-order simultaneous method. In 1982, Werner [8] proved that both Ehrlich’s and Börsch-Supan’s methods are identical. For historical notes and advanced results for Ehrlich’s method, we refer to [9,10,11].
Local and semilocal convergence analysis. Recently, a general convergence theory of iterative methods of the type x ( k + 1 ) = T ( x ( k ) ) , where T : D X X is an iteration function of a metric space X, was developed in [12,13]. Central to this theory is the concept of the function of initial approximations (see ([13], Section 3)). Roughly speaking, this is a real-valued function E : D X R + that sets the initial conditions. The initial condition of any convergence theorem of an iterative method can be represented in the form
E ( x ( 0 ) ) J ,
where J is an interval on R of the form [ 0 , R ] , [ 0 , R ) or [ 0 , + ) , where R is a positive number.
Convergence analysis of an iterative method always is done with respect to a function of the initial conditions E. The goal of convergence is to find initial guesses that guarantees convergence of the iteration sequence to x ( k + 1 ) = T ( x ( k ) ) to a fixed point ξ D of the iteration function T.
Definition 1.
A convergence analysis is called semilocal when the function of initial conditions E does not depend on the fixed point ξ and is local otherwise.
Semilocal convergence theorems have great practical applications because their initial conditions are computer-verifiable. In general, local convergence theorems have mainly theoretical significance. However, very recently in [14], it was proved that two kinds of local convergence theorems for iterative methods for simultaneous approximation of polynomial zeros can be transformed into semilocal convergence results.
Nourein’s method for simple zeros. There are different ways to increase the convergence order of an iterative method for simultaneous computation of polynomial zeros. In 1977, Nourein [1,15] constructed three simultaneous methods that increase the convergence order of Weierstrass’s, Ehrlich’s and Börsch-Supan’s methods. Each of these three methods was constructed as a combination of two already known iterative methods. In particular, combining Ehrlich’s method and the classical Newton’s method, Nourein [1] constructed in K n the following fourth-order iterative method (for simple zeros):
x ( k + 1 ) = F ( x ( k ) ) , k = 0 , 1 , 2 , ,
where the iteration function F : D F K n K n is defined by F ( x ) = ( F 1 ( x ) , , F n ( x ) ) with
F i ( x ) = x i f ( x i ) f ( x i ) f ( x i ) j = 1 j i n 1 x i x j + f ( x j ) / f ( x j ) ( i = 1 , , n ) .
Nourein’s method (7) is also known as Ehrlich’s method with Newton’s corrections. As it is shown in Milovanović and Petković [16] and in Petković and Herceg [17], the iterative method (7) is one of the most efficient and powerful simultaneous methods with globally convergent performance.
In 1998, Petković, Herceg and Ilić [18] proved that the method (7) is convergent under the initial condition
W f ( x ( 0 ) ) < δ ( x ( 0 ) ) 3 n ,
where the function δ : K n R + is defined by
δ ( x ) = min i j | x i x j | .
In 2007, Petković, Petković and Rančić [19] (see also ([20], Theorem 3.10)) established the following improvement of this result.
Theorem 1
(Petković, Petković and Rančić [19]). Suppose f C [ z ] is a polynomial of degree n 3 with simple zeros. If an initial approximation x ( 0 ) C n with distinct coordinates satisfies the condition
W f ( x ( 0 ) ) < c n δ ( x ( 0 ) ) w i t h c n = 1 / ( 2.2 n + 1.9 ) , 3 n 21 , 1 / ( 2.2 n ) , n 22 ,
then Nourein’s iteration (7) is convergent with an order of convergence of four.
Nourein’s method for multiple zeros. Nourein’s method (7) has a well-known but not well-studied generalization for the simultaneous finding of multiple polynomial zeros. Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s (s is a positive integer such that 1 s n ) be all distinct zeros of f with multiplicity m 1 , , m s ( m 1 + + m s = n ) , respectively.
In what follows, we use a binary relation # on K s defined by
x # y x i y j for all i , j I s with i j ,
where I s = { 1 , 2 , , s } .
Let us define Nourein’s method for polynomials that has at least one multiple zero. First, we define the Schröder’s iteration function N : D N K s K s by [21]:
N ( x ) = ( N 1 ( x ) , , N s ( x ) ) with N i ( x ) = x i m i f ( x i ) f ( x i ) if f ( x i ) 0 , x i if f ( x i ) = 0 ,
where the domain of N is the set
D N = { x K s : f ( x i ) 0 whenever f ( x i ) 0 } .
Now the generalized Nourein’s method for simultaneously finding all the zeros of f is defined in K s by the following fixed-point iteration (see, e.g., ([2], Section 20) and ([22], Section 7.2)):
x ( k + 1 ) = Φ ( x ( k ) ) , k = 0 , 1 , 2 , ,
where the iteration function Φ : D Φ K s K s is defined by
Φ ( x ) = ( Φ 1 ( x ) , , Φ s ( x ) ) with Φ i ( x ) = x i m i f ( x i ) f ( x i ) j = 1 j i s m j x i N j ( x ) if f ( x i ) 0 , x i if f ( x i ) = 0 ,
and the domain of Φ is the set
D Φ = { x D N : x # N ( x ) and f ( x i ) f ( x i ) j = 1 j i s m j x i N j ( x ) 0 whenever f ( x i ) 0 } .
Contributions. In this paper, we present a detailed local convergence analysis for generalized Nourein’s method (11) for multiple zeros. As a consequence of these results, we obtain two types of local convergence theorems as well as semilocal convergence theorems (with verifiable initial condition and a posteriori error bound) for the classical Nourein’s method (7). Each of the new semilocal convergence results improves Theorem 1 in several directions.
The paper is structured as follows: Section 2 gives some notations that are used throughout the paper without specific quoting. In Section 3, we study the local convergence of generalized Nourein’s iteration (11) with respect to a function of initial conditions of the first kind. In this section, the main new result is Theorem 3. In Section 4, we study the convergence of generalized Nourein’s iteration (11) with respect to a function of initial conditions of the second kind. The main new result of this section is Theorem 5, which plays an important role in Section 7 for obtaining new semilocal convergence results for the classical Nourein’s method (7).
In Section 5, we present two new local convergence results of the first kind (Theorem 6 and Corollary 1) for the classical Nourein’s method (7). In Section 6, we obtain three new local convergence results of the second kind (Theorems 7 and 8 and Corollary 3) for the classical Nourein’s method (7).
To the best of authors’ knowledge, the theorems given in Section 3, Section 4, Section 5 and Section 6 are the first local convergence results in the literature about both Nourein’s methods (for simple or multiple zeros).
In Section 7, we provide three new semilocal convergence results for polynomials with simple zeros (Theorems 11 and 12 and Corollary 5). Each of these semilocal convergence results improves the result of Petković, Petković and Rančić [19] in several directions. Note that these results are based on some results given in [14] and some local convergence results obtained in the previous sections. Section 8 provides several numerical examples that show the applicability of our semilocal convergence theorems. Finally, the paper ends with a conclusion section.

2. Notations

In this short section, we give some notations that are used throughout the paper without specific quoting. We denote by R and R + the real and the nonnegative numbers, respectively.
Let R s be equipped with partial coordinate-wise ordering defined by
x y if and only if x i y i for each i I s ,
and let the vector space K s be equipped with the norm
x p = i = 1 s | x i | p 1 / p for some 1 p ,
and with a vector norm · (with values in R s ) defined by
x = ( | x 1 | , , | x s | ) .
Let x K s and y R s be two vectors. We denote by x y a vector in R s defined by
x y = | x 1 | y 1 , , | x s | y s
if y has only nonzero coordinates. We define a function d : K s R s by
d ( x ) = ( d 1 ( x ) , , d s ( x ) ) with d i ( x ) = min j i | x i x j | ( i = 1 , , s ) .
Also, we define a function δ : K n R + by
δ ( x ) = min i j | x i x j | .
We assume by definition that 0 0 = 1 . For two integers k 0 and r 0 , we define the quantity S k ( r ) by
S k ( r ) = j = 0 k 1 r j if k 1 ,
and S k ( r ) = 0 if k = 0 . In the short, we write
j i instead of j = 1 j i s .
Throughout the paper, J denotes an interval on R + containing 0, that is, an interval of the form [ 0 , R ] , [ 0 , R ) or [ 0 , + ) , where R is a positive number.
Definition 2
([13]). A function φ : J R + R + is said to be quasi-homogeneous of degree u 0 if
φ ( λ t ) λ u φ ( t ) f o r a l l λ [ 0 , 1 ] a n d t J .

3. Local Convergence Theorem of the First Kind for Multiple Zeros

Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. In this section, we investigate the local convergence of Nourein’s iteration (11) with respect to a function of initial conditions E : K s R + defined as follows:
E ( x ) = x ξ d ( ξ ) p ( 1 p ) ,
where ξ = ( ξ 1 , , ξ s ) . The function of initial conditions (15) has been used in [23,24,25] for studying the local convergence of the first kind of some iterative methods for simultaneous approximation of multiple polynomial zeros.
We define the quantities m = m ( m 1 , , m s ) , a = a ( p , m 1 , , m s ) and b = b ( p ) as follows:
m = min 1 i s m i and a = max 1 i s 1 m i j i m j q 1 / q and b = 2 1 / q ,
where 1 q is defined by
1 p + 1 q = 1 .
We begin this section with some useful inequalities in K s which play an important role in the proofs of our results.
Lemma 1
([5], Lemma 6.1). Let x , ξ K s and ξ have pairwise distinct coordinates. Then for all i j , the following inequalities hold:
| x i x j | ( 1 b E ( x ) ) d i ( ξ ) a n d | x i ξ j | ( 1 E ( x ) ) d i ( ξ ) ,
where E : K s R + is defined by (15) and b is defined by (16).
Lemma 2.
Let α 0 and x , y , ξ K s be three vectors such that
y ξ α x ξ .
If ξ is a vector with pairwise distinct coordinates, then for all i j , we have
| x i y j | ( 1 ( 1 + α ) E ( x ) ) d j ( ξ ) ,
where E : K s R + is defined by (15).
Proof. 
From the triangle inequality in K and the inequality (17), we obtain
| x i y j | | ξ i ξ j | | x i ξ i | | y j ξ j | | ξ i ξ j | | x i ξ i | α | x j ξ j | 1 | x i ξ i | d i ( ξ ) α | x j ξ j | d j ( ξ ) | ξ i ξ j | 1 ( 1 + α ) x ξ d ( ξ ) p d j ( ξ ) ,
which completes the proof. □
The following general convergence theorem plays a substantial role in our paper.
Theorem 2
(Proinov [26]). Let T : D K s K s be an iteration function, and let ξ K s be a vector with pairwise distinct coordinates, and let a function E : K s R + be defined by (15). Suppose ϕ : J R + is a quasi-homogeneous function of degree u 0 such that, for every vector x K s with E ( x ) J , the following conditions hold:
x D a n d T x ξ ϕ ( E ( x ) ) x ξ .
Let x ( 0 ) K s be an initial guess such that
E ( x ( 0 ) ) J a n d ϕ ( E ( x ( 0 ) ) ) < 1 .
Then the Picard iteration x ( k + 1 ) = T ( x ( k ) ) is well-defined and converges to ξ with order r = u + 1 and with error estimates
x ( k + 1 ) ξ λ r k x ( k ) ξ a n d x ( k ) ξ λ S k ( r ) x ( 0 ) ξ f o r a l l k 0 ,
where λ = ϕ ( E ( x ( 0 ) ) ) .
Before formulating the main result of this section, we need a few more lemmas.
Lemma 3
([12], Lemma 4.4). Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose a vector x K s satisfies
E ( x ) < m n ,
where the function E : K s R + is defined by (15) and m is given by (16). Then
x D N a n d N ( x ) ξ ( n m ) E ( x ) m n E ( x ) x ξ ,
where the iteration function N : D N K s K s is defined by (9).
Lemma 4.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose a vector x D Φ is such that f ( x i ) 0 for some i. Then
Φ i ( x ) ξ i = σ i 1 σ i ( x i ξ i ) ,
where the iteration function Φ : D Φ K s K s is given by (12) and σ i K is given by
σ i = x i ξ i m i j i m j ( N j ( x ) ξ j ) ( x i ξ j ) ( x i N j ( x ) ) .
Proof. 
Taking into account that ξ 1 , , ξ s are the zeros of f with multiplicity m 1 , , m s , we get
f ( x i ) f ( x i ) j i m j x i N j ( x ) = m i x i ξ i + j i m j x i ξ j j i m j x i N j ( x ) = m i x i ξ i j i m j ( N j ( x ) ξ j ) ( x i ξ j ) ( x i N j ( x ) ) = m i ( 1 σ i ) x i ξ i ,
where σ i is given by (19). From (12) and (20), we have
Φ i ( x ) ξ i = x i ξ i x i ξ i 1 σ i = σ i 1 σ i ( x i ξ i ) ,
which proves (18). □
In this and the next section, we use the following polynomial functions:
A ( t ) = ( 1 t ) ( m ( n + m ) t + m t 2 ) a ( n m ) t 3 ,
B ( t ) = ( 1 t ) ( m ( n + m ) t + m t 2 ) 2 a ( n m ) t 3 ,
Q ( t ) = ( 1 b t ) ( 1 t ) ( m ( n + m ) t + m t 2 ) 2 a ( n m ) t 3 ,
where m, a and b are given by (16).
It is easy to show that each of the functions A and B strictly decreases on R and has a unique zero which lies in the interval ( 0 , m / n ] . If η ( 0 , m / n ) is the unique solution of the equation A ( t ) = 0 , then it can be shown that
A ( t ) > Q ( t ) for every t ( 0 , η ) .
Lemma 5.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose x K s is such that
A ( E ( x ) ) > 0 ,
where the function E : K s R + is defined by (15) and the function A is defined by (21). Then x D Φ and
Φ ( x ) ξ ϕ ( E ( x ) ) x ξ ,
where the function ϕ is defined by
ϕ ( t ) = a ( n m ) t 3 ( 1 t ) ( m ( n + m ) t + m t 2 ) a ( n m ) t 3
with a and m defined by (16).
Proof. 
First we prove that x D Φ . It follows from (25) that E ( x ) < m / n , which according to Lemma 3 shows that x D N . We can apply Lemma 3 with y = N ( x ) since the inequality (17) holds with
α = ( n m ) E ( x ) m n E ( x ) .
Then by Lemma 2 and (25), we obtain
| x i N j ( x ) | 1 m ( 1 E ( x ) ) m n E ( x ) E ( x ) d j ( ξ ) = m ( n + m ) E ( x ) + m E ( x ) 2 m n E ( x ) d j ( ξ ) > 0
for all i j . Consequently, x # N ( x ) . Now suppose f ( x i ) 0 . According to (13), it remains to prove that
f ( x i ) f ( x i ) j i m j x i N j ( x ) 0 .
Using (20), we get that (30) holds true if and only if σ i 1 , where σ i is given by (19). By the triangle inequality, Lemma 3, the second part of Lemma 1, the inequality (29), Hölder’s inequality and condition (25), we obtain for σ i the following estimate:
| σ i | | x i ξ i | d i ( ξ ) ( n m ) E ( x ) ( 1 E ( x ) ) ( m ( n + m ) E ( x ) m E ( x ) 2 ) 1 m i j i m i | x j ξ j | d j ( ξ ) a ( n m ) E ( x ) 3 ( 1 E ( x ) ) ( m ( n + m ) E ( x ) + m E ( x ) 2 ) < 1 ,
which yields σ i 1 . Hence, x D Φ . To prove (26), we have to show that
| Φ i ( x ) ξ i | ϕ ( E ( x ) ) | x i ξ i | for every i = 1 , , s .
If x i = ξ i , then Φ i ( x ) = ξ i and so (32) becomes an equality. Suppose x i ξ i . From Lemma 4 and the triangle inequality, we obtain
| Φ i ( x ) ξ i | | σ i | 1 | σ i | | x i ξ i | a ( n m ) E ( x ) 3 ( 1 E ( x ) ) ( m ( n + m ) E ( x ) + m E ( x ) 2 ) a ( n m ) E ( x ) 3 | x i ξ i | = ϕ ( E ( x ) ) | x i ξ i | ,
which completes the proof. □
We are ready now to state the first main result of this paper.
Theorem 3.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose x ( 0 ) K s is an initial guess satisfying the following condition:
B ( E ( x ( 0 ) ) ) > 0 ,
where the functions E and B are defined by (15) and (22), respectively. Then Nourein’s iteration (11) is well-defined and converges with fourth-order to the root vector ξ = ( ξ 1 , , ξ s ) with the following error estimates:
x ( k + 1 ) ξ λ 4 k x ( k ) ξ a n d x ( k ) ξ λ ( 4 k 1 ) / 3 x ( 0 ) ξ f o r a l l k 0 ,
where λ = ϕ ( E ( x ( 0 ) ) ) and the function ϕ is defined by (27).
Proof. 
We shall apply Theorem 2 to the iteration function Φ : D Φ K s K s defined by (12). Let η > 0 be the unique solution of the equation A ( t ) = 0 . The function ϕ is quasi-homogeneous of degree m = 3 on [ 0 , η ) . It follows from Lemma 5 that, for every vector x K s with E ( x ) < η , we have x D Φ and that the inequality (26) holds. Then it follows from Theorem 2 that, under the initial condition
E ( x ( 0 ) ) < η and ϕ ( E ( x ( 0 ) ) ) < 1 ,
the iteration (11) is well-defined and converges to ξ with order r = 4 and with error estimates (34). It is easy to see that the above initial condition is equivalent to (33). This completes the proof. □

4. Local Convergence Theorem of the Second Kind for Multiple Zeros

Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. In this section, we investigate the local convergence of Nourein’s iteration (11) with respect to the function of initial conditions E : D R + defined as follows:
E ( x ) = x ξ d ( x ) p ( 1 p ) ,
where ξ = ( ξ 1 , , ξ s ) . Recall that D denotes the set of all vectors in K s with pairwise distinct coordinates. We note that the function of initial conditions (35) has been used in [23,24,25] to study the local convergence of the second kind of some iterative methods for finding simultaneously multiple polynomial zeros.
Lemma 6
([5], Lemma 7.1). Let x , ξ K s . If x has pairwise distinct coordinates, then for all i j , the following inequalities hold:
| x i ξ j | ( 1 E ( x ) ) d i ( x ) a n d | x i x j | d j ( x )
where E : D R + is defined by (35).
Lemma 7
([27], Lemma 3.4). Let α 0 and x , y , ξ K s be three vectors satisfying (17). If x is a vector with pairwise distinct coordinates, then for all i j , we have
| x i y j | ( 1 ( 1 + α ) E ( x ) ) d j ( x ) ,
where E : D R + is defined by (35).
Theorem 4
(Proinov [26]). Let T : D K s K s be an iteration function, ξ K s be a vector, and let a function E : D K s R + be defined by (35). Suppose β : J R + is a nonzero quasi-homogeneous function of degree u 0 such that, for any x D with E ( x ) J , we have
x D a n d T x ξ β ( E ( x ) ) x ξ .
Let x ( 0 ) D be an initial guess such that
E ( x ( 0 ) ) J a n d Ψ ( E ( x ( 0 ) ) ) 0 ,
where the function Ψ : J R is defined by
Ψ ( t ) = 1 b t β ( t ) ( 1 + b t ) .
Then the Picard iteration x ( k + 1 ) = T ( x ( k ) ) is well-defined and converges to ξ with error estimates
x ( k + 1 ) ξ θ λ r k x ( k ) ξ a n d x ( k ) ξ θ k λ S k ( r ) x ( 0 ) ξ f o r a l l k 0 ,
where r = u + 1 , λ = ϕ ( E ( x ( 0 ) ) ) , θ = ψ ( E ( x ( 0 ) ) ) and the functions ψ and ϕ are defined by
ψ ( t ) = 1 b t ( 1 + β ( t ) ) a n d ϕ ( t ) = β ( t ) / ψ ( t ) .
Besides, if the inequality in (36) is strict, then the order of convergence is at least r.
Now, we define the real functions β , Ψ , ψ , as follows:
β ( t ) = a ( n m ) t 3 ( 1 t ) ( m ( n + m ) t + m t 2 ) a ( n m ) t 3 ,
Ψ ( t ) = 1 b t β ( t ) ( 1 + b t ) = ( 1 b t ) ( 1 t ) ( m ( n + m ) t + m t 2 ) 2 a ( n m ) t 3 ( 1 t ) ( m ( n + m ) t + m t 2 ) a ( n m ) t 3 ,
ψ ( t ) = 1 b t ( 1 + β ( t ) ) = 1 b t ( 1 t ) ( m ( n + m ) t + m t 2 ) ( 1 t ) ( m ( n + m ) t + m t 2 ) a ( n m ) t 3 ,
where a, b and m are defined by (16). Let η > 0 be the unique zero of the function A, where A is defined by (21). It is easy to show that β strictly increases on [ 0 , η ) and that it is quasi-homogeneous of degree u = 3 on [ 0 , η ) . The function Ψ strictly decreases [ 0 , η ) and it has a unique zero on [ 0 , η ) because Ψ ( 0 ) = 1 and lim t η Ψ ( t ) = + . On the other hand, Ψ ( t ) = Q ( t ) / A ( t ) for every t [ 0 , η ) , where the functions A and Q are defined by (21) and (23). Thus, we conclude that the function Q has a unique zero on [ 0 , η ) too.
Analogously to Lemma 3, we can prove the following lemma:
Lemma 8.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose x D is a vector satisfying
E ( x ) < m n ,
where the function E : D R + is defined by (35) and m is defined by (16). Then,
x D N a n d N ( x ) ξ ( n m ) E ( x ) m n E ( x ) x ξ ,
where the iteration function N : D N K s K s is defined by (9).
Proof. 
The first part of (40) follows from the first part of Lemma 3. The second part of (40) is equivalent to
| N i ( x ) ξ i | ( n m ) E ( x ) m n E ( x ) | x i ξ i |
for every i = 1 , , s . If x i = ξ i , then (41) holds trivially. Suppose that x i ξ i . Then it is easy to show that
N i ( x ) ξ i = μ i 1 + μ i ( x i ξ i ) ,
where
μ i = x i ξ i m i j i m i x i ξ j .
From the triangle inequality and Lemma 6, we obtain
| μ i | | x i ξ i | m i j i m i | x i ξ j | | x i ξ i | m i ( 1 E ( x ) ) d i ( x ) j i m j ( n m ) E ( x ) m ( 1 E ( x ) ) < 1 .
From (43), we obtain the following estimate:
| 1 + μ i | 1 | μ i | m n E ( x ) m ( 1 E ( x ) ) > 0 .
From (42) and the estimates (43) and (44), we get (41), which completes the proof. □
Lemma 9.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose a vector x K s with distinct coordinates satisfies
A ( E ( x ) ) > 0 ,
where the functions E and A are defined by (35) and (21) respectively. Then x D Φ and
Φ ( x ) ξ β ( E ( x ) ) x ξ ,
where the function β is defined by (37).
Proof. 
The proof is carried out in the same way as the proof of Lemma 5 using Lemma 6, Lemma 7 and Lemma 9 instead of Lemma 1, Lemma 2 and Lemma 3, respectively. □
Now, we are ready to state and prove the main result in this section.
Theorem 5.
Let f K [ z ] be a polynomial of degree n 2 which splits in K , and let ξ 1 , , ξ s be all distinct zeros of f with multiplicity m 1 , , m s , respectively. Suppose x ( 0 ) K n is an initial guess with distinct coordinates which satisfies the following conditions:
A ( E ( x ( 0 ) ) ) > 0 a n d Q ( E ( x ( 0 ) ) ) 0 ,
where the functions E, A and Q are defined by (35), (21) and (23), respectively. Then the iteration (11) is well-defined and converges to ξ with error estimates
x ( k + 1 ) ξ θ λ 4 k x ( k ) ξ a n d x ( k ) ξ θ k λ ( 4 k 1 ) / 3 x ( 0 ) ξ f o r a l l k 0 ,
where λ = ϕ ( E ( x ( 0 ) ) ) , θ = ψ ( E ( x ( 0 ) ) ) , ϕ = β / ψ and ψ is defined by (39). Moreover, if the second inequality in (47) is strict, then the convergence order is at least four.
Proof. 
We shall apply Theorem 4 to the iteration function Φ : D Φ K s K s defined by (12). Let η be the unique positive solution of the equation A ( t ) = 0 . The function β is quasi-homogeneous of degree m = 3 on [ 0 , η ) . It follows from Lemma 9 that, for every vector x D with E ( x ) < η , we have x D Φ and the inequality (46) holds. Then it follows from Theorem 4 that under the initial condition
E ( x ( 0 ) ) < η and Ψ ( E ( x ( 0 ) ) ) 0 ,
the iteration (11) is well-defined and converges to ξ with order r = 4 and with error estimates (48). Taking into account that Ψ ( t ) = Q ( t ) / A ( t ) , we can see that the initial conditions (47) and (49) are equivalent. This completes the proof. □

5. Local Convergence Theorem of the First Kind for Simple Zeros

Let f K [ z ] be a polynomial of degree n 2 which has n simple zeros in K , and let ξ K n be a root vector of the polynomial f. In this section, we study the local convergence of the classical Nourein’s method (7) with respect to the function of initial conditions E : K n R + defined by
E ( x ) = x ξ d ( ξ ) p ( 1 p ) .
Now the real functions A, B, Q and ϕ , considered in Section 3, take the following forms:
A ( t ) = ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) a ( n 1 ) t 3 ,
B ( t ) = ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 a ( n 1 ) t 3 ,
Q ( t ) = ( 1 b t ) ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 a ( n 1 ) t 3 ,
ϕ ( t ) = a ( n 1 ) t 3 ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) a ( n 1 ) t 3 ,
where a and b are defined by
a = ( n 1 ) 1 / q and b = 2 1 / q .
As a consequence of Theorem 3, we get the following two convergence results for simple zeros.
Theorem 6.
Let f K [ z ] be a polynomial of degree n 2 which has n simple zeros in K , and let ξ K n be a root vector of f. Suppose that x ( 0 ) K n is an initial guess satisfying the following condition:
B ( E ( x ( 0 ) ) ) > 0 ,
where the functions E and B are defined by (50) and (52), respectively. Then Nourein’s iteration (7) is well-defined and converges with fourth-order to ξ with the following error estimates:
x ( k + 1 ) ξ λ 4 k x ( k ) ξ a n d x ( k ) ξ λ ( 4 k 1 ) / 3 x ( 0 ) ξ f o r a l l k 0 ,
where λ = ϕ ( E ( x ( 0 ) ) ) and the function ϕ is defined by (54).
Corollary 1.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K , and let ξ K n be a root vector of f. If x ( 0 ) K n is an initial guess satisfying
E ( x ( 0 ) ) = x ( 0 ) ξ d ( ξ ) < 4 7 n 2 ,
then Nourein’s iteration (7) is well-defined and converges with fourth-order to ξ with error estimates
x ( k + 1 ) ξ λ 4 k x ( k ) ξ a n d x ( k ) ξ λ ( 4 k 1 ) / 3 x ( 0 ) ξ f o r a l l k 0 ,
where the real function ϕ is defined by
ϕ ( t ) = ( n 1 ) 2 t 3 ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) ( n 1 ) 2 t 3 .
Proof. 
Let x ( 0 ) K n be an initial guess satisfying condition (57). In view of Theorem 6 ( p = ), we have to prove that x ( 0 ) satisfies the initial condition (56). In the case p = , the function B defined by (52) takes the form
B ( t ) = ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 ( n 1 ) 2 t 3 .
Using condition (57) and the monotonicity of the function B, we obtain
B ( E ( x ( 0 ) ) ) > B 4 7 n 2 = 147 n 3 590 n 2 + 740 n 296 ( 7 n 2 ) 3 = ( n 2 ) ( 147 n 2 296 n 2 + 148 ) ( 7 n 2 ) 3 0
which completes the proof. □
The following convergence result is an immediate consequence of Corollary 1.
Corollary 2.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K , and let ξ K n be a root vector of f. Suppose a vector x ( 0 ) K n satisfies an initial condition
x ( 0 ) ξ < 4 7 n 2 δ ( ξ ) ,
where the function δ is defined by (14). Then Nourein’s iteration (7) is well-defined and converges with fourth-order to ξ.
Remark 1.
For the first time, an initial condition of the type (58) was presented by Dochev [28]. He has proved that, if f C [ z ] is a polynomial of degree n 2 and has only simple zeros, then Weierstrass’s iteration (3) is well-defined and convergent quadratically to a root vector ξ C n of f under the initial condition
x ( 0 ) ξ < 2 n 1 1 2 2 n 1 1 δ ( ξ ) ,
where the function δ is defined by (14).
For other local convergence theorems of the first kind for other simultaneous iterative methods, we refer to [5,9,29,30,31,32,33].

6. Local Convergence Theorem of the Second Kind for Simple Zeros

Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K and let ξ K n be a root vector of f. We study the convergence of the classical Nourein’s method (7) with respect to a function of initial conditions E : D R + defined by
E ( x ) = x ξ d ( x ) p ( 1 p ) .
Define real functions β , Ψ , ψ and ϕ as follows:
β ( t ) = a ( n 1 ) t 3 ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) a ( n 1 ) t 3 ,
Ψ ( t ) = ( 1 b t ) ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 a ( n 1 ) t 3 ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 ( n 1 ) t 3 ,
ϕ ( t ) = a ( n 1 ) t 3 1 2 t ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) , ψ ( t ) = 1 1 b t ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) a ( n 1 ) t 3 ,
where a and b are defined by (55).
Applying Theorem 5 to the polynomials with simple zeros, we get the following convergence theorem.
Theorem 7.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K , and let ξ K n be a root vector of f. Suppose that x ( 0 ) K n is an initial guess with distinct coordinates which satisfies the following condition:
A ( E ( x ( 0 ) ) ) > 0 a n d Q ( E ( x ( 0 ) ) ) 0 ,
where the functions E and Q are defined by (59) and (53), respectively. Then Nourein’s iteration (7) is well-defined and converges to ξ with error estimates
x ( k + 1 ) ξ θ λ 4 k x ( k ) ξ a n d x ( k ) ξ θ k λ ( 4 k 1 ) / 3 x ( 0 ) ξ f o r a l l k 0 ,
where λ = ϕ ( E ( x ( 0 ) ) ) , θ = ψ ( E ( x ( 0 ) ) ) and the functions ϕ and ψ are defined by (62). Moreover, if the second inequality in (63) is strict, then the rate of convergence is of order four.
In the case p = , as a consequence of Theorem 5, we obtain the next two results. Now, we define the function of initial conditions E : D R + by
E ( x ) = x ξ d ( x ) .
Theorem 8.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K , and let ξ K n be a root vector of f. Suppose that x ( 0 ) K n is an initial guess with distinct coordinates which satisfies the following condition:
E ( x ( 0 ) ) < 1 n a n d Q ( E ( x ( 0 ) ) ) 0 ,
where the function E is defined by (65) and the function Q is defined by
Q ( t ) = ( 1 2 t ) ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) 2 ( n 1 ) 2 t 3 .
Then Nourein’s iteration (7) is well-defined and converges to ξ with error estimates (64), where λ = ϕ ( E ( x ( 0 ) ) ) , θ = ψ ( E ( x ( 0 ) ) ) and ϕ and ψ are defined by
ϕ ( t ) = ( n 1 ) 2 t 3 1 2 t ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) a n d ψ ( t ) = 1 2 t ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) ( 1 t ) ( 1 ( n + 1 ) t + t 2 ) ( n 1 ) 2 t 3 .
Moreover, if the second inequality in (66) is strict, then the convergence order is at least four.
Proof. 
According to Theorem 7, we have to prove that A ( E ( x ( 0 ) ) ) > 0 . It can be proved that Q strictly decreases on [ 0 , 1 / n ) , Q ( 0 ) = 1 and Q ( 1 / n ) = ( n 1 ) ( 3 n 2 4 n + 1 ) / n 4 < 0 . Hence, Q has a unique zero R on ( 0 , 1 / n ) . Then it follows from (66) that
E ( x ( 0 ) ) R .
On the other hand, we have mentioned above that the function Q has a unique zero on ( 0 , η ) . Consequently, R < η . Then we deduce that E ( x ( 0 ) ) < η , which yields A ( E ( x ( 0 ) ) ) > 0 . This completes the proof. □
Corollary 3.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros in K , and let ξ K n be a root vector of f. Suppose that x ( 0 ) K n is an initial guess with distinct coordinates which satisfies the following initial condition:
E ( x ( 0 ) ) 23 24 n + 44 ,
where the function E is defined by (65). Then Nourein’s iteration (7) is well-defined and converges to ξ with order of convergence four and with error estimates (64).
Proof. 
In view of Theorem 8, we have to show that x ( 0 ) satisfies the initial condition (66). We shall prove the second inequality in (66) because the first one is trivial. From condition (68) and the fact that Q defined by (67) strictly decreases on ( 0 , 1 / n ) , we obtain
Q ( E ( x ( 0 ) ) ) Q 23 24 n + 44 = 6912 n 4 128712 n 3 + 591572 n 2 + 1098464 n 565861 128 ( 6 n + 11 ) 4 > 0 .
Hence, the condition (66) is satisfied, which completes the proof. □
From Corollary 3, we immediately obtain the following convergence result.
Corollary 4.
Let f K [ z ] be a polynomial of degree n 2 that has n simple zeros and let ξ K n be a root vector of f. Suppose x 0 K n is a vector with distinct coordinates satisfying the initial condition
x ( 0 ) ξ < 23 24 n + 44 δ ( ξ ) ,
where the function δ is defined by (14). Then Nourein’s iteration (7) is well-defined and converges to ξ with order of convergence four.
Remark 2.
For the first time, an initial condition of the type (69) was presented by Wang and Zhao [34]. They proved that, if f C [ z ] is a polynomial of degree n 2 and has only simple zeros, then Ehrlich’s iteration (5) is well-defined and convergent cubically to a root vector ξ C n of f under the initial condition
x ( 0 ) ξ < 2 8 + 8 n 7 δ ( x ( 0 ) ) ,
where the function δ is defined by (14).
For local convergence of the second kind for other iterative method, we refer to [5,9,27,29,31,32,35].

7. Semilocal Convergence Analysis for Simple Zeros

In this section, we establish three semilocal convergence theorems for the classical Nourein’s method (7). Each of these results improves Theorem 1 in several directions.
Let f K [ z ] be a polynomial of degree n 2 . We study the convergence of the method (7) with respect to the function of initial conditions E f : D R + defined by
E f ( x ) = W f ( x ) d ( x ) p ( 1 p ) ,
where the operator W f is defined by (4). We define a relation of equivalence ≡ on K n by x y if there exists a permutation ( i 1 , , i n ) of the indexes ( 1 , , n ) such that
( x 1 , , x n ) = ( y i 1 , , y i n ) .
Now, we can define a distance between two vectors x , y C n as follows [32,33]:
ρ ( x , y ) = min v y x v p .
For proof of the main theorem of this section, we need two results from [14]. In order to make the paper self-contained, we include recall of these results.
Theorem 9
([14], Theorem 5.1). Let K be an algebraically closed field and let f K [ z ] be a polynomial of degree n 2 . Suppose that x K n is a vector with distinct coordinates satisfying
E f ( x ) = W f ( x ) d ( x ) p < τ = 1 ( 1 + a ) 2 ,
where a is defined by (55). Then f has only simple zeros and there exists a root vector ξ K n of f such that
ρ ( x , ξ ) α ( E f ( x ) ) W f ( x ) p a n d x ξ d ( x ) p < h ( E f ( x ) ) ,
where the distance function ρ is defined by (71) and the real functions α , h : [ 0 , τ ] R + are defined by
α ( t ) = 2 1 ( a 1 ) t + ( 1 ( a 1 ) t ) 2 4 t a n d h ( t ) = t α ( t ) .
We note that the functions α and h strictly increase on [ 0 , τ ] , where τ is defined by (72).
Theorem 10
([14], Theorem 5.2). Let K be algebraically closed field, and let f K [ z ] be a polynomial of degree n 2 . Suppose that x K n is a vector with distinct coordinates such that
E f ( x ) = W f ( x ) d ( x ) p < R ( 1 R ) 1 + ( a 1 ) R
and 0 R 1 / ( 1 + a ) , where W f is defined by (4) and a is defined by (55). Then polynomial f has only simple zeros in K and there exists a root vector ξ K n of f such that
ρ ( x , ξ ) α ( E f ( x ) ) W f ( x ) a n d x ξ d ( x ) p < R ,
where the function α is defined by (74).
Now, we can state and prove the main results of this paper.
Theorem 11.
Let K be an algebraically closed field. and let f K [ z ] be a polynomial of degree n 2 . Suppose that x ( 0 ) K n is an initial guess with distinct coordinates satisfying
E f ( x ( 0 ) ) < n 1 n ( a + n + 1 ) , A ( h ( E f ( x ( 0 ) ) ) ) > 0 a n d Ψ ( h ( E f ( x ( 0 ) ) ) ) > 0 ,
where a is defined by (55) and the functions E f , A, Q and h are defined by (70), (51), (53) and (74), respectively. Then f has only simple zeros in K and Nourein’s iteration (7) is well-defined and converges to a root vector ξ of f with order of convergence four and with a posteriori error estimate
ρ ( x ( k ) , ξ ) α ( E f ( x ( k ) ) ) W f ( x ( k ) ) p f o r a l l k 0 s u c h t h a t E f ( x ( k ) ) < τ ,
where the distance function ρ is defined by (71), τ is defined by (72) and the real function α is defined by (74).
Proof. 
First, we note that the function h is defined on [ 0 , τ ] by (74), where τ is defined by (72). It follows from the first inequality of (77) that
E f ( x ( 0 ) ) < n 1 n ( a + n + 1 ) τ .
Then by Theorem 9, it follows that f has n simple zeros in K and there exists a root vector ξ K n of f such that
E ( x ( 0 ) ) < h ( E f ( x ( 0 ) ) ) ,
where the function E : D R + is defined by (59). From the second inequality of (77), we conclude that h ( E f ( x ( 0 ) ) ) < η , where η is the unique solution of the equation A ( t ) = 0 . Now from (79), we conclude that E f ( x ( 0 ) ) < η , which yields the inequality
A ( E f ( x ( 0 ) ) ) > 0 .
Define the real function Ψ on [ 0 , η ) by (61). From the inequality (79), taking into account that the functions Ψ strictly decreases on [ 0 , η ) , we obtain Ψ ( E ( x ( 0 ) ) ) > Ψ ( h ( E f ( x ( 0 ) ) ) ) > 0 , which implies
Q ( E f ( x ( 0 ) ) ) > 0
since E f ( x ( 0 ) ) < η and Ψ ( t ) = Q ( t ) / A ( t ) for every t [ 0 , η . Hence, the initial guess x ( 0 ) satisfies the condition (63). Now, it follows from Theorem 7 that the iteration (7) is well-defined and converges to ξ with order of convergence four. The error estimate (78) follows from Theorem 9. □
Furthermore, we shall consider two semilocal convergence results in the case p = . In this case, the function of initial conditions E f : D R + is defined by
E f ( x ) = W f ( x ) d ( x )
and the functions α and h are defined on [ 0 , τ ] by
α ( t ) = 2 1 ( n 2 ) t + ( 1 ( n 2 ) t ) 2 4 t and h ( t ) = t α ( t ) ,
where τ > 0 is defined by
τ = 1 ( 1 + n 1 ) 2 .
The distance function ρ is defined on K n by
ρ ( x , y ) = min u y x u .
Let us define a real function Ω as follows
Ω ( t ) = Q ( h ( t ) ) ,
where Q and h are defined by (67) and (81), respectively.
Theorem 12.
Let K be an algebraically closed field, and let f K [ z ] be a polynomial of degree n 2 . Suppose that x ( 0 ) K n is an initial guess with distinct coordinates satisfying
E f ( x ( 0 ) ) < 1 2 n a n d Ω ( E f ( x ( 0 ) ) ) > 0 ,
where the functions E f and Ω are defined by (80) and (84), respectively. Then f has only simple zeros in K and Nourein’s iteration (7) is well-defined and converges to a root vector ξ of f with order of convergence four and with a posteriori error estimate
ρ ( x ( k ) , ξ ) α ( E f ( x ( k ) ) ) W f ( x ( k ) ) f o r a l l k 0 s u c h t h a t E f ( x ( k ) ) < τ ,
where the distance function ρ is defined by (83), τ is defined by (82) and the real function α is defined by (81).
Proof. 
It follows from the first inequality of (85) that E f ( x ( 0 ) ) < 1 / ( 2 n ) τ , where τ is defined by (82). By Theorem 9, we conclude that f has only simple zeros and there exists a root vector ξ K n of f such that E ( x ( 0 ) ) < h ( E f ( x ( 0 ) ) ) , where the function E : D R + is defined by (65). Then by monotonicity of h, we get
E ( x ( 0 ) ) < h ( E f ( x ( 0 ) ) ) < h 1 2 n = 1 n .
From this and the second inequality in (77), taking into account that Q is strictly decreasing on [ 0 , 1 / n ) , we obtain
Q ( E ( x ( 0 ) ) ) > Q ( h ( E f ( x ( 0 ) ) ) ) = Ω ( E f ( x ( 0 ) ) ) > 0 .
It follows from Theorem 8 that the iteration (7) is well-defined and converges to ξ with order of convergence four. The error estimate (86) follows from the Theorem 9. □
Using Corollary 3 and Theorem 10, we obtain the next semilocal result.
Corollary 5.
Let K be an algebraically closed field and f K [ z ] be a polynomial of degree n 2 . Suppose that x ( 0 ) K n is an initial guess with distinct coordinates satisfying
E f ( x ( 0 ) ) 69 ( 8 n + 7 ) 1128 n 2 + 2020 n 88 ,
where the function E is defined by (80). Then f has only simple zeros in K and Nourein’s iteration (7) is well-defined and converges to a root vector ξ of f with order of convergence four and with a posteriori error estimate (86).
Proof. 
The initial condition (87) can be represented in the form (75) with R defined by
R = 23 24 n + 44 .
It is easy to check that R < 1 / ( 1 + n 1 ) . Then it follows from Theorem 10 that f has only simple zeros in K and there exists a root vector ξ K n of f such that
E ( x ( 0 ) ) < R ,
where the function E : D R + is defined by (65). Now, Corollary 3 implies that Nourein’s iteration (7) converges to ξ with order of convergence four. As we have mentioned in the proof of the previous theorem, the error estimate (86) follows from the Theorem 9. This ends the proof. □
Remark 3.
We note that each of our semilocal convergence results (Theorems 11 and 12 and Corollary 5) improves and complements (with a posteriori error estimate) the result of Petković, Petković and Rančić [19] (see Theorem 1 above). In particular, they give larger convergence domains than Theorem 1 and they do not require in advance the simplicity of the zeros of f. For instance, let us prove that Corollary 5 is an improvement of Theorem 1. Let an initial vector satisfies the initial condition (8). Then,
E f ( x ( 0 ) ) = W f ( x ( 0 ) ) d ( x ( 0 ) ) W f ( x ( 0 ) ) δ ( x ( 0 ) ) < c n < 69 ( 8 n + 7 ) 1128 n 2 + 2020 n 88 ,
which shows that the initial condition (87). Hence, it follows from Corollary 5 that the conclusion of Theorem 1 holds.
Semilocal convergence of the same kind as above results can be found in [5,27,31,32,33,35,36].

8. Numerical Examples

In this section, we present three numerical examples to show the applicability of Theorem 12. Let f C [ z ] be a polynomial of degree n 2 , and let x ( 0 ) C n be an initial guess. Starting from the vector x ( 0 ) , we generate Nourein’s iterative sequence ( x ( k ) ) k = 0 for the polynomial f. Each of our semilocal convergence results (Theorem 11 and 12 and Corollary 5) gives the following two very useful criteria:
  • Convergence criterion that determines whether Nourein’s method is convergent.
  • Accuracy criterion that determines whether Nourein’s method has reached a preset accuracy ε > 0 . It can be used as stopping criterion.
Below, we consider the criteria obtained from Theorem 12. As in the previous sections, we define the functions E f , W f , Ω and α by (4), (80), (81) and (84), respectively.
Convergence criterion. If there exists an integer m 0 such that
E f ( x ( m ) ) μ = 1 2 n and Ω ( E f ( x ( m ) ) ) 0 ,
then f has only simple zeros and Nourein’s iteration (7) starting from x ( 0 ) is well-defined and converges to a root vector ξ of f with order of convergence four. In each example, we calculate the smallest m that satisfies convergence criterion (88).
Accuracy criterion (stopping criterion). Let ε > 0 . If there exists an integer k 0 , such that
E f ( x ( k ) ) τ = 1 ( 1 + n 1 ) 2 and ε k = α ( E f ( x ( k ) ) ) W f ( x ( k ) ) ε ,
then the iterate x ( k ) approximates the vector of zeros of f with accuracy ε . Moreover, the guaranteed accuracy is ε k . Indeed, according to Theorem 11, we have
ρ ( x ( k ) , ξ ) < ε ,
where the distance function ρ is defined by (71) and ξ C n is a root vector of f. In each example, we calculate the smallest k that satisfies accuracy criterion (89) with
ε = 10 15 .
In the examples, we apply Nourein’s method to three monic polynomials f of degree 16 n 21 taken from [17]. In each example, we choose two types of very crude initial approximations x ( 0 ) C n as follows:
First type of initial approximations. For a monic polynomial
f ( z ) = z n + a 1 z n 1 + + a n 1 z + a n
of degree n 2 , we choose the coordinates x 1 ( 0 ) , , x n ( 0 ) of the initial vector x ( 0 ) C n by Aberth’s formula [37]:
x j ( 0 ) = a 1 n + r 0 exp ( i θ j ) , θ ν = π n 2 j 3 2 , j = 1 , , n ,
where r 0 > 0 is a parameter.
The Aberth’s initial approximations x 1 ( 0 ) , , x n ( 0 ) are equidistantly spaced on the circle with radius with center a 1 / n and radius r 0 . We take very crude Aberth’s approximations on the circle with radius r 0 = 200 for Example 1 and r 0 = 10 for Examples 2 and 3.
Second type of initial approximations. We choose the coordinates x 1 ( 0 ) , , x n ( 0 ) of the initial vector x ( 0 ) C n randomly in the square
{ z C : | Re ( z ) | r 0 and | Im ( z ) | r 0 } ,
where r 0 is the same as in the previous type of initial approximations.
We use CAS Wolfram Mathematica 11 to implement the corresponding algorithms and to present approximations of higher accuracy.
Example 1.
Let us consider Wilkinson’s ill-conditioned polynomial, often a hard nut to crack for most methods:
f 1 ( z ) = z 18 76 z 17 + 2451 z 16 42636 z 15 + 405042 z 14 1480632 z 13 9162218 z 12 + 124928648 z 11 407525547 z 10 1153431708 z 9 + 11555719383 z 8 18182560188 z 7 73778959736 z 6 + 272611286816 z 5 23388233616 z 4 923526085824 z 3 + 833270250240 z 2 + 670127385600 z 747242496000 = k = 5 k 0 13 ( z k ) .
Our random initial guess x ( 0 ) for Example 1 is
x ( 0 ) = { 23.842 12.426 i , 196.390 136.008 i , 180.322 50.643 i , 183.774 + 14.551 i , 113.220 + 159.627 i , 159.800 + 22.698 i , 27.676 167.497 i , 195.857 + 18.631 i , 178.262 + 114.435 i , 23.150 + 157.953 i , 69.927 + 5.095 i , 81.282 137.910 i , 154.174 68.812 i , 57.413 151.660 i , 10.138 + 98.167 i , 115.399 33.393 i , 106.991 161.731 i , 190.202 80.620 i } .
Example 2.
Consider the following polynomial with clusters
f 2 ( z ) = z 16 + z 10 10 z 9 + 45 z 8 120 z 7 + 210 z 6 252 z 5 + 210 z 4 120 z 3 + 45 z 2 10 z + 1 .
The random initial guess x ( 0 ) for Example 2 is
x ( 0 ) = { 2.631 + 3.166 i , 2.600 + 9.323 i , 6.468 7.619 i , 7.365 + 6.063 i , 5.325 1.413 i , 4.844 2.670 i , 5.890 4.534 i , 0.205 + 9.302 i , 9.232 9.540 i , 9.672 + 7.694 i , 4.235 8.552 i , 0.012 + 1.758 i , 1.980 + 8.182 i , 9.820 + 6.746 i , 1.307 3.871 i , 6.293 0.111 i } .
Example 3.
Consider the polynomial with ring zeros:
f 3 ( z ) = z 21 + 7 z 20 9765626 z 11 68359382 z 10 + 9765625 z + 68359375 = ( z + 7 ) ( z 10 1 ) ( z 10 5 10 )
The random initial guess x ( 0 ) for Example 3 is
x ( 0 ) = { 8.057 3.640 i , 0.257 4.298 i , 7.718 + 8.568 i , 0.852 + 1.961 i , 9.167 5.264 i , 4.866 5.726 i , 0.552 9.817 i , 7.496 2.718 i , 6.733 4.709 i , 7.203 + 9.421 i , 5.545 0.359 i , 5.599 + 5.612 i , 7.335 + 4.054 i , 6.545 0.471 i , 4.882 9.617 i , 2.488 5.469 i , 7.712 + 6.404 i } .
Numerical results. In Table 1 are presented the results for the considered examples, we exhibit the values of m, E f ( x ( m ) ) , Ω ( E f ( x ( m ) ) ) , k and ε k . We recall that:
  • m is the smallest nonnegative integer that satisfies the convergence criterion (88);
  • ε m is defined in (89) and denotes the guaranteed accuracy (by Theorem 12) for the approximation x m of the zeros of f;
  • k is the smallest nonnegative integer that satisfies convergence accuracy criterion (89) with the preset accuracy ε = 10 15 ;
  • ε k is the guaranteed accuracy (Theorem 12) for the approximation x k of the zeros of f.
It can be seen from the table that, in all six experiments, Theorem 12 guarantees that Nourein’s method (7) is convergent under the given very rough initial approximations. Also it shows on which iteration that the preset accuracy is reached.
For instance, for Example 1, under the first initial approximation, it is seen that the convergence criterion (88) is satisfied for m = 31 and that the accuracy criterion (89) is satisfied for k = 33 , which means that the preset accuracy 10 15 is reached. Moreover, the table shows that, at 33 iterations, Theorem 12 guarantees an accuracy of 10 41 and at 34 iterations, it guarantees that each of the roots of the polynomial (91) is calculated with a guaranteed accuracy of 10 167 .
In Figure 1, Figure 2 and Figure 3, we present the trajectories of approximations x 1 ( 0 ) , x 2 ( 0 ) , , x n ( 0 ) in the complex plane with different colors. For the random initial guess (second type), all initial approximations are numbered and colored by blue. From these figures, one can observe that all trajectories starting with Aberth’s initial approximations have regular paths. However, it is not necessary that the initial approximations lie in an inclusion disk with a minimum radius to have this nice and regular form. About the second type of initial approximations, one can see that some initial points during iterating are not going to the nearest zero of the polynomial.

9. Conclusions

In this paper, we studied the convergence of two well-known iterative methods for finding all zeros of a polynomial simultaneously. The first one is due to Nourein [1] and it has quartic convergence when all zeros of the polynomial are simple. It is also known as Ehrlich’s method with Newton’s corrections because it is obtained by combining Ehrlich’s method [6] and the classical Newton’s method. The second one is a generalization of Nourein’s method for simultaneously finding all zeros of a polynomial that has at least one multiple zero. To our knowledge, it appears for the first time in the book of Sendov, Andreev and Kjurkchiev [2].
We have proved several new local and semilocal convergence theorems (Theorems 6–8, 11 and 12) for the classical Nourein’s method (7) under different initial conditions. The initial conditions and error bounds of the semilocal convergence results (Theorems 11 and 12) are computationally verifiable, which is of practical importance. Each of our semilocal convergence results improves the previous result due to Petković, Petković and Rančić [19] in several directions. We note that our approach to semilocal convergence analysis is different from those of the previous authors [18,19]. In Section 8, we present several numerical examples that show the applicability of our semilocal convergence results.
We have obtained two new local convergence theorems (Theorems 3 and 5) for the generalized Nourein’s method (11) under different initial conditions. To the best of authors’ knowledge, our local convergence theorems are the first local convergence results in the literature about Nourein’s method (for simple or multiple zeros).

Author Contributions

The authors contributed equally to the writing and approved the final manuscript of this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Science Fund of the Bulgarian Ministry of Education and Science under grant DN 12/12.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nourein, A.W.M. An improvement on two iteration methods for simultaneous determination of the zeros of a polynomial. Intern. J. Comput. Math. 1977, 6, 241–252. [Google Scholar] [CrossRef]
  2. Sendov, B.; Andreev, A.; Kjurkchiev, N. Numerical Solution of Polynomial Equations. In Handbook of Numerical Analysis; Ciarlet, P., Lions, J., Eds.; Elsevier: Amsterdam, The Netherlands, 1994; Volume III, pp. 625–778. [Google Scholar] [CrossRef]
  3. Weierstrass, K. Neuer Beweis des Satzes, dass jede ganze rationale Function einer Veränderlichen dargestellt werden kann als ein Product aus linearen Functionen derselben Veränderlichen. Sitzungsber. Königl. Akad. Wiss. Berlin 1891, II, 1085–1101. [Google Scholar] [CrossRef]
  4. Proinov, P.D.; Petkova, M.D. A new semilocal convergence theorem for the Weierstrass method for finding zeros of a polynomial simultaneously. J. Complex. 2014, 30, 366–380. [Google Scholar] [CrossRef]
  5. Proinov, P.D. General convergence theorems for iterative processes and applications to the Weierstrass root-finding method. J. Complex. 2016, 33, 118–144. [Google Scholar] [CrossRef]
  6. Ehrlich, L. A modified Newton method for polynomials. Commun. ACM 1967, 10, 107–108. [Google Scholar] [CrossRef]
  7. Börsch-Supan, W. Residuenabschätzung für Polynom-Nullstellen mittels Lagrange Interpolation. Numer. Math. 1970, 14, 287–296. [Google Scholar] [CrossRef]
  8. Werner, W. On the simultaneous determination of polynomial roots. Lect. Notes Math. 1982, 953, 188–202. [Google Scholar]
  9. Proinov, P.D. On the local convergence of Ehrlich method for numerical computation of polynomial zeros. Calcolo 2016, 53, 413–426. [Google Scholar] [CrossRef]
  10. Proinov, P.D. A general semilocal convergence theorem for simultaneous methods for polynomial zeros and its applications to Ehrlich’s and Dochev-Byrnev’s methods. Appl. Math. Comput. 2016, 284, 102–114. [Google Scholar] [CrossRef] [Green Version]
  11. Shemyakov, S.; Chernov, R.; Rumiantsau, D.; Schleicher, D.; Schmitt, S.; Shemyakov, A. Finding polynomial roots by dynamical systems—A case study. Discret. Contin. Dyn. Syst. Ser. A 2020, 40, 6845–6965. [Google Scholar] [CrossRef]
  12. Proinov, P.D. General local convergence theory for a class of iterative processes and its applications to Newton’s method. J. Complex. 2009, 25, 38–62. [Google Scholar] [CrossRef] [Green Version]
  13. Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton Kantorovich type theorems. J. Complex. 2010, 26, 3–42. [Google Scholar] [CrossRef] [Green Version]
  14. Proinov, P.D. Relationships between different types of initial conditions for simultaneous root finding methods. Appl. Math. Lett. 2016, 52, 102–111. [Google Scholar] [CrossRef]
  15. Nourein, A.W. An improvement on Noureins method for the simultaneous determination of the zeros of a polynomial (an algorithm). J. Comput. Appl. Math. 1977, 3, 109–110. [Google Scholar] [CrossRef] [Green Version]
  16. Milovanović, G.; Petković, M. On computational efficiency of the iterative methods for simultaneous approximation of polynomial zeros. ACM Trans. Math. Softw. 1986, 12, 295–306. [Google Scholar] [CrossRef]
  17. Petković, I.; Herceg, Đ. Computer methodologies for comparison of computational efficiency of simultaneous methods for finding polynomial zeros. J. Comput. Appl. Math. 2020, 368, 112513. [Google Scholar] [CrossRef]
  18. Petković, M.; Herceg, Đ.; Ilić, S. Point estimation and some applications to iterative methods. BIT Numer. Math. 1998, 38, 112–126. [Google Scholar] [CrossRef]
  19. Petković, M.S.; Petković, L.D.; Rančić, L.Z. Point estimation of simultaneous methods for solving polynomial equations: A survey (II). J. Comput. Appl. Math. 2007, 205, 32–52. [Google Scholar] [CrossRef] [Green Version]
  20. Petkovic, M. Point Estimation of Root Finding Methods; Lecture Notes in Mathematics; Springer: Berlin, Germany, 2008; Volume 1933. [Google Scholar] [CrossRef]
  21. Schröder, E. Über unendlich viele Algorithmen zur Autlösung der Glleichungen. Math. Anal. 1870, 2, 317–365. [Google Scholar] [CrossRef] [Green Version]
  22. Petković, M.S.; Neta, B.; Petković, L.D.; Džunić, J. Multipoint Methods for Solving Nonlinear Equations; Elsevier/Academic Press: Amsterdam, The Netherlands, 2013. [Google Scholar] [CrossRef]
  23. Proinov, P.D.; Cholakov, S.I. Convergence of Chebyshev-like method for simultaneous approximation of multiple polynomial zeros. Compt. Rend. Acad. Bulg. Sci. 2014, 67, 907–918. [Google Scholar]
  24. Proinov, P.D. On the local convergence of Gargantini-Farmer-Loizou method for simultaneous approximation of multiple polynomial zeros. J. Nonlinear Sci. Appl. 2018, 11, 1045–1055. [Google Scholar] [CrossRef]
  25. Proinov, P.D.; Vasileva, M.T. On the convergence of high-order Gargantini-Farmer-Loizou type iterative methods for simultaneous approximation of polynomial zeros. Appl. Math. Comput. 2019, 361, 202–214. [Google Scholar] [CrossRef]
  26. Proinov, P.D. Unified convergence analysis for Picard iteration in n-dimensional vector spaces. Calcolo 2018, 55, 6. [Google Scholar] [CrossRef]
  27. Proinov, P.D.; Vasileva, M.T. On the convergence of high-order Ehrlich-type iterative methods for approximating all zeros of a polynomial simultaneously. J. Inequal. Appl. 2015, 2015, 336. [Google Scholar] [CrossRef]
  28. Dochev, K. A variant of Newton’s method for the simultaneous approximation of all roots of an algebraic equation. Phys. Math. J. Bulg. Acad. Sci. 1962, 5, 136–139. [Google Scholar]
  29. Cholakov, S.I. Local convergence of Chebyshev-like method for simultaneous finding polynomial zeros. Comp. Rend. Acad. Bulg. Sci. 2013, 66, 1081–1090. [Google Scholar]
  30. Cholakov, S.I.; Petkova, M.D. On the convergence of a fourth-order method for simultaneous finding polynomial zeros. Indian J. Appl. Res. 2015, 5, 397–402. [Google Scholar]
  31. Cholakov, S.I. Local and semilocal convergence of Wang-Zheng’s method for simultaneous finding polynomial zeros. Symmetry 2019, 11, 736. [Google Scholar] [CrossRef] [Green Version]
  32. Proinov, P.D.; Ivanov, S.I. Convergence analysis of Sakurai-Torii-Sugiura iterative method for simultaneous approximation of polynomial zeros. J. Comput. Appl. Math. 2019, 357, 56–70. [Google Scholar] [CrossRef]
  33. Proinov, P.D.; Petkova, M.D. Local and semilocal convergence of a family of multi-point Weierstrass-type root-finding methods. Mediterr. J. Math. 2020, 17, 107. [Google Scholar] [CrossRef]
  34. Wang, D.R.; Zhao, F.G. Complexity analysis of a process for simultaneously obtaining all zeros of polynomials. Computing 1989, 43, 187–197. [Google Scholar] [CrossRef]
  35. Cholakov, S.I.; Vasileva, M.T. A convergence analysis of a fourth-order method for computing all zeros of a polynomial simultaneously. J. Comput. Appl. Math. 2017, 321, 270–283. [Google Scholar] [CrossRef]
  36. Ivanov, S.I. A unified semilocal convergence analysis of a family of iterative algorithms for computing all zeros of a polynomial simultaneously. Numer. Algorithms 2017, 75, 1193–1204. [Google Scholar] [CrossRef]
  37. Aberth, O. Iteration methods for finding all zeros of a polynomial simultaneously. Math. Comput. 1973, 27, 339–344. [Google Scholar] [CrossRef]
Figure 1. Trajectories of approximations for Example 1.
Figure 1. Trajectories of approximations for Example 1.
Symmetry 12 01801 g001
Figure 2. Trajectories of approximations for Example 2.
Figure 2. Trajectories of approximations for Example 2.
Symmetry 12 01801 g002
Figure 3. Trajectories of approximations for Example 3.
Figure 3. Trajectories of approximations for Example 3.
Symmetry 12 01801 g003
Table 1. Numerical results.
Table 1. Numerical results.
Examplem E f ( x ( m ) ) Ω ( E f ( x ( m ) ) ) ε m k ε k ε k + 1
First type initial approximations
Example 131 6.254 × 10 3 0.848 6.988 × 10 3 33 1.042 × 10 41 1.442 × 10 167
Example 220 2.845 × 10 5 0.999 3.498 × 10 6 22 5.275 × 10 20 5.711 × 10 76
Example 314 6.688 × 10 5 0.998 4.139 × 10 5 15 2.719 × 10 17 5.946 × 10 66
Second type initial approximations
Example 131 2.774 × 10 5 0.999 2.775 × 10 5 32 5.366 × 10 20 1.170 × 10 78
Example 218 1.945 × 10 3 0.960 2.460 × 10 4 20 2.684 × 10 51 6.697 × 10 202
Example 314 1.045 × 10 6 0.999 6.463 × 10 7 15 1.622 × 10 24 7.769 × 10 95
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Proinov, P.D.; Vasileva, M.T. Local and Semilocal Convergence of Nourein’s Iterative Method for Finding All Zeros of a Polynomial Simultaneously. Symmetry 2020, 12, 1801. https://doi.org/10.3390/sym12111801

AMA Style

Proinov PD, Vasileva MT. Local and Semilocal Convergence of Nourein’s Iterative Method for Finding All Zeros of a Polynomial Simultaneously. Symmetry. 2020; 12(11):1801. https://doi.org/10.3390/sym12111801

Chicago/Turabian Style

Proinov, Petko D., and Maria T. Vasileva. 2020. "Local and Semilocal Convergence of Nourein’s Iterative Method for Finding All Zeros of a Polynomial Simultaneously" Symmetry 12, no. 11: 1801. https://doi.org/10.3390/sym12111801

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop