Next Article in Journal
Categorical Functional Data Analysis. The cfda R Package
Next Article in Special Issue
Convergence Criteria of Three Step Schemes for Solving Equations
Previous Article in Journal
An Ideal-Based Dot Total Graph of a Commutative Ring
Previous Article in Special Issue
Extended Kung–Traub Methods for Solving Equations with Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Note on Traub’s Method for Systems of Nonlinear Equations

Department of Applied Mathematics, Naval Postgraduate School, Monterey, CA 93943, USA
Mathematics 2021, 9(23), 3073; https://doi.org/10.3390/math9233073
Submission received: 13 October 2021 / Revised: 14 November 2021 / Accepted: 27 November 2021 / Published: 29 November 2021
(This article belongs to the Special Issue Application of Iterative Methods for Solving Nonlinear Equations)

Abstract

:
Traub’s method was extended here to systems of nonlinear equations and compared to Steffensen’s method. Even though Traub’s method is only of order 1.839 and not quadratic, it performed better in the 10 examples.

1. Introduction

Recently, Traub’s [1] method was compared to other superlinear methods, and to Steffensen’s scheme [2], see also Amat and Busquier [3], Chicharro et al. [4], Cordero et al. [5]. Neta [6] has shown that Traub’s method is superior in terms of its dynamics. The method is given by
x k + 1 = x k + f ( x k ) f [ x k 2 , x k ] f [ x k 2 , x k 1 ] + f [ x k 1 , x k ] , k = 0 , 1 , ,
being x 0 , x 1 and x 2 their initial estimations and the divided differences are f [ x i , x j ] = f ( x i ) f ( x j ) x i x j . The method is of order p = 1.839 and it was used as a first step to obtain higher order methods for finding simple roots, see Neta [6,7]. The reasons for its stability were discussed by Cordero et al. [8].
In this article, we extend Traub’s method to solve systems of nonlinear equations and compare its performance to Steffensen’s method, which is used currently as a basis for derivative-free multistep methods because of its quadratic convergence. Chun and Neta [9] have compared Steffensen’s method to several higher order methods (see [10,11,12,13,14,15]). and found that in 21 examples, a three-step fourth order method based on Neta’s method was best.
In Section 2, we state the methods and list several examples for the comparative study. In Section 3, the numerical performance of the method is checked in terms of the number of function evaluations, CPU run-time, and the accuracy of the solution.

2. Numerical Examples

In this section, we compare two methods, namely Traub’s method and Steffensen’s method. Traub’s method for systems of nonlinear equations F ( x ) = 0 where x R n and F : D R n R n is given by
x n + 1 = x n B n 1 F ( x n )
where
B n = [ x n 2 , x n ; F ] [ x n 2 , x n 1 ; F ] + [ x n 1 , x n ; F ]
and the divided difference
[ x , y ; F ] i j = f i ( x 1 , , x j , y j + 1 , , y n ) f j ( x 1 , , x j 1 , y j , , y n ) x i y j
The method is of order 1.839, which is lower than Steffensen’s and requires only one function evaluation per step. It requires two additional initial vectors, which we took as a small number added to the components of x 0 .
Theorem 1.
Let the function F : D R m R m be sufficiently differentiable in a convex set D containing asimplezero α of F ( x ) . Then the local convergence order of the method (2) is 1.839, and the error equation is given by
e n + 1 = 2 A 3 e n ϵ n E n +
where e n = x n α R m ,   ϵ n = x n 1 α R m , E n = x n 2 α R m , e n i = ( e n , e n , . . . , e n ) i t i m e s , A j = ( 1 / j ! ) [ F ( α ) ] 1 F ( j ) ( α ) L i ( R m , R m ) , F ( j ) L ( R m × × R m , R m ) and [ F ( α ) ] 1 L ( R m ) , and similarly for ϵ n i and E n i .
Proof. 
Let α be the zero. Given x 0 , x 1 and x 2 , our method is given by (2) where B n = 0 1 F ( x n + t h 1 ) d t 0 1 F ( x n + σ h 2 ) d σ + 0 1 F ( x n + τ h 3 ) d τ , where h 1 = x n 2 x n , h 2 = x n 2 x n 1 and h 3 = x n 1 x n .
Now we expand in the Taylor series
F ( x n ) = F ( α ) e n + A 2 e n 2 + A 3 e n 3 + A 4 e n 4 + ,
where e n = x n α , ϵ n = x n 1 α and E n = x n 2 α . We would like to prove that e n + 1 = 2 A 3 e n ϵ n E n + .
Now we have
B n = F ( x n ) + 1 2 F ( 2 ) ( x n ) ( E n e n ) + 1 6 F ( 3 ) ( x n ) ( E n e n ) 2 + F ( x n ) + 1 2 F ( 2 ) ( x n ) ( E n ϵ n ) + 1 6 F ( 3 ) ( x n ) ( E n ϵ n ) 2 + + F ( x n ) + 1 2 F ( 2 ) ( x n ) ( e n ϵ n ) + 1 6 F ( 3 ) ( x n ) ( e n ϵ n ) 2 + = F ( x n ) + 1 3 F ( 3 ) ( x n ) ( e n 2 e n E n ϵ n ( e n E n ) + .
From the Taylor series expansion of F ( x n ) given above, we get
F ( x n ) = F ( α ) ( I + 2 A 2 e n + 3 A 3 e n 2 + 4 A 4 e n 3 + )
F ( 2 ) ( x n ) = F ( α ) ( 2 A 2 + 6 A 3 e n + 12 A 4 e n 2 + )
F ( 3 ) ( x n ) = F ( α ) ( 6 A 3 + 24 A 4 e n + 60 A 5 e n 2 + )
Substituting all of these expansions in B n , we have
B n = F ( α ) I + 2 A 2 e n + 2 A 3 ( e n ϵ n + e n E n + ϵ n E n ) +
The inverse is
B n 1 = I 2 A 2 e n 2 A 3 ( e n ϵ n + e n E n + ϵ n E n ) + 4 A 2 2 e n 2 + F ( α ) 1 .
Therefore
e n + 1 = e n I 2 A 2 e n 2 A 3 ( e n ϵ n + e n E n + ϵ n E n ) + 4 A 2 2 e n 2 + e n + A 2 e n 2 + .
Expanding and collecting lower order terms leads to
e n + 1 = 2 A 3 e n ϵ n E n +
Therefore, the method is of order 1.839.
Steffensen’s method is given by
x n + 1 = x n C n 1 F ( x n )
where C n = [ x n , y n ; F ] and y n = x n + F ( x n ) . The method is of order 2, and requires two function evaluations per step.
The criteria for comparisons are the number of function evaluations for convergence. We use a tolerance of 10 15 and allow a maximum of 50 iterations. We also collected the accuracy and the CPU run-time.
The systems used for our comparative study are mostly taken from [9]:
  • Example 1
    This system has six solutions.
    x 1 2 + x 1 x 2 3 9 = 0 3 x 1 2 x 2 x 2 3 4 = 0
    x 0 = ( 1.2 , 2.5 ) T
    α 1 = ( 0.9012661905 , 2.086587595 ) T
    α 2 = ( 9.985950982 , 2.086587595 ) T
    α 3 = ( 2.998375993 , 0.1481079950 ) T
    α 4 = ( 3.001624887 , 0.1481079950 ) T
    α 5 = ( 1.336355377 , 1.754235198 ) T
    α 6 = ( 6.734735503 , 1.754235198 ) T
  • Example 2
    cos x 2 sin x 1 = 0 x 3 x 1 1 / x 2 = 0 e x 1 x 3 2 = 0
    x 0 = ( 1.2 , 0.5 , 1.5 ) T
    α 1 = ( 0.9095694944 , 0.6612268323 , 1.575834144 ) T
    α 2 = ( 0.9095694944 , 0.6612268323 , 0.6345845493 ) T
  • Example 3
    x 1 5 + x 2 3 x 3 4 + 1 = 0 x 1 2 x 2 x 3 = 0 x 3 4 1 = 0
    x 0 = ( 8 , 2 , 8 ) T
    α = ( 1 , 0 , 1 ) T
  • Example 4
    6 x 1 2 + x 2 37 6 = 0 x 1 6 x 2 2 5 6 = 0 x 1 + x 2 + x 3 1 2 = 0
    x 0 = ( 1.5 , 0.5 , 0.5 ) T
    α 1 = ( 1 , 1 / 6 , 2 / 3 ) T
    α 2 = ( 1.028512437 , 0.1803603357 , 0.3481521018 ) T
    and two other complex conjugate solutions.
  • Example 5
    12 x 1 3 x 2 2 4 x 3 7.17 = 0 x 1 2 + 10 x 2 x 3 11.54 = 0 x 2 3 + 7 x 3 7.631 = 0
    x 0 = ( 3 , 0 , 1 ) T
    α 1 = ( 1.2 , 1.1 , 0.9 ) T
    α 2 = ( 7.809384276 , 3.953119569 , 9.915287083 ) T
    and two other pairs of complex conjugate solutions.
  • Example 6
    x 2 x 3 + x 4 ( x 2 + x 3 ) = 0 x 1 x 3 + x 4 ( x 1 + x 3 ) = 0 x 1 x 2 + x 4 ( x 1 + x 2 ) = 0 x 1 x 2 + x 1 x 3 + x 2 x 3 1 = 0
    x 0 = ( 1.7 , 0.7 , 1.8 , 0.8 ) T
    α 1 = ( 1 / 3 , 1 / 3 , 1 / 3 , 1 / ( 2 3 ) ) T
    α 2 = ( 1 / 3 , 1 / 3 , 1 / 3 , 1 / ( 2 3 ) ) T
  • Example 7
    x i x i + 1 1 = 0 , i = 1 , 2 , , n 1 x n x 1 1 = 0
    x 0 = ( 2 , 2 , , 2 ) T
    If n is odd, there are two solutions:
    α 1 = ( 1 , 1 , , 1 ) T
    α 2 = ( 1 , 1 , , 1 ) T
    If n is even, then choose x n
    x 1 = x 3 = = x n 1 = 1 x n
    x 2 = x 4 = = x n 2 = x n
    We used n = 9 in this example.
  • Example 8
    This is another example from [16,17].
    x k 2 x k + 1 1 = 0 , 1 k n 1 , x n 2 x 1 1 = 0 , k = n
    x 0 = ( 0.1 , 0.1 , , 0.1 ) T
    α = ( 1 , 1 , , 1 ) T
    We used n = 10 in this example.
  • Example 9
    The process of penetration of the magnetic field into the material is modeled by the averaged integro-differential equation:
    U t 1 + 0 t 0 1 U x 2 d x d τ 2 U x 2 = f ( x , t ) ,
    with Dirichlet boundary conditions
    U ( 0 , t ) = U ( 1 , t ) = 0 ,
    and the initial condition
    U ( x , 0 ) = U 0 ( x ) ,
    in the rectangle Q T = ( 0 , 1 ) × ( 0 , T ) , where T is a positive constant, f = f ( x , t ) and U 0 = U 0 ( x ) are given functions of their arguments. The resulting nonlinear system is [18]
    1 τ u i j + 1 A u j + 1 u i + 1 j + 1 2 u i j + 1 + u i 1 j + 1 h 2 f i j 1 τ u i j = 0 , i = 1 , , M 1 .
    where
    A u j + 1 = 1 + τ h = 1 M k = 1 j + 1 u k u 1 k h 2 .
    This system can be written in matrix form
    T u j + 1 u j + 1 = 1 τ u j + f j .
    The vector u containing all the unknowns u 1 , , u M 1 at the level indicated. The matrix T is symmetric and tridiagonal with elements
    T i r = 1 τ + 2 A h 2 , r = i , A h 2 , r = i ± 1 .
    In our experiments, we take M = 100 , so h = 0.01 . The initial solution is given by
    U 0 ( x ) = x ( 1 x ) e x .
    In order to get the exact solution U ( x , t ) = x ( 1 x ) e x t , we picked the following forcing term
    f ( x , t ) = 9 8 3 8 e 2 1 8 3 8 e 2 e 2 t ( 4 + 5 x x 2 ) e x t x ( 1 x ) e x t .
  • Example 10
    Consider the Hammerstein integral equation [19],
    x ( s ) = 1 + 1 5 0 1 K ( s , t ) x ( t ) 3 d t
    where x C [ 0 , 1 ] , s , t [ 0 , 1 ] and the kernel
    K ( s , t ) = ( 1 s ) t t s , s ( 1 t ) s t .
    Solving the system numerically by using the Gauss–Legendre quadrature formula with 7 points, yield
    5 x i 5 j = 1 7 a i j x j 3 = 0
    where
    a i j = w j t j ( 1 t i ) j i w j t i ( 1 t j ) i j ,
    and w j are the weights and t j are the nodes. The initial solution is x ( 0 ) = ( 1 , , 1 ) T . The approximate solution is
    x ( k + 1 ) = ( 1.0027414204 , 1.0125680365 , 1.0235854047 , 1.0285959684 , 1.0243765818 , 1.0179001828 , 1.0026562769 ) T .

3. Numerical Experiments

We ran the following two methods:
  • Steffensen’s method
  • Traub’s method
on each of the following examples with the corresponding initial guesses listed.
All computations were conducted in MAPLE 2016 with 100-digit floating point arithmetic in a laptop with Windows 10 Home and processor Intel(R) Core(TM) i7-7500u CPU 2.90GHZ. The results for the number of function evaluations are given in Table 1. We also collected the CPU times used, which are listed in Table 2. The number of function evaluations and CPU times used by each method for each example are given with tolerance F ( x k ) < 10 15 , and “Div” denotes that the method does not converge within 50 iterations.
As can be seen in the tables, Steffensen’s method, is the only one that diverged in several examples. Both methods converged to the first root α 1 , except Steffensen’s method converged to α 2 in examples 4 and 5. Steffensen’s method requires more function evaluations than Traub’s method, even when converged.
In Table 2, we list the CPU times required for each method and each example. Clearly, we did not list the CPU times for Steffensen’s method in the several examples where the methods did not converge. It is no clear cut as to which method is faster, but the main result is that Traub’s method is more robust.
We now compute the error norm for each method and each example, see Table 3. We see that Traub has much smaller error than Steffensen in 3 out of the 10 examples. In two other examples, they both have the same accuracy, and only in one case the accuracy of Steffensen is higher.
Overall, Traub’s method is best, since it almost always converges (except 1 in 10) and is fastest with the fewest number of function evaluations. If we discount the examples for which the methods diverge, then we find that Traub’s method requires the least number of function evaluations to converge (4.3 iterations on average). We thus conclude that Traub’s method is a better, derivative-free method on which to build multistep methods for the solution of systems of nonlinear equations.

4. Conclusions

We compared Traub’s method to Steffensen’s method and showed that, in ten examples, the latter diverged in four cases and the former in only one. Traub’s method requires fewer function evaluations to converge and it is almost always more accurate. In future research, we plan to develop multistep derivative-free methods based on Traub’s method as the first step. We will compare the new method to currently available schemes based on Steffensen’s method as the first step.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The author would like to thank the anonymous reviewers for their suggestions and comments that have improved the final version of this manuscript.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Traub, J.F. Iterative Methods for the Solution of Equations; Prentice-Hall: Englewood Cliffs, NJ, USA, 1964. [Google Scholar]
  2. Steffensen, I.F. Remarks on iteration. Scand. Actuar. J. 1933, 1933, 64–72. [Google Scholar] [CrossRef]
  3. Amat, S.; Busquier, S. (Eds.) Advances in Iterative Methods for Nonlinear Equations; Springer: Cham, Switzerland, 2016. [Google Scholar]
  4. Chicharro, F.I.; Cordero, A.; Torregrosa, J.R. Dynamics and fractal dimension of Steffensen-type methods. Algorithms 2015, 8, 271–279. [Google Scholar] [CrossRef] [Green Version]
  5. Cordero, A.; Soleymani, F.; Torregrosa, J.R.; Shateyi, S. Basins of attraction for various Steffensen-type methods. J. Appl. Math. 2014, 2014, 539707. [Google Scholar] [CrossRef]
  6. Neta, B. Basin attractors for derivative-free methods to find simple roots of nonlinear equations. Rev. D’Anal. Numérique Théorie L’Approx. 2020, 49, 177–189. [Google Scholar]
  7. Neta, B. A New Derivative-Free Method to Solve Nonlinear Equations. Mathematics 2021, 9, 583. [Google Scholar] [CrossRef]
  8. Cordero, A.; Neta, B.; Torregrosa, J.R. Reasons for stability in the construction of derivative-free multistep iterative methods. submitted for publication.
  9. Chun, C.; Neta, B. An efficient derivative-free method for the solution of systems of equations. Numer. Funct. Anal. Optim. 2021, 42, 838–848. [Google Scholar] [CrossRef]
  10. Cordero, A.; Maimo, J.G.; Torregrosa, J.R.; Vasileva, M.P. Iterative methods with memory for solving systemes of nonlinear equations using second order approximation. Mathematics 2019, 7, 1069. [Google Scholar] [CrossRef] [Green Version]
  11. Narang, M.; Bhatia, S.; Kanwar, V. New two-parameter Chebyshev-Halley like family of fourth and sixth-order methods for systems of nonlinear equations. Appl. Math. Comput. 2016, 275, 394–403. [Google Scholar] [CrossRef]
  12. Sharma, J.R.; Sharma, R.; Karla, N. A novel family of composite Newton-Traub methods for solving systems of nonlinear equations. Appl. Math. Comput. 2015, 269, 520–535. [Google Scholar] [CrossRef]
  13. Hueso, J.L.; Martinez, E.; Teruel, C. Convergence, efficiency and dynamics of new fourth and sixth order families of iterative methods for nonlinear systems. J. Comput. Apple. Math. 2015, 275, 412–420. [Google Scholar] [CrossRef]
  14. Montazeri, H.; Soleymani, F.; Shateyi, S.; Motsa, S.S. On a new method for computing the numerical solution of systems of nonlinear equations. J. Appl. Math. 2012, 2012, 751975. [Google Scholar] [CrossRef] [Green Version]
  15. Sharma, J.R.; Arora, H. Efficient Jarratt-like methods for solving systems of nonlinear equations. Calcolo 2014, 51, 193–210. [Google Scholar] [CrossRef]
  16. Ezquerro, J.A.; Hernández, M.A.; Romero, N.; Velasco, A.I. On Steffensen’s method on Banach spaces. J. Comput. Appl. Math. 2013, 249, 9–23. [Google Scholar] [CrossRef]
  17. Ramandeep, B.; Cordero, A.; Motsa, S.S.; Torregrosa, J.R. Stable high-order iterative methods for solving nonlinear models. Appl. Math. Comput. 2017, 303, 70–88. [Google Scholar]
  18. Jangveladze, T.; Kiguradze, Z.; Neta, B. Large time behavior of solutions and finite difference scheme to a nonlinear integro-differential equation. Comput. Math. Appl. 2009, 57, 799–811. [Google Scholar] [CrossRef] [Green Version]
  19. Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equations in Several Variables; Academic Press: Cambridge, MA, USA, 1970. [Google Scholar]
Table 1. Number of function evaluations used by each method for each example.
Table 1. Number of function evaluations used by each method for each example.
ExampleDimensionSteffensenTraub
12Div7
23188
33DivDiv
43227
53247
652217
79167
8102217
999Div5
107Div6
Table 2. CPU time (in seconds) used by each method for each example.
Table 2. CPU time (in seconds) used by each method for each example.
ExampleDimensionSteffensenTraub
12Div0.079
230.1720.116
33DivDiv
430.0620.076
530.1560.25
650.1720.307
790.780.133
8100.2030.594
999Div459.172
107Div0.137
Table 3. Accuracy of methods for each example.
Table 3. Accuracy of methods for each example.
ExampleDimensionSteffensenTraub
12Div1.27(−20)
231.89(−16)2.13(−21)
33DivDiv
431.13(−24)2.04(−19)
539.93(−18)7.12(−28)
651.45(−20)2.47(−20)
791.03(−16)1.75(−30)
8102.05(−20)2.47(−20)
999Div2.06(−20)
107Div1.47(−22)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Neta, B. A Note on Traub’s Method for Systems of Nonlinear Equations. Mathematics 2021, 9, 3073. https://doi.org/10.3390/math9233073

AMA Style

Neta B. A Note on Traub’s Method for Systems of Nonlinear Equations. Mathematics. 2021; 9(23):3073. https://doi.org/10.3390/math9233073

Chicago/Turabian Style

Neta, Beny. 2021. "A Note on Traub’s Method for Systems of Nonlinear Equations" Mathematics 9, no. 23: 3073. https://doi.org/10.3390/math9233073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop