A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition

Han, Yang; Rui, Shaoping

doi:10.3390/sym16060674

Open AccessArticle

A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition

by

Yang Han

and

Shaoping Rui

^*

Department of Mathematics, Faculty of Mathematics and Statistics, Huaibei Normal University, Huaibei 235000, China

^*

Author to whom correspondence should be addressed.

Symmetry 2024, 16(6), 674; https://doi.org/10.3390/sym16060674

Submission received: 29 April 2024 / Revised: 25 May 2024 / Accepted: 27 May 2024 / Published: 30 May 2024

(This article belongs to the Special Issue Computational Mathematics and Its Applications in Numerical Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

The Levenberg–Marquardt (LM) method is one of the most significant methods for solving nonlinear equations as well as symmetric and asymmetric linear equations. To improve the method, this paper proposes a new adaptive LM algorithm by modifying the LM parameter, combining the trust region technique and the non-monotone technique. It is interesting that the new algorithm is constantly optimized by adaptively choosing the LM parameter. To evaluate the effectiveness of the new algorithm, we conduct tests using various examples. To extend the convergence results, we prove the convergence of the new algorithm under the Hölderian local error bound condition rather than the commonly used local error bound condition. Theoretical analysis and numerical results show that the new algorithm is stable and effective.

Keywords:

Levenberg–Marquardt method; nonlinear equations; LM parameter; Hölderian local error bound; convergence

1. Introduction

Nonlinear equations are widely used in key fields such as electricity, optics, mechanics, economic management, engineering technology, biomedicine, and alternative energy [1,2,3,4,5,6]. This paper discusses the following nonlinear equations:

\{\begin{matrix} f_{1} (x_{1}, x_{2}, \dots, x_{n}) = 0, \\ f_{2} (x_{1}, x_{2}, \dots, x_{n}) = 0, \\ ⋮ \\ f_{n} (x_{1}, x_{2}, \dots, x_{n}) = 0, \end{matrix}

which can be written in a vector form:

F (x) = 0,

(1)

where

F (x) : R^{n} \to R^{n}

is continuously differentiable and

x = {(x_{1}, x_{2}, \dots, x_{n})}^{T}

. We denote the solution set of Equation (1) by

X^{*}

and assume that

X^{*}

is nonempty.

Several promising numerical methods [7,8,9,10,11] have been proposed for solving nonlinear equations. One of the classical methods to solve Equation (1) is the Gauss–Newton method, which at each iteration computes the trial step

d_{k} = - {(J_{k}^{T} J_{k})}^{- 1} J_{k}^{T} F_{k},

where

F_{k} = F (x_{k})

,

J_{k} = F^{'} (x_{k})

is the Jacobian matrix of

F (x)

at

x_{k}

.

However, in the actual calculation, the trial step of the Gauss–Newton method may not be well defined when

J (x)

is singular or near-singular. To overcome this difficulty, the Levenberg–Marquardt(LM) method [10,11] was proposed. At the kth iteration, the LM method computes the trial step

d_{k} = - {(J_{k}^{T} J_{k} + λ_{k} I)}^{- 1} J_{k}^{T} F_{k},

(2)

where

λ_{k} \geq 0

is the LM parameter and

I \in R^{n \times n}

is the identity matrix. The trial step of the LM method is actually a modification of the trial step of the Gauss–Newton method, where the parameter

λ_{k}

is introduced to prevent the steps from being undefined or too large when

J (x)

is singular or nearly singular.

The LM method has quadratic convergence when

J (x)

is Lipschitz continuous and nonsingular at the solution of Equation (1) [12]. Nevertheless, the theoretical research shows that the condition of the nonsingularity of

J (x)

is too strong. To solve this problem, some scholars [13,14,15,16,17,18,19] have analysed the convergence of the LM method under the following local error bound condition, which is weaker than nonsingularity of

J (x)

:

c \cdot dist (x, X^{*}) \leq ∥ F (x) ∥, \forall x \in N (x^{*}),

(3)

where

c > 0

is a positive constant,

dist (x, X^{*})

is the distance from x to

X^{*}

, and

N (x^{*})

is some neighbourhood of

x^{*} \in X^{*}

. In this paper,

∥ \cdot ∥

is the 2-norm.

Although the local error bound condition is weaker than the nonsingularity of

J (x)

, this condition is not always satisfied with some ill-conditioned nonlinear equations in biochemical systems and certain applications. Recently, some scholars [20,21,22,23] have studied the convergence of LM method under the following Hölderian error bound condition, which is weaker than the local error bound condition:

c \cdot dist (x, X^{*}) \leq {∥ F (x) ∥}^{γ}, \forall x \in N (x^{*}),

(4)

where c is a positive constant and

γ \in (0, 1]

. Obviously, the Hölderian error bound condition (4) is a generalization of the local error bound condition (3), where the exponent

γ

of

∥ F (x) ∥

is extended to an interval

(0, 1]

. In this paper, we study the convergence of the new algorithm under the Hölderian error bound condition.

The LM parameter

λ_{k}

is vital to the efficiency of LM algorithms. Several scholars have done interesting research [13,14,15,16,17,18,19,21,22,23] on

λ_{k}

. Yamashita and Fukushima [13] took

λ_{k} = {∥F_{k}∥}^{2}

, although the disadvantage of choosing parameters in this way is that the value of

λ_{k} = {∥F_{k}∥}^{2}

may be too small to be effective when the sequence

{x_{k}}

is close to the solution set of Equation (1), which affects the local convergence rate. In order to solve this disadvantage and reduce the impact, Fan and Yuan [14] chose

λ_{k} = {∥F_{k}∥}^{δ}

, which is a generalization of

λ_{k} = {∥F_{k}∥}^{2}

, and extended the exponent

δ

of

λ_{k} = {∥F_{k}∥}^{δ}

to an interval

[1, 2]

. The numerical results when solving some equations showed better performance when

δ = 1

; however, the disadvantage of choosing parameters in this way is that it may make

λ_{k} = ∥F_{k}∥

too large and step

d_{k}

too small when

{x_{k}}

is far away from the solution set, causing the sequence to move slowly to the solution set and affecting the global convergence rate. To compensate for this flaw, Fan [17] used

λ_{k} = μ_{k} ∥F_{k}∥

, where

μ_{k}

is updated every iteration by the trust region technique. Numerical results showed that this change improved the performance of the algorithm. Chen and Ma [23] took

λ_{k} = θ ∥ F_{k} ∥^{δ} + (1 - θ) {∥ J_{k}^{T} F_{k} ∥}^{δ}

for

θ \in [0, 1]

and

δ \in [1, 2]

, finding that this improved the numerical results of the LM algorithm. Recently, Li et al. [24] proposed a new adaptive accelerated LM algorithm by choosing the LM parameter as

λ_{k + 1} = \frac{μ_{k + 1} ∥ F_{k + 1} ∥}{1 + ∥ F_{k + 1} ∥}

, with numerical results showing that the algorithm is efficient for solving symmetric and asymmetric linear equations.

Inspired by the above literature, we take a new adaptive LM parameter to enhance the computing performance of the LM algorithm, as follows:

λ_{k} = \{\begin{matrix} μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{δ}), if ∥F_{k}∥ \leq 1, \\ μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{- δ}), otherwise, \end{matrix} (0 \leq θ \leq 1, 1 \leq δ \leq 2),

where

μ_{k}

is updated every iteration via trust region technology. When

{x_{k}}

is close to a solution set,

∥F_{k}∥

is close to 0; thus,

λ_{k}

is close to

μ_{k} {∥F_{k}∥}^{δ}

if

δ = 1

, as used in [17]. Conversely, when

{x_{k}}

is far from the solution set, the leading

∥F_{k}∥

may be very large; thus,

λ_{k}

will be close to

μ_{k} θ

. This effectively regulates the range of

λ_{k}

to prevent the LM step from becoming excessively small, thereby enhancing computational efficiency. Therefore, it seems that this choice of

λ_{k}

is more effective for the LM algorithm.

The following sections outline the remaining contents of this paper. In Section 2, we propose a new algorithm with a new LM parameter in more detail and prove its global convergence. In Section 3, we analyse the convergence rate of the new algorithm. In Section 4, we present numerical results verifying that the new algorithm is effective. Finally, some key conclusions are put forward in Section 5.

2. The New Adaptive LM Algorithm and Its Global Convergence

In this section, we introduce our new adaptive algorithm and establish its global convergence.

If we define the merit function for Equation (1) as

ϕ (x) = {∥ F (x) ∥}^{2},

then, at the kth iteration, the actual reduction of

ϕ (x)

is provided by

{A r e d}_{k} = {∥F_{k}∥}^{2} - {∥F (x_{k} + d_{k})∥}^{2}

(5)

and the predicted reduction of

ϕ (x)

by

{P r e d}_{k} = {∥F_{k}∥}^{2} - {∥F_{k} + J_{k} d_{k}∥}^{2},

(6)

where

d_{k}

is computed by Equation (2). The ratio of

{A r e d}_{k}

to

{P r e d}_{k}

is

r_{k} = \frac{{A r e d}_{k}}{{P r e d}_{k}},

(7)

which determines whether to accept

d_{k}

and update

μ_{k}

. Several studies have suggested that algorithms employing non-monotone strategies outperform those with monotone strategies [18,25,26,27,28]. To carry out the non-monotone strategy, Amini et al. [18] used the following actual reduction to replace Equation (5):

\bar{A} r e d_{k} = F_{l (k)}^{2} - {∥F (x_{k} + d_{k})∥}^{2}

(8)

where

F_{l (k)} = max_{0 \leq j \leq n (k)} \{∥F_{k - j}∥\}, k = 0, 1, 2, \dots,

(9)

n (k) = min \{N_{0}, k\}

, and

N_{0}

is a positive integer constant. With this change,

∥F (x_{k + 1})∥

is compared with

max_{0 \leq j \leq n (k)} \{∥F_{k - j}∥\}

at each iteration. To combine the non-monotone strategy with the new adaptive LM parameter, we use the following ratio:

{\hat{r}}_{k} = \frac{\bar{A} {r e d}_{k}}{{P r e d}_{k}}

to replace the original role of the ratio

r_{k}

in the algorithm.

Next, we present a new adaptive LM algorithm, named the ALLM algorithm (Algorithm 1).

Algorithm 1 (ALLM Algorithm)

Step 1. Given

x_{0} \in R^{n}

,

N_{0} > 0, μ_{0} > m > 0, ε > 0, 0 < p_{0} \leq p_{1} \leq p_{2} < 1

. Set

k : = 0

.
Step 2. If

∥J_{k}^{T} F_{k}∥ \leq ε

, stop. Otherwise let

λ_{k} = \{\begin{matrix} μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{δ}), & if ∥F_{k}∥ \leq 1, \\ μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{- δ}), & otherwise, \end{matrix} (0 \leq θ \leq 1, 1 \leq δ \leq 2) .

(10)

Step 3. Compute

d_{k}

(J_{k}^{T} J_{k} + λ_{k} I) d = - J_{k}^{T} F_{k},

(11)

Step 4. Compute

F_{l (k)}

,

P r e d_{k}

and

\bar{A} r e d_{k}

by Equations (9), (6) and (8). Set

{\hat{r}}_{k} = \frac{\bar{A} {r e d}_{k}}{{P r e d}_{k}} .

(12)

Step 5. Set

x_{k + 1} = \{\begin{matrix} x_{k} + d_{k}, & if {\hat{r}}_{k} \geq p_{0} \\ x_{k}, & otherwise \end{matrix}

(13)

Step 6. Choose

μ_{k + 1}

as

μ_{k + 1} = \{\begin{matrix} 4 μ_{k}, & if {\hat{r}}_{k} < p_{1} \\ μ_{k}, & if {\hat{r}}_{k} \in [p_{1}, p_{2}] \\ max \{\frac{μ_{k}}{4}, m\}, & otherwise . \end{matrix}

(14)

Step 7. Set

k = k + 1

and return to Step 2.

To prevent excessively large steps, we impose the following condition:

μ_{k} \geq m, \forall k \in N

(15)

where m is a positive constant.

Lemma 1.

P r e d_{k} \geq ∥J_{k}^{T} F_{k}∥ min {∥d_{k}∥, \frac{∥J_{k}^{T} F_{k}∥}{∥J_{k}^{T} J_{k}∥}}

for all

k \in N

.

Proof.

This proof comes from famous result in [29]. □

Lemma 2

([18]). Assume that sequence

\{x_{k}\}

is generated by the ALLM algorithm; then, the sequence

\{F_{l (k)}\}

converges.

Assumption 1.

(a)

J (x)

is Hölderian continuous, i.e., there exists a constant

κ_{h j} > 0

such that

∥ J (x) - J (y) ∥ \leq κ_{h j} {∥ x - y ∥}^{v}, \forall x, y \in R^{n}

(16)

where the exponent

v \in (0, 1]

.

(b)

J (x)

is bounded, i.e., there exists a constant

κ_{b j} > 0

such that

∥ J (x) ∥ \leq κ_{b j}, \forall x \in R^{n} .

(17)

It follows from Equation (16) that

∥ F (y) - F (x) - J (x) (y - x) ∥ \leq \frac{κ_{h j}}{1 + v} {∥ y - x ∥}^{1 + v} .

(18)

Thus, there exists a constant

κ_{b f} > 0

that makes

∥ F (y) - F (x) ∥ \leq κ_{b f} ∥ y - x ∥ .

(19)

Theorem 1.

Under Assumption 1, the ALLM algorithm satisfies

lim_{k \to \infty} inf ∥ J_{k}^{T} F_{k} ∥ = 0 .

(20)

Proof.

Assuming that Theorem 1 is not true, we obtain

∥ J_{k}^{T} F_{k} ∥ \geq ϵ_{0}, \forall k \geq k_{0}

(21)

where

ϵ_{0}

is a positive constant and

k_{0} \in N

.

If

d_{k}

is accepted by the ALLM algorithm, then

F_{l (k)}^{2} - {∥F (x_{k} + d_{k})∥}^{2} \geq p_{0} {P r e d}_{k} .

Per Lemma 1, Equations (17) and (21) indicate that, for all

k \geq k_{0}

,

\begin{matrix} F_{l (k)}^{2} - {∥F_{k + 1}∥}^{2} & \geq p_{0} ∥ J_{k}^{T} F_{k} ∥ min {∥d_{k}∥, \frac{∥ J_{k}^{T} F_{k} ∥}{∥ J_{k}^{T} J_{k} ∥}} \\ \geq p_{0} ϵ_{0} min {∥d_{k}∥, \frac{ϵ_{0}}{κ_{b j}^{2}}} . \end{matrix}

Then, substituting k for

l (k) - 1

,

F_{l (l (k) - 1)}^{2} - {∥ F_{l (k)} ∥}^{2} \geq p_{0} ϵ_{0} min \{∥ d_{l (k) - 1} ∥, \frac{ϵ_{0}}{κ_{b j}^{2}}\}

holds for all sufficiently large k.

Per Lemma 1, we obtain

lim_{k \to \infty} (F_{l (l (k) - 1)}^{2} - {∥ F_{l (k)} ∥}^{2}) = 0;

thus,

lim_{k \to \infty} min \{∥ d_{l (k) - 1} ∥, \frac{ϵ_{0}}{κ_{b j}^{2}}\} = 0,

as

\frac{ϵ_{0}}{κ_{b j}^{2}}

is a positive constant, meaning that

lim_{k \to \infty} ∥ d_{l (k) - 1} ∥ = 0 .

Per Equation (19), the last equality implies that

lim_{k \to \infty} ∥ F (x_{l (k)}) ∥ = lim_{k \to \infty} ∥ F (x_{l (k) - 1}) ∥ .

Next, by considering the proof process of Theorem 2.4 in [18], we can prove that

lim_{k \to \infty} ∥d_{k}∥ = 0 .

(22)

Along with Equations (10), (11), (17) and (21), this implies that

μ_{k} \to \infty, as k \to \infty .

(23)

Next, per Equation (18), we obtain

|∥F (x_{k} + d_{k})∥ - ∥F_{k} + J_{k} d_{k}∥| \leq \frac{κ_{h j}}{1 + v} {∥d_{k}∥}^{1 + v},

which yields

| ∥ F (x_{k} + d_{k}) ∥^{2} - ∥ F_{k} + J_{k} d_{k} ∥^{2} | \leq \frac{2 κ_{h j}}{1 + v} ∥F_{k} + J_{k} d_{k}∥ {∥d_{k}∥}^{1 + v} + \frac{κ_{h j}^{2}}{{(1 + v)}^{2}} {∥d_{k}∥}^{2 + 2 v} .

From Lemma 1 and Equations (17), (21), (22) and

∥F_{k} + J_{k} d_{k}∥ \leq ∥F_{k}∥ \leq ∥F_{1}∥

, we obtain

\begin{matrix} |r_{k} - 1| & = |\frac{{A r e d}_{k} - {P r e d}_{k}}{{P r e d}_{k}}| \\ \leq \frac{\frac{2 κ_{h j}}{1 + v} ∥F_{k} + J_{k} d_{k}∥ {∥d_{k}∥}^{1 + v} + \frac{κ_{h j}^{2}}{{(1 + v)}^{2}} {∥d_{k}∥}^{2 + 2 v}}{∥J_{k}^{T} F_{k}∥ min \{∥d_{k}∥, \frac{∥J_{k}^{T} F_{k}∥}{∥J_{k}^{T} J_{k}∥}\}} \\ \to 0; \end{matrix}

thus,

lim_{k \to + \infty} r_{k} = 1 .

Combined with Equations (6), (8), (9) and (12), we obtain

{\hat{r}}_{k} = \frac{{\bar{A} r e d}_{k}}{{P r e d}_{k}} = \frac{F_{l (k)}^{2} - {∥F (x_{k} + d_{k})∥}^{2}}{{P r e d}_{k}} \geq \frac{{∥F_{k}∥}^{2} - {∥F_{k + 1}∥}^{2}}{{P r e d}_{k}} = r_{k} \to 1 .

In view of the ALLM algorithm, for all large k there exists a positive constant

\bar{μ} > m

that makes

μ_{k} < \bar{μ}

, which contradicts Equation (23). Thus, Theorem 1 holds. □

3. Convergence Rate

This section discusses the convergence rate of the ALLM algorithm. Here, we let

\{x_{k}\}

generated by ALLM algorithm lie within a neighborhood of

x^{*} \in X^{*}

and converge to the solution set

X^{*}

of Equation (1).

Assumption 2.

(a)

F (x)

provides a Hölderian local error bound, i.e., there exist constants

c > 0

and

0 < b < 1

that make

c \cdot dist (x, X^{*}) \leq {∥ F (x) ∥}^{γ}, \forall x \in N (x^{*}, b),

(24)

where the exponent

γ \in (0, 1]

,

N (x^{*}, b) = \{x \in R^{n} ∣ ∥x - x^{*}∥ \leq b\}

.

(b)

J (x)

is Hölderian continuous, i.e., there exists a constant

κ_{h j} > 0

such that

∥ J (x) - J (y) ∥ \leq κ_{h j} {∥ x - y ∥}^{v}, \forall x, y \in N (x^{*}, b)

(25)

where the exponent

v \in (0, 1]

.

From Equation (25), we have

∥ F (y) - F (x) - J (x) (y - x) ∥ \leq \frac{κ_{h j}}{1 + v} {∥ y - x ∥}^{1 + v}, \forall x, y \in N (x^{*}, b) .

(26)

Thus,

∥ F (y) - F (x) ∥ \leq κ_{b f} ∥ y - x ∥, \forall x, y \in N (x^{*}, b),

(27)

where

κ_{b f}

is a positive constant.

Defining by

{\bar{x}}_{k} \in X^{*}

satisfies

∥{\bar{x}}_{k} - x_{k}∥ = dist (x_{k}, X^{*}),

which implies that

{\bar{x}}_{k}

is closest to

x_{k}

.

Next, we discuss the important property of

∥d_{k}∥

and

μ_{k}

; finally, we study the convergence rate of the ALLM algorithm using the singular value decomposition (SVD) technique. Without loss of generality, we assume that

x_{k} \in N (x^{*}, \frac{b}{4})

.

Lemma 3.

Under Assumption 2, we have

(1) If

∥F_{k}∥ \leq 1

; then, the following relationship holds:

∥d_{k}∥ \leq \bar{c} dist {(x_{k}, X^{*})}^{min {1, 1 + v - \frac{δ}{2 γ}}}

(28)

where

\bar{c}

is a positive constant.

(2) If

∥F_{k}∥ > 1

, then the following relationship holds:

∥d_{k}∥ \leq \tilde{c} dist (x_{k}, X^{*})

(29)

where

\tilde{c}

is a positive constant.

Proof.

(1) As

x_{k} \in N (x^{*}, \frac{b}{4})

, we have

∥ {\bar{x}}_{k} - x^{*} ∥ \leq ∥ {\bar{x}}_{k} - x_{k} ∥ + ∥ x_{k} - x^{*} ∥ \leq \frac{b}{2};

thus,

{\bar{x}}_{k} \in N (x^{*}, \frac{b}{2})

.

We define

φ_{k} (d) = ∥ F_{k} + J_{k} {d ∥}^{2} + λ_{k} {∥ d ∥}^{2} .

It can be concluded from (11) that

d_{k}

is the minimizer of

φ_{k} (d)

. From (26) and

F ({\bar{x}}_{k}) = 0

, we have

\begin{matrix} ∥ d_{k} ∥^{2} & \leq \frac{φ_{k} (d_{k})}{λ_{k}} \\ \leq \frac{φ_{k} ({\bar{x}}_{k} - x_{k})}{λ_{k}} \\ = \frac{∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥^{2} + λ_{k} {∥ {\bar{x}}_{k} - x_{k} ∥}^{2}}{λ_{k}} \\ = \frac{∥ F ({\bar{x}}_{k}) - F_{k} - J_{k} ({\bar{x}}_{k} - x_{k}) ∥^{2} + λ_{k} {∥ {\bar{x}}_{k} - x_{k} ∥}^{2}}{λ_{k}} \\ \leq \frac{1}{λ_{k}} {(\frac{κ_{h j}}{1 + v})}^{2} ∥ {\bar{x}}_{k} - x_{k} ∥^{2 + 2 v} + {∥ {\bar{x}}_{k} - x_{k} ∥}^{2} . \end{matrix}

If

∥ F_{k} ∥ \leq 1

, then

∥ F_{k} ∥^{δ} \leq 1

and

1 + ∥ F_{k} ∥^{δ} \leq 2

. In conjunction with (15) and (24), this yields

\begin{matrix} λ_{k} & = μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{δ}) \\ \geq m (θ \frac{c^{\frac{δ}{γ}} {∥x_{k} - {\bar{x}}_{k}∥}^{\frac{δ}{γ}}}{2} + (1 - θ) c^{\frac{δ}{γ}} {∥x_{k} - {\bar{x}}_{k}∥}^{\frac{δ}{γ}}) \\ \geq \frac{1}{2} (3 m - m θ) c^{\frac{δ}{γ}} {∥x_{k} - {\bar{x}}_{k}∥}^{\frac{δ}{γ}} . \end{matrix}

Thus,

\begin{matrix} {∥d_{k}∥}^{2} & \leq \frac{1}{λ_{k}} {(\frac{κ_{h j}}{1 + v})}^{2} {∥{\bar{x}}_{k} - x_{k}∥}^{2 + 2 v} + {∥{\bar{x}}_{k} - x_{k}∥}^{2} \\ \leq \frac{2 κ_{h j}^{2} c^{- \frac{δ}{γ}}}{(3 m - m θ) {(1 + v)}^{2}} {∥{\bar{x}}_{k} - x_{k}∥}^{2 + 2 v - \frac{δ}{γ}} + {∥{\bar{x}}_{k} - x_{k}∥}^{2} \\ \leq (\frac{2 κ_{h j}^{2} c^{- \frac{δ}{γ}}}{(3 m - m θ) {(1 + v)}^{2}} + 1) {∥{\bar{x}}_{k} - x_{k}∥}^{2 min {1, 1 + v - \frac{δ}{2 γ}}} . \end{matrix}

Setting

\bar{c} = \sqrt{2 κ_{h j}^{2} c^{- \frac{δ}{γ}} / (3 m - m θ) {(1 + v)}^{2}}

, we obtain Equation

(28)

.

(2) If

∥F_{k}∥ > 1

, then

{∥F_{k}∥}^{δ} > 1

and

1 + {∥F_{k}∥}^{δ} \leq 2 {∥F_{k}∥}^{δ}

. Along with (19), this allows us to conclude that

\begin{matrix} λ_{k} & = μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{- δ}) \\ \geq m (θ \frac{{∥F_{k}∥}^{δ}}{2 {∥F_{k}∥}^{δ}} + (1 - θ) k_{b f}^{- δ} {∥x_{k} - {\bar{x}}_{k}∥}^{- δ}) \\ \geq \frac{m θ}{2} + m (1 - θ) k_{b f}^{- δ} {∥x_{k} - {\bar{x}}_{k}∥}^{- δ} . \end{matrix}

Thus, there exists a constant

\tilde{c} > 0

such that

∥ d_{k} ∥^{2} \leq {\tilde{c}}^{2} dist {(x_{k}, X^{*})}^{2} .

Therefore,

∥ d_{k} ∥ \leq \tilde{c} dist (x_{k}, X^{*})

. □

Lemma 4.

Under Assumption 2, we have the following:

(1) If

∥ F_{k} ∥ \leq 1

,

v > max {\frac{1}{γ} - 1, \frac{1}{γ (1 + v) - \frac{δ}{2}} - 1, \frac{1 - γ}{γ (1 + v) - \frac{δ}{2}}},

then

μ_{k}

is bounded above, i.e., there exists a positive constant

M_{1}

such that

μ_{k} \leq M_{1}

holds for all large k.

(2) If

∥ F_{k} ∥ > 1,

v > \frac{1}{γ} - 1,

then

μ_{k}

is bounded above, i.e., there exists a positive constant

M_{2}

such that

μ_{k} \leq M_{2}

holds for all large k.

Proof.

(1) Considering Lemma 3.3 in [21], we can see that

\begin{matrix} |r_{k} - 1| & = |\frac{{A r e d}_{k} - {P r e d}_{k}}{{P r e d}_{k}}| \\ = |\frac{{∥F_{k} + J_{k} d_{k}∥}^{2} - {∥F (x_{k} + d_{k})∥}^{2}}{{P r e d}_{k}}| \\ \leq \frac{{(\frac{κ_{h j}}{1 + v})}^{2} {∥d_{k}∥}^{2 + 2 v} + \frac{2 κ_{h j}}{1 + v} ∥F_{k} + J_{k} d_{k}∥ {∥d_{k}∥}^{1 + v}}{c_{4} ∥F_{k}∥ {∥d_{k}∥}^{max {\frac{1}{γ}, \frac{1}{γ (1 + v) - \frac{δ}{2}}, \frac{1 - γ}{γ (1 + v) - \frac{δ}{2}} + 1}}} \\ \to 0; \end{matrix}

thus,

lim_{k \to + \infty} r_{k} = 1 .

This, along with Equations (6), (8), (9), and (12), yields

{\hat{r}}_{k} = \frac{\bar{A} r e d_{k}}{{P r e d}_{k}} = \frac{F_{l (k)}^{2} - {∥F (x_{k} + d_{k})∥}^{2}}{P r e d_{k}} \geq \frac{{∥F_{k}∥}^{2} - {∥F_{k + 1}∥}^{2}}{{P r e d}_{k}} = r_{k} \to 1 .

Considering the updating rule from (14), we can ascertain the existence of a positive constant

M_{1} > m

, ensuring that

μ_{k} \leq M_{1}

holds for sufficiently large k.

(2) Consider the following two cases.

Case 1:

∥{\bar{x}}_{k} - x_{k}∥ \leq ∥d_{k}∥

. Per Lemma 3 (2), Equations (24), (26) and

v > \frac{1}{γ} - 1

, we have

\begin{matrix} ∥F_{k}∥ - ∥F_{k} + J_{k} d_{k}∥ & \geq ∥F_{k}∥ - ∥F_{k} + J_{k} ({\bar{x}}_{k} - x_{k})∥ \\ \geq c^{\frac{1}{γ}} {∥{\bar{x}}_{k} - x_{k}∥}^{\frac{1}{γ}} - \frac{κ_{h j}}{1 + v} {∥{\bar{x}}_{k} - x_{k}∥}^{1 + v} \\ \geq c_{1} {∥{\bar{x}}_{k} - x_{k}∥}^{\frac{1}{γ}} \\ \geq c_{2} {∥d_{k}∥}^{\frac{1}{γ}} \end{matrix}

(30)

which holds for some

c_{1}

,

c_{2} > 0

.

Case 2:

∥{\bar{x}}_{k} - x_{k}∥ > ∥d_{k}∥

. It follows from Equation (30) that

\begin{matrix} ∥F_{k}∥ - ∥F_{k} + J_{k} d_{k}∥ & \geq ∥F_{k}∥ - ∥F_{k} + \frac{∥d_{k}∥}{∥{\bar{x}}_{k} - x_{k}∥} J_{k} ({\bar{x}}_{k} - x_{k})∥ \\ \geq ∥F_{k}∥ - ∥(1 - \frac{∥d_{k}∥}{∥{\bar{x}}_{k} - x_{k}∥}) F_{k} + \frac{∥d_{k}∥}{∥{\bar{x}}_{k} - x_{k}∥} (F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}))∥ \\ \geq \frac{∥d_{k}∥}{∥{\bar{x}}_{k} - x_{k}∥} (∥F_{k}∥ - ∥F_{k} + J_{k} ({\bar{x}}_{k} - x_{k})∥) \\ \geq c_{1} ∥d_{k}∥ {∥{\bar{x}}_{k} - x_{k}∥}^{\frac{1}{γ} - 1} \\ \geq c_{3} {∥d_{k}∥}^{\frac{1}{γ}} \end{matrix}

(31)

holds for some

c_{3} > 0

.

Therefore, from Equations (30) and (31), we have

\begin{matrix} P r e d_{k} & = (∥F_{k}∥ + ∥F_{k} + J_{k} d_{k}∥) (∥F_{k}∥ - ∥F_{k} + J_{k} d_{k}∥) \\ \geq ∥F_{k}∥ (∥F_{k}∥ - ∥F_{k} + J_{k} d_{k}∥) \\ \geq c_{4} ∥F_{k}∥ {∥d_{k}∥}^{\frac{1}{γ}}, \end{matrix}

(32)

which holds for some

c_{4} > 0

.

Because

∥F_{k} + J_{k} d_{k}∥ \leq ∥F_{k}∥, v > \frac{1}{γ} - 1

, from Equations (26) and (32) we have

\begin{matrix} |r_{k} - 1| & = |\frac{A r e d_{k} - P r e d_{k}}{P r e d_{k}}| \\ = |\frac{{∥F_{k} + J_{k} d_{k}∥}^{2} - {∥F (x_{k} + d_{k})∥}^{2}}{{Pred}_{k}}| \\ \leq \frac{{(\frac{κ_{h j}}{1 + v})}^{2} {∥d_{k}∥}^{2 + 2 v} + \frac{2 κ_{h j}}{1 + v} ∥F_{k} + J_{k} d_{k}∥ {∥d_{k}∥}^{1 + v}}{c_{5} ∥F_{k}∥ {∥d_{k}∥}^{\frac{1}{γ}}} \\ \to 0; \end{matrix}

thus,

lim_{k \to + \infty} r_{k} = 1 .

This, along with Equations (6), (8), (9) and (12), yields

{\hat{r}}_{k} = \frac{\bar{A} r e d_{k}}{P r e d_{k}} = \frac{F_{l (k)}^{2} - {∥F (x_{k} + d_{k})∥}^{2}}{P r e d_{k}} \geq \frac{{∥F_{k}∥}^{2} - {∥F_{k + 1}∥}^{2}}{P r e d_{k}} = r_{k} \to 1 .

Therefore, there exists a positive constant

M_{2} > m

such that

μ_{k} \leq M_{2}

holds for sufficiently large k. □

Next, we consider SVD technology. In view of the findings provided by Behling and Iusem in [30], without loss of generality, we set

rank (J (\bar{x})) = r

for all

\bar{x} \in N (x^{*}, b) \cap X^{*}

. Suppose that the SVD of

J ({\bar{x}}_{k})

is

J ({\bar{x}}_{k}) = {\bar{U}}_{k} {\bar{Σ}}_{k} {\bar{V}}_{k}^{T} = ({\bar{U}}_{k, 1}, {\bar{U}}_{k, 2}) (\begin{matrix} {\bar{Σ}}_{k, 1} \\ 0 \end{matrix}) (\begin{matrix} {\bar{V}}_{k, 1}^{T} \\ {\bar{V}}_{k, 2}^{T} \end{matrix}) = {\bar{U}}_{k, 1} {\bar{Σ}}_{k, 1} {\bar{V}}_{k, 1}^{T},

where

{\bar{Σ}}_{k, 1} = diag ({\bar{σ}}_{k, 1}, \dots, {\bar{σ}}_{k, r}) > 0

.

Correspondingly,

J_{k} = (U_{k, 1}, U_{k, 2}) (\begin{matrix} Σ_{k, 1} \\ Σ_{k, 2} \end{matrix}) (\begin{matrix} V_{k, 1}^{T} \\ V_{k, 2}^{T} \end{matrix}) = U_{k, 1} Σ_{k, 1} V_{k, 1}^{T} + U_{k, 2} Σ_{k, 2} V_{k, 2}^{T},

where

Σ_{k, 2} = diag (σ_{k, r + 1}, \dots, σ_{k, n}) > 0

.

For clearness, we let

J_{k} = U_{1} Σ_{1} V_{1}^{T} + U_{2} Σ_{2} V_{2}^{T},

which neglects the subscription k in

U_{k, i}

,

Σ_{k, i}

and

V_{k, i}

.

Lemma 5

([21]). Under Assumption 2, the following relationship holds:

(1)

∥U_{1} U_{1}^{T} F_{k}∥ \leq κ_{b f} ∥{\bar{x}}_{k} - x_{k}∥

(2)

∥U_{2} U_{2}^{T} F_{k}∥ \leq 2 κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{1 + v}

.

Theorem 2.

Under the conditions of Lemma 3, we have the following:

(1) If

∥F_{k}∥ \leq 1

, then the

\{x_{k}\}

generated by the ALLM algorithm converges to the solution set of Equation (1) with order

min {γ (1 + δ), γ (1 + v), γ (1 + v) (1 + v - \frac{δ}{2 γ})}

.

(2) If

∥F_{k}∥ > 1

, then the

\{x_{k}\}

generated by the ALLM algorithm converges to the solution set of Equation (1) with order γ.

Proof.

(1) It follows from the SVD of

J_{k}

that

d_{k} = - V_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} Σ_{1} U_{1}^{T} F_{k} - V_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} Σ_{2} U_{2}^{T} F_{k}

and

\begin{matrix} F_{k} + J_{k} d_{k} & = F_{k} - U_{1} Σ_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} Σ_{1} U_{1}^{T} F_{k} - U_{2} Σ_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} Σ_{2} U_{2}^{T} F_{k} \\ = λ_{k} U_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} U_{1}^{T} F_{k} + λ_{k} U_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} U_{2}^{T} F_{k} . \end{matrix}

(33)

According to the theory of matrix perturbation [31] and Equation (25), we have

∥diag (Σ_{1} - {\bar{Σ}}_{k, 1}, Σ_{2})∥ \leq ∥J_{k} - J ({\bar{x}}_{k})∥ \leq κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{v},

which indicates

∥Σ_{1} - {\bar{Σ}}_{k, 1}∥ \leq κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{v}, ∥Σ_{2}∥ \leq κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{v} .

(34)

As

\{x_{k}\}

converges to

X^{*}

, without loss of generality, we let

κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{v} \leq \frac{\bar{σ}}{2}

hold for all large k. From Equation (34), we have

∥{(Σ_{1}^{2} + λ_{k} I)}^{- 1}∥ \leq ∥Σ_{1}^{- 2}∥ \leq \frac{1}{{(\bar{σ} - κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{v})}^{2}} \leq \frac{4}{{\bar{σ}}^{2}} .

(35)

From Equations (34), (35), Lemma 5, and

{(Σ_{2}^{2} + λ_{k} I)}^{- 1} ∥ \leq λ_{k}^{- 1}

, we have

∥F_{k} + J_{k} d_{k}∥ \leq \frac{4 λ_{k} κ_{b f}}{{\bar{σ}}^{2}} ∥{\bar{x}}_{k} - x_{k}∥ + 2 κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{1 + v} .

(36)

If

∥F_{k}∥ \leq 1

, then

{∥F_{k}∥}^{δ} \leq 1

, while from Equation (27) and Lemma 4 we have

\begin{matrix} λ_{k} & = μ_{k} (θ \frac{{∥F_{k}∥}^{δ}}{1 + {∥F_{k}∥}^{δ}} + (1 - θ) {∥F_{k}∥}^{δ}) \\ \leq M_{1} (θ {∥F_{k}∥}^{δ} + (1 - θ) {∥F_{k}∥}^{δ}) \\ \leq M_{1} k_{b f}^{δ} {∥x_{k} - {\bar{x}}_{k}∥}^{δ} . \end{matrix}

This, along with Equation (36), yields

\begin{matrix} ∥F_{k} + J_{k} d_{k}∥ & \leq \frac{4 M_{1} κ_{b f}^{1 + δ}}{{\bar{σ}}^{2}} {∥{\bar{x}}_{k} - x_{k}∥}^{1 + δ} + 2 κ_{h j} {∥{\bar{x}}_{k} - x_{k}∥}^{1 + v} \\ \leq (\frac{4 M_{1} κ_{b f}^{1 + δ}}{{\bar{σ}}^{2}} + 2 κ_{h j}) {∥{\bar{x}}_{k} - x_{k}∥}^{min {1 + δ, 1 + v}} . \end{matrix}

(37)

Letting

c_{5} = \frac{4 M_{1} κ_{b f}^{1 + δ}}{{\bar{σ}}^{2}} + 2 κ_{h j}

, from Equations (24), (26), (28) and (37) we obtain

\begin{matrix} {(c ∥{\bar{x}}_{k + 1} - x_{k + 1}∥)}^{\frac{1}{γ}} & \leq ∥F (x_{k} + d_{k})∥ \\ \leq ∥F_{k} + J_{k} d_{k}∥ + κ_{h j} {∥d_{k}∥}^{1 + v} \\ \leq c_{5} {∥{\bar{x}}_{k} - x_{k}∥}^{min {1 + δ, 1 + v}} + κ_{h j} {\bar{c}}^{1 + v} {∥{\bar{x}}_{k} - x_{k}∥}^{min {1 + v, (1 + v) (1 + v - \frac{δ}{2 γ})}} \\ \leq (c_{5} + κ_{h j} {\bar{c}}^{1 + v}) {∥{\bar{x}}_{k} - x_{k}∥}^{min {1 + δ, 1 + v, (1 + v) (1 + v - \frac{δ}{2 γ})}} . \end{matrix}

Thus,

c ∥{\bar{x}}_{k + 1} - x_{k + 1}∥ \leq {(c_{5} + κ_{h j} {\bar{c}}^{1 + v})}^{γ} {∥{\bar{x}}_{k} - x_{k}∥}^{min {γ (1 + δ), γ (1 + v), γ (1 + v) (1 + v - \frac{δ}{2 γ})}},

(38)

which indicates that

\{x_{k}\}

converges to the solution set

X^{*}

of Equation (1) with convergence rate

min {γ (1 +

δ), γ (1 + v), γ (1 + v) (1 + v - \frac{δ}{2 γ})}

.

(2) The proof of

∥F_{k}∥ > 1

is similar to the proof of

{∥F_{k}∥}^{- δ} \leq 1

. We obtain

c ∥{\bar{x}}_{k + 1} - x_{k + 1}∥ \leq {(c_{6} + κ_{h j} {\bar{c}}^{1 + v})}^{γ} {∥{\bar{x}}_{k} - x_{k}∥}^{γ};

thus,

\{x_{k}\}

converges to the solution set

X^{*}

of Equation (1) with order

γ

. □

Theorem 3.

Under Assumption 2, we have the following:

(1) If

∥ F_{k} ∥ \leq 1,

v > \frac{1}{γ} - 1

,

\frac{1}{γ} - 1 < δ \leq 2 γ v

, then

\{x_{k}\}

generated by the ALLM algorithm converges to some solution of Equation (1) with order

min {γ (1 + δ), γ (1 + v)}

.

(2) If

∥ F_{k} ∥ > 1,

v > \frac{1}{γ} - 1

, then

\{x_{k}\}

generated by the ALLM algorithm converges to some solution of Equation (1) with order γ.

Proof.

(1) If

∥ F_{k} ∥ \leq 1

and

v \geq \frac{δ}{2 γ}

, then from Equation (28) we obtain

∥d_{k}∥ \leq \bar{c} ∥{\bar{x}}_{k} - x_{k}∥ .

(39)

It follows from

v > \frac{1}{γ} - 1

and

v \geq \frac{δ}{2 γ}

that

\begin{matrix} (\frac{1}{γ} - 1) - (\frac{1 - γ}{γ (1 + v) - \frac{δ}{2}}) \geq 0 \end{matrix}

and

(\frac{1 - γ}{γ (1 + v) - \frac{δ}{2}}) - (\frac{1}{γ (1 + v) - \frac{δ}{2}} - 1) \geq 0 .

Therefore, the conditions of Lemma 4 (1) hold. In conjunction with

δ > \frac{1}{γ} - 1

and

δ \leq 2 γ v

, this yields

\begin{matrix} min & \{γ (1 + δ), γ (1 + v), γ (1 + v) (1 + v - \frac{δ}{2 γ})\} \\ = & min {γ (1 + δ), γ (1 + v)} \\ > & 1 . \end{matrix}

(40)

Thus,

\{x_{k}\}

converges superlinearly to

X^{*}

.

For clearness,

∥{\bar{x}}_{k} - x_{k}∥ \leq ∥{\bar{x}}_{k + 1} - x_{k + 1}∥ + ∥d_{k}∥ .

(41)

In view of Equations (38) and (40), we know the existence of a constant

M > 0

, meaning that

∥{\bar{x}}_{k} - x_{k}∥ \leq M ∥d_{k}∥

(42)

holds for large k. Thus, from Equations (38), (39), (40), and (42), we have

∥d_{k + 1}∥ \leq O ({∥d_{k}∥}^{min {γ (1 + δ), γ (1 + v)}}),

which means that the ALLM algorithm converges with order

min {γ (1 + δ), γ (1 + v)}

.

(2) The proof of

∥F_{k}∥ > 1

is similar to the proof of

∥ F_{k} ∥ \leq 1

. We obtain

∥d_{k + 1}∥ \leq O ({∥d_{k}∥}^{γ});

thus, ALLM algorithm converges with order

γ

. □

4. Numerical Experiments

In this section, we verify the effectiveness of the ALLM algorithm by presenting some numerical experiments. Algorithm 1 (named the AELM algorithm) from [22] is used for comparison. All algorithms were tested in the MATLAB R2022b programming environment on a personal PC with an i7-7500U CPU and 2.7 GHz. We selected the parameters of the AELM algorithm as follows:

p_{0} = 10^{- 4}

,

p_{1} = 0.25,

p_{2} = 0.75, N_{0} = 5, μ_{1} = 0.01, m = 10^{- 8}

. We selected the parameters of the ALLM algorithm as follows:

p_{0} = 10^{- 4}, p_{1} = 0.25, p_{2} = 0.75, N_{0} = 5, μ_{1} = 0.01, m = 10^{- 8}, θ = 0, 0.5, 1, δ = 1, 2 .

All algorithms were terminated when

∥J_{k}^{T} F_{k}∥ \leq 10^{- 5}

or when the number of iterations surpassed 1000.

Example 1.

We consider four special functions [22] to verify that the ALLM algorithm satisfies more theoretical applications. Functions 1–4 satisfy the Hölderian local error bound condition around the zero point but do not satisfy the local error bound condition. Here, the

J (x)

for Functions 3–4 are Hölderian continuous but not Lipschitz continuous, while the

J (x)

for Functions 1–2 are both Lipschitz continuous and Hölderian continuous.

Function 1

\begin{matrix} f_{1} (x) & = x_{1} + 10 x_{2}, \\ f_{2} (x) & = \sqrt{5} (x_{3} - x_{4}), \\ f_{3} (x) & = {(x_{2} - 2 x_{3})}^{2}, \\ f_{4} (x) & = \sqrt{10} {(x_{1} - x_{4})}^{2} . \end{matrix}

Initial point:

x_{0} = {(3, - 1, 0, 1)}^{T}

, zero point:

{(0, 0, 0, 0)}^{T}

.

Function 2

\begin{matrix} f_{1} (x) & = x_{1} x_{2}, \\ f_{2} (x) & = x_{1}^{2} + x_{2}^{2} . \end{matrix}

Initial point:

x_{0} = {(1, 1)}^{T}

, zero point:

{(0, 0)}^{T}

.

Function 3

\begin{matrix} f_{1} (x) = x_{1} + 10 x_{2}, \\ f_{2} (x) = x_{3} - x_{4}, \\ f_{3} (x) = {(x_{2} - 2 x_{3})}^{\frac{3}{2}}, \\ f_{4} (x) = {(x_{1} - x_{4})}^{\frac{3}{2}} . \end{matrix}

Initial point:

x_{0} = {(3, 1, 0, 1)}^{T}

, zero point:

{(0, 0, 0, 0)}^{T}

.

Function 4

\begin{matrix} f_{1} (x) = x_{1} + 10 x_{2}, \\ f_{2} (x) = x_{3} - x_{4}, \\ f_{3} (x) = {(x_{2} - 2 x_{3})}^{\frac{4}{3}}, \\ f_{4} (x) = {(x_{1} - x_{4})}^{\frac{4}{3}} . \end{matrix}

Initial point:

x_{0} = {(3, - 1, 0, 1)}^{T}

, zero point:

{(0, 0, 0, 0)}^{T}

.

We tested each function for three starting points,

x_{0}, 10 x_{0}

and

100 x_{0}

, to study the global convergence of the ALLM algorithm. Table 1 lists the numerical results achieved by the AELM and ALLM algorithms on the four test functions. The symbols in Table 1 have the following meanings:

NF: The number of function calculations.
NJ: The number of Jacobian calculations.
NT: We generally use the ‘ $NT = NF + NJ * n$ ’ to indicate the total computations.

Table 1. Numerical results of the AELM and ALLM algorithms with various choices of

δ

and

θ

.

Table 1. Numerical results of the AELM and ALLM algorithms with various choices of

δ

and

θ

.

			AELM				ALLM
					$δ = 1$			$δ = 2$
Function	$n$	$x_{0}$		$θ = 0$	$θ = 0.5$	$θ = 1$	$θ = 0$	$θ = 0.5$	$θ = 1$
			NF/NJ/NT	NF/NJ/NT	NF/NJ/NT	NF/NJ/NT	NF/NJ/NT	NF/NJ/NT	NF/NJ/NT
1	4	1	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50
		10	13/13/65	13/13/65	13/13/65	13/13/65	13/13/65	13/13/65	13/13/65
		100	16/16/80	16/16/80	16/16/80	16/16/80	16/16/80	16/16/80	16/16/80
2	2	1	8/8/24	8/8/24	8/8/24	8/8/24	8/8/24	8/8/24	8/8/24
		10	11/11/33	11/11/33	11/11/33	11/11/33	11/11/33	11/11/33	11/11/33
		100	15/15/45	15/15/45	15/15/45	15/15/45	15/15/45	15/15/45	15/15/45
3	4	1	8/8/40	8/8/40	8/8/40	8/8/40	8/8/40	8/8/40	8/8/40
		10	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50	10/10/50
		100	12/12/60	12/12/60	12/12/60	12/12/60	12/12/60	12/12/60	12/12/60
4	4	1	13/13/65	7/7/35	7/7/35	7/7/35	7/7/35	7/7/35	7/7/35
		10	16/16/80	9/9/45	9/9/45	9/9/45	9/9/45	9/9/45	9/9/45
		100	61/50/261	11/11/55	11/11/55	11/11/55	11/11/55	11/11/55	11/11/55

As can be seen from Table 1, the ALLM algorithm is obviously superior to the AELM algorithm for the numerical results of Function 4, while the two algorithms are the same for the numerical results of Functions 1–3.

Example 2.

We consider some singular problems which are created by the following form [32]:

\hat{F} (x) = F (x) - J (x^{*}) A {(A^{T} A)}^{- 1} A^{T} (x - x^{*}),

where the test function

F (x)

is provided by Moré, Garbow, and Hillstrom in [33],

x^{*}

is the root of

F (x)

, and

A \in R^{n \times k}

has full column rank. It is clear that the Jacobian of

\hat{F} (x^{*})

is

\hat{J} (x^{*}) = J (x^{*}) (I - A {(A^{T} A)}^{- 1} A^{T}),

with rank

n - k (1 \leq k \leq n)

and

\hat{F} (x^{*}) = 0

. Similar to [33], we choose

A = {[1, 1, \dots, 1]}^{T} \in R^{n \times 1},

which implies

rank (\hat{J} (x^{*})) = n - 1

.

Next, we ran all test problems for three starting points:

- 10 x_{0}, - x_{0}, x_{0}, 10 x_{0}

, and

100 x_{0}

, where

x_{0}

derives from [33]. Table 2 and Table 3 display the numerical results achieved by the algorithms for all test functions. The meanings of the symbols in Table 2 and Table 3 are as follows:

Iter: Number of iterations.
F: Final value of the norm of the function.
Time: CPU time in seconds.

From Table 2 and Table 3, it is evident that the ALLM algorithm generally outperforms the AELM algorithm in terms of CPU time across most test functions. Compared with the AELM algorithm, the performance of the ALLM algorithm exhibits superior performance when

θ = 0

and

δ = 2

, dominating approximately 90% of the CPU time results; about

4 %

of the results of iterations of the two algorithms are the same. In particular, for certain test functions it can be seen that the ALLM algorithm consistently outperforms the AELM algorithm in terms of both iteration count and CPU time when the initial point is distant from the solution set. From Table 2, for the extended helical valley function, when

n = 501

and the initial point is

- 10 x_{0}

, the number of iterations and the CPU time of the ALLM algorithm are better than those of the AELM algorithm. From Table 3, for the discrete boundary value function, when

n = 1000

and the initial point is

- 10 x_{0}

or

- x_{0}

or

x_{0}

or

10 x_{0}

or

100 x_{0}

, the number of iterations and CPU time of the ALLM algorithm are better than those of the AELM algorithm.

To compare the numerical performance profile of the AELM and ALLM algorithms, we chose the performance analysis method proposed by Dolan [34]. As can be seen from Figure 1, when

θ = 0

and

δ = 2

, the ALLM algorithm demonstrates the best performance in terms of iteration count, while when

θ = 1

and

δ = 1

, the performance in terms of the number of iterations for both algorithms. As can be seen from Figure 2, when

θ = 0

and

δ = 2

, the CPU time of the ALLM algorithm has the best performance, while when

θ

and

δ

take other values the ALLM algorithm maintains advantages in CPU time performance.

In general, the ALLM algorithm proves more effective in solving nonlinear equations compared to the AELM algorithm. In particular, when

δ

is larger and

θ

is smaller, the ALLM algorithm demonstrates superior performance. According to the needs of practical applications, the selection of

λ_{k}

is continuously optimized by changing the values of

δ

and

θ

.

Table 2. Numerical results of the AELM and ALLM algorithms with

δ = 1

and various choices of

θ

.

Table 2. Numerical results of the AELM and ALLM algorithms with

δ = 1

and various choices of

θ

.

			AELM	ALLM
				$δ = 1$
Function	$n$	$x_{0}$		$θ = 0$	$θ = 0.5$	$θ = 1$
			Iters/F/Time	Iters/F/Time	Iters/F/Time	Iters/F/Time
Extended Rosenbrock	500	−10	19/2.7631 × 10⁻⁷/0.30	19/2.7858 × 10⁻⁷/0.27	19/2.7744 × 10⁻⁷/0.29	19/2.7631 × 10⁻⁷/0.28
		−1	16/1.7341 × 10⁻⁷/0.23	14/2.5509 × 10⁻⁷/0.19	15/3.3852 × 10⁻⁷/0.20	16/1.7341 × 10⁻⁷/0.25
		1	17/2.2186 × 10⁻⁷/0.24	17/1.9553 × 10⁻⁷/0.25	17/2.0901 × 10⁻⁷/0.22	17/2.2186 × 10⁻⁷/0.23
		10	19/3.9252 × 10⁻⁷/0.29	19/3.8895 × 10⁻⁷/0.25	19/3.9066 × 10⁻⁷/0.27	19/3.9252 × 10⁻⁷/0.29
		100	23/1.3154 × 10⁻⁷/0.42	23/1.3142 × 10⁻⁷/0.30	23/1.3150 × 10⁻⁷/0.34	23/1.3154 × 10⁻⁷/0.37
	1000	−10	19/3.9156 × 10⁻⁷/1.63	19/3.9442 × 10⁻⁷/1.52	19/3.9299 × 10⁻⁷/1.53	19/3.9156 × 10⁻⁷/1.64
		−1	16/2.5034 × 10⁻⁷/1.43	14/3.8204 × 10⁻⁷/1.10	16/1.2047 × 10⁻⁷/1.28	16/2.5034 × 10⁻⁷/1.34
		1	17/3.1868 × 10⁻⁷/1.46	17/2.8057 × 10⁻⁷/1.40	17/2.9943 × 10⁻⁷/1.36	17/3.1868 × 10⁻⁷/1.46
		10	20/1.3911 × 10⁻⁷/1.67	20/1.3781 × 10⁻⁷/1.93	20/1.3866 × 10⁻⁷/1.65	20/1.3911 × 10⁻⁷/1.57
		100	23/1.8652 × 10⁻⁷/2.06	23/1.8646 × 10⁻⁷/1.90	23/1.8638 × 10⁻⁷/1.90	23/1.8652 × 10⁻⁷/1.98
Extended Helical valley	501	−10	42/1.3356 × 10⁻⁶/0.68	3/5.1316 × 10⁻⁷/0.03	13/3.7044 × 10⁻⁷/0.17	14/3.2573 × 10⁻⁷/0.23
		−1	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01
		1	8/3.1758 × 10⁻⁷/0.13	8/1.4137 × 10⁻⁷/0.09	8/2.1648 × 10⁻⁷/0.10	8/3.1758 × 10⁻⁷/0.12
		10	8/1.8981 × 10⁻⁹/0.12	8/8.0024 × 10⁻¹⁰/0.10	8/1.2568 × 10⁻⁹/0.10	8/1.8981 × 10⁻⁹/0.11
		100	8/5.9124 × 10⁻¹⁰/0.12	8/3.9747 × 10⁻¹⁰/0.11	8/4.8399 × 10⁻¹⁰/0.11	8/5.9124 × 10⁻¹⁰/0.12
	1000	−10	7/2.3766 × 10⁻¹³/0.53	6/3.0134 × 10⁻¹³/0.45	6/1.5143 × 10⁻⁹/0.44	7/2.3766 × 10⁻¹³/0.55
		−1	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04
		1	8/1.8337 × 10⁻⁸/0.69	8/7.2043 × 10⁻⁹/0.68	8/1.1804 × 10⁻⁸/0.62	8/1.8337 × 10⁻⁸/0.67
		10	8/1.6817 × 10⁻¹¹/0.66	8/1.0758 × 10⁻¹¹/0.61	8/1.4744 × 10⁻¹¹/0.60	8/1.6817 × 10⁻¹¹/0.65
		100	26/9.3824 × 10⁻¹³/2.51	35/1.0739 × 10⁻⁷/3.29	26/6.6675 × 10⁻⁸/2.18	26/6.9835 × 10⁻¹¹/2.16
Discrete boundary value	500	−10	6/3.3487 × 10⁻³/0.11	6/3.3666 × 10⁻³/0.07	6/3.3631 × 10⁻³/0.09	6/3.3487 × 10⁻³/0.12
		−1	4/1.2234 × 10⁻³/0.06	4/1.2579 × 10⁻³/0.04	4/1.2417 × 10⁻³/0.05	4/1.2234 × 10⁻³/0.04
		1	3/3.5633 × 10⁻⁴/0.04	3/3.6008 × 10⁻⁴/0.03	3/3.5823 × 10⁻⁴/0.03	3/3.5633 × 10⁻⁴/0.03
		10	5/6.7290 × 10⁻³/0.08	5/6.7739 × 10⁻³/0.06	5/6.7614 × 10⁻³/0.06	5/6.7290 × 10⁻³/0.08
		100	12/1.3651 × 10⁻⁴/0.23	13/1.3834 × 10⁻⁵/0.16	12/1.5752 × 10⁻⁴/0.17	12/1.3651 × 10⁻⁴/0.16
	1000	−10	6/3.6656 × 10⁻³/0.52	6/3.6804 × 10⁻³/0.50	6/3.6780 × 10⁻³/0.47	6/3.6656 × 10⁻³/0.48
		−1	4/1.4253 × 10⁻³/0.30	4/1.4669 × 10⁻³/0.30	4/1.4474 × 10⁻³/0.30	4/1.4253 × 10⁻³/0.31
		1	3/1.3022 × 10⁻⁴/0.22	3/1.3092 × 10⁻⁴/0.20	3/1.3058 × 10⁻⁴/0.21	3/1.3022 × 10⁻⁴/0.21
		10	5/6.5900 × 10⁻³/0.40	5/6.6346 × 10⁻³/0.38	5/6.6209 × 10⁻³/0.35	5/6.5900 × 10⁻³/0.39
		100	13/9.9458 × 10⁻⁵/1.09	13/1.0869 × 10⁻⁴/1.06	13/1.0505 × 10⁻⁴/1.07	13/9.9458 × 10⁻⁵/1.08
Discrete integral equation	500	−10	12/1.2304 × 10⁻⁵/1.06	12/1.2171 × 10⁻⁵/1.03	12/1.2238 × 10⁻⁵/1.04	12/1.2304 × 10⁻⁵/1.06
		−1	9/1.5928 × 10⁻⁵/0.76	9/1.4153 × 10⁻⁵/0.76	9/1.5162 × 10⁻⁵/0.75	9/1.5928 × 10⁻⁵/0.76
		1	7/1.3357 × 10⁻⁵/0.59	7/1.3770 × 10⁻⁵/0.58	7/1.3592 × 10⁻⁵/0.58	7/1.3357 × 10⁻⁵/0.58
		10	10/9.3502 × 10⁻⁶/0.86	8/9.0151 × 10⁻⁶/0.67	9/1.5419 × 10⁻⁵/0.76	10/9.3502 × 10⁻⁶/0.86
		100	10/4.5155 × 10⁻⁹/0.91	10/4.5463 × 10⁻⁹/0.89	10/4.5306 × 10⁻⁹/0.88	10/4.5155 × 10⁻⁹/0.91
	1000	−10	12/1.7452 × 10⁻⁵/4.50	12/1.7265 × 10⁻⁵/4.48	12/1.7358 × 10⁻⁵/4.50	12/1.7452 × 10⁻⁵/4.51
		−1	10/6.0308 × 10⁻⁶/3.71	10/5.2005 × 10⁻⁶/3.73	10/5.6998 × 10⁻⁶/3.70	10/6.0308 × 10⁻⁶/3.67
		1	8/5.1495 × 10⁻⁶/2.86	8/5.3838 × 10⁻⁶/2.90	8/5.2754 × 10⁻⁶/2.50	8/5.1495 × 10⁻⁶/2.86
		10	10/1.4251 × 10⁻⁵/3.73	9/5.0675 × 10⁻⁶/3.30	10/5.7297 × 10⁻⁶/3.59	10/1.4251e × 10⁻⁵/3.66
		100	10/6.3828 × 10⁻⁹/3.86	10/6.4261 × 10⁻⁹/3.83	10/6.4040 × 10⁻⁹/3.81	10/6.3828 × 10⁻⁹/3.83
Broyden banded	500	−10	10/3.8446 × 10⁻¹²/0.17	10/3.9166 × 10⁻¹²/0.16	10/4.3882 × 10⁻¹²/0.14	10/3.8446 × 10⁻¹²/0.17
		−1	26/6.9212 × 10⁻⁶/0.52	31/1.2468 × 10⁻⁵/0.53	28/1.6756 × 10⁻⁵/0.50	25/1.2128 × 10⁻⁵/0.50
		1	12/1.5063 × 10⁻⁵/0.20	12/1.5060 × 10⁻⁵/0.20	12/1.5061 × 10⁻⁵/0.19	12/1.5063 × 10⁻⁵/0.20
		10	18/1.7636 × 10⁻⁵/0.33	18/1.7636 × 10⁻⁵/0.30	18/1.7636 × 10⁻⁵/0.28	18/1.7636 × 10⁻⁵/0.28
		100	24/1.0280 × 10⁻⁵/0.44	24/1.0280 × 10⁻⁵/0.36	24/1.0280 × 10⁻⁵/0.37	24/1.0280 × 10⁻⁵/0.37
	1000	−10	10/3.5499 × 10⁻¹²/0.90	10/3.8124 × 10⁻¹²/0.91	10/4.6220 × 10⁻¹²/0.90	10/3.5499 × 10⁻¹²/0.99
		−1	33/9.9927 × 10⁻⁶/3.35	27/9.6110 × 10⁻⁶/2.54	33/2.6949 × 10⁻⁵/3.04	28/9.7912 × 10⁻⁶/2.62
		1	12/2.1201 × 10⁻⁵/1.22	12/2.1196 × 10⁻⁵/1.08	12/2.1199 × 10⁻⁵/1.09	12/2.1201 × 10⁻⁵/1.17
		10	18/2.4886 × 10⁻⁵/1.82	18/2.4886 × 10⁻⁵/1.59	18/2.4886 × 10⁻⁵/1.68	18/2.4886 × 10⁻⁵/1.68
		100	24/1.4499 × 10⁻⁵/2.80	24/1.4499 × 10⁻⁵/2.42	24/1.4499 × 10⁻⁵/2.37	24/1.4499 × 10⁻⁵/2.54

Table 3. Numerical results of the AELM and ALLM algorithms with

δ = 2

and various choices of

θ

.

Table 3. Numerical results of the AELM and ALLM algorithms with

δ = 2

and various choices of

θ

.

			AELM	ALLM
				$δ = 2$
Function	$n$	$x_{0}$		$θ = 0$	$θ = 0.5$	$θ = 1$
			Iters/F/Time	Iters/F/Time	Iters/F/Time	Iters/F/Time
Extended Rosenbrock	500	−10	19/2.7631 × 10⁻⁷/0.30	19/2.7841 × 10⁻⁷/0.26	19/2.7734 × 10⁻⁷/0.26	19/2.7635 × 10⁻⁷/0.30
		−1	16/1.7341 × 10⁻⁷/0.23	14/1.7729 × 10⁻⁷/0.17	15/3.2343 × 10⁻⁷/0.21	16/1.7288 × 10⁻⁷/0.20
		1	17/2.2186 × 10⁻⁷/0.24	17/1.8677 × 10⁻⁷/0.25	17/2.0435 × 10⁻⁷/0.25	17/2.2121 × 10⁻⁷/0.22
		10	19/3.9252 × 10⁻⁷/0.29	19/3.8875 × 10⁻⁷/0.24	19/3.9061 × 10⁻⁷/0.26	19/3.9243 × 10⁻⁷/0.25
		100	23/1.3154 × 10⁻⁷/0.42	23/1.3134 × 10⁻⁷/0.29	23/1.3145 × 10⁻⁷/0.33	23/1.3149 × 10⁻⁷/0.33
	1000	−10	19/3.9156 × 10⁻⁷/1.63	19/3.9423 × 10⁻⁷/1.50	19/3.9284 × 10⁻⁷/1.54	19/3.9140 × 10⁻⁷/1.56
		−1	16/2.5034 × 10⁻⁷/1.43	14/2.3878 × 10⁻⁷/1.05	15/4.5237 × 10⁻⁷/1.24	16/2.4969 × 10⁻⁷/1.32
		1	17/3.1868 × 10⁻⁷/1.46	17/2.7048 × 10⁻⁷/1.33	17/2.9369 × 10⁻⁷/1.36	17/3.1809 × 10⁻⁷/1.33
		10	20/1.3911 × 10⁻⁷/1.67	20/1.3786 × 10⁻⁷/1.53	20/1.3857 × 10⁻⁷/1.57	20/1.3919e × 10⁻⁷/1.59
		100	23/1.8652 × 10⁻⁷/2.06	23/1.8649 × 10⁻⁷/1.83	23/1.8655 × 10⁻⁷/1.80	23/1.8633 × 10⁻⁷/1.81
Extended Helical valley	501	−10	42/1.3356 × 10⁻⁶/0.68	3/7.5853 × 10⁻¹³/0.04	13/1.7643 × 10⁻⁷/0.18	14/2.0341 × 10⁻⁷/0.19
		−1	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01	1/0.0000 × 10⁰/0.01
		1	8/3.1758 × 10⁻⁷/0.13	8/1.0595 × 10⁻⁷/0.11	8/1.9230 × 10⁻⁷/0.11	8/3.2101 × 10⁻⁷/0.09
		10	8/1.8981 × 10⁻⁹/0.12	8/4.7841 × 10⁻¹⁰/0.11	8/9.5278 × 10⁻¹⁰/0.10	8/1.6935 × 10⁻⁹/0.10
		100	8/5.9124 × 10⁻¹⁰/0.12	8/2.0055 × 10⁻¹⁰/0.10	8/3.2729 × 10⁻¹⁰/0.13	8/4.9161 × 10⁻¹⁰/0.13
	1000	−10	7/2.3766 × 10⁻¹³/0.53	5/7.4481 × 10⁻¹⁰/0.36	6/1.7998 × 10⁻¹²/0.51	6/4.6416 × 10⁻⁷/0.42
		−1	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04	1/0.0000 × 10⁰/0.04
		1	8/1.8337 × 10⁻⁸/0.69	8/4.1226 × 10⁻⁹/0.59	8/9.1462 × 10⁻⁹/0.62	8/1.7753 × 10⁻⁸/0.63
		10	8/1.6817 × 10⁻¹¹/0.66	8/2.2703 × 10⁻¹¹/0.59	8/3.2386 × 10⁻¹¹/0.60	8/4.0803 × 10⁻¹¹/0.64
		100	26/9.3824 × 10⁻¹³/2.51	46/2.6770 × 10⁻¹¹/3.70	26/8.5493 × 10⁻⁹/2.09	26/2.8466 × 10⁻¹⁰/2.15
Discrete boundary value	500	−10	6/3.3487 × 10⁻³/0.11	4/3.1555 × 10⁻³/0.04	4/4.2613 × 10⁻³/0.04	4/4.7919 × 10⁻³/0.05
		−1	4/1.2234 × 10⁻³/0.06	3/7.2588 × 10⁻⁴/0.03	3/7.1034 × 10⁻⁴/0.03	3/6.9394 × 10⁻⁴/0.04
		1	3/3.5633 × 10⁻⁴/0.04	3/4.1479 × 10⁻⁶/0.03	3/4.1402 × 10⁻⁶/0.04	3/4.1324 × 10⁻⁶/0.04
		10	5/6.7290 × 10⁻³/0.08	4/2.4328 × 10⁻³/0.05	4/ 3.0758 × 10⁻³/0.05	4/3.4218 × 10⁻³/0.05
		100	12/1.3651 × 10⁻⁴/0.23	11/4.2188 × 10⁻⁵/0.16	12/1.5591 × 10⁻⁵/0.18	12/1.3814 × 10⁻⁵/0.18
	1000	−10	6/3.6656 × 10⁻³/0.52	4/3.5180 × 10⁻³/0.29	4/4.5349 × 10⁻³/0.29	4/5.0429 × 10⁻³/0.28
		−1	4/1.4253 × 10⁻³/0.30	3/9.3230 × 10⁻⁴/0.21	3/9.1090 × 10⁻⁴/0.21	3/8.8813 × 10⁻⁴/0.23
		1	3/1.3022 × 10⁻⁴/0.22	2/2.6311 × 10⁻⁴/0.13	2/2.6303 × 10⁻⁴/0.13	2/2.6296 × 10⁻⁴/0.13
		10	5/6.5900 × 10⁻³/0.40	4/2.5604 × 10⁻³/0.29	4/3.0932 × 10⁻³/0.30	4/3.4004 × 10⁻³/0.27
		100	13/9.9458 × 10⁻⁵/1.09	11/1.2743 × 10⁻⁴/0.88	11/2.1884 × 10⁻⁴/0.93	11/2.1536 × 10⁻⁴/0.85
Discrete integral equation	500	−10	12/1.2304 × 10⁻⁵/1.06	12/1.2047 × 10⁻⁵/1.03	12/1.2171 × 10⁻⁵/1.05	12/1.2294 × 10⁻⁵/1.04
		−1	9/1.5928 × 10⁻⁵/0.76	9/1.0655 × 10⁻⁵/0.75	9/1.2735 × 10⁻⁵/0.76	9/ 1.4195 × 10⁻⁵/0.76
		1	7/1.3357 × 10⁻⁵/0.59	7/1.0869 × 10⁻⁵/0.57	7/1.0772 × 10⁻⁵/0.58	7/1.0633 × 10⁻⁵/0.57
		10	10/9.3502 × 10⁻⁶/0.86	9/4.8669 × 10⁻⁶/0.75	9/1.3758 × 10⁻⁵/0.76	10/9.7578 × 10⁻⁶/0.84
		100	10/4.5155 × 10⁻⁹/0.91	10/4.5453 × 10⁻⁹/0.89	10/4.5300 × 10⁻⁹/0.90	10/4.5154 × 10⁻⁹/0.88
	1000	−10	12/1.7452 × 10⁻⁵/4.50	12/1.7133 × 10⁻⁵/4.41	12/1.7285 × 10⁻⁵/4.46	12/1.7441 × 10⁻⁵/4.47
		−1	10/6.0308 × 10⁻⁶/3.71	9/1.5092 × 10⁻⁵/3.25	9/1.8533 × 10⁻⁵/3.27	10/5.2150 × 10⁻⁶/3.66
		1	8/5.1495 × 10⁻⁶/2.86	7/1.5749 × 10⁻⁵/2.48	7/1.5610 × 10⁻⁵/2.50	7/1.5367 × 10⁻⁵/2.62
		10	10/1.4251 × 10⁻⁵/3.73	9/8.5398 × 10⁻⁶/3.25	10/5.1845 × 10⁻⁶/3.62	10/1.4626 × 10⁻⁵/3.71
		100	10/6.3828 × 10⁻⁹/3.86	10/6.4246 × 10⁻⁹/3.76	10/6.4031 × 10⁻⁹/3.79	10/6.3825 × 10⁻⁹/3.77
Broyden banded	500	−10	10/3.8446 × 10⁻¹²/0.17	10/3.795 × 10⁻¹²/0.16	10/4.3814 × 10⁻¹²/0.17	10/3.8105 × 10⁻¹²/0.16
		−1	26/6.9212 × 10⁻⁶/0.52	29/6.5177 × 10⁻⁶/0.45	28/1.6459 × 10⁻⁵/0.44	25/1.1907 × 10⁻⁵/0.42
		1	12/1.5063 × 10⁻⁵/0.20	12/1.5059 × 10⁻⁵/0.20	12/1.5061 × 10⁻⁵/0.20	12/1.5063 × 10⁻⁵/0.18
		10	18/1.7636 × 10⁻⁵/0.33	18/1.7636 × 10⁻⁵/0.30	18/1.7636 × 10⁻⁵/0.33	18/1.7636 × 10⁻⁵/0.31
		100	24/1.0280 × 10⁻⁵/0.44	24/1.0280 × 10⁻⁵/0.36	24/1.0280 × 10⁻⁵/0.40	24/1.0280 × 10⁻⁵/0.36
	1000	−10	10/3.5499 × 10⁻¹²/0.90	10/4.5936 × 10⁻¹²/0.86	10/3.810 × 10⁻¹²/0.89	10/5.7143 × 10⁻¹²/0.93
		−1	33/9.9927 × 10⁻⁶/3.35	29/1.5374 × 10⁻⁵/2.76	31/1.8408 × 10⁻⁵ /2.84	28/9.7968 × 10⁻⁶/2.61
		1	12/2.1201 × 10⁻⁵/1.22	12/2.1194 × 10⁻⁵/1.07	12/2.1198 × 10⁻⁵/1.08	12/2.1201 × 10⁻⁵/1.13
		10	18/2.4886 × 10⁻⁵/1.82	18/2.4886 × 10⁻⁵/1.69	18/2.4886 × 10⁻⁵/1.64	18/2.4886 × 10⁻⁵/1.65
		100	24/1.4499 × 10⁻⁵/2.80	24/1.4499 × 10⁻⁵/2.30	24/1.4499 × 10⁻⁵/2.33	24/1.4499 × 10⁻⁵/2.51

Figure 1. Performance profile of AELM and ALLM based on number of iterations for example 1–10.

Figure 2. Performance profile of AELM and ALLM based on CPU time for example 1–10.

5. Conclusions

In this paper, inspired by the Hölderian local error bound condition, we studied the convergence properties of our ALLM algorithm under different conditions. We used the new modified adaptive LM parameter and incorporated the non-monotone technique to modify the Levenberg–Marquardt algorithm. The numerical results show that our new algorithm is efficient and stable.

Author Contributions

Conceptualization, Y.H. and S.R.; methodology, Y.H. and S.R.; Software, Y.H.; validation, Y.H. and S.R.; formal analysis, Y.H. and S.R.; investigation, Y.H. and S.R.; resources, Y.H. and S.R.; data curation, Y.H. and S.R.; Writing—original draft, Y.H.; writing—review and editing, Y.H. and S.R.; visualization, Y.H.; supervision, S.R.; project administration, S.R.; funding acquisition, S.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of the Anhui Higher Education Institutions of China, 2023AH050348.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank S.R. and everyone for their valuable comments and suggestions which helped us improve the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Leonov, E.; Polbin, A. Numerical Search for a Global Solution in a Two-Mode Economy Model with an Exhaustible Resource of Hydrocarbons. Math. Model. Comput. Simul. 2022, 14, 213–223. [Google Scholar] [CrossRef]
Xu, D.; Bai, Z.; Jin, X.; Yang, X.; Chen, S.; Zhou, M. A mean-variance portfolio optimization approach for high-renewable energy hub. Appl. Energy 2022, 325, 119888. [Google Scholar] [CrossRef]
Manzoor, Z.; Iqbal, M.S.; Hussain, S.; Ashraf, F.; Inc, M.; Tarar, M.A.; Momani, S. A study of propagation of the ultra-short femtosecond pulses in an optical fiber by using the extended generalized Riccati equation mapping method. Opt. Quantum Electron. 2023, 55, 717. [Google Scholar] [CrossRef]
Vu, D.T.S.; Gharbia, I.B.; Haddou, M.; Tran, Q.H. A new approach for solving nonlinear algebraic systems with complementarity conditions. Application to compositional multiphase equilibrium problems. Math. Comput. Simul. 2021, 190, 1243–1274. [Google Scholar] [CrossRef]
Maia, L.; Nornberg, G.; Pacella, F. A dynamical system approach to a class of radial weighted fully nonlinear equations. Commun. Partial Differ. Equ. 2021, 46, 573–610. [Google Scholar] [CrossRef]
Vasin, V.V.; Skorik, G.G. Two-stage method for solving systems of nonlinear equations and its applications to the inverse atmospheric sounding problem. In Doklady Mathematics; Springer: New York, NY, USA, 2020; Volume 102, pp. 367–370. [Google Scholar]
Luo, X.l.; Xiao, H.; Lv, J.h. Continuation Newton methods with the residual trust-region time-stepping scheme for nonlinear equations. Numer. Algorithms 2022, 89, 223–247. [Google Scholar] [CrossRef]
Waziri, M.Y.; Ahmed, K. Two descent Dai-Yuan conjugate gradient methods for systems of monotone nonlinear equations. J. Sci. Comput. 2022, 90, 1–53. [Google Scholar] [CrossRef]
Pes, F.; Rodriguez, G. A doubly relaxed minimal-norm Gauss–Newton method for underdetermined nonlinear least-squares problems. Appl. Numer. Math. 2022, 171, 233–248. [Google Scholar] [CrossRef]
Levenberg, K. A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 1944, 2, 164–168. [Google Scholar] [CrossRef]
Marquardt, D.W. An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Nocedal, J.; Wright, S.J. Numerical Optimization; Springer: New York, NY, USA, 1999. [Google Scholar]
Yamashita, N.; Fukushima, M. On the rate of convergence of the Levenberg-Marquardt method. In Topics in Numerical Analysis: With Special Emphasis on Nonlinear Problems; Springer: New York, NY, USA, 2001; pp. 239–249. [Google Scholar]
Fan, J.Y.; Yuan, Y.X. On the quadratic convergence of the Levenberg-Marquardt method without nonsingularity assumption. Computing 2005, 74, 23–39. [Google Scholar] [CrossRef]
Fischer, A. Local behavior of an iterative framework for generalized equations with nonisolated solutions. Math. Program. 2002, 94, 91–124. [Google Scholar] [CrossRef]
Ma, C.; Jiang, L. Some research on Levenberg–Marquardt method for the nonlinear equations. Appl. Math. Comput. 2007, 184, 1032–1040. [Google Scholar] [CrossRef]
Fan, J.Y. A modified Levenberg-Marquardt algorithm for singular system of nonlinear equations. J. Comput. Math. 2003, 625–636. [Google Scholar]
Amini, K.; Rostami, F.; Caristi, G. An efficient Levenberg–Marquardt method with a new LM parameter for systems of nonlinear equations. Optimization 2018, 67, 637–650. [Google Scholar] [CrossRef]
Rezaeiparsa, Z.; Ashrafi, A. A new adaptive Levenberg–Marquardt parameter with a nonmonotone and trust region strategies for the system of nonlinear equations. Math. Sci. 2023, 1–13. [Google Scholar] [CrossRef]
Ahookhosh, M.; Aragón Artacho, F.J.; Fleming, R.M.; Vuong, P.T. Local convergence of the Levenberg–Marquardt method under Hölder metric subregularity. Adv. Comput. Math. 2019, 45, 2771–2806. [Google Scholar] [CrossRef]
Wang, H.Y.; Fan, J.Y. Convergence rate of the Levenberg-Marquardt method under Hölderian local error bound. Optim. Methods Softw. 2020, 35, 767–786. [Google Scholar] [CrossRef]
Zeng, M.; Zhou, G. Improved convergence results of an efficient Levenberg–Marquardt method for nonlinear equations. J. Appl. Math. Comput. 2022, 68, 3655–3671. [Google Scholar] [CrossRef]
Chen, L.; Ma, Y. A modified Levenberg–Marquardt method for solving system of nonlinear equations. J. Appl. Math. Comput. 2023, 69, 2019–2040. [Google Scholar] [CrossRef]
Li, R.; Cao, M.; Zhou, G. A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems. Symmetry 2023, 15, 588. [Google Scholar] [CrossRef]
Grippo, L.; Lampariello, F.; Lucidi, S. A nonmonotone line search technique for Newton’s method. SIAM J. Numer. Anal. 1986, 23, 707–716. [Google Scholar] [CrossRef]
Ahookhosh, M.; Amini, K. A nonmonotone trust region method with adaptive radius for unconstrained optimization problems. Comput. Math. Appl. 2010, 60, 411–422. [Google Scholar] [CrossRef]
Ahookhosh, M.; Amini, K. An efficient nonmonotone trust-region method for unconstrained optimization. Numer. Algorithms 2012, 59, 523–540. [Google Scholar] [CrossRef]
Wang, P.; Zhu, D. A derivative-free affine scaling trust region methods based on probabilistic models with new nonmonotone line search technique for linear inequality constrained minimization without strict complementarity. Int. J. Comput. Math. 2019, 96, 663–691. [Google Scholar] [CrossRef]
Powell, M.J.D. Convergence properties of a class of minimization algorithms. In Nonlinear Programming 2; Elsevier: Amsterdam, The Netherlands, 1975; pp. 1–27. [Google Scholar]
Behling, R.; Iusem, A. The effect of calmness on the solution set of systems of nonlinear equations. Math. Program. 2013, 137, 155–165. [Google Scholar] [CrossRef]
Stewart, G.; Sun, J. Matrix Perturbation Theory; Academic Press: San Diego, CA, USA, 1990. [Google Scholar]
Schnabel, R.B.; Frank, P.D. Tensor methods for nonlinear equations. SIAM J. Numer. Anal. 1984, 21, 815–843. [Google Scholar] [CrossRef]
Moré, J.J.; Garbow, B.S.; Hillstrom, K.E. Testing unconstrained optimization software. ACM Trans. Math. Softw. (TOMS) 1981, 7, 17–41. [Google Scholar] [CrossRef]
Dolan, E.D.; Moré, J.J. Benchmarking optimization software with performance profiles. Math. Program. 2002, 91, 201–213. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Han, Y.; Rui, S. A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition. Symmetry 2024, 16, 674. https://doi.org/10.3390/sym16060674

AMA Style

Han Y, Rui S. A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition. Symmetry. 2024; 16(6):674. https://doi.org/10.3390/sym16060674

Chicago/Turabian Style

Han, Yang, and Shaoping Rui. 2024. "A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition" Symmetry 16, no. 6: 674. https://doi.org/10.3390/sym16060674

APA Style

Han, Y., & Rui, S. (2024). A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition. Symmetry, 16(6), 674. https://doi.org/10.3390/sym16060674

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Adaptive Levenberg–Marquardt Method for Nonlinear Equations and Its Convergence Rate under the Hölderian Local Error Bound Condition

Abstract

1. Introduction

2. The New Adaptive LM Algorithm and Its Global Convergence

3. Convergence Rate

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI