Convergence properties of a family of inexact Levenberg-Marquardt methods

Luyao Zhao; Jingyong Tang; Luyao Zhao; Jingyong Tang

doi:10.3934/math.2023950

AIMS Mathematics

2023, Volume 8, Issue 8: 18649-18664. doi: 10.3934/math.2023950

Previous Article Next Article

Research article

Convergence properties of a family of inexact Levenberg-Marquardt methods

Luyao Zhao ,
Jingyong Tang ^,

College of Mathematics and Statistics, Xinyang Normal University, Xinyang 464000, China

Received: 17 April 2023 Revised: 13 May 2023 Accepted: 18 May 2023 Published: 02 June 2023
MSC : 90C33, 65K05

We present a family of inexact Levenberg-Marquardt (LM) methods for the nonlinear equations which takes more general LM parameters and perturbation vectors. We derive an explicit formula of the convergence order of these inexact LM methods under the H $\mathrm{\ddot{o}}$ derian local error bound condition and the H $\mathrm{\ddot{o}}$ derian continuity of the Jacobian. Moreover, we develop a family of inexact LM methods with a nonmonotone line search and prove that it is globally convergent. Numerical results for solving the linear complementarity problem are reported.

Keywords:

nonlinear equations,
inexact Levenberg-Marquardt method,
global convergence,
convergence rate,
H $\mathrm{\ddot{o}}$ derian local error bound

Citation: Luyao Zhao, Jingyong Tang. Convergence properties of a family of inexact Levenberg-Marquardt methods[J]. AIMS Mathematics, 2023, 8(8): 18649-18664. doi: 10.3934/math.2023950

Related Papers:

[1]	Dingyu Zhu, Yueting Yang, Mingyuan Cao . An accelerated adaptive two-step Levenberg–Marquardt method with the modified Metropolis criterion. AIMS Mathematics, 2024, 9(9): 24610-24635. doi: 10.3934/math.20241199
[2]	Lin Zheng, Liang Chen, Yanfang Ma . A variant of the Levenberg-Marquardt method with adaptive parameters for systems of nonlinear equations. AIMS Mathematics, 2022, 7(1): 1241-1256. doi: 10.3934/math.2022073
[3]	Xiaorui He, Jingyong Tang . A smooth Levenberg-Marquardt method without nonsingularity condition for wLCP. AIMS Mathematics, 2022, 7(5): 8914-8932. doi: 10.3934/math.2022497
[4]	Linsen Song, Gaoli Sheng . A two-step smoothing Levenberg-Marquardt algorithm for real-time pricing in smart grid. AIMS Mathematics, 2024, 9(2): 4762-4780. doi: 10.3934/math.2024230
[5]	Panjie Tian, Zhensheng Yu, Yue Yuan . A smoothing Levenberg-Marquardt algorithm for linear weighted complementarity problem. AIMS Mathematics, 2023, 8(4): 9862-9876. doi: 10.3934/math.2023498
[6]	Iftikhar Ahmad, Hira Ilyas, Muhammad Asif Zahoor Raja, Tahir Nawaz Cheema, Hasnain Sajid, Kottakkaran Sooppy Nisar, Muhammad Shoaib, Mohammed S. Alqahtani, C Ahamed Saleel, Mohamed Abbas . Intelligent computing based supervised learning for solving nonlinear system of malaria endemic model. AIMS Mathematics, 2022, 7(11): 20341-20369. doi: 10.3934/math.20221114
[7]	Xiangtuan Xiong, Wanxia Shi, Xuemin Xue . Determination of three parameters in a time-space fractional diffusion equation. AIMS Mathematics, 2021, 6(6): 5909-5923. doi: 10.3934/math.2021350
[8]	Miao Guo, Qingbiao Wu . Two effective inexact iteration methods for solving the generalized absolute value equations. AIMS Mathematics, 2022, 7(10): 18675-18689. doi: 10.3934/math.20221027
[9]	Khalil Ur Rehman, Wasfi Shatanawi, Zead Mustafa . Artificial intelligence (AI) based neural networks for a magnetized surface subject to tangent hyperbolic fluid flow with multiple slip boundary conditions. AIMS Mathematics, 2024, 9(2): 4707-4728. doi: 10.3934/math.2024227
[10]	Khalil Ur Rehman, Wasfi Shatanawi, Zeeshan Asghar, Haitham M. S. Bahaidarah . Neural networking analysis for MHD mixed convection Casson flow past a multiple surfaces: A numerical solution. AIMS Mathematics, 2023, 8(7): 15805-15823. doi: 10.3934/math.2023807

Abstract

1. Introduction

Consider the system of nonlinear equations

$\begin{equation} F(x) = 0, \end{equation}$

(1.1)

where $F(x):\mathbb{R}^n\rightarrow \mathbb{R}^n$ is a continuously differentiable function. In the paper, we assume that the solution set of (1.1) is nonempty and denote it by ${X}^*$ . Moreover, we denote the Jacobian $F'(x)$ as ${J}(x)$ and use the notations $F_k = F(x_k), J_k = J(x_k)$ for simplification.

The Levenberg-Marquardt (LM) method is one of the most important algorithms for solving (1.1). At every iteration, the LM method computes the trial step ${d}_k$ by solving the following linear system

$\begin{equation} (J_k^TJ_k+\mu_kI)d_k = -J_k^TF_k, \end{equation}$

(1.2)

where $\mu_k$ is the LM parameter which plays an important role in analyzing the convergence rate of the LM method. For example, Yamashita and Fukushima ^[13] proved that the LM method taking $\mu_k = \|F_k\|^2$ has quadratic convergence under the local error bound condition which is weaker than the nonsingularity. Fan and Yuan ^[6] proved that the LM method taking $\mu_k = \|F_k\|^\delta$ with $\delta\in[1, 2]$ still achieves the quadratic convergence under the local error bound condition. More researches on the LM method can be found in ^{[1,11,14,15,16]} and references therein.

The LM method solves the linear system (1.2) exactly at every iteration which may be very expensive when solving a large-scale nonlinear equation. The inexact approach is one way to overcome this difficulty. In the inexact LM method, the direction ${d}_k$ is given by the solution of the system

$\begin{equation} (J_k^TJ_k+\mu_kI)d_k = -J_k^TF_k+p_k, \end{equation}$

(1.3)

where $p_k\in \mathbb{R}^n$ is a perturbation vector which measures how inexactly the linear system (1.2) is solved. Under the nonsingularity, Facchinei and Kanzow ^[3] proved that if $\mu_k\rightarrow0$ and $\|p_k\|\le o(\|J_k^TF_k\|)$ , then the inexact LM method has superlinear convergence rate and if $\mu_k = O(\|J_k^TF_k\|)$ and $\|p_k\| = O(\|J_k^TF_k\|^2)$ , then its convergence rate is quadratic. Suppose

$\mu_k = O(\|F_k\|^\alpha)\; \; \mbox{and}\; \; \|p_k\| = O(\|F_k\|^{\alpha+\theta}),$

where $\alpha > 0$ and $\theta > 0$ are constants. Under the local error bound condition, many researchers (e.g., ^[2,4,5,7]) investigated the convergence rate of the inexact LM method for different values of $\alpha$ and $\theta$ respectively. Lately, Wang and Fan ^[12] studied the convergence rate of the inexact LM method taking $\mu_k = \|F_k\|^\alpha$ with $\|p_k\| = \|F_k\|^{\alpha+\theta}$ and $\mu_k = \|J_k^TF_k\|^\alpha$ with $\|p_k\| = \|J_k^TF_k\|^{\alpha+\theta}$ respectively under the H $\mathrm{\ddot{o}}$ derian local error bound condition and the H $\mathrm{\ddot{o}}$ derian continuity of the Jacobian, which are more general than the local error bound condition and the Lipschitz continuity of the Jacobian used in ^[1,2,4,5,7].

In this paper, we study the convergence rate of a family of inexact LM methods with more general LM parameters and perturbation vectors. We consider

$\begin{equation} \mu_{k} = \sigma\|F_{k}\|^{\alpha}+(1-\sigma)\|J_{k}^{ {T}}F_{k}\|^{\alpha}, \end{equation}$

(1.4)

$\begin{equation} \|p_{k}\| = \tau\|F_{k}\|^{\alpha+\theta}+(1-\tau)\|J_{k}^{T}F_{k}\|^{\alpha+\theta}, \end{equation}$

(1.5)

where $\sigma, \tau\in[0, 1]$ and $\alpha, \theta > 0$ . We derive an explicit formula of the convergence order under the H $\mathrm{\ddot{o}}$ derian local error bound condition and the H $\mathrm{\ddot{o}}$ derian continuity of the Jacobian. Moreover, we develop a family of inexact LM methods with a nonmonotone line search and prove its global convergence. We also investigate the numerical performances of these inexact LM methods by solving the nonlinear equations arising in the linear complementarity problem.

The organization of this paper is as follows. In the next section, we investigate the convergence order under the H $\mathrm{\ddot{o}}$ derian local error bound condition and the H $\mathrm{\ddot{o}}$ derian continuity of the Jacobian. In Section 3, we present a family of inexact LM methods with a nonmonotone line search and prove that it is globally convergent. In Section 4, we apply these inexact LM methods to solve the nonlinear equations arising from the linear complementarity problem and report some numerical results.

2. Convergence rate of the inexact LM methods

In this section, we study the convergence rate of the inexact LM methods with the iteration

$\begin{equation} x_{k+1} = x_{k}+d_{k}, \end{equation}$

(2.1)

where $d_{k}$ is obtained by (1.3) with $\mu_k, p_k$ satisfying (1.4) and (1.5). We suppose that the generated sequence $\{x_k\}$ converges to the solution set $X^*$ and lies in some neighbourhoods of $x^*\in X^*.$

Assumption 2.1. $\textrm{(a)}$ $F(x)$ provides a H $\mathrm{\ddot{o}}$ derian local error bound of order $\gamma\in(0, 1]$ in some neighbourhoods of $x^{*}\in X^{*}$ , i.e., there exist constants $\kappa > 0$ and $0 < r < 1$ such that

$\begin{equation} \kappa\mathrm{dist}(x, X^{*})\leq \|F(x)\|^\gamma, \; \; \forall x \in N(x^{*}, r) = \{x\in \mathbb{R}^{n} |\; \|x-x^{*}\|\leq r\}. \end{equation}$

(2.2)

$\textrm{(b)}$ $J(x)$ is H $\mathrm{\ddot{o}}$ derian continuous of order $\upsilon\in(0, 1]$ , i.e., there exists a constant $L > 0$ such that

$\begin{equation} \|J(x)-J(y)\|\leq L\|x-y\|^\upsilon, \; \; \forall x, y \in N(x^{*}, r). \end{equation}$

(2.3)

It is worth pointing out that if $\gamma = \upsilon = 1$ , then Assumption 2.1 (a) is the local error bound condition and Assumption 2.1 (b) is the Lipschitz continuity of the Jacobian. Moreover, by (2.3), we have (see ^[12])

$\begin{align} \|F(y)-F(x)-J(x)(y-x)\| &\leq \frac{L}{1+\upsilon}\|y-x\|^{1+\upsilon}, \; \; \forall x, y \in N(x^{*}, r). \end{align}$

(2.4)

Furthermore, there exists a constant $M > 0$ such that

$\begin{equation} \|F(y)-F(x)\|\leq M\|y-x\|, \; \; \; \; \forall x, y \in N(x^{*}, r). \end{equation}$

(2.5)

In the following, we denote by $\overline{x}_{k}$ the vector in $X^{*}$ that is closest to $x_{k}$ , i.e.,

$\begin{equation} \|\overline{x}_{k}-x_{k}\| = \mathrm{dist}(x_{k}, X^{*}). \end{equation}$

(2.6)

Now we suppose the singular value decomposition (SVD) of $J(\bar{x}_k)$ is

$\begin{eqnarray*} \label{J-SVD} J(\bar{x}_k)& = &\bar{U}_k\bar{\Sigma}_k\bar{V}_k^T = (\bar{U}_{k, 1}, \bar{U}_{k, 2}) \left( \begin{array}{cccc} \bar{\Sigma}_{k, 1}&\\ &0 \end{array} \right) \left( \begin{array}{cccc} \bar{V}_{k, 1}^{T}\\ \bar{V}_{k, 2}^{T} \end{array} \right) = \bar{U}_{k, 1}\bar{\Sigma}_{k, 1}\bar{V}_{k, 1}^{T}, \end{eqnarray*}$

where $\bar{\Sigma}_{k, 1} = \mathrm{diag}(\bar{\sigma}_{k, 1}, ..., \bar{\sigma}_{k, r})$ with $\bar{\sigma}_{k, 1}\ge\cdots\ge\bar{\sigma}_{k, r} > 0$ . The corresponding SVD of $J_k$ is

$\begin{eqnarray*} \label{Jk-SVD} J_k = U_k\Sigma_kV_k^T & = &(U_{k, 1}, U_{k, 2}) \left( \begin{array}{cccc} \Sigma_{k, 1}&\\ &\Sigma_{k, 2} \end{array} \right) \left( \begin{array}{cccc} V_{k, 1}^T\\ V_{k, 2}^T \end{array} \right) = U_{k, 1}\Sigma_{k, 1}V_{k, 1}^T+U_{k, 2}\Sigma_{k, 2}V_{k, 2}^T, \end{eqnarray*}$

where ${\Sigma}_{k, 1} = \mathrm{diag}({\sigma}_{k, 1}, ..., {\sigma}_{k, r})$ with ${\sigma}_{k, 1}\ge\cdots\ge{\sigma}_{k, r} > 0$ and ${\Sigma}_{k, 2} = \mathrm{diag}({\sigma}_{k, r+1}, ..., {\sigma}_{k, n})$ with ${\sigma}_{k, r+1}\ge\cdots\ge{\sigma}_{k, n}\ge0$ . For simplicity, we neglect the subscript $k$ in $U_{k, i}, \Sigma_{k, i}, V_{k, i} (i = 1, 2)$ and write $J(\bar{x}_k)$ and $J_k$ as

$J(\bar{x}_k) = \bar{U}_{1}\bar{\Sigma}_{1}\bar{V}_{1}^T, \; \; \; J_k = U_{1}\Sigma_{1}V_{1}^T+U_{2}\Sigma_{2}V_{2}^T.$

By the matrix perturbation theory ^[8] and (2.3), we have

$\|\mathrm{diag}(\Sigma_{1}-\bar{\Sigma}_{1}, \Sigma_{2})\| \leq\|J(\bar{x}_k)-J_k\| \leq L\|\bar{x}_k-x_k\|^{\upsilon},$

which gives

$\begin{equation} \|\Sigma_{1}-\bar{\Sigma}_{1}\| \leq L\|\bar{x}_k-x_k\|^{\upsilon}, \; \; \; \; \|\Sigma_{2}\| \leq L\|\bar{x}_k-x_k\|^{\upsilon}. \end{equation}$

(2.7)

Moreover, by (1.3) we have

$\begin{align} d_{k} = &-(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}J_{k}^{T}F_{k}+(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}p_{k}\\ = &-V_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}\Sigma_{1}U_{1}^{ {T}}F_{k}-V_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}\Sigma_{2}U_{2}^{ {T}}F_{k}\\ &+V_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}V_{1}^{ {T}}p_{k}+V_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}V_{2}^{ {T}}p_{k}, \end{align}$

(2.8)

and

$\begin{align} F_{k}+ J_{k}d_{k} = &F_{k}-J_{k}(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}J_{k}^{T}F_{k}+J_{k}(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}p_{k}\\ = &\mu_{k}U_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}U_{1}^{ {T}}F_{k}+\mu_{k}U_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}U_{2}^{ {T}}F_{k}\\ &+U_{1}\Sigma_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}V_{1}^{ {T}}p_{k}+U_{2}\Sigma_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}V_{2}^{ {T}}p_{k}. \end{align}$

(2.9)

In the following, we suppose without loss of generality that $x_k$ lies in $N(x^{*}, \frac{r}{2})$ .

Lemma 2.1. Under the conditions of Assumption 2.1, if $\upsilon > \frac{2}{\gamma}-2$ , then there exist positive constants $a_1, a_2, b_1, b_2$ such that

$\begin{equation} a_1\|\overline{x}_{k}-x_{k}\|^{({\frac{2}{\gamma}-1})\alpha} \leq \mu_{k} \leq a_2\|\overline{x}_{k}-x_{k}\|^{\alpha}, \end{equation}$

(2.10)

$\begin{equation} b_1\|\overline{x}_{k}-x_{k}\|^{({\frac{2}{\gamma}-1})(\alpha+\theta)} \leq \|p_{k}\| \leq b_2\|\overline{x}_{k}-x_{k}\|^{\alpha+\theta}. \end{equation}$

(2.11)

Proof. Since $\|\overline{x}_{k}-x^{*}\|\leq \|\overline{x}_{k}-x_{k}\|+\|x_{k}-x^{*}\|\leq2\|x_{k}-x^{*}\|\leq{r},$ we have $\overline{x}_{k}\in N(x^{*}, r)$ . Hence, by (2.2) and (2.5),

$\begin{equation} \kappa^{\frac{1}{\gamma}} \|\overline{x}_{k}-x_{k}\|^{\frac{1}{\gamma}}\leq \|F_{k}\|\leq M \|\overline{x}_{k}-x_{k}\|. \end{equation}$

(2.12)

By (2.5), we have

$\begin{equation} \|J_{k}^{ {T}}F_{k}\|\leq \|J_{k}\|\|F_{k}-F(\overline{x}_{k})\|\leq M^{2}\|\overline{x}_{k}-x_{k}\|. \end{equation}$

(2.13)

Let $T_k: = F_{k}-F(\overline{x}_{k})-J_{k}(x_{k}-\overline{x}_{k})$ . Then,

$\begin{equation} F_{k}^{ {T}}J_{k}(x_{k}-\overline{x}_{k}) = \|F_{k}\|^{2}-F_{k}^{ {T}}T_k. \end{equation}$

(2.14)

It follows from (2.4), (2.12) and (2.14) that

$\begin{align*} \|F_{k}^{ {T}}J_{k}\|\|\overline{x}_{k}-x_{k}\|&\geq\|F_{k}\|^{2}-F_{k}^{ {T}}T_k\notag\\ &\geq \|F_{k}\|^{2}-\|F_{k}\|\|F_{k}-F(\overline{x}_{k})-J_{k}(x_{k}-\overline{x}_{k})\|\notag\\ &\geq \kappa^{\frac{2}{\gamma}}\|\overline{x}_{k}-x_{k}\|^{\frac{2}{\gamma}}- \frac{L M}{1+\upsilon} \|\overline{x}_{k}-x_{k}\|^{2+\upsilon}, \end{align*}$

which together with $\upsilon > \frac{2}{\gamma}-2$ gives

$\begin{align} \|F_{k}^{ {T}}J_{k}\| &\geq \kappa^{\frac{2}{\gamma}}\|\overline{x}_{k}-x_{k}\|^{\frac{2}{\gamma}-1}-\frac{L M}{1+\upsilon}\|\overline{x}_{k}-x_{k}\|^{1+\upsilon}\\ &\geq (\kappa^{\frac{2}{\gamma}}-\frac{L M}{1+\upsilon}\|\overline{x}_{k}-x_{k}\|^{2+\upsilon-\frac{2}{\gamma}})\|\overline{x}_{k}-x_{k}\|^{\frac{2}{\gamma}-1}\\ &\geq \tilde{c}\|\overline{x}_{k}-x_{k}\|^{\frac{2}{\gamma}-1}, \end{align}$

(2.15)

where $\tilde{c} > 0$ is some constant. By (2.13) and (2.15), we have

$\begin{equation} \tilde{c}\|\overline{x}_{k}-x_{k}\|^{{\frac{2}{\gamma}-1}}\leq \| J_{k} ^{ {T} } F_{k}\|\leq M^{2} \|\overline{x}_{k}-x_{k}\|. \end{equation}$

(2.16)

Since $\mu_{k} = \sigma\|F_{k}\|^{\alpha}+(1-\sigma)\|J_{k}^{ {T}}F_{k}\|^{\alpha}$ , by (2.12) and (2.16), we have

$a_1\|\overline{x}_{k}-x_{k}\|^{\max\{\frac{\alpha}{\gamma}, ({\frac{2}{\gamma}-1})\alpha\}} \leq \mu_{k} \leq a_2\|\overline{x}_{k}-x_{k}\|^{\alpha},$

where $a_1: = \sigma \kappa^{\frac{\alpha}{\gamma}}+(1-\sigma)\tilde{c}^{\alpha}$ and $a_2: = \sigma M^{\alpha} +(1-\sigma) M^{2 \alpha}$ , which together with ${\frac{2}{\gamma}-1}\ge\frac{1}{\gamma}$ gives

$a_1\|\overline{x}_{k}-x_{k}\|^{ ({\frac{2}{\gamma}-1})\alpha} \leq \mu_{k} \leq a_2\|\overline{x}_{k}-x_{k}\|^{\alpha}.$

This proves (2.10). Moreover, since $\|p_{k}\| = \tau\|F_{k}\|^{\alpha+\theta}+(1-\tau)\|J_{k}^{ {T}}F_{k}\|^{\alpha+\theta}$ , according to (2.12) and (2.16), we have

$b_1\|\overline{x}_{k}-x_{k}\|^{\max\{\frac{\alpha+\theta}{\gamma}, ({\frac{2}{\gamma}-1})(\alpha+\theta)\}} \leq \|p_{k}\| \leq b_2\|\overline{x}_{k}-x_{k}\|^{\alpha+\theta},$

where $b_1: = \tau \kappa^{\frac{\alpha+\theta}{\gamma}}+(1-\tau) \tilde{c}^{\alpha+\theta}$ and $b_2: = \tau M^{\alpha+\theta} +(1-\tau) M^{2(\alpha+\theta)},$ which together with ${\frac{2}{\gamma}-1}\ge\frac{1}{\gamma}$ gives

$b_1\|\overline{x}_{k}-x_{k}\|^{({\frac{2}{\gamma}-1})(\alpha+\theta)} \leq \|p_{k}\| \leq b_2\|\overline{x}_{k}-x_{k}\|^{\alpha+\theta}.$

This proves (2.11). □ □

Lemma 2.2. Under the conditions of Assumption 2.1, if $\upsilon > \frac{2}{\gamma}-2$ , $0 < \alpha < \frac{2\gamma(1+\upsilon)}{2-\gamma}$ and $\theta > \frac{(2-2\gamma)\alpha}{\gamma}$ , there exists a constant $c > 0$ such that

$\begin{equation} \|d_{k}\|\leq c\|\overline{x}_{k}-x_{k}\|^{\min\{1, 1+\upsilon-\frac{(2-\gamma)\alpha}{2\gamma}, \frac{(2\gamma-2)\alpha}{\gamma}+\theta\}}. \end{equation}$

(2.17)

Proof. Let

$\begin{equation} \bar{d}_k: = -(J_k^TJ_k+\mu_kI)^{-1}J_k^TF_k. \end{equation}$

(2.18)

Then ${\bar{d}}_k$ is the LM step computed by solving (1.2). Moreover, by (1.3) we have

$\begin{equation} {d}_k = \bar{d}_k+(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}p_{k}. \end{equation}$

(2.19)

Now we define

$\begin{equation} \varphi_{k}(d): = \|F_{k}+J_{k}d\|^{2}+\mu_{k}\|d\|^{2}. \end{equation}$

(2.20)

Then, the LM step $\overline{d}_{k}$ defined by (2.18) is the minimizer of $\varphi_{k}(d)$ . By (2.4) and the left inequality in (2.10), we have

$\begin{align} \|\overline{d}_{k}\|^{2} \leq\frac{\varphi_{k}(\overline{d}_{k})}{\mu_{k}}&\leq\frac{\varphi_{k}(\overline{x}_{k}-x_{k})}{\mu_{k}} = \frac{\|F_{k}+J_{k}(\overline{x}_{k}-x_{k})\|^{2}}{\mu_{k}}+\|\overline{x}_{k}-x_{k}\|^{2}\\ &\leq \bigg(\frac{L}{1+\upsilon}\bigg)^2 a_1^{-1}\|\overline{x}_{k}-x_{k}\|^{ 2+2\upsilon-({\frac{2}{\gamma}-1})\alpha}+\|\overline{x}_{k}-x_{k}\|^{2}\\ &\leq c_1\|\overline{x}_{k}-x_{k}\|^{2\min\{1+\upsilon-\frac{\alpha}{\gamma}+\frac{\alpha}{2}, 1\}}, \end{align}$

(2.21)

where $c_1: = (({L}/(1+\upsilon))^2 a_1^{-1}+1)$ . Thus, by (2.19), (2.21) and the left inequality in (2.10) and the right inequality in (2.11), we have

$\begin{align} \|d_{k}\| &\leq\|\overline{d}_{k}\|+\|d_{k}-\overline{d}_{k}\|\\ & = \|\overline{d}_{k}\|+\|(J_{k}^{ {T}}J_{k}+\mu_{k}I)^{-1}p_{k}\|\\ &\leq\|\overline{d}_{k}\|+\frac{\|p_{k}\|}{\mu_{k}}\\ &\leq c_1\|\overline{x}_{k}-x_{k}\|^{\min\{1+\upsilon-\frac{\alpha}{\gamma}+\frac{\alpha}{2}, 1\}}+\frac{b_2}{a_1} \|\overline{x}_{k}-x_{k}\|^{\alpha+\theta-({\frac{2}{\gamma}-1})\alpha}\\ &\leq c\|\overline{x}_{k}-x_{k}\|^{\min\{1, 1+\upsilon-\frac{(2-\gamma)\alpha}{2\gamma}, \frac{(2\gamma-2)\alpha}{\gamma}+\theta\}}, \end{align}$

(2.22)

where $c = \sqrt{c_1}+{b_2}/{a_1}$ .□ □

Lemma 2.3. [12, Lemma 2.3] Under the conditions of Assumption 2.1, we have

$\textrm{(i)}\|U_{1}U_{1}^{ {T}}F_{k}\|\leq M\|\overline{x}_{k}-x_{k}\|;$ $\textrm{(ii)}\|U_{2}U_{2}^{ {T}}F_{k}\|\leq 2L\|\overline{x}_{k}-x_{k}\|^{1+\upsilon}$ .

Theorem 2.1. Under the conditions of Assumption 2.1, if $\upsilon > \frac{2}{\gamma}-2$ , $0 < \alpha < \frac{2\gamma(1+\upsilon)}{2-\gamma}$ and $\theta > \frac{(2-2\gamma)\alpha}{\gamma}$ , then the sequence $\{x_{k}\}$ converges to the solution set $X^*$ with the order ${{h}(\alpha, \theta, \gamma, \upsilon)}$ where

$\begin{array}{c} {h}(\alpha, \theta, \gamma, \upsilon) = \gamma\min\bigg\{1+\alpha, 1+\upsilon, \alpha+\theta, \frac{(2\gamma-2)\alpha}{\gamma}+\theta+\upsilon, \\ (1+\upsilon)\bigg(1+\upsilon-\frac{(2-\gamma)\alpha}{2\gamma}\bigg), (1+\upsilon)\bigg(\frac{(2\gamma-2)\alpha}{\gamma}+\theta\bigg)\bigg\}. \end{array}$

(2.23)

Proof. Since ${x_{k}}$ converges to $X^{*}$ , we assume that $L\|\overline{x}_{k}-x_{k}\|^v\leq\frac{\bar{\sigma}}{2}$ holds for all sufficiently large $k$ . Then, it follows from (2.7) that

$\begin{equation} \|(\Sigma_{1}^{2}+\mu_{k}I)^{-1}\|\leq\|\Sigma_{1}^{-2}\|\leq \frac{1}{(\bar{\sigma}-L\|\overline{x}_{k}-x_{k}\|^v)^2} < \frac{4}{\bar{\sigma}^{2}}. \end{equation}$

(2.24)

On the other hand, by (2.7) and the left inequality in (2.10), for all sufficiently large $k$ ,

$\begin{align} \|\Sigma_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}\|\leq\frac{\|\Sigma_{2}\|}{\mu_{k}}&\leq L a_1^{-1}\|\overline{x}_{k}-x_{k}\|^{\upsilon-({\frac{2}{\gamma}-1})\alpha}. \end{align}$

(2.25)

Hence, it follows from (2.9)–(2.11), (2.24), (2.25), $\|(\Sigma_{2}^{2}+\mu_{k}I)^{-1}\|\leq\mu_{k}^{-1}$ and Lemma 2.3 that

$\begin{align} \|F_{k}+ J_{k}d_{k}\|&\le\|\mu_{k}U_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}U_{1}^{ {T}}F_{k}\|+\|\mu_{k}U_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}U_{2}^{ {T}}F_{k}\|\\ &+\|U_{1}\Sigma_{1}(\Sigma_{1}^{2}+\mu_{k}I)^{-1}V_{1}^{ {T}}p_{k}\|+\|U_{2}\Sigma_{2}(\Sigma_{2}^{2}+\mu_{k}I)^{-1}V_{2}^{ {T}}p_{k}\|\\ &\leq \frac{4 Ma_2} {\bar{\sigma}^{2}} \|\overline{x}_{k}-x_{k}\|^{1+\alpha}+2 L\|\overline{x}_{k}-x_{k}\|^{1+\upsilon}\\&+ \frac{2}{\bar{\sigma}}b_2 \|\overline{x}_{k}-x_{k}\|^{\alpha+\theta}+La_1^{-1}b_2\|\overline{x}_{k}-x_{k}\|^{\frac{(2\gamma-2)\alpha}{\gamma}+\theta+\upsilon}\\ &\leq \bar{c}\|\overline{x}_{k}-x_{k}\|^{\min\{1+\alpha, 1+\upsilon, \alpha+\theta, \frac{(2\gamma-2)\alpha}{\gamma}+\theta+\upsilon\}}, \end{align}$

(2.26)

where $\bar{c} = {4 Ma_2}/ {\bar{\sigma}^{2}}+2 L+\frac{2}{\bar{\sigma}}b_2 +La_1^{-1}b_2$ . Therefore, by (2.2), (2.4), (2.26) and Lemma 2.2, we have

$\begin{align*} \|\overline{x}_{k+1}-x_{k+1}\|&\leq\frac{1}{\kappa}\|F_{k+1}\|^{\gamma}\notag\\ &\leq\frac{1}{\kappa}(\|F_{k}+ J_{k}d_{k}\|+{\frac{L}{1+\upsilon}}\|d_{k}\|^{1+\upsilon})^{\gamma}\notag\\ &\leq\frac{1}{\kappa}\bigg(\bar{c}\|\overline{x}_{k}-x_{k}\|^{\min\{1+\alpha, 1+\upsilon, \alpha+\theta, \frac{(2\gamma-2)\alpha}{\gamma}+\theta+\upsilon\}}\notag\\ &+{\frac{L}{1+\upsilon}}c^{1+\upsilon}\|\overline{x}_{k}-x_{k}\|^{\min\{1+\upsilon, (1+\upsilon)(1+\upsilon-\frac{(2-\gamma)\alpha}{2\gamma}), (1+\upsilon)(\frac{(2\gamma-2)\alpha}{\gamma}+\theta)\}}\bigg)^{\gamma}\notag\\ &\leq O(\|\overline{x}_{k}-x_{k}\|^{{h}(\alpha, \theta, \gamma, \upsilon)}, \end{align*}$

where ${h}(\alpha, \theta, \gamma, \upsilon)$ is given in (2.23). □ □

Corollary 2.1. Under the conditions of Assumption 2.1 and $\gamma = \upsilon = 1$ , if $0 < \alpha < 4,$ then the sequence $\{x_{k}\}$ converges to the solution set $X^*$ with the order ${{h}(\alpha, \theta)}$ where

$\begin{align*} {h}(\alpha, \theta) = \begin{cases} \min\{\alpha+\theta, 4-\alpha, 2\theta\}, & {{{{if}}\; 0 < \theta < 1 }}, \\ \min\{2, 1+\alpha, 4-\alpha\}, & {{{{if}}\; \theta \geq1. }} \end{cases} \end{align*}$

More precisely,

$\begin{align*} {h}(\alpha, \theta) = \begin{cases} \alpha+\theta& {{{{if}}\; \alpha \in (0, \theta] }}, \\ 2\theta& {{{{if}}\; \alpha \in (\theta, 4-2\theta] }}, \; \; \mathit{\mbox{if}}\; \; 0 < \theta < 1, \\ 4-\alpha& {{{{if}}\; \alpha \in (4-2\theta, 4] }}, \end{cases} \end{align*}$

and

$\begin{align*} {h}(\alpha, \theta) = \begin{cases} 1+\alpha& {{{{if}}\; \alpha \in (0, 1] }}, \\ 2& {{{{if}}\; \alpha \in (1, 2] }}, \; \; \mathit{\mbox{if}}\; \; \theta\geq1, \\ 4-\alpha& {{{{if}}\; \alpha \in (2, 4]. }} \end{cases} \end{align*}$

As we know from ^[5] that for any $\alpha\in(0, 2]$ and $\theta\ge 1$ , the sequence generated by the inexact LM method (1.3) converges to some solution of (1.1), which is a stronger result than the convergence to the solution set. We show that this convergence result also holds true for the inexact LM methods studied in this paper.

Theorem 2.2. Under the conditions of Assumption 2.1 and $\gamma = \upsilon = 1$ , if $\alpha\in(0, 2]$ and $\theta\ge 1$ , then the sequence $\{x_{k}\}$ converges to a solution of (1.1) with the order $g(\alpha)$ where

$\begin{align} {g}(\alpha) = \begin{cases} 1+\alpha& {{{{if}}\; \alpha \in (0, 1] }}, \\ 2& {{{{if}}\; \alpha \in (1, 2] .}} \end{cases} \end{align}$

(2.27)

Proof. By Corollary 2.1, when $\alpha\in(0, 2]$ and $\theta\ge 1$ , it holds that

$\begin{align} \|\overline{x}_{k+1}-x_{k+1}\|&\leq O(\|\overline{x}_{k}-x_{k}\|^{{g}(\alpha)}), \end{align}$

(2.28)

where $g(\alpha)$ is defined by (2.27). Then, for all sufficiently large $k$ ,

$\|\bar{x}_k-x_k\|\le \|x_k-\bar{x}_{k+1}\|\le \|x_{k+1}-\bar{x}_{k+1}\|+\|d_k\|\le O(\|\overline{x}_{k}-x_{k}\|^{{g}(\alpha)})+\|d_k\|,$

which together with ${g}(\alpha) > 1$ gives

$\begin{equation} \|\bar{x}_k-x_k\|\le 2\|d_k\|. \end{equation}$

(2.29)

Hence, we deduce from (2.17), (2.28) and (2.29) that

$\begin{eqnarray*} \|d_{k+1}\|& = &O(\|\bar{x}_{k+1}-x_{k+1}\|^{\min\{1, 2-\frac{\alpha}{2}, \theta\}})\nonumber\\ & = &O(\|\bar{x}_{k+1}-x_{k+1}\|)\nonumber\\ & = &O(\|\bar{x}^{k}-x_{k}\|^{{g}(\alpha)})\nonumber\\ & = &O(\|d_k\|^{{g}(\alpha)}). \end{eqnarray*}$

This implies that the sequence $\{x_k\}$ converges to some solution of (1.1) with the order $g(\alpha)$ . □ □

3. Globally convergent inexact LM methods

In this section, we study a family of globally convergent inexact LM methods. We define the merit function $\psi: \mathbb{R}^n\rightarrow \mathbb{R}$ as

$\begin{equation} \psi(x): = \frac{1}{2} \|{F}(x)\|^2. \end{equation}$

(3.1)

Obviously, $\psi(x)$ is continuously differentiable at any $x\in \mathbb{R}^n$ with $\nabla\psi(x) = {J}(x)^T{F}(x).$ Our method is described as follows.

Algorithm 3.1. Choose parameters $\sigma, \tau \in[0, 1]$ , $\alpha\in(0, 4]$ , $\theta > 0$ , $\rho, \xi, \chi, \delta, \zeta \in(0, 1)$ and $x_0\in \mathbb{R}^n$ . Let $\Theta_{0}: = \psi(x_{0})$ . Set $k: = 0.$

Step 1: If $\|\nabla \psi(x_k)\| = 0$ , then stop.

Step 2: Set

$\begin{equation} \mu_k: = \sigma\|F_{k}\|^{\alpha}+(1-\sigma)\|J_{k}^{{T}}F_{k}\|^{\alpha}, \end{equation}$

(3.2)

$\begin{equation} w_k: = \tau\|F_{k}\|^{\alpha+\theta}+(1-\tau)\|J_{k}^{{T}}F_{k}\|^{\alpha+\theta}. \end{equation}$

(3.3)

Find a search direction $d_k\in {\mathbb{R}}^n$ which satisfies

$\begin{equation} (J_{k}^TJ_{k}+\mu_kI)d_k = -\nabla \psi(x_k)+p_k, \end{equation}$

(3.4)

where

$\begin{equation} \|p_k\|\leq \min\{\rho\|\nabla \psi(x_k)\|, w_k\}. \end{equation}$

(3.5)

If $d_k$ satisfies

$\begin{equation} \| {F}(x_k+d_k)\|\leq\xi\| {F}(x_k)\|, \end{equation}$

(3.6)

then set $\lambda_k: = 1$ and go to Step 4.

Step 3: If the descent condition

$\begin{equation} \nabla\psi(x_k)^Td_k\leq-\chi\|d_k\|^2 \end{equation}$

(3.7)

is not satisfied, then set $d_k: = -\nabla\psi(x_k)$ . Let $l_k$ be the smallest nonnegative integer $l$ satisfying

$\begin{equation} \psi(x_k+\delta^{l} d_k)\leq \Theta_k-\zeta\|\delta^{l}d_k\|^2. \end{equation}$

(3.8)

Set $\lambda_k : = \delta^{l_k}$ and go to Step 4.

Step 4: Set $x_{k+1}: = x_k+\lambda_k d_k$ and

$\begin{equation} \Theta_{k+1}: = \frac{(\Theta_k+1)\psi(x_{k+1})}{\psi(x_{k+1})+1}. \end{equation}$

(3.9)

Set $k: = k+1$ and go to Step 1.

Algorithm 3.1 is designed based on the inexact LM method ^[2] and the nonmonotone smoothing Newton method ^[10]. The main feature of Algorithm 3.1 is that it takes more general LM parameter $\mu_k$ and perturbation vector $p_k$ and adopts a nonmonotone line search technique to ensure the global convergence.

Theorem 3.1. Algorithm 3.1 generates an infinite sequence $\{x_k\}$ which satisfies $\psi(x_k)\leq \Theta_k$ for all $k\geq0$ .

Proof. For some $k$ , we assume that $\psi(x_k)\leq \Theta_k$ . If $\nabla \psi(x_k)\neq0$ , then ${F}(x_k)\neq0$ and hence $\mu_k > 0$ . So, the matrix $J_{k}^TJ_{k}+\mu_kI$ is positive definite and the search direction $d_k$ in Step 2 is always obtained. Notice that the obtained $d_k\neq0$ . In fact, if $d_k = 0$ , then by (3.4) we have $\|p_k-\nabla \psi(x_k)\| = 0$ . Since $\|p_k\|\leq \rho\|\nabla \psi(x_k)\|$ , it follows that $\|\nabla \psi(x_k)\| = \|p_k\| = 0$ which contradicts $\nabla \psi(x_k)\neq0$ . So, in Step 3, if the descent condition (3.7) holds, then $\nabla\psi(x_k)^Td_k\leq-\chi\|d_k\|^2 < 0$ . Otherwise, $d_k = -\nabla\psi(x_k)$ which gives $\nabla\psi(x_k)^Td_k = -\|\nabla\psi(x_k)\|^2 < 0$ . Thus, the direction $d_k$ used in the line search (3.8) is always a descent direction of $\psi$ . Next we show that there exists at least a nonnegative integer $l$ satisfying (3.8). On the contrary, we suppose that for all nonnegative integer $l$ ,

$\psi(x_k+\delta^{l} d_k) > \Theta_k-\zeta\|\delta^{l}d_k\|^2,$

which together with $\psi(x_k)\leq \Theta_k$ gives

$\begin{equation} \frac{\psi(x_k+\delta^{l} d_k)-\psi(x_k)}{\delta^{l}}+\zeta\delta^{l}\|d_k\|^2 > 0. \end{equation}$

(3.10)

By letting $l\rightarrow \infty$ in (3.10), we have $\nabla\psi(x_k)^Td_k\geq0$ which contradicts $\nabla\psi(x_k)^Td_k < 0$ . So, we can obtain $x_{k+1}$ in Step 4. Now we show $\psi(x_{k+1})\leq \Theta_{k+1}$ . In fact, if the condition (3.6) holds, then

$\psi(x_{k+1})\leq\xi^{2}\psi(x_k) < \psi(x_k)\leq \Theta_k.$

Otherwise, by Step 3, we also have $\psi(x_{k+1})\leq \Theta_k$ . Hence, from (3.9) it holds that

$\Theta_{k+1} = \frac{(\Theta_k+1)\psi(x_{k+1})}{\psi(x_{k+1})+1}\geq\frac{(\psi(x_{k+1})+1)\psi(x_{k+1})}{\psi(x_{k+1})+1} = \psi(x_{k+1}).$

Therefore, we can conclude that if $\psi(x_k)\leq \Theta_k$ and $\nabla \psi(x_k)\neq0$ for some $k$ , then $x_{k+1}$ can be generated by Algorithm 2.1 with $\psi(x_{k+1})\leq \Theta_{k+1}$ . Since $\psi(x_{0}) = \Theta_{0}$ , by induction on $k,$ we prove the theorem. □

Theorem 3.2. Every accumulation point $x^*$ of a sequence $\{x_k\}$ generated by Algorithm 3.1 is a stationary point of $\psi(x)$ , i.e., $\nabla \psi(x^*) = 0.$

Proof. By Steps 2 and 3, we have $\psi(x_{k+1})\leq \Theta_k$ for all $k\geq0$ . This and (3.9) yield that

$\Theta_{k+1} = \frac{\Theta_k\psi(x_{k+1})+\psi(x_{k+1})}{\psi(x_{k+1})+1}\leq\frac{\Theta_k\psi(x_{k+1})+\Theta_k}{\psi(x_{k+1})+1} = \Theta_k.$

Thus there exists $\Theta^*\geq0$ such that $\lim \limits_{k\rightarrow \infty} {\Theta_k} = \Theta^*$ . Further, by (3.9) we have

$\lim \limits_{k\rightarrow \infty} {\psi}(x_{k}) = \lim \limits_{k\rightarrow \infty}\frac{\Theta_k}{\Theta_{k-1}-\Theta_k+1} = \Theta^*,$

and so

$\begin{equation} \lim\limits_{k\rightarrow \infty}\| {F}(x_{k})\| = \sqrt{2\Theta^*}. \end{equation}$

(3.11)

So, if there are infinitely many $k$ for which $\| {F}(x_k+d_k)\|\leq\xi\|{F}(x_k)\|$ holds, then $\sqrt{2\Theta^*}\le\xi\sqrt{2\Theta^*}$ which together with $\xi\in(0, 1)$ yields $\Theta^* = 0$ , i.e., $\lim\limits_{k\rightarrow \infty}{F}(x_{k}) = 0.$ By the continuity, we have the desired result. Next, we assume that there exists an index $\bar{k}$ such that $\| {F}(x_k+d_k)\| > \xi\| {F}(x_k)\|$ for all $k\geq \bar{k}$ , i.e., $\lambda_k$ is determined by (3.8) for all $k\geq \bar{k}$ . Since $x^*$ is the accumulation point of $\{x_k\}$ , there exists a subsequence $\{x_k\}_{k\in K}$ where $K\subset\{0, 1, ...\}$ such that $\lim\limits_{(K\ni)k\rightarrow \infty}x_k = x^*.$ We assume that $\nabla \psi(x^*)\neq0$ and will derive a contradiction. Since $\nabla \psi(x^*) = J(x^*)^TF(x^*)\neq0$ , we have $\|J(x^*)\| > 0$ and $\|{F}(x^{*})\| > 0$ . Moreover, by the continuity, we have

$\lim\limits_{(K\ni)k\rightarrow \infty}\mu_k = \sigma\|{F}(x^{*})\|^{\alpha}+(1-\sigma)\|{J}(x^{*})^{{T}}{F}(x^{*})\|^{\alpha}: = \mu^*.$

Obviously, $\mu^* > 0$ . So, there exists a positive constant $\bar{\mu}$ such that $\mu_k\geq\bar{\mu} > 0$ for all $k\in K$ . Since $\{\nabla \psi(x_k)\}$ is bounded on any convergent subsequence $\{x_k\}_{k\in K}$ , for any $k\in K$ , either

$\begin{eqnarray*} \|d_k\|&\leq&\|(J_{k}^TJ_{k}+\mu_kI)^{-1}\|(\|\nabla \psi(x_k)\|+\|p_k\|) \leq\frac{(1+\rho)\|\nabla \psi(x_k)\|}{\bar{\mu}} < \infty, \end{eqnarray*}$

or $\|d_k\| = \|-\nabla\psi(x_k)\| < \infty$ . Hence, the sequence $\{\|d_k\|\}_{k\in K}$ is bounded. By passing to the subsequence, we suppose $\lim\limits_{(K_1\ni)k\rightarrow \infty} d_k = d^*$ where $K_1\subset K$ is an infinite subset. In the following, we prove $\nabla\psi(x^*)^Td^* = 0.$ By (3.8) we have

$\zeta\lambda_k^2\|d_k\|^2\leq \Theta_k-\psi(x_{k+1}).$

This and $\lim \limits_{k\rightarrow \infty} {\psi}(x_{k}) = \lim \limits_{k\rightarrow \infty} {\Theta_k} = \Theta^*$ yield $\lim \limits_{k\rightarrow \infty} \lambda_k\|d_k\| = 0.$ Hence, if $\lambda_k\ge\bar{\lambda} > 0$ for any $k\in K_1$ where $\bar{\lambda} > 0$ is a constant, then $\lim\limits_{(K_1\ni)k\rightarrow \infty} d_k = d^* = 0$ and hence $\nabla\psi(x^*)^Td^* = 0.$ Otherwise, $\{\lambda_k\}_{k\in K_1}$ has a subsequence converging to zero and we suppose $\lim\limits_{(K_2\ni)k\rightarrow \infty}\lambda_k = 0$ where $K_2\subset K_1$ is an infinite set. From (3.8), for all $k\geq \bar{k}$ and $k\in {K}_2$ ,

$\begin{eqnarray*} {\psi}(x_k+\delta^{-1}\lambda_kd_k)& > &\Theta_k-\zeta\|\delta^{-1}\lambda_kd_k\|^2 \geq {\psi}(x_k)-\zeta\|\delta^{-1}\lambda_kd_k\|^2, \end{eqnarray*}$

which gives

$\begin{equation} \frac{ {\psi}(x_k+\delta^{-1}\lambda_kd_k)- {\psi}(x_k)}{\delta^{-1}\lambda_k} > -\zeta\delta^{-1}\lambda_k\|d_k\|^2. \end{equation}$

(3.12)

Since $\psi$ is continuously differentiable at $x^*$ , by letting $k\rightarrow \infty$ with $k\in K_2$ in (3.12), we have $\nabla\psi(x^*)^Td^*\geq0.$ On the other hand, since $d_k$ is a sufficient descent direction of $\psi$ , we have $\nabla\psi(x^*)^Td^* = \lim\limits_{(K_2\ni)k\rightarrow \infty} \nabla\psi(x_k)^Td^k\leq0.$ These give $\nabla\psi(x^*)^Td^* = 0.$ Hence, we can conclude that $\nabla\psi(x^*)^Td^* = 0.$ Let $\bar{K}: = \{k\in K_1 |d_k = -\nabla\psi(x_k)\}.$ If $\bar{K}$ is an infinite set, then

$\begin{eqnarray*} \|\nabla \psi(x^*)\|^2& = &\lim\limits_{(\bar{K}\ni)k\rightarrow \infty}\|\nabla\psi(x_k)\|^2 = \lim\limits_{(\bar{K}\ni)k\rightarrow \infty}-\nabla\psi(x_{k})^Td_k = -\nabla\psi(x^*)^Td^* = 0, \end{eqnarray*}$

which contradicts the assumption $\nabla \psi(x^*)\neq0.$ Otherwise, $\bar{K}$ is a finite set and $d_k$ satisfies (3.7) for all sufficiently large $k\in K_1.$ Then, by (3.7) we have

$\begin{eqnarray*} \chi\|d^*\|^2& = &\lim\limits_{(K_1\ni)k\rightarrow \infty}\chi\|d_k\|^2 \leq\lim\limits_{(K_1\ni)k\rightarrow \infty}-\nabla\psi(x_{k})^Td_k = -\nabla\psi(x^*)^Td^* = 0, \end{eqnarray*}$

which gives $d^* = 0.$ By (3.4), we have for all $k\in K_1$ ,

$\begin{equation} \|p_k- \nabla\psi(x_k)\|\leq\|J_k^TJ_k+\mu_kI\|\|d_k\|. \end{equation}$

(3.13)

Since $\lim\limits_{(K_1\ni)k\rightarrow \infty}(J_k^TJ_k+\mu_kI) = {J}(x^*)^T {J}(x^*)+\mu^*I,$ by (3.13) and $d^* = 0$ , we have

$\begin{equation} \lim\limits_{(K_1\ni)k\rightarrow \infty}\|p_k- \nabla\psi(x_k)\| = 0. \end{equation}$

(3.14)

Since $\|p_k\|\leq \rho\|\nabla\psi(x_k)\|$ , we can deduce from (3.14) that

$\|\nabla \psi(x^*)\| = \lim\limits_{(K_1\ni)k\rightarrow \infty}\|\nabla\psi(x_k)\| = \lim\limits_{(K_1\ni)k\rightarrow \infty}\|p_k\| = 0,$

which also contradicts the assumption $\nabla \psi(x^*)\neq0.$ We complete the proof. □

Next, we analyze the convergence rete of Algorithm 3.1. Suppose that the generated iteration sequence $\{x_k\}$ has an accumulation point $x^*$ such that $F(x^*) = 0$ and Assumption 2.1 holds at $x^*$ for $\gamma = \upsilon = 1$ . We will show that the whole sequence $\{x_k\}$ converges to $x^*$ at leat superlinearly for any $\alpha\in(0, 2]$ and $\theta > 1$ .

Lemma 3.1. Assume that Assumption 2.1 holds for $\gamma = \upsilon = 1$ . Let $\alpha\in(0, 2]$ and $\theta > 1$ . If $x_{k}, x_{k}+d_k\in N(x^{*}, r/2)$ , then there exists $\hat{c} > 0$ such that

$\begin{equation} \mathrm{ dist}(x_{k}+d_{k}, X^{*})\leq \hat{c}\mathrm{dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}}. \end{equation}$

(3.15)

Proof. Since $\overline{d}_{k}$ defined by (2.18) is the minimizer of $\varphi_{k}(d)$ in (2.20), by (2.4) and (2.10),

$\begin{eqnarray} \varphi_{k}(\bar{d}_{k})&\leq&\varphi_{k}(\bar{x}_{k}-x_{k})\\ & = &\|F_{k}+J_{k}(\bar{x}_{k}-x_{k})\|^{2}+\mu_{k}\|\bar{x}_{k}-x_{k}\|^{2}\\ &\leq&L^2/4\|\bar{x}_{k}-x_{k}\|^{4}+a_2\|\overline{x}_{k}-x_{k}\|^{\alpha+2}\\ &\leq&(L^2/4+a_{2})\|\bar{x}_{k}-x_{k}\|^{\alpha+2}. \end{eqnarray}$

(3.16)

It holds from (2.20) and (3.16) that

$\begin{equation} \|F_{k}+J_{k}\bar{d}_{k}\|\leq\sqrt{\varphi_{k}(\bar{d}_{k})}\leq\sqrt{L^2/4+a_{2}} \|\bar{x}_{k}-x_{k}\|^{\frac{\alpha}{2}+1}. \end{equation}$

(3.17)

Since $d_{k} = \bar{d}_{k}+(J_{k}^{T}J_{k}+\mu_{k}I)^{-1}p_k,$ we have from (2.5), (2.10), (2.11) and (3.17) that

$\begin{eqnarray} \|F_{k}+J_{k}d_{k}\|& = &\|F_{k}+J_{k}\bar{d}_{k}+J_{k}(J_{k}^{T}J_{k}+\mu_{k}I)^{-1}p_k\|\\ &\leq&\|F_{k}+J_{k}\bar{d}_{k}\|+\frac{M\|p_k\|}{\mu_{k}}\\ &\leq&\sqrt{L^2/4+a_{2}}\|\bar{x}_{k}-x_{k}\|^{\frac{\alpha}{2}+1}+\frac{Mb_{2}}{a_1}\|\overline{x}_{k}-x_{k}\|^{\theta}\\ &\leq& \tilde{C}\|\overline{x}_{k}-x_{k}\|^{\min\{\frac{\alpha}{2}+1, \theta\}}, \end{eqnarray}$

(3.18)

where $\tilde{C} = \sqrt{L^2/4+a_{2}}+{Mb_{2}}a_1^{-1}.$ Moreover, by (2.17), $\alpha\in(0, 2]$ and $\theta > 1$ , we have

$\begin{equation} \|d_{k}\|\leq c\|\overline{x}_{k}-x_{k}\|. \end{equation}$

(3.19)

Thus, by (2.4), (2.17), (3.18) and (3.19), we have

$\begin{eqnarray*} \|F(x_{k}+d_{k})\|&\leq&\|F_{k}+J_{k}d_{k}\|+L/2\|d_{k}\|^{2}\nonumber\\ &\leq&\tilde{C}\|\bar{x}_{k}-x_{k}\|^{\min\{\frac{\alpha}{2}+1, \theta\}}+Lc^2/2\|\bar{x}_{k}-x_{k}\|^{2}\nonumber\\ &\leq&(\tilde{C}+Lc^2/2)\|\bar{x}_{k}-x_{k}\|^{\min\{\frac{\alpha}{2}+1, \theta\}}, \end{eqnarray*}$

which together with (2.2) gives

$\begin{eqnarray*} \mathrm{ dist}(x_{k}+d_{k}, X^{*})\leq {\kappa}^{-1}\|F(x_{k}+d_{k})\|\leq ({\tilde{C}+Lc^2/2}){\kappa}^{-1}\mathrm{ dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}}. \end{eqnarray*}$

Letting $\hat{c}: = ({\tilde{C}+Lc^2/2}){\kappa}^{-1}$ , we complete the proof. □ □

Lemma 3.2. Under the same conditions in Lemma 3.1, there exists an index $\bar{k}$ such that for all $k\geq\bar{k}$ it holds: $\textrm{(i)}$ $x_k, x_k+d_k\in N(x^*, r/2);$ $\textrm{(ii)}$ $\|{F}(x_k+d_{{k}})\|\leq\xi\|{F}(x_k)\|.$

Proof. By Lemma 3.1, similarly as the proof of [9, Lemma 11], we can prove the result. □□

Theorem 3.3. Under the same conditions in Lemma 3.1, the whole sequence $\{x_k\}$ converges to $x^*$ with

$\|x_{k+1}-x^*\| = O(\|x_{k}-x^*\|^{\min\{\frac{\alpha}{2}+1, \theta\}}).$

Proof. By Lemma 3.2 and Step 2 of Algorithm 3.1, we have $x_{k+1} = x_k+d_k$ and $\|F(x_{k+1})\|\leq\xi\|F(x_k)\|$ for all $k\geq\bar{k}$ . It follows that $\lim\limits_{k\to \infty}\|F(x_k)\| = 0,$ which together with (2.2) yields $\lim\limits_{k\to \infty}\mathrm{dist}(x_k, X^*) = 0.$ Thus, from Lemma 3.1, for all sufficiently large $k$ ,

$\mathrm{dist}(x_{k+1}, X^*)\leq \hat{c}\mathrm{dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}} = \hat{c}\mathrm{dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}-1}\mathrm{dist}(x_k, X^*).$

Since $\min\{\frac{\alpha}{2}+1, \theta\} > 1$ and $\lim\limits_{k\to \infty}\mathrm{dist}(x_k, X^*) = 0$ , we have $\hat{c}\mathrm{dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}-1} < \frac{1}{2}$ for all sufficiently large $k$ . It follows that for all sufficiently large $k$ ,

$\mathrm{dist}(x_{k+1}, X^*)\leq \frac{1}{2}\mathrm{dist}(x_k, X^*).$

This implies that for all sufficiently large $k$ ,

$\begin{eqnarray*} \mbox{dist}(x_{k}, X^*)&\leq& \|x_k-\bar{x}_{k+1}\| = \|x_{k+1}-\bar{x}_{k+1}-d_k\| \\ &\leq & \mbox{dist}(x_{k+1}, X^*)+\|d_k\|\\ &\leq & \frac{1}{2} \mbox{dist}(x_{k}, X^*)+\|d_k\|, \end{eqnarray*}$

that is

$\begin{equation} \mbox{dist}(x_{k}, X^*) \le 2\|d_k\|. \end{equation}$

(3.20)

By (3.19) and Lemma 3.1, for all sufficiently large $k$ ,

$\begin{equation} \|d_{k+1}\| \leq c\mathrm{dist}(x_{k+1}, X^*)\leq c\hat{c}\mathrm{dist}(x_{k}, X^{*})^{\min\{\frac{\alpha}{2}+1, \theta\}}, \end{equation}$

(3.21)

which together with $\lim\limits_{k\to \infty}\mathrm{dist}(x_k, X^*) = 0$ gives $\lim\limits_{k\to \infty}d_k = 0$ and

$\begin{equation} \|d_{k+1}\| \leq \frac{1}{4} \mbox{dist}(x_{k}, X^*). \end{equation}$

(3.22)

By (3.20) and (3.22), for all sufficiently large $k$ ,

$\begin{equation} \|d_{k+1}\| \le \frac{1}{2} \|d_k\|. \end{equation}$

(3.23)

So, when $k$ is sufficiently large, (3.23) gives

$\begin{equation} \|x_{k+1}-x^*\| = \|\sum\limits_{j = k+1}^{\infty} d_j \| \le \sum\limits_{j = k+1}^{\infty} \| d_j \|\le 2 \|d_{k+1}\|. \end{equation}$

(3.24)

This and $\lim\limits_{k\to \infty}d_k = 0$ yield $\lim\limits_{k \to \infty }x_{k} = x^*$ . Further, by (3.21) and (3.24) we have

$\lim\limits_{k \to \infty } \frac{\|x_{k+1}-x^*\|}{\|x_{k}-x^*\|^{\min\{\frac{\alpha}{2}+1, \theta\}}}\le \lim\limits_{k \to \infty } \frac{ 2 \|d_{k+1}\|}{ \mbox{dist}(x_{k}, X^*)^{\min\{\frac{\alpha}{2}+1, \theta\}} } < \infty.$

The proof is completed. □ □

4. Numerical results

We apply Algorithm 3.1 to solve the nonlinear equations arising in the well-known linear complementarity problem (LCP):

$\begin{equation} (\mathrm{LCP})\; \; u\geq0, \; v\geq0, \; u = Mv+q, \; u^Tv = 0, \end{equation}$

(4.1)

where $M\in {\mathbb{R}}^{n\times n}$ and $q\in {\mathbb{R}}^n$ are given matrix and vector. To reformulate the LCP into an equivalent system of equations, we define the function $\phi: {\mathbb{R}}^2\rightarrow {\mathbb{R}}$ as

$\begin{equation} \mathrm{\phi}(a, b) = a^2+b^2-\mathrm{sgn}(a+b)(a+b)^2, \; \; \forall (a, b)\in {\mathbb{R}}^2, \end{equation}$

(4.2)

where $\mathrm{sgn}(t): = \begin{cases} 1 & {\mbox{if}\; t>0 }\\ 0 & {\mbox{if}\; t = 0. }\\ -1 & {\mbox{if}\; t<0 } \end{cases}$

Proposition 4.1. $\textrm{(i)}$ $\mathrm{\phi}(a, b) = 0\Longleftrightarrow a\geq0, \; b\geq0, \; ab = 0.$

$\textrm{(ii)}$ $\mathrm{\phi}$ is continuously differentiable at any $(a, b)\in{\mathbb{R}}^2$ whose gradient is given by

$\nabla\mathrm{\phi}(a, b) = 2\left[ \begin{array}{c} a-|a+b|\\ b-|a+b| \end{array} \right].$

Proof. Let $f(t): = \mathrm{sgn}(t)t^2.$ Since $f_q$ is a bijective function, it follows that

$\begin{eqnarray*} \mathrm{\phi}(a, b) = 0&\Longleftrightarrow&f(\sqrt{a^2+b^2})-f(a+b) = 0\\ &\Longleftrightarrow&f(\sqrt{a^2+b^2}) = f(a+b)\\ &\Longleftrightarrow&\sqrt{a^2+b^2} = a+b\\ &\Longleftrightarrow&a\geq0, \; b\geq0, \; ab = 0. \end{eqnarray*}$

The result (ⅱ) holds because $f(t)$ is continuously differentiable everywhere with $f'(t) = 2|t|.$ □

Let $x: = (u, v)$ . By using the function $\phi$ , we may have that solving the LCP is equivalent to computing a solution of the smooth nonlinear equations

$\begin{equation} F(x) = \left( \begin{array}{c} {Mv+q-u}\\ \phi(u_1, v_1)\\ \vdots\\ \phi(u_n, v_n) \end{array} \right) = 0. \end{equation}$

(4.3)

In the following, we apply Algorithm 3.1 to solve the nonlinear equations (4.3). The parameters are chosen as $\rho = 10^{-3}, \xi = 0.5, \gamma = 10^{-5}, \zeta = 10^{-5}, \delta = 0.8, \theta = 1, \tau = 0.5$ and $\sigma, \alpha$ are given the specific experiments. In Step 2, GMRES is used as the linear solver to find the inexact direction $d_k.$ Moreover, we use $\|F(x_k)\|\leq10^{-5}$ as the stopping criterion.

We test two classes of LCPs defined as follows:

LCP (ⅰ) Let $M$ be the block diagonal matrix with $\frac{N_1^TN_1}{\|N_1^TN_1\|}, ..., \frac{N_4^TN_4}{\|N_4^TN_4\|}$ as block diagonals, i.e., $M = {{\textbf{diag}}}\big(\frac{N_i^TN_i}{\|N_i^TN_i\|}\big)$ with $N_i = {\bf rand}(\frac{n}{4}, \frac{n}{4})$ for $i = 1, ..., 4$ . Take $q = {\bf rand}(n, 1)$ . Obviously, the matrix $M$ is positive semidefinite.

LCP (ⅱ) Let $M = {{\textbf{diag}}}\big(\frac{N_i}{\|N_i\|}-{\bf eye}(n/4)\big)$ with $N_i = {\bf rand}(\frac{n}{4}, \frac{n}{4})$ for $i = 1, ..., 4$ . Take $q = {\bf rand}(n, 1)$ .

We use $v_0 = (1, 0, ..., 0)^T$ and $u_0 = Mv_0+q$ as the initial point. and show numerical experimental results of Algorithm 3.1 with different values of $\sigma$ and $\alpha$ , in which IT denotes the iteration number, CPU denotes the CPU time in seconds, Fx denotes the value of $\|F(x_k)\|$ at the final iteration point and "–" stands for that the algorithm fails to find the solution. These numerical results show that Algorithm 3.1 is efficient for solving LCPs. It can find a solution point meeting the desired accuracy in very few iteration numbers and in short CPU time. Moreover, from our numerical implementations, we may find that Algorithm 3.1 with $\sigma = 1$ , i.e., $\mu_{k} = \|F_{k}\|^{\alpha}$ , has advantage over it with $\sigma = 0$ , i.e., $\mu_{k} = \|J_{k}^{ {T}}F_{k}\|^{\alpha}$ . At last, we point out that we have tested Algorithm 3.1 with different values of $\tau$ and found that the numerical performances are same. □

Table 1. Numerical results for LCP (ⅰ).

	$\sigma=0$			$\sigma=0.5$			$\sigma=1$
$\alpha$	$n$	IT	CPU	Fx	IT	CPU	Fx	IT	CPU	Fx
1	1000	7	2.94	1.176e-06	7	2.86	1.668e-06	6	2.58	6.607e-06
	1300	7	5.65	2.905e-07	7	5.65	3.887e-07	6	4.56	2.253e-06
	1500	7	7.86	8.645e-08	7	7.58	1.247e-07	6	6.84	3.491e-06
	1700	–	–	–	–	–	–	–	–	–
	2000	6	13.93	9.715e-06	6	13.72	6.021e-06	6	13.59	1.957e-06
	2500	7	28.52	2.317e-07	6	23.35	7.880e-06	6	23.41	1.507e-06
2	1000	–	–	–	6	2.73	1.753e-06	6	2.44	3.739e-07
	1300	–	–	–	6	4.76	1.936e-06	6	4.55	3.479e-09
	1500	7	8.62	2.622e-06	7	8.02	5.073e-10	5	5.75	8.161e-06
	1700	–	–	–	–	–	–	–	–	–
	2000	7	15.30	1.478e-07	7	16.38	2.749e-11	5	11.40	8.913e-06
	2500	–	–	–	6	24.52	1.018e-06	6	24.18	7.721e-10

| Show Table

DownLoad: CSV

Table 2. Numerical results for LCP (ⅱ).

	$\sigma=0$			$\sigma=0.5$			$\sigma=1$
$\alpha$	$n$	IT	CPU	Fx	IT	CPU	Fx	IT	CPU	Fx
1	1000	5	2.77	2.895e-06	5	2.17	4.086e-06	5	2.55	4.250e-06
	1300	5	4.02	1.334e-07	5	4.11	5.072e-07	4	3.19	9.426e-06
	1500	4	4.82	3.402e-08	3	3.50	8.312e-06	3	3.34	5.988e-06
	1700	5	7.72	4.276e-06	5	7.67	2.346e-06	5	7.87	1.505e-06
	2000	4	10.24	5.374e-07	4	9.72	3.970e-07	3	8.73	9.130e-06
	2500	5	21.22	2.845e-07	5	20.35	4.013e-07	4	16.23	8.123e-06
~~
2	1000	4	1.77	3.116e-06	4	1.88	3.262e-06	4	1.82	2.224e-06
	1300	4	3.27	2.005e-07	4	3.26	1.156e-07	3	3.52	1.381e-08
	1500	3	3.56	5.093e-08	3	3.42	3.387e-08	3	3.75	1.226e-08
	1700	4	6.02	7.728e-06	4	6.23	1.625e-06	4	6.27	1.532e-07
	2000	3	7.15	1.553e-07	3	7.18	8.826e-08	3	7.40	4.989e-08
	2500	4	16.81	1.474e-07	4	16.52	1.299e-07	4	16.78	1.053e-07

| Show Table

DownLoad: CSV

5. Conclusions

We have presented a family of inexact Levenberg-Marquardt methods for the nonlinear equations. The presented LM method takes more general LM parameters and perturbation vectors which are convex combinations of $\|F_k\|^\alpha$ and $\|J_k^TF_k\|^\alpha$ and $\|F_k\|^{\alpha+\theta}$ and $\|J_k^TF_k\|^{\alpha+\theta}$ . Under the H $\mathrm{\ddot{o}}$ derian local error bound condition and the H $\mathrm{\ddot{o}}$ derian continuity of the Jacobian, we have derived an explicit formula of the convergence order of these inexact LM methods. Moreover, we have developed a family of globally convergent inexact LM methods and showed its effectiveness by some numerical experiments.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

Research of this paper was supported by Natural Science Foundation of Henan Province (222300420520) and Key scientific research projects of Higher Education of Henan Province (22A110020).

Conflict of interest

All authors declare no conflicts of interest in this paper.

References

[1]	K. Amini, F. Rostami, G. Caristi, An efficient Levenberg-Marquardt method with a new LM parameter for systems of nonlinear equations, Optimization, 67 (2018), 637–650. https://doi.org/10.1080/02331934.2018.1435655 doi: 10.1080/02331934.2018.1435655
[2]	H. Dan, N. Yamashita, M. Fukushima, Convergence properties of the inexact Levenberg-Marquardt method under local error bound, Optim. Method. Softw., 17 (2002), 605–626. http://dx.doi.org/10.1080/1055678021000049345 doi: 10.1080/1055678021000049345
[3]	F. Facchinei, C. Kanzow, A nonsmooth inexact Newton method for the solution of large scale nonlinear complementarity problems, Math. Program., 76 (1997), 493–512. https://doi.org/10.1007/bf02614395 doi: 10.1007/bf02614395
[4]	J. Y. Fan, J. Y. Pan, Inexact Levenberg-Marquardt method for nonlinear equations, Discrete Cont. Dyn.-B, 4 (2004), 1223–1232. http://dx.doi.org/10.3934/dcdsb.2004.4.1223 doi: 10.3934/dcdsb.2004.4.1223
[5]	J. Y. Fan, J. Y. Pan, On the convergence rate of the inexact Levenberg-Marquardt method, J. Ind. Manag. Optim., 7 (2011), 199–210. http://dx.doi.org/10.3934/jimo.2011.7.199 doi: 10.3934/jimo.2011.7.199
[6]	J. Y. Fan, Y. X. Yuan, On the quadratic convergence of the Levenberg-Marquardt method without nonsingularity assumption, Computing, 74 (2005), 23–39. https://doi.org/10.1007/s00607-004-0083-1 doi: 10.1007/s00607-004-0083-1
[7]	A. Fischera, P. K. Shuklaa, M. Wang, On the inexactness level of robust Levenberg-Marquardt methods, Optimization, 59 (2010), 273–287. https://doi.org/10.1080/02331930801951256 doi: 10.1080/02331930801951256
[8]	G. W. Stewart, J. G. Sun, Matrix Perturbation Theory, San Diego: Academic Press, 1990.
[9]	J. Y. Tang, J. C. Zhou, Quadratic convergence analysis of a nonmonotone Levenberg-Marquardt type method for the weighted nonlinear complementarity problem, Comput. Optim. Appl., 80 (2021), 213–244. http://dx.doi.org/10.1007/S10589-021-00300-8 doi: 10.1007/S10589-021-00300-8
[10]	J. Y. Tang, H. C. Zhang, A nonmonotone smoothing Newton algorithm for weighted complementarity problems, J. Optim Theory Appl., 189 (2021), 679–715. http://dx.doi.org/10.1007/S10957-021-01839-6 doi: 10.1007/S10957-021-01839-6
[11]	H. Y. Wang, J. Y. Fan, Convergence rate of the Levenberg-Marquardt method under Hölderian local error bound, Optim. Methods Softw., 35 (2020), 767–786. http://dx.doi.org/10.1080/10556788.2019.1694927 doi: 10.1080/10556788.2019.1694927
[12]	H. Y. Wang, J. Y. Fan, Convergence properties of inexact Levenberg-Marquardt method under Hölderian local error bound, J. Ind. Manag. Optim., 17 (2021), 2265–2275. http://doi.org/10.3934/jimo.2020068 doi: 10.3934/jimo.2020068
[13]	N. Yamashita, M. Fukushima, On the rate of convergence of the Levenberg-Marquardt method, Computing, 15 (2001), 239–249. http://doi.org/10.1007/978-3-7091-6217-0-18 doi: 10.1007/978-3-7091-6217-0-18
[14]	M. Zeng, G. Zhou, Improved convergence results of an efficient Levenberg-Marquardt method for nonlinear equations, J. Appl. Math. Comput., 68 (2022), 3655–367. http://doi.org/10.1007/S12190-021-01599-6 doi: 10.1007/S12190-021-01599-6
[15]	L. Zheng, L. Chen, Y. F. Ma, A variant of the Levenberg-Marquardt method with adaptive parameters for systems of nonlinear equations, AIMS Math., 7 (2021), 1241–1256. http://doi.org/10.3934/math.2022073 doi: 10.3934/math.2022073
[16]	L. Zheng, L. Chen, Y. X. Tang, Convergence rate of the modified Levenberg-Marquardt method under Hölderian local error bound, Open Math., 20 (2022), 998–1012. http://dx.doi.org/10.1515/MATH-2022-0485 doi: 10.1515/MATH-2022-0485

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1508) PDF downloads(60) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

Convergence properties of a family of inexact Levenberg-Marquardt methods

Related Papers:

Abstract

1. Introduction

2. Convergence rate of the inexact LM methods

3. Globally convergent inexact LM methods

4. Numerical results

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction

2. Convergence rate of the inexact LM methods

3. Globally convergent inexact LM methods

4. Numerical results

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

AIMS Mathematics

Convergence properties of a family of inexact Levenberg-Marquardt methods

Related Papers:

Abstract

1. Introduction

2. Convergence rate of the inexact LM methods

3. Globally convergent inexact LM methods

4. Numerical results

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Convergence rate of the inexact LM methods

3. Globally convergent inexact LM methods

4. Numerical results

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References