A two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues

Wei Ma; Zhenhao Li; Yuxin Zhang; Wei Ma; Zhenhao Li; Yuxin Zhang

doi:10.3934/math.20241117

AIMS Mathematics

2024, Volume 9, Issue 8: 22986-23011. doi: 10.3934/math.20241117

Previous Article Next Article

Research article

A two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues

1.
School of Mathematics and Statistics, Nanyang Normal University, Nanyang, Henan 473061, China
2.
School of Artificial Intelligence and Software Engineering, Nanyang Normal University, Nanyang, Henan 473061, China

Received: 27 May 2024 Revised: 12 July 2024 Accepted: 15 July 2024 Published: 25 July 2024
MSC : 15A18, 65F15, 65F18

Our focus in this study was on examining the convergence problem of a novel method, inspired by the Ulm-Chebyshev-like Cayley transform method, which was designed to solve the inverse eigenvalue problems (IEPs) with multiple eigenvalues. Compared with other existing methods, the proposed method has higher convergence order and/or requires less operations. Under the assumption that the relative generalized Jacobian matrices at a solution are nonsingular, the proposed method was proved to be convergent with cubic convergence. Experimental findings demonstrated the practicality and efficiency of the suggested approaches.

Keywords:

Citation: Wei Ma, Zhenhao Li, Yuxin Zhang. A two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues[J]. AIMS Mathematics, 2024, 9(8): 22986-23011. doi: 10.3934/math.20241117

Related Papers:

[1]	Wei Ma, Ming Zhao, Jiaxin Li . A multi-step Ulm-Chebyshev-like method for solving nonlinear operator equations. AIMS Mathematics, 2024, 9(10): 28623-28642. doi: 10.3934/math.20241389
[2]	Batirkhan Turmetov, Valery Karachik . On solvability of some inverse problems for a nonlocal fourth-order parabolic equation with multiple involution. AIMS Mathematics, 2024, 9(3): 6832-6849. doi: 10.3934/math.2024333
[3]	Jiao Xu, Yinlan Chen . On a class of inverse palindromic eigenvalue problem. AIMS Mathematics, 2021, 6(8): 7971-7983. doi: 10.3934/math.2021463
[4]	Mingyuan Cao, Yueting Yang, Chaoqian Li, Xiaowei Jiang . An accelerated conjugate gradient method for the Z-eigenvalues of symmetric tensors. AIMS Mathematics, 2023, 8(7): 15008-15023. doi: 10.3934/math.2023766
[5]	Shixian Ren, Yu Zhang, Ziqiang Wang . An efficient spectral-Galerkin method for a new Steklov eigenvalue problem in inverse scattering. AIMS Mathematics, 2022, 7(5): 7528-7551. doi: 10.3934/math.2022423
[6]	Liangkun Xu, Hai Bi . A multigrid discretization scheme of discontinuous Galerkin method for the Steklov-Lamé eigenproblem. AIMS Mathematics, 2023, 8(6): 14207-14231. doi: 10.3934/math.2023727
[7]	Latifa I. Khayyat, Abdullah A. Abdullah . The onset of Marangoni bio-thermal convection in a layer of fluid containing gyrotactic microorganisms. AIMS Mathematics, 2021, 6(12): 13552-13565. doi: 10.3934/math.2021787
[8]	Lingling Sun, Hai Bi, Yidu Yang . A posteriori error estimates of mixed discontinuous Galerkin method for a class of Stokes eigenvalue problems. AIMS Mathematics, 2023, 8(9): 21270-21297. doi: 10.3934/math.20231084
[9]	Yalçın Güldü, Ebru Mişe . On Dirac operator with boundary and transmission conditions depending Herglotz-Nevanlinna type function. AIMS Mathematics, 2021, 6(4): 3686-3702. doi: 10.3934/math.2021219
[10]	Erdal Bas, Ramazan Ozarslan, Resat Yilmazer . Spectral structure and solution of fractional hydrogen atom difference equations. AIMS Mathematics, 2020, 5(2): 1359-1371. doi: 10.3934/math.2020093

Abstract

1. Introduction

Let $A_{0}, \ A_{1}, \ldots, A_{n}$ be $n+1$ real symmetric $n$ -by- $n$ matrices. For any ${\bf c} = (c_{1}, c_{2}, \ldots, c_{n})^{T}$ $\in{\mathbb{R}}^{n}$ such that the eigenvalues $\{\lambda_{i}(A({\bf c}))\}_{i = 1}^{n}$ of the matrices

$\begin{equation} A( {\bf c})\equiv A_0+\sum\limits_{i = 1}^{n}c_{i}A_{i} \end{equation}$

(1.1)

with the order $\lambda_{1}(A(\bf c))\leq \lambda_{2}(A({\bf c}))\leq\cdots \leq\lambda_{n}(A({\bf c}))$ . In this note, the inverse eigenvalue problem (IEP) defined here is, for the given $n$ real numbers $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ with the order $\lambda_{1}^{*}\leq\lambda_{2}^{*}\leq\cdots\leq\lambda_{n}^{*}$ , to find a vector ${\bf c}^{*}\in{\mathbb{R}}^{n}$ such that

$\begin{equation} \lambda_{i}(A( {\bf c}^{*})) = \lambda_{i}^{*},\quad i = 1,\ldots,n. \end{equation}$

(1.2)

The IEP is utilized in a wide range of fields including the inverse Toeplitz eigenvalue problem [1,2,3], structural dynamics [4], molecular spectroscopy [5], the pole assignment problem [6], the inverse Sturm-Liouville problem [7], and also problems in mechanics applications [8,9], structural integrity assessments [10], geophysical studies [11], particle physics research [12], numerical analysis [13], and dynamics systems [14]. For further insights into the diverse practical uses, underlying mathematical principles, and computational techniques of general IEPs, readers may consult the comprehensive review articles [15,16] and the relevant literature [17,18].

The IEP (1.2) can be represented mathematically through a set of non-linear equations:

$\begin{equation} {\bf f}( {\bf c}): = (\lambda_{1}(A( {\bf c}))-\lambda_{1}^{*},\lambda _{2}(A( {\bf c}))-\lambda_{2}^{*},\ldots,\lambda_{n}(A( {\bf c}))-\lambda _{n}^{*})^{T} = {\bf 0}. \end{equation}$

(1.3)

In situations where the given eigenvalues are distinct, i.e.,

$\begin{equation} \lambda_{1}^{*} < \lambda_{2}^{*} < \cdots < \lambda_{n}^{*}. \end{equation}$

(1.4)

Newton's method can be applied to nonlinear equation (1.3) with (1.4). However, as noted in [,,], Newton's method has the following two disadvantages: (ⅰ) It requires computing the exact eigenvectors at each iteration; (ⅱ) It requires solving a Jacobian equation at each iteration. These two facts make it inefficient from the point of numerical computations especially when the problem size $n$ is large. Thus, a focus was placed on avoiding on the disadvantages: (ⅰ) The Newton-like method was proposed in [22,23] which computed the approximate eigenvectors instead of the exact eigenvectors. The quadratic convergence rate of this type of Newton-like method was re-proved in [24]. To alleviate the over-solving problem, Chen et al. proposed in [25] an inexact Newton-like method, which stopped the inner iterations before convergence. (ⅱ) Shen et al. proposed in [26,27] an Ulm-type method, which avoided solving approximate Jacobian equation at each outer iteration and hence could reduce the instability problem caused by the possible ill-conditioning in solving an approximate Jacobian equation.

Note that all of the methods mentioned above are quadratically convergent. In order to speed up the convergence rate of the methods, Chen et al. [] proposed a super quadratic convergent two-step Newton-type method where the approximate Jacobian equations are solved by inexact methods. In view of this difficulty, Wen et al. proposed, in [], a two-step inexact Newton-Chebyshev-like method with cubic root-convergence rate, in which the approximate eigenvectors were obtained by applying the one-step inverse power method and avoided solving the approximate Jacobian equations by using the Chebyshev method to approach the inverse of the Jacobian matrix. In 2022, Wei Ma designed a two-step Ulm-Chebyshev-like Cayley transform method [] which utilized a Cayley transform to find the approximate eigenvectors. However, the convergence analysis for the above methods became ineffective in the absence of distinct eigenvalues, due to the breakdown of $f's$ differentiability and the eigenvectors' continuity for multiple eigenvalues [22]. When multiple eigenvalues are present, all of the numerical methods in the mentioned references above are quadratic convergent, which extends to the case of multiples.

In this paper, motivated by [30], we propose a two-step Ulm-Chebyshev-like Cayley transform method for solving the IEP (1.2). Further exploration involves analyzing the performance of the newly introduced two-step Ulm-Chebyshev-like method in the presence of multiple eigenvalues. Under the assumption similar to the one used in a previous study, by the Rayleigh quotient as an approximate eigenvalue of the symmetric matrix and the estimates of eigenvalues, eigenvectors, and the relative generalized Jacobian, we show that the proposed method is still cubically convergent. Numerical experiments show the efficiency of our method and comparisons with some known methods are made.

The structure of this paper is as follows. We give some notations and preliminary results of the relative generalized Jacobian and some useful lemmas in Section 2. A novel method, the two-step Ulm-Chebyshev-like Cayley transform approach, is introduced in Section 3 and our main convergence theorems are established for the new method in Section 4. Experimental results are presented in the final section.

2. Preliminaries

Let $n$ be a positive integer. Let ${\mathbb{R}}^{n}$ represent an $n$ -dimensional Euclidean space, $S$ be a subset of ${\mathbb{R}}^{n}$ , and $clS$ represent the closure of $S$ . Usually, we use $\mathbf{B}({\bf x}, \delta_1)$ to represent the empty sphere of ${\mathbb{R}}^{n}$ center ${\bf x}\in{\mathbb{R}}^{n}$ and radius $\delta_1 > 0$ . Let $\|\cdot\|$ and $\|\cdot\|_{F}$ represent the Euclidean vector norms or their corresponding induced matrix norms and Frobenius norms, respectively. $I$ is the identity matrix of appropriate dimensions. Then, by (2.3.7) in [31], we have

$\begin{equation*} \|A\|\leq \|A\|_{F}\leq\sqrt{n}\|A\|,\quad\text{for each}\quad A\in{\mathbb{R}}^{n}. \end{equation*}$

We define

$\begin{equation} K = 6\beta^{2}\|\lambda^{*}\|,\quad N = (n^{2}-t^{2})\max\limits_{i\in[1,n-1]}{\frac{1}{\lambda_{i+1}^{*}-\lambda_{i}^{*}}},\quad H_{1} = \frac{8n^{\frac{3}{2}}\xi\beta\rho_{0}\max\limits_{1\leq j\leq n}\|A_{j}\|}{1-\Big(\frac{\delta}{\tau}\Big)^{2}}, \end{equation}$

(2.1)

$\begin{equation} C = \max\big\{2+2\beta+12N\beta\|\lambda^*\|,\ 2N\max\limits_{1\leq j\leq n}\| A_j\|\big\},\ \rho = \max\{2\sqrt{n}(2\beta+\beta^{2}+2N K+\frac{1}{2}\beta C),\ 3\sqrt{n}C\}, \end{equation}$

(2.2)

$\begin{equation} \alpha_{1} = 8n^{\frac{3}{2}}\beta\rho_{0}\max\limits_{1\leq j\leq n}\|A_{j}\|,\ \gamma = \frac{K}{1-H_{1}\delta},\ \alpha_{2} = 1+8n^{\frac{3}{2}}\gamma K\rho_{0}^{2}, \end{equation}$

(2.3)

$\begin{array}{*{20}{c}} {\alpha_{3} = \rho(\alpha_{2}+4n\rho_{0}^{2}),\quad \alpha_{4} = \alpha_{3}+\rho_{0}\alpha_{2},\quad \alpha_{6} = 1+2\gamma\alpha_{1},}\\ {\alpha_{5} = 6\sqrt{n}\beta^{2}\Big(\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|+ K\sqrt{n}+\|\lambda^{*}\|\Big)\alpha_{3}^{2},}\\ {\alpha_{7} = 2\gamma\alpha_{5}+\alpha_{2}\alpha_{6},\quad\delta_{2} = \min \Big\{\epsilon_{0},\quad\frac{1}{\rho},\quad \frac{1}{\beta}\Big\},} \end{array}$

(2.4)

$\begin{equation} \tau = \min \Big\{1,\quad\frac{1}{\alpha_{7}},\quad \frac{\sqrt{n}\rho_{0}}{\rho_{3}(\alpha_{7}+\alpha_{3}^{2})},\quad \frac{1}{(1+2\gamma\alpha_{1})^{3}},\quad\frac{2}{H_{1}}\Big\} \end{equation}$

(2.5)

and

$\begin{equation} 0 < \delta = \min\big\{\mu,\ \delta_{0},\ \delta_{2},\ \frac{\tau}{2},\ \frac{\delta_{2}}{2\sqrt{n}\rho_{0}},\ \frac{\delta_{2}}{\alpha_{2}},\ \frac{\delta_{2}}{\alpha_{3}},\ \frac{\delta_{2}}{\alpha_{7}},\ \frac{1}{\gamma\alpha_{1}}\big\}, \end{equation}$

(2.6)

where $\delta_{0}$ and $\rho_{0}$ are defined in Lemma 2.1, $\beta$ and $\epsilon_{0}$ are defined in Lemma 2.2, and $\lambda^{*}$ is defined in (2.9).

2.1. Relative generalized Jacobian

A locally Lipschitz continuous function ${\bf h}:\mathbb{R}^n\rightarrow \mathbb{R}^m$ is considered. The Jacobian of ${\bf h}$ , denoted as ${ {\bf h}}'$ , whenever it exists, and $D_ {\bf h}$ represents the set of differentiable points of ${\bf h}$ . Moreover, the B-differential Jacobian of ${\bf h}$ at ${\bf x}\in \mathbb{R}^n$ is denoted according to [32].

$\partial_B {\bf h}( {\bf x}): = \Big\{ U\in \mathbb{R}^{m\times n}\big| U = \lim\limits_{ {\bf x}_k\rightarrow {\bf x}}{ {\bf h}'( {\bf x}_k)},\ {\bf x}_k\in D_ {\bf h} \Big\}.$

Considering the composite nonsmooth function $\mathbf{h}: = \varphi \circ \psi,$ in which $\varphi:\mathbb{R}^t\rightarrow \mathbb{R}^m$ is nonsmooth and $\psi :\mathbb{R}^n\rightarrow \mathbb{R}^t$ is continuously differentiable, the generalized Jacobian $\partial_Q {\bf h}({\bf x})$ [] and relative generalized Jacobian $\partial_{Q|S} {\bf h}({\bf x})$ [34] are respectively defined by

$\partial_Q {\bf h}( {\bf x}): = \partial_B(\varphi(\psi( {\bf x})))\psi'( {\bf x})$

and

$\partial_{Q|S} {\bf h}( {\bf x}): = \big\{U|U\ is\ a\ limit\ of\ U_{i}\in\partial_Q {\bf h}( {\bf y}_{i}),\ {\bf y}_{i}\in S,\ {\bf y}_{i}\rightarrow {\bf x}\big\}.$

For ${\bf c}\in \mathbb{R}^n$ , write

$\varLambda( {\bf c}): = \text{diag}(\lambda_1( {\bf c}),...,\lambda_n( {\bf c})),$

and define

$\begin{equation} \mathcal{Q}( {\bf c}): = \{Q( {\bf c})|Q( {\bf c})^{T}Q( {\bf c}) = I\ and\ Q( {\bf c})^{T}A( {\bf c})Q( {\bf c}) = \varLambda( {\bf c})\}. \end{equation}$

(2.7)

By (1.3) and the concept of a generalized Jacobian to $f$ , we have [34]

$\begin{equation*} \partial_{Q} {\bf f}( {\bf c}) = \{J|[J]_{ij} = {\bf q}_{i}( {\bf c})^{T}A_{j} {\bf q}_{i}( {\bf c}),\ where\ \ [ {\bf q}_{1}( {\bf c}),..., {\bf q}_{n}( {\bf c})]\in \mathcal{Q}( {\bf c})\}. \end{equation*}$

In particular, if $J({\bf c})$ is a singleton, we write $\partial_{Q} {\bf f}({\bf c}) = \{J({\bf c})\}$ . Let

$\begin{equation*} S: = \{ {\bf c}\in\mathbb{R}^n|A( {\bf c})\ has\ distinct\ eigenvalues\}. \end{equation*}$

Then, let ${\bf c}\in S$ and $f$ be continuously differentiable at ${\bf c}$ . Moreover,

$\partial_{Q} {\bf f}( {\bf c}) = \{J( {\bf c})\} = \{ {\bf f}'( {\bf c})\}.$

Thus, we get the following relative generalized Jacobian [34]:

$\begin{equation*} \partial_{Q|S} {\bf f}( {\bf c}) = \{J|J = \lim\limits_{k\rightarrow +\infty}J( {\bf y}^k) \ with\ \{ {\bf y}^k\}\subset S \ and \ {\bf y}^k\rightarrow {\bf c}\}. \end{equation*}$

2.2. Preliminary results

Throughout this paper, let the given eigenvalues $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ satisfy $\lambda_{1}^{*}\leq\lambda_{2}^{*}\leq\cdots\leq\lambda_{n}^{*}$ . For simplicity, without loss of generality, we assume that

$\begin{equation} \lambda_{1}^{*} = \lambda_{2}^{*} = \cdots = \lambda_{t}^{*} < \lambda_{t+1}^{*} < \cdots\cdots < \lambda_{n}^{*}, \end{equation}$

(2.8)

where $1\leq t\leq n$ . Write

$\begin{equation} \varLambda^* = \text{diag}\left(\lambda _{1}^{*},\lambda _{2}^{*},...,\lambda _{n}^{*} \right) \quad\text{and}\quad \lambda^* = \left( \lambda _{1}^{*},\lambda _{2}^{*},...,\lambda _{n}^{*}\right)^T. \end{equation}$

(2.9)

Then a solution of the IEP (1.2) can be written by ${\bf c}^{*}$ and $Q({\bf c}^{*})$ as

$\begin{equation} Q( {\bf c}^{*})^{T}A( {\bf c}^{*})Q( {\bf c}^{*}) = \varLambda^*, \end{equation}$

(2.10)

where $Q({\bf c}^{*})$ is an orthogonal matrix. Recall that $Q({\bf c})$ can be defined by (2.7). Let $Q(\mathbf{c})\in\mathcal{Q}(\mathbf{c})$ and write $Q({\bf c}){ = }[Q^{(1)}({\bf c}), Q^{(2)}({\bf c})]$ in which $Q^{(1)}({\bf c})\in\mathbb{R}^{n\times t}$ and $Q^{(2)}({\bf c})\in\mathbb{R}^{n\times(n-t)}$ . Let ${\bf c}^{*}$ be the solution of the IEPs with (2.8). Define

$\begin{equation*} \Pi = Q^{(1)}( {\bf c}^{*})Q^{(1)}( {\bf c}^{*})^T. \end{equation*}$

Clearly, $\Pi$ is the eigenprojection of $A({\bf c}^{*})$ for $\lambda_{1}^{*}$ in (2.8). Given an orthogonal matrix $P = [P^{(1)}, P^{(2)}]$ , where $P^{(1)}\in\mathbb{R}^{n\times t}$ and $P^{(2)}\in\mathbb{R}^{n\times(n-t)}$ , we obtain the QR factorization of $\Pi P^{(1)}$ by

$\begin{equation} \Pi P^{(1)} = \tilde{Q}^{(1)}( {\bf c}^{*})R, \end{equation}$

(2.11)

where $R$ is a $t\times t$ nonsingular upper triangular matrix and $\tilde{Q}^{(1)}({\bf c}^{*})$ is an $n\times t$ matrix whose columns are orthonormal. Let

$\begin{equation} \tilde{Q}( {\bf c}^{*}): = \big[\tilde{Q}^{(1)}( {\bf c}^{*}), Q^{(2)}( {\bf c}^{*})\big] . \end{equation}$

(2.12)

Obviously, $\tilde{Q}(\mathbf{c}^{*})\in\mathcal{Q}(\mathbf{c}^{*})$ . Moreover, we define the error matrix

$\begin{equation*} E: = [P^{(1)}-\Pi P^{(1)}, P^{(2)}-Q^{(2)}( {\bf c}^{*})]. \end{equation*}$

Now, we state the following two lemmas, which are usefull for our proof.

Lemma 2.1. [,] Let ${\bf c}^{*}\in\mathbb{R}^n$ and the eigenvalues of the matrix $A({\bf c}^{*})$ satisfy (2.8). Then, there exist two positive numbers $\delta_{0}$ and $\rho_{0}$ such that, for each ${\bf c}\in\mathbf{B}({\bf c}^{*}, \delta_0)$ and $[Q^{(1)}({\bf c}), \ Q^{(2)}({\bf c})]\in Q({\bf c})$ , we get

$\begin{equation*} \|A( {\bf c})-A( {\bf c}^{*})\|\leqslant \max\limits_{1\leq j\leq n}\| A_j\|\|\mathbf{c}-\mathbf{c}^*\|, \end{equation*}$

$\begin{equation*} \|\Lambda(\mathbf{c})-\Lambda^*\| \leqslant \rho_{0}\|\mathbf{c}-\mathbf{c}^*\|, \end{equation*}$

$\begin{equation*} \|Q^{(2)}(\mathbf{c})-Q^{(2)}(\mathbf{c}^*)\| \leqslant \rho_{0}\|\mathbf{c}-\mathbf{c}^*\| \end{equation*}$

and

$\begin{equation*} \|(I-\Pi)Q^{(1)}(\mathbf{c})\| \leqslant \rho_{0}\|\mathbf{c}-\mathbf{c}^*\|. \end{equation*}$

Lemma 2.2. There exist two positive numbers $\epsilon_{0}$ and $\beta$ such that, for any orthogonal matrix $P = [P^{(1)}, \:P^{(2)}]$ , if $\|E\| = \|P^{(1)}-\Pi P^{(1)}, \ P^{(2)}-Q^{(2)}(\mathbf{c}^{*})\|\leq\epsilon_{0}$ and the skew-symmetric matrix $X$ defined by $e^X = P^T\tilde{Q}(\mathbf{c}^*)$ , then we get

$\begin{equation} \|X\|_{F}\leqslant\beta\:\|E\|\quad{\rm and}\quad\|X^{(11)}\|_{F}\:\leqslant\beta\:\|E\|^{2}, \end{equation}$

(2.13)

in which $X^{(11)}$ is the $t$ -by- $t$ leading block of $X$ . Moreover, if $\|X\|_{F} < 1$ , then

$\begin{equation} \bigg\|\sum\limits_{l = 2}^{\infty}\frac{X^{l-2}}{l!}\bigg\|_{F}\leq1,\ \bigg\|\sum\limits_{l = 2}^{\infty}\frac{(-X)^{l-2}}{l!}\bigg\|_{F}\leq1,\ \bigg\|\sum\limits_{l = 1}^{\infty}\frac{(-X)^{l-1}}{l!}\bigg\|_{F}\leq2,\ {\rm and}\ \bigg\|\sum\limits_{l = 0}^{\infty}\frac{(-X)^{l}}{l!}\bigg\|_{F}\leq3. \end{equation}$

(2.14)

Proof. (2.13) can be found in [22,35,36]. Noting that

$\sum\limits_{l = 2}^\infty\frac1{l!}\leq\sum\limits_{l = 2}^\infty\frac1{l(l-1)} = 1.$

If $\|X\|_{F}\leq1$ , we get

$\left\|\sum\limits_{l = 2}^\infty\frac{X^{l-2}}{l!}\right\|_{F}\leq1\quad{\rm and}\quad\left\|\sum\limits_{l = 2}^{\infty}\frac{(-X)^{l-2}}{l!}\right\|_{F}\leq1,$

which implies that

$\left\|\sum\limits_{l = 1}^{\infty}\frac{(-X)^{l-1}}{l!}\right\|_{F}\leq2\quad{\rm and}\quad\left\|\sum\limits_{l = 0}^{\infty}\frac{(-X)^{l}}{l!}\right\|_{F}\leq3.$

3. The two-step Ulm-Chebyshev-like Cayley transform method

We first recall the given eigenvalues in (2.8). Suppose that $P_{k}$ is the current estimate of $Q({\bf c}^{*})$ and $Y_{k}$ is a skew-symmetric matrix, i.e., $Y_{k}^{T} = -Y_{k}$ . Let us write $Q({\bf c}^{*}) = P_{k}e^{Y_{k}}$ . Then, by using the Taylor series of the exponential function, we can express (2.10) as

$\begin{equation*} P_{k}^{T}A( {\bf c}^{*})P_{k} = e^{Y_{k}}\varLambda^{*}e^{-Y_{k}} = \big(I+Y_{k}+\frac{1}{2}Y_{k}^2+\cdots\big)\varLambda^{*}\big(1-Y_{k}+\frac{1}{2}Y_{k}^2+\cdots\big). \end{equation*}$

The vector ${\bf c}^{k}$ is updated as ${\bf c}^{k+1}$ by neglecting the second–order term of the above equality in $Y_{k}$ as

$\begin{equation} P_{k}^{T}A( {\bf c}^{k+1})P_{k} = \varLambda^{*}+Y_{k}\varLambda^{*}-\varLambda^{*}Y_{k}. \end{equation}$

(3.1)

We obtain ${\bf c}^{k+1}$ by equating the diagonal elements in (3.1) as

$\begin{equation*} J_{k} {\bf c}^{k+1} = \lambda^{*}- {\bf b}^{k}, \end{equation*}$

in which, $J_{k}$ and ${\bf b}^{k}$ are defined by

$[J_{k}]_{ij} = ( {\bf p}_{i}^{k})^{T}A_{j} {\bf p}_{i}^{k},\quad 1\leq i,j\leq n\quad\text{and}\quad[ {\bf b}]_{i}^{k} = ( {\bf p}_{i}^{k})^{T}A_{0} {\bf p}_{i}^{k},\quad 1\leq i\leq n.$

On the other hand, equating the off-diagonal in (3.1),

$\begin{equation} ( {\bf p}_{i}^{k})^{T}A( {\bf c}^{k+1}) {\bf p}_{j}^{k} = \left[Y_{k}\right]_{ij}\left(\lambda_{j}^{*}-\lambda_{i}^{*}\right),\ \ \text{for each}\ i,j\in \left[ 1,n \right] \ \text{and}\ i\ne j, \end{equation}$

(3.2)

and, assuming that the given eigenvalues are defined in (2.8), we obtain the skew-symmetric matrix $Y_{k}$ as

$\begin{equation} \left[ Y_k \right] _{ij} = \left\{ \begin{array}{l} 0,\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{for each}\ 1\leq i,j\leq t, \ {\rm or} \ i = j;\\ \frac{\left( {\bf p}_{i}^{k} \right) ^TA( {\bf c}^{k+1}) {\bf p}_{j}^{k}}{\lambda _{j}^{*}-\lambda _{i}^{*}},\ \ \text{for each}\ t+1\leq i\leq n \ {\rm or}\ t+1\leq j\leq n\ {\rm and}\ i\neq j.\\ \end{array} \right. \end{equation}$

(3.3)

Furthermore, by using the Cayley transform, we calculate the matrix $P_{k+1}$ as

$\begin{equation} P_{k+1} = P_{k}\left(I+\frac{1}{2}Y_{k}\right)\left(I-\frac{1}{2}Y_{k}\right)^{-1}. \end{equation}$

(3.4)

Finally, by (3.3), (3.4), and the two-step Ulm-Chebyshev iterative procedure[30], we can propose the following two-step Ulm-Chebyshev-like Cayley transform method for solving the IEP with multiple eigenvalues.

Algorithm Ⅰ: The two-step Ulm-Chebyshev-like Cayley transform method

Step 1. Given ${\bf c}^{0}\in{\mathbb{R}}^{n}$ , calculate the orthogonal eigenvectors $\{ {\bf q}_{i}({\bf c}^{0})\}_{i = 1}^{n}$ of $A({\bf c}^{0})$ . Let

$\begin{eqnarray*} P_{0} = [ {\bf p}_{1}^{0}, {\bf p}_{2}^{0} ,\ldots, {\bf p}_{n}^{0}] = [ {\bf q}_{1}( {\bf c}^{0}), {\bf q}_{2}( {\bf c}^{0}) ,\ldots, {\bf q}_{n}( {\bf c}^{0})], \end{eqnarray*}$

and $J_{0} = J({\bf c}^{0})$ and the vector ${\bf b}^{0}$ are defined as follows:

$\begin{eqnarray} &&[J_{0}]_{ij} = ( {\bf p}_{i}^{0})^{T}A_{j} {\bf p}_{i}^{0},\quad 1\leq i,j\leq n,\\ &&[ {\bf b}]_{i}^{0} = ( {\bf p}_{i}^{0})^{T}A_{0} {\bf p}_{i}^{0},\quad \; \; 1\leq i\leq n. \end{eqnarray}$

(3.5)

Let $B_{0}\in\mathbb{R}^{n\times n}$ satisfy

$\|I-B_{0}J( {\bf c}^{0})\|\leq\mu,$

where $\mu$ is a positive constant.

Step 2. For $k = 0$ , until convergence, do:

(a) Calculate ${\bf y}^{k}$ by

$\begin{equation} {\bf y}^{k} = {\bf c}^{k}-B_{k}(J_{k} {\bf c}^{k}+ {\bf b}^{k}-\boldsymbol\lambda^{*}). \end{equation}$

(3.6)

(b) Form the skew-symmetric matrix $Y_{k}$ :

$\begin{equation} [Y_k] _{ij} = \left\{ \begin{array}{l} 0,\quad\quad\quad\quad \quad{\rm for}\ 1\leq i,j\leq t, \ {\rm or} \ i = j;\\ \frac{( {\bf p}_{i}^{k})^{T}A( {\bf y}^{k}) {\bf p}_{j}^{k}}{\lambda_{j}^{*}-\lambda_{i}^{*}},\ \ \ \quad{\rm for}\ t+1\leq i\leq n \ {\rm or}\ t+1\leq j\leq n\ {\rm and}\ i\neq j,\\ \end{array} \right. \end{equation}$

(3.7)

where the matrix $A({\bf y}^{k})$ is defined by (1.1).

(c) Calculate $P({\bf y}^{k}) = [ {\bf p}_{1}({\bf y}^{k}), {\bf p}_{2}({\bf y}^{k}), \ldots, {\bf p}_{n}({\bf y}^{k})]^{T} = [ {\bf v}_{1}^{k}, {\bf v}_{2}^{k}, \ldots, {\bf v}_{n}^{k}]^{T}$ by solving

$\begin{equation} (I+\frac{1}{2}Y_{k}) {\bf v}_{j}^{k} = {\bf h}_{j}^{k}, \quad{\rm for}\ 1\leq j\leq n, \end{equation}$

(3.8)

where ${\bf h}_{j}^{k}$ is the $j$ th column of $H_{k} = (I-\frac{1}{2}Y_{k})P_{k}^{T}$ .

(d) Calculate the approximate eigenvalues of $A({\bf y}^{k})$ via

$\begin{equation*} \hat{\lambda}_{i}( {\bf y}^{k}) = ( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf y}^{k}) {\bf p}_{i}( {\bf y}^{k}),\quad{\rm for}\ 1\leq i\leq n. \end{equation*}$

(e) Calculate ${\bf c}^{k+1}$ by

$\begin{equation} {\bf c}^{k+1} = {\bf y}^{k}-B_{k}(\hat{\boldsymbol\lambda}( {\bf y}^{k})-\boldsymbol\lambda^{*}). \end{equation}$

(3.9)

(f) Form the skew-symmetric matrix $\hat{Y}_{k}$ :

$\begin{equation*} [\hat{Y}_{k}] _{ij} = \left\{ \begin{array}{l} 0,\quad\quad\quad\quad\quad\quad\quad \ {\rm for}\ 1\leq i,j\leq t, \ {\rm or} \ i = j;\\ \frac{( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf c}^{k+1})( {\bf p}_{j}( {\bf y}^{k}))}{\lambda_{j}^{*}-\lambda_{i}^{*}},\quad{\rm for}\ t+1\leq i\leq n \ {\rm or}\ t+1\leq j\leq n\ {\rm and}\ i\neq j,\\ \end{array} \right. \end{equation*}$

where the matrix $A({\bf c}^{k+1})$ is defined by (1.1).

(g) Calculate $P^{k+1} = [ {\bf p}_{1}^{k+1}, {\bf p}_{2}^{k+1}, \ldots, {\bf p}_{n}^{k+1}]^{T} = [\hat{ {\bf v}}_{1}^{k}, \hat{ {\bf v}}_{2}^{k}, \ldots, \hat{ {\bf v}}_{n}^{k}]^{T}$ by solving

$\begin{equation} (I+\frac{1}{2}\hat{Y}_{k})\hat{ {\bf v}}_{j}^{k} = \hat{ {\bf h}}_{j}^{k}, \quad{\rm for}\ 1\leq j\leq n, \end{equation}$

(3.10)

where $\hat{ {\bf h}}_{j}^{k}$ is the $j$ th column of $\hat{H}_{k} = (I-\frac{1}{2}\hat{Y}_{k})(P({\bf y}^{k}))^{T}$ .

(h) Form the matrix $J_{k+1}$ and the vector ${\bf b}^{k+1}$ :

$[J_{k+1}]_{ij} = ( {\bf p}_{i}^{k+1})^{T}A_{j} {\bf p}_{i}^{k+1}, 1\leq i,j\leq n,$

$\; \; [ {\bf b}]_{i}^{k+1} = ( {\bf p}_{i}^{k+1})^{T}A_{0} {\bf p}_{i}^{k+1}, \; \; 1\leq i\leq n.\quad\$

(i) Calculate the Chebyshev matrices $B_{k+1}$ by

$\begin{equation*} B_{k+1} = B_{k}+B_{k}(2I-J( {\bf c}^{k+1})B_{k})(I-J( {\bf c}^{k+1})B_{k}). \end{equation*}$

Remark 3.1. For $k = 0, 1, 2, \ldots$ , from (c) and (g) in Step 2 in Algorithm Ⅰ, we have

$\begin{equation} P( {\bf y}^{k}) = P_{k}(I+\frac{1}{2}Y_{k})(I-\frac{1}{2}Y_{k})^{-1} \end{equation}$

(3.11)

and

$\begin{equation} P^{k+1} = P( {\bf y}^{k})(I+\frac{1}{2}\hat{Y}_{k})(I-\frac{1}{2}\hat{Y}_{k})^{-1}. \end{equation}$

(3.12)

Remark 3.2. Without the distinction of the given eigenvalues, the convergence analysis of the two-step Ulm-Chebyshev-like method in [30] cannot work properly due to the differentiability of $f$ and the discontinuity of the eigenvectors corresponding to multiple eigenvalues [22]. Based on the relative generalized Jacobian of eigenvalue function [32], we propose the improved method for solving the IEP (1.2) with multiple eigenvalues. Clearly, in the case when $t = 1$ , the method presented below is reduced to the two-step Ulm-Chebyshev-like method proposed in [30] for the distinct case.

4. Convergence analysis

In this section, we shall analyze the convergence of Algorithm Ⅰ. To ensure the cubical convergence, it is assumed that all $J\in \partial_{Q} {\bf f}({\bf c}^*)$ are nonsingular for robustness. Yet, a suitable choice of eigenvectors can render $J$ nonsingular in a general manner. Therefore, we assume that all $J\in \partial_{Q|S} {\bf f}({\bf c}^*)$ are nonsingular.

Let ${\bf c}^{k}$ , ${\bf y}^{k}$ , $Y_{k}$ , $\hat{Y}_{k}$ , $P_{k}$ , $P({\bf y}^{k})$ , $J_{k}$ and $B_{k}$ be generated by Algorithm Ⅰ with initial point ${\bf c}^{0}$ . For $k = 0, 1, 2, \ldots$ , let

$\begin{equation} E_{k}: = [P_{k}^{(1)}-\Pi P_{k}^{(1)}\quad P_{k}^{(2)}-Q^{(2)}( {\bf c}^{*})] \end{equation}$

(4.1)

and

$\begin{equation} \quad E( {\bf y}^{k}): = [P( {\bf y}^{k})^{(1)}-\Pi P( {\bf y}^{k})^{(1)}\quad P( {\bf y}^{k})^{(2)}-Q^{(2)}( {\bf c}^{*})]. \end{equation}$

(4.2)

Then, we can get the following lemmas.

Lemma 4.1. Let the given eigenvalues $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ be defined as in (2.8). Then, in Algorithm Ⅰ, there exists a number $0 < \delta_{2}\leq1$ such that for $k = 0, 1, 2, \ldots$ , if $\| {\bf y}^{k}- {\bf c}^{*}\|\leq\delta_{2}$ , $\| {\bf c}^{k+1}- {\bf c}^{*}\|\leq\delta_{2}$ , $\|E_{k}\|\leq\delta_{2}$ and $\|E({\bf y}^{k})\|\leq\delta_{2}$ , then

$\begin{equation} \|\Lambda^{*}+X_{k}\Lambda^{*}-\Lambda^{*}X_{k}-P_{k}^{T}A(\mathbf{c}^{*})P_{k}\|\leq K\|E_{k}\|^{2}, \end{equation}$

(4.3)

$\begin{equation} \|P( {\bf y}^{k})-P_k\|\leq \rho(\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+\|E_{k}\|), \end{equation}$

(4.4)

$\begin{equation} \|E( {\bf y}^{k})\|\leq\rho(\| {\bf y}^{k}- {\bf c}^{*}\|+\|E_{k}\|^{2}), \end{equation}$

(4.5)

$\begin{equation} \|\Lambda^{*}+Y_{k}\Lambda^{*}-\Lambda^{*}Y_{k}-P( {\bf y}^{k})^{T}A\left(\mathbf{c}^{*}\right)P( {\bf y}^{k})\| < K\|E( {\bf y}^{k})\|^{2}, \end{equation}$

(4.6)

$\begin{equation} \|P_{k+1}-P( {\bf y}^{k})\|\leq\rho(\| {\bf c}^{k+1}- {\bf c}^{*}\|+\|E( {\bf y}^{k})\|) \end{equation}$

(4.7)

and

$\begin{equation} \|E_{k+1}\|\leq\rho(\| {\bf c}^{k+1}- {\bf c}^{*}\|+\|E( {\bf y}^{k})\|^{2}), \end{equation}$

(4.8)

where $K$ and $\rho$ are defined by (2.1) and (2.2), respectively.

Proof. Let $e^{X_k}: = P_k^T\tilde{Q}(\mathbf{c}^*)$ , where $X_k$ is the skew-symmetric matrix and $\tilde{Q}(\mathbf{c}^*)$ is defined by (2.11) and (2.12) with $P = P_k$ . By $\|E_{k}\|\leq\delta_{2}\leq\epsilon_0$ and Lemma 2.2, we have

$\begin{equation} \|X_k\|_F\leqslant \beta\|E_k\|\quad and\quad \|X_k^{( 11) }\|_F\leqslant \beta\|E_k\|^2, \end{equation}$

(4.9)

where $\beta$ is a positive number and $X_k^{(11)}$ is the $t$ -by- $t$ leading block of $X_k.$ On the other hand, by $\tilde{Q}(\mathbf{c}^{*})\in\mathcal{Q}(\mathbf{c}^{*})$ , we derive

$\begin{equation} e^{X_{k}}\Lambda_{*}e^{-X_{k}} = P_{k}^{T}A( {\bf c}^{*})P_{k}. \end{equation}$

(4.10)

Thus, by the fact of $e^{X_k} = \sum\limits_{l = 0}^{\infty}(\frac{X_{k}^{l}}{l!})$ , we can express (4.10) as

$\begin{equation} \Lambda^{*}+X_{k}\Lambda^{*}-\Lambda^{*}X_{k} = P_{k}^{T}A( {\bf c}^{*})P_{k}+R(X_{k}), \end{equation}$

(4.11)

where

$\begin{eqnarray*} R(X_{k}) = -X_{k}^{2}\sum\limits_{l = 2}^{\infty}(\frac{X_{k}^{l-2}}{l!})\Lambda^{*}\big(\sum\limits_{l = 0}^{\infty}\frac{(-X_{k})^{l}}{l!}\big)+\Lambda^{*}X_{k}^{2}\sum\limits_{l = 2}^{\infty}\frac{(-X_{k})^{l-2}}{l!} -X_k\Lambda^*X_k\sum\limits_{l = 1}^\infty\frac{(-X_k)^{l-1}}{l!}. \end{eqnarray*}$

By (2.1) and Lemma 2.2, we get

$\begin{eqnarray} \|R(X_{k})\|_{F}\leqslant6\:\|X_{k}\:\|_{F}^{2}\cdot\:\|\Lambda^{*}\:\|_{F} = 6\:\|X_{k}\:\|_{F}^{2}\cdot\:\|\lambda^{*}\:\|\leqslant6\beta^{2}\:\|\lambda^{*}\|\cdot\:\|E_{k}\:\|^{2}\leq K\|E_{k}\|^{2}. \end{eqnarray}$

(4.12)

Thus, (4.3) is seen to hold by (4.11) and (4.12).

In order to prove (4.4) and (4.5), assume that $\|\mathbf{y}^{k}-\mathbf{c}^*\|\leqslant\delta_2$ . We note by (4.11) that

$\left[X_k\right]_{ij} = \frac{1}{\lambda_j^*-\lambda_i^*}\big(\mathbf{p}_i^k\big)^TA(\mathbf{c}^*)\mathbf{p}_j^k+\big[R\big(X_k\big)\big]_{ij},$

where $t+1\leq i\leq n, \ 1\leq j\leq n, \:i > j.$ Combining this with (3.7), we have

$\left[X_k\right]_{ij}-\left[Y_k\right]_{ij} = \frac{1}{\lambda_j^*-{\lambda_i}^*}\bigl(\mathbf{p}_i^k\bigr)^T\bigl[A(\mathbf{c}^*)-A(\mathbf{y}^{k})\bigr]\mathbf{p}_j^k+\bigl[R\bigl(X_k\bigr)\bigr]_{ij},$

in which $t+1\leq i\leq n, \ 1\leq j\leq n, \:i > j.$ By (4.12), Lemma 2.1, and the fact that $\{\mathbf{p}_i^k\}_{i = 1}^n$ are orthogonal, we get

$\max\limits_{t+1\leq i\leq n,\ 1\leq j\leq n,i\neq j}\left|\left[X_{k}\right]_{ij}-\left[Y_{k}\right]_{ij}\right|\leqslant\max\limits_{1\leq i\leq n-1}{\frac{1}{\lambda_{i+1}^{*}-\lambda_{i}^{*}}}\times\big(\max\limits_{1\leq j\leq n}\|A_{j}\|\cdot\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+ K\|E_{k}\|^{2}\big).$

In addition, by the fact that $[Y_k]_{ij} = 0$ , for each $i, j\in[1, t]$ , we have

$\|X_k-Y_k\|\leqslant\|X_k-Y_k\|_F\leqslant\|X_k^{(11)}\|_F+\left(n^2-t^2\right)\max\limits_{i\in[t+1,n],j\in[1,n],i\neq j}\left|\left[X_k\right]_{ij}-\left[Y_k\right]_{ij}\right|.$

Then, it follows from (2.1) and (4.9) that

$\begin{eqnarray} \|X_k-Y_k\|\leqslant\beta\|E_k\|^2+N\bigl(\max\limits_{1\leq j\leq n}\|A_j\|\cdot\|\mathbf{y}^{k}-\mathbf{c}^*\|+ K\|E_k\|^2\bigr), \end{eqnarray}$

(4.13)

and so

$\|Y_{k}\|\leqslant\beta\|E_{k}\|+\beta\|E_{k}\|^{2}+N\big(\max\limits_{1\leq j\leq n}\|A_{j}\|\cdot\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+ K\|E_{k}\|^{2}\big).$

Thus, thanks to the fact that $\beta\|E_k\|\leq\beta\delta_{2}\leq1$ and (2.2), one has

$\begin{eqnarray} \|Y_{k}\|\leqslant&N\max\limits_{1\leq j\leq n}\|A_{j}\|\cdot\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+\left(1+\beta+6N\beta\|\lambda^{*}\|\right)\|E_{k}\|\\ \leqslant&\frac{C}{2}\big(\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+\|E_{k}\|\big)\leqslant\frac{\rho}{2}\big(\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+\|E_{k}\|\big). \end{eqnarray}$

(4.14)

Since $\|E_k\|\leqslant\delta_{2}$ and $\|\mathrm{y}^{k}-\mathrm{c}^*\|\leqslant\delta_{2}$ , it follows from (2.4) and (4.14) that $\|Y_k\|\leqslant1.$ Consequently,

$\begin{eqnarray} \bigl\|\big(I-\frac{1}{2}Y_{k}\big)^{-1}\bigr\|\leqslant\frac{1}{1-\frac{1}{2}\left\|Y_{k}\right\|}\leqslant2. \end{eqnarray}$

(4.15)

Therefore, in the following, we estimate $\|P(\mathrm{y}^{k})- P_k\|$ and $\|E(\mathrm{y}^{k})\|.$ Indeed, by (3.11),

$P(\mathrm{y}^{k})-P_k = P_k\big[\big(I+\frac{1}{2}Y_k\big)-\big(I-\frac{1}{2}Y_k\big)\big]\big(I-\frac{1}{2}Y_k\big)^{-1} = P_kY_k\big(I-\frac{1}{2}Y_k\big)^{-1}.$

This together with (4.14), (4.15), as well as the orthogonality of $P_k$ indicate that (4.4) holds.

As for (4.5), we note by (3.11) and $X_k$ that

$\begin{aligned} P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*})& = P_{k}\big[\big(I+\frac{1}{2}Y_{k}\big)\big(I-\frac{1}{2}Y_{k}\big)^{-1}-e^{X_{k}}\big] \\ & = P_{k}\big[\big(I+\frac{1}{2}Y_{k}\big)-e^{X_{k}}\big(I-\frac{1}{2}Y_{k}\big)\big]\big(I-\frac{1}{2}Y_{k}\big)^{-1}. \end{aligned}$

Combining this with $e^{X_k} = \sum\limits_{l = 0}^{\infty}(\frac{X_{k}^{l}}{l!})$ , we get

$\begin{gathered} P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*}) = P_{k}\big[Y_{k}-X_{k}+\frac{1}{2}X_{k}Y_{k}-\big(X_{k}^{2}\sum\limits_{m = 2}^{\infty}\frac{X_{k}^{m-2}}{m!}\big)\big(I-\frac{1}{2}Y_{k}\big)\big]\big(I-\frac{1}{2}Y_{k}\big)^{-1} \\ \; \; \; \; \; \; \; \; = P_{k}\big(Y_{k}-X_{k}+\frac{1}{2}X_{k}Y_{k}\big)\big(I-\frac{1}{2}Y_{k}\big)^{-1}-P_{k}X_{k}^{2}\sum\limits_{m = 2}^{\infty}\frac{X_{k}^{m-2}}{m!}. \end{gathered}$

Since $P_k$ is orthogonal, note by (2.14) and (4.15) that

$\|P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*})\|\leqslant2\parallel Y_{k}-X_{k}\parallel+\parallel X_{k}\parallel\cdot\parallel Y_{k}\parallel+\parallel X_{k}\parallel^{2}.$

Thus, we derive by using (2.13), (4.13), and (4.14),

$\begin{eqnarray} \|P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*})\| &\leq&2\beta\:\|E_{k}\|^{2}\:+2N\big(\max\limits_{1\leq j\leq n}\:\|A_{j}\:\|\cdot\|\mathbf{y}^{k}-\mathbf{c}^{*}\|\:+ K\|E_{k}\|^{2}\big) \\ &&+\frac{1}{2}\beta C\:\|E_{k}\|\cdot\big(\|\mathbf{y}^{k}-\mathbf{c}^{*}\|+\|E_{k}\|\big)+\beta^{2}\:\|E_{k}\|^{2} \\ &\leq&\big(2\beta+\beta^{2}+2N K+\frac{1}{2}\beta C\big)\|E_{k}\|^{2}+\big(2N\max\limits_{1\leq j\leq n}\|A_{j}\|+\frac{1}{2}C\big)\|\mathbf{y}^{k}-\mathbf{c}^{*}\| \\ &\leq& \max\limits_{1\leq j\leq n}\|A_{j}\|\| {\bf y}^{k}- {\bf c}^{*}\|+ K\|E( {\bf y}^{k})\|^{2},\quad 1\leq i\leq n, \end{eqnarray}$

(4.16)

where the second inequality holds because of the fact that $\beta\|E_k\|\leqslant1$ while the last inequality holds because of the definition of $C$ . Write $P(\mathrm{y}^{k}) = [P(\mathrm{y}^{k})^{(1)}\ P(\mathrm{y}^{k})^{(2)}]$ , where $P(\mathrm{y}^{k})^{(1)}\in\mathbb{R}^{n\times t}$ and $P(\mathrm{y}^{k})^{(2)}\in\mathbb{R}^{n\times(n-t)}$ . Since $(I-\Pi)\tilde{Q}^{(1)}(\mathbf{c}^*) = \mathbf{0}$ , where $\mathbf{0}$ is a zero matrix, we have

$\begin{split} \big\|(I-\Pi)P(\mathrm{y}^{k})^{(1)}\|& = \|(I-\Pi)\big(P(\mathrm{y}^{k})^{(1)}-\tilde{Q}^{(1)}(\mathbf{c}^{*})+\tilde{Q}^{(1)}(\mathbf{c}^{*})\big)\big\|\\& = \big\|(I-\Pi)\big(P(\mathrm{y}^{k})^{(1)}-\tilde{Q}^{(1)}(\mathbf{c}^{*})\big)\big\|\\& \leqslant\big\|P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^*)\big\| \end{split}$

and

$\|P(\mathrm{y}^{k})^{(2)}-\tilde{Q}^{(2)}(\mathbf{c}^{*})\|\leq\|P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*})\|.$

Hence, by (4.2) and (4.16), we obtain

$\begin{eqnarray} \|E(\mathrm{y}^{k})\|&\leqslant&\|(I-\Pi)P(\mathrm{y}^{k})^{(1)}\|+\|P(\mathrm{y}^{k})^{(2)}-\tilde{Q}^{(2)}(\mathbf{c}^{*})\|\leqslant2\sqrt{n}\parallel P(\mathrm{y}^{k})-\tilde{Q}(\mathbf{c}^{*})\parallel\\ &\leqslant&2\:\sqrt{n}\Big[\:\big(2\beta+\beta^{2}+2N K+\frac{1}{2}\beta C\big)\:\|E_{k}\|^{2}+\frac{3}{2}C\:\|\:\mathbf{y}^{k}-\mathbf{c}^{*}\|\Big]. \end{eqnarray}$

(4.17)

Therefore, (4.5) is proved by (2.2) and (4.17). We defined $e^{Y_k}: = P({\bf y}^{k})^T\tilde{Q}(\mathbf{c}^*)$ , where $Y_k$ is the skew-symmetric matrix. Similarly, (4.6)–(4.8) also hold. □

Lemma 4.2. Let $\rho_{0}$ and $\delta_{0}$ be defined in Lemma 2.1. If $\| {\bf y}_{k}- {\bf c}^{*}\|\leq\delta_{0}$ and $\hat{{\boldsymbol\lambda}}({\bf y}^{k}) = ({\hat{\lambda}}_{1}({\bf y}^{k}), {\hat{\lambda}}_{2}({\bf y}^{k}), \ldots, {\hat{\lambda}}_{n}({\bf y}^{k}))^{T}$ , in which $\hat{\lambda}_{i}({\bf y}^{k}) = ({\bf p}_{i}({\bf y}^{k}))^{T}A({\bf y}^{k}) {\bf p}_{i}({\bf y}^{k}), \ 1\leq i\leq n,$ then

$\begin{equation} \|\hat{{\boldsymbol\lambda}}( {\bf y}^{k})\|\leq\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|\| {\bf y}^{k}- {\bf c}^{*}\|+ K\sqrt{n}\|E( {\bf y}^{k})\|^{2}+\|\boldsymbol\lambda^{*}\|. \end{equation}$

(4.18)

Proof. From the diagonal elements of $\Lambda^{*}+Y_{k}\Lambda^{*}-\Lambda^{*}Y_{k}-P({\bf y}^{k})^{T}A\left(\mathbf{c}^{*}\right)P({\bf y}^{k})$ , we obtain from (4.6) that

$|( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf c}^{*}) {\bf p}_{i}( {\bf y}^{k})-\boldsymbol\lambda_{i}^{*}|\leq K\|E( {\bf y}^{k})\|^{2},\quad{\rm for}\ 1\leq i\leq n,$

which together with Lemma 3.1 gives

$\begin{eqnarray*} |( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf y}^{k}) {\bf p}_{i}( {\bf y}^{k})-\boldsymbol\lambda_{i}^{*}| & = &|( {\bf p}_{i}( {\bf y}^{k}))^{T}(A( {\bf y}^{k})-A( {\bf c}^{*})) {\bf p}_{i}( {\bf y}^{k})+( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf c}^{*}) {\bf p}_{i}( {\bf y}^{k})-\boldsymbol\lambda_{i}^{*}|\\ &\leq&|( {\bf p}_{i}( {\bf y}^{k}))^{T}(A( {\bf y}^{k})-A( {\bf c}^{*})) {\bf p}_{i}( {\bf y}^{k})|+|( {\bf p}_{i}( {\bf y}^{k}))^{T}A( {\bf c}^{*}) {\bf p}_{i}( {\bf y}^{k})-\boldsymbol\lambda_{i}^{*}|\\ &\leq& \max\limits_{1\leq j\leq n}\|A_{j}\|\| {\bf y}^{k}- {\bf c}^{*}\|+ K\|E( {\bf y}^{k})\|^{2},\quad 1\leq i\leq n. \end{eqnarray*}$

Therefore,

$\|\hat{{\boldsymbol\lambda}}( {\bf y}^{k})-\boldsymbol\lambda^{*}\|\leq\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|\| {\bf y}^{k}- {\bf c}^{*}\|+ K\sqrt{n}\|E( {\bf y}^{k})\|^{2},$

which together with the fact that $\|\hat{{\boldsymbol\lambda}}({\bf y}^{k})\|\leq\|\hat{{\boldsymbol\lambda}}({\bf y}^{k})-\boldsymbol\lambda^{*}\|+\|\boldsymbol\lambda^{*}\|,$ we can get (4.18). □

Lemma 4.3. Let the Jacobian matrix of $\tilde{J}({\bf c}^{*})$ and the vector $\tilde{ {\bf b}}$ be defined as follows:

$[\tilde{J}( {\bf c}^{*})]_{ij} = \tilde{ {\bf q}}_{i}( {\bf c}^{*})^{T}A_{j}\tilde{ {\bf q}}_{i}( {\bf c}^{*}),\quad 1\leq i,j\leq n\quad\mathit{\text{and}}\quad[\tilde{ {\bf b}}]_{i} = (\tilde{ {\bf q}}_{i}( {\bf c}^{*}))^{T}A_{0}\tilde{ {\bf q}}_{i}( {\bf c}^{*}),\quad 1\leq i\leq n.$

Then, we have

$\begin{eqnarray} \|\tilde{J}( {\bf c}^{*}) {\bf y}^{k}+\tilde{ {\bf b}}-\hat{\boldsymbol\lambda}( {\bf y}^{k})\|\leq 6\sqrt{n}\beta^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\|\:\|E( {\bf y}^{k})\:\|^{2}. \end{eqnarray}$

(4.19)

Proof. Let $e^{Y_k}: = P({\bf y}^{k})^T\tilde{Q}(\mathbf{c}^*)$ where $Y_k$ is the skew-symmetric matrix and $\tilde{Q}(\mathbf{c}^*)$ is defined by (2.11) and (2.12) with $P = P({\bf y}^{k})$ . By $\|E({\bf y}^{k})\|\leq\delta_{2}\leq\epsilon_0$ and Lemma 2.2, we get

$\begin{equation*} \|Y_k\|_F\leqslant \beta\|E( {\bf y}^{k})\|, \end{equation*}$

where $\beta$ is a positive number. By $\hat{\Lambda}({\bf y}^{k}) = P({\bf y}^{k})^TA({\bf y}^{k})P({\bf y}^{k})$ and $e^{-Y_{k}}\hat{\Lambda}({\bf y}^{k})e^{Y_{k}} = \tilde{Q}(\mathbf{c}^*)^{T}A({\bf y}^{k})\tilde{Q}(\mathbf{c}^*)$ , we have

$\begin{equation*} \hat{\Lambda}( {\bf y}^{k})-Y_{k}\hat{\Lambda}( {\bf y}^{k})+\hat{\Lambda}( {\bf y}^{k})Y_{k} = \tilde{Q}(\mathbf{c}^*)^{T}A( {\bf y}^{k})\tilde{Q}(\mathbf{c}^*)+R(Y_{k}), \end{equation*}$

where

$\begin{eqnarray*} R(Y_{k}) = (Y_{k})^{2}\sum\limits_{l = 2}^{\infty}(\frac{(-Y_{k})^{l-2}}{l!})\hat{\Lambda}( {\bf y}^{k})\bigg(\sum\limits_{l = 0}^{\infty}\frac{(Y_{k})^{l}}{l!}\bigg)-\hat{\Lambda}( {\bf y}^{k})(Y_{k})^{2}\sum\limits_{l = 2}^{\infty}\frac{(Y_{k})^{l-2}}{l!} +Y_{k}\hat{\Lambda}( {\bf y}^{k})Y_{k}\sum\limits_{l = 1}^\infty\frac{(Y_{k})^{l-1}}{l!}. \end{eqnarray*}$

By Lemma 2.2, we get

$\begin{eqnarray*} \|R(Y_{k})\|_{F}\leqslant6\:\|Y_{k}\:\|_{F}^{2}\:\|\hat{\Lambda}( {\bf y}^{k})\:\|_{F} = 6\:\|Y_{k}\:\|_{F}^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\:\|\leqslant6\beta^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\|\:\|E( {\bf y}^{k})\:\|^{2}. \end{eqnarray*}$

Thus,

$\begin{equation} \|\tilde{Q}(\mathbf{c}^*)^{T}A( {\bf y}^{k})\tilde{Q}(\mathbf{c}^*)+Y_{k}\hat{\Lambda}( {\bf y}^{k})-\hat{\Lambda}( {\bf y}^{k})Y_{k}-\hat{\Lambda}( {\bf y}^{k})\|\leq6\beta^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\|\:\|E( {\bf y}^{k})\:\|^{2}. \end{equation}$

(4.20)

By the diagonal entries of $\tilde{Q}(\mathbf{c}^*)^{T}A({\bf y}^{k})\tilde{Q}(\mathbf{c}^*)+Y_{k}\hat{\Lambda}({\bf y}^{k})-\hat{\Lambda}({\bf y}^{k})Y_{k}-\hat{\Lambda}({\bf y}^{k})$ and (4.20), we get

$|(\tilde{ {\bf q}}_{i}( {\bf c}^{*}))^{T}A( {\bf y}^{k})\tilde{ {\bf q}}_{i}( {\bf c}^{*})-\hat{\boldsymbol\lambda}_{i}( {\bf y}^{k})| < 6\beta^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\|\:\|E( {\bf y}^{k})\:\|^{2},\quad{\rm for}\ 1\leq i\leq n.$

Therefore, by the definitions of $\tilde{J}({\bf c}^{*}), \hat{\boldsymbol\lambda}({\bf y}^{k}), \tilde{ {\bf b}}$ and $A({\bf y}^{k})$ , we can get (4.19). □

Lemma 4.4. [] Let $J_{0}$ be defined in (3.5). Suppose that $J_{0}$ is invertible. Let $k\geq 1$ such that

$\begin{equation} 2n\|J_{0}^{-1}\|\max\limits_{1\leq j\leq n}\|A_{j}\|\|P_{k}-P_{0}\| < 1. \end{equation}$

(4.21)

Then, the matrix $J_{k}$ is nonsingular and

$\|J_{k}^{-1}\|\leq\frac{\|J_{0}^{-1}\|}{1-2n\|J_{0}^{-1}\|\max\limits_{1\leq j\leq n}\|A_{j}\|\|P_{k}-P_{0}\|}.$

Lemma 4.5. Let the vector ${\bf c}^{*}\in clS$ and the given eigenvalues of the matrix $A({\bf c}^{*})$ satisfy (2.8). Let all $J\in \partial_{Q|S} {\bf f}({\bf c}^*)$ be nonsingular. If for ${\bf c}^{0}\in\mathbf{B}({\bf c}^{*}, \delta)\bigcap S$ where $\delta > 0$ and $k = 0, 1, 2, \ldots$ , then there exist three numbers $0 < \tau\leq 1$ , $0 < \delta < \tau$ and $0\leq\mu\leq\delta$ , that if $\| {\bf c}^{0}- {\bf c}^{*}\|\leq\delta$ , the conditions

$\begin{equation} \|E_{k}\|\leq 2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}, \end{equation}$

(4.22)

$\begin{equation} \| {\bf c}^{k}- {\bf c}^{*}\|\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}, \end{equation}$

(4.23)

and

$\begin{equation} \|I-B_{k}J_{k}\|\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}} \end{equation}$

(4.24)

imply

$\begin{equation} \|J_{k}^{-1}\|\leq\gamma,\quad\|B_{k}\|\leq 2\gamma,\quad{\rm and}\quad\| {\bf c}^{k+1}- {\bf c}^{*}\|\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}}. \end{equation}$

(4.25)

Proof. Since all $J\in \partial_{Q|S} {\bf f}({\bf c}^*)$ are nonsingular, and from Theorem 3.2 in [], for each ${\bf c}^{0}\in\mathbf{B}({\bf c}^{*}, \delta_0)\bigcap S$ , we have

$\begin{equation*} \sup\limits_{J\in\partial_{Q|S} {\bf f}( {\bf c}^{0})}\|J^{-1}\|\leq\xi, \end{equation*}$

where $\xi > 0$ and $\delta_{0} > 0$ . From (2.3), (2.5), and (2.6), we know that

$\begin{equation} \tau\leq 1. \end{equation}$

(4.26)

By (2.6) and (4.22), we get

$\|E_{k}\|\leq 2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\leq2\sqrt{n}\rho_{0}\delta\leq\delta_{2},$

and then, from Lemma 2.2, we obtain

$\begin{equation} \|X_k\|_F\leqslant \beta\|E_k\|, \end{equation}$

(4.27)

where $\beta > 0$ . By the definitions of $X_k$ and the Taylor series of the exponential function $e^{X_{k}}$ , we have

$\begin{equation*} P_{k}-\tilde{Q}( {\bf c}^{*}) = P_{k}(I-e^{X_{k}}) = P_{k}(-X_{k})\sum\limits_{l = 1}^{\infty}\frac{(X)^{l-1}}{l!}. \end{equation*}$

Since $P_k$ is orthogonal, note by (2.14) and (4.27) that

$\begin{equation} \|P_{k}-\tilde{Q}( {\bf c}^{*})\|\leq 2\|X_k\|\leq 2\|X_k\|_F\leq 2\beta\|E_k\|. \end{equation}$

(4.28)

Similarly, we also have

$\|P_{k-1}-\tilde{Q}( {\bf c}^{*})\|\leq 2\|X_{k-1}\|\leq 2\|X_{k-1}\|_F\leq 2\beta\|E_{k-1}\|.$

Thus, by (2.5), (4.22), and (4.28), we have

$\begin{eqnarray} \|P_{k}-P_{k-1}\| &\leq&\|P_{k}-\tilde{Q}( {\bf c}^{*})\|+\|P_{k-1}-\tilde{Q}( {\bf c}^{*})\| \\ &\leq&2\beta\|E_k\|+2\beta\|E_{k-1}\| \\ &\leq& 2\beta(2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}+2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k-1}}) \\ &\leq& 4\sqrt{n}\beta\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k-1}}. \end{eqnarray}$

(4.29)

Therefore, we further have

$\begin{eqnarray} \|P_{m}-P_{0}\| \leq \sum\limits_{k = 1}^{m}\|P_{k}-P_{k-1}\| \leq 4\sqrt{n}\beta\rho_{0}\tau\Big[\Big(\frac{\delta}{\tau}\Big)+\Big(\frac{\delta}{\tau}\Big)^{3}+\Big(\frac{\delta}{\tau}\Big)^{3^{2}}+\cdots+\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}\Big]. \end{eqnarray}$

(4.30)

Since $3^{n}\geq 2n+1$ for each $n\geq0$ , we obtain from (4.30) that

$\begin{eqnarray*} \|P_{m}-P_{0}\| &\leq&4\sqrt{n}\beta\rho_{0}\tau\Big[\Big(\frac{\delta}{\tau}\Big)+\Big(\frac{\delta}{\tau}\Big)^{3}+\Big(\frac{\delta}{\tau}\Big)^{5}+\cdots+\Big(\frac{\delta}{\tau}\Big)^{2m-1}\Big] \nonumber\\ &\leq&4\sqrt{n}\beta\rho_{0}\delta\frac{\Big[1-\Big(\frac{\delta}{\tau}\Big)^{2m}\Big]}{1-\Big(\frac{\delta}{\tau}\Big)^{2}}, \end{eqnarray*}$

which together with (2.1) and (2.6), we obtain

$2n\xi\max\limits_{1\leq j\leq n}\|A_{j}\|\|P_{m}-P_{0}\|\leq\frac{8n^{\frac{3}{2}}\xi\beta\rho_{0}\delta\max\limits_{1\leq j\leq n}\|A_{j}\|}{1-\Big(\frac{\delta}{\tau}\Big)^{2}} = H_{1}\delta < \frac{1}{2}H_{1}\tau < 1.$

Consequently, using Lemma 4.4, we can derive that $J_{k}$ is nonsingular and moreover

$\begin{equation} \|J_{m}^{-1}\|\leq\frac{\xi}{1-2n\xi\max\limits_{1\leq j\leq n}\|A_{j}\|\|P_{m}-P_{0}\|}\leq\frac{\xi}{1-H_{1}\delta} = \gamma. \end{equation}$

(4.31)

Furthermore, by (2.5), (2.6), and (4.24), we have

$\begin{equation} \|B_{k}\|\leq\|B_{k}J_{k}\|\|J_{k}^{-1}\|\leq(I+\|I-B_{k}J_{k}\|)\|J_{k}^{-1}\|\leq(1+\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}})\gamma\leq(1+\tau)\gamma\leq 2\gamma. \end{equation}$

(4.32)

On the other hand, considering the diagonal elements of $\Lambda^{*}+X_{k}\Lambda^{*}-\Lambda^{*}X_{k}-P_{k}^{T}A(\mathbf{c}^{*})P_{k}$ , we obtain from (4.3) that

$|( {\bf p}_{i}^{k})^{T}A( {\bf c}^{*}) {\bf p}_{i}^{k}-\lambda_{i}^{*}|\leq K\|E_{k}\|^{2},\ {\rm for}\ 1\leq i\leq n.$

Therefore, by the definitions of $\boldsymbol\lambda^{*}, \ J_{k}, \ {\bf b}^{k}$ and $A({\bf c}^{*})$ , we have

$\begin{equation} \|J_{k} {\bf c}^{*}-\boldsymbol\lambda^{*}+ {\bf b}^{k}\|\leq \sqrt{n} K\|E_{k}\|^{2}. \end{equation}$

(4.33)

From (3.6), we get

${\bf y}^{k}- {\bf c}^{*} = B_{k}(\boldsymbol\lambda^{*}- {\bf b}^{k}-J_{k} {\bf c}^{*})+(I-B_{k}J_{k})( {\bf c}^{k}- {\bf c}^{*}).$

It follows that

$\| {\bf y}^{k}- {\bf c}^{*}\|\leq \|B_{k}\|\|J_{k} {\bf c}^{*}-\boldsymbol\lambda^{*}+ {\bf b}^{k}\|+\|I-B_{k}J_{k}\|\| {\bf c}^{k}- {\bf c}^{*}\|,$

which together with (4.22)–(4.24), (4.32), and (4.33) gives

$\begin{eqnarray} \| {\bf y}^{k}- {\bf c}^{*}\| &\leq& 2\gamma\sqrt{n} K\|E_{k}\|^{2}+\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\cdot\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}} \\ &\leq& (1+8\gamma Kn^{\frac{3}{2}}\rho_{0}^{2})\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}: = \alpha_{2}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}. \end{eqnarray}$

(4.34)

By (2.6), (4.22), and (4.34), we have

$\begin{equation} \| {\bf y}^{k}- {\bf c}^{*}\|\leq \alpha_{2}\delta\leq\delta_{2}\leq 1 \end{equation}$

(4.35)

and

$\begin{equation} \|E_{k}\|\leq 2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\leq2\sqrt{n}\rho_{0}\delta\leq\delta_{2}\leq 1, \end{equation}$

(4.36)

which together with (4.22) and (4.28), we obtain

$\begin{eqnarray*} \| {\bf p}_{i}^{k}-\tilde{ {\bf q}}_{i}( {\bf c}^{*})\|\leq\|P_{k}-\tilde{Q}( {\bf c}^{*})\|\leq 4\sqrt{n}\beta\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}},\quad 1\leq i\leq n. \end{eqnarray*}$

This together with the orthogonality of $P_k$ and $\tilde{Q}({\bf c}^{*})$ and the Cauchy-Schwarz inequality indicates that

$\begin{eqnarray*} |[J_{k}]_{ij}-[\tilde{J}( {\bf c}^{*})]_{ij}| & = &|( {\bf p}_{i}^{k})^{T}A_{j} {\bf p}_{i}^{k}-\tilde{ {\bf q}}_{i}( {\bf c}^{*})^{T}A_{j}\tilde{ {\bf q}}_{i}( {\bf c}^{*})|\\ & = &|( {\bf p}_{i}^{k}-\tilde{ {\bf q}}_{i}( {\bf c}^{*}))^{T}A_{j} {\bf p}_{i}^{k}-\tilde{ {\bf q}}_{i}( {\bf c}^{*})^{T}A_{j}(\tilde{ {\bf q}}_{i}( {\bf c}^{*})- {\bf p}_{i}^{k})|\\ &\leq& 2\|A_{j}\|\| {\bf p}_{i}^{k}-\tilde{ {\bf q}}_{i}( {\bf c}^{*})\|\\ &\leq& 8\sqrt{n}\beta\rho_{0}\|A_{j}\|\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}},\quad 1\leq i,j\leq n. \end{eqnarray*}$

Thus, we get

$\begin{equation} \|J_{k}-\tilde{J}( {\bf c}^{*})\|\leq\|J_{k}-\tilde{J}( {\bf c}^{*})\|_{F}\leq 8n^{\frac{3}{2}}\beta\rho_{0}\max\limits_{1\leq j\leq n}\|A_{j}\|\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}: = \alpha_{1}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}. \end{equation}$

(4.37)

By (4.24), (4.32), and (4.37), we have

$\begin{eqnarray} \|I-B_{k}\tilde{J}( {\bf c}^{*})\| &\leq& \|I-B_{k}J_{k}\|+\|B_{k}\|\|J_{k}-\tilde{J}( {\bf c}^{*})\| \\ &\leq& \tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}+2\gamma\alpha_{1}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\\ &\leq& (1+2\gamma\alpha_{1})\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\\ &: = &\alpha_{6}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}. \end{eqnarray}$

(4.38)

By (4.34)–(4.36) and Lemma 4..1, we obtain

$\begin{eqnarray} \|E( {\bf y}^{k})\| &\leq&\rho\Big(\alpha_{2}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}+4n\rho_{0}^{2}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}\Big)\\ &\leq&\rho(\alpha_{2}+4n\rho_{0}^{2})\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}\\&: = &\alpha_{3}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}}, \end{eqnarray}$

(4.39)

which together with (2.6), (4.22), and (4.34), we have

$\begin{eqnarray*} \|E( {\bf y}^{k})\|\leq \alpha_{3}\delta\leq\delta_{2}\leq 1, \end{eqnarray*}$

which together with (4.35) and Lemmas 4.2 and 4..3, we get

$\begin{eqnarray} \|\tilde{J}( {\bf c}^{*}) {\bf y}^{k}+\tilde{ {\bf b}}-\hat{\boldsymbol\lambda}( {\bf y}^{k})\| &\leq& 6\sqrt{n}\beta^{2}\:\|{\hat{\boldsymbol\lambda}}( {\bf y}^{k})\|\|E( {\bf y}^{k})\|^{2}\\ &\leq& 6\sqrt{n}\beta^{2}\Big(\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|\| {\bf y}^{k}- {\bf c}^{*}\|+ K\sqrt{n}\|E( {\bf y}^{k})\|^{2}+\|\boldsymbol\lambda^{*}\|\Big)\|E( {\bf y}^{k})\|^{2} \\ &\leq& 6\sqrt{n}\beta^{2}\Big(\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|+ K\sqrt{n}+\|\boldsymbol\lambda^{*}\|\Big)\alpha_{3}^{2}\tau^{4}\Big(\frac{\delta}{\tau}\Big)^{4\cdot3^{k}} \\ &\leq& 6\sqrt{n}\beta^{2}\Big(\sqrt{n}\max\limits_{1\leq j\leq n}\|A_{j}\|+ K\sqrt{n}+\|\boldsymbol\lambda^{*}\|\Big)\alpha_{3}^{2}\tau^{3}\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}} \\ &: = &\alpha_{5}\tau^{3}\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}}. \end{eqnarray}$

(4.40)

Together with (3.9) and ${\boldsymbol \lambda}^{*} = \tilde{J}({\bf c}^{*}) {\bf c}^{*}+\tilde{ {\bf b}}$ , we get

${\bf c}^{k+1}- {\bf c}^{*} = B_{k}(\tilde{J}( {\bf c}^{*}) {\bf y}^{k}+ {\bf b}-\hat{\boldsymbol\lambda}( {\bf y}^{k}))+(I-B_{k}\tilde{J}( {\bf c}^{*}))( {\bf y}^{k}- {\bf c}^{*}).$

It follows from (4.26), (4.32), (4.34), (4.38), and (4.40) that

$\begin{eqnarray} \| {\bf c}^{k+1}- {\bf c}^{*}\| &\leq& \|B_{k}\|\|\tilde{J}( {\bf c}^{*}) {\bf y}^{k}+\tilde{ {\bf b}}-\hat{\boldsymbol\lambda}( {\bf y}^{k})\|+\|I-B_{k}\tilde{J}( {\bf c}^{*})\|\| {\bf y}^{k}- {\bf c}^{*}\| \\ &\leq& 2\gamma\alpha_{5}\tau^{3}\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}}+\alpha_{6}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{k}}\cdot\alpha_{2}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{2\cdot3^{k}} \\ &\leq& (2\gamma\alpha_{5}+\alpha_{2}\alpha_{6})\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}} = \tau\Big(\frac{\delta}{\tau}\Big)^{3^{k+1}}. \end{eqnarray}$

(4.41)

Finally, by (4.31), (4.32), and (4.41), we have (4.25). □

Next, we can analyze the convergence of Algorithm Ⅰ.

Theorem 4.1. Let the vector ${\bf c}^{*}\in clS$ and the given eigenvalues $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ satisfy (2.8). All $J\in \partial_{Q|S} {\bf f}({\bf c}^*)$ are nonsingular. Then Algorithm Ⅰ is locally cubic convergent.

Proof. Let us start by mathematical induction that (4.22)–(4.24) are true for all $k > 0$ . Clearly, by assumptions $\mu\leq\delta$ and $\| {\bf c}^{0}- {\bf c}^{*}\|\leq\delta$ , (4.23) and (4.24) for $k = 0$ are trivial. From Lemma 2.1, we have

$\begin{eqnarray*} \|E_{0}\|\leq\|\|E_{0}\|_{F}\leq\|(I-\Pi)Q^{(1)}( {\bf c}^{0})\|_{F}+\|Q^{(2)}( {\bf c}^{0})-Q^{(2)}( {\bf c}^{*})\|_{F}\leq2\sqrt{n}\alpha_{1}\| {\bf c}^{0}- {\bf c}^{*}\|\leq2\sqrt{n}\alpha_{1}\delta, \end{eqnarray*}$

and this gives that (4.22) is true for $k = 0$ .

Now assume that (4.22)–(4.24) are true for all $k\leq m-1$ . Recalling that (2.5) and (2.6), we get

$\|E_{m-1}\|\leq \sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}\leq\sqrt{n}\rho_{0}\delta\leq\delta_{2},$

which together with Lemma 4.1, (2.5), (2.6), (4.34), and (4.39) with $k = m-1$ gives

$\|E( {\bf y}^{m-1})\|\leq\alpha_{3}\delta\leq\delta_{2}.$

It follows from Lemma 4.1, (2.5), (4.39), and (4.41) with $k = m-1$ that

$\begin{eqnarray*} \| {\bf c}^{m}- {\bf c}^{*}\|\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m}}\leq\delta\leq\delta_{2} \end{eqnarray*}$

and

$\begin{eqnarray*} \|E_{m}\| \leq \rho_{3}\Big(\alpha_{7}\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{3^{m}}+\alpha_{3}^{2}\tau^{4}\Big(\frac{\delta}{\tau}\Big)^{4\cdot3^{m-1}}\Big) \leq \rho_{3}(\alpha_{7}+\alpha_{3}^{2})\tau^{2}\Big(\frac{\delta}{\tau}\Big)^{3^{m}} \leq 2\sqrt{n}\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m}}\leq2\sqrt{n}\rho_{0}\delta. \end{eqnarray*}$

Thus, (4.22) and (4.23) hold for $k = m$ . From Lemma 4.5 with $k = m-1$ , we get $\|I-B_{m-1}J_{m-1}\|\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}$ and $\|B_{m-1}\|\leq2\gamma$ . Thanks to (4.29) (with $k = m$ ), one can see that

$\begin{eqnarray*} \|P_{m}-P_{m-1}\|\leq 4\sqrt{n}\beta\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}, \end{eqnarray*}$

which implies that

$\begin{eqnarray*} \| {\bf p}_{i}^{m}- {\bf p}_{i}^{m-1}\|\leq 4\sqrt{n}\beta\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}},\quad 1\leq i\leq n. \end{eqnarray*}$

Consequently,

$\begin{eqnarray*} |[J_{m}]_{ij}-[J_{m-1}]_{ij}| & = &|( {\bf p}_{i}^{m}- {\bf p}_{i}^{m-1})^{T}A_{j} {\bf p}_{i}^{m}-( {\bf p}_{i}^{m})^{T}A_{j}( {\bf p}_{i}^{m}- {\bf p}_{i}^{m-1})|\\ &\leq& 2\|A_{j}\|\|( {\bf p}_{i}^{m}- {\bf p}_{i}^{m-1})\|\\ &\leq& 8\|A_{j}\|\sqrt{n}\beta\rho_{0}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}},\ 1\leq i,j\leq n. \end{eqnarray*}$

Hence,

$\begin{eqnarray*} \|J_{m}-J_{m-1}\|\leq\|J_{m}-J_{m-1}\|_{F}\leq 8n^{\frac{3}{2}}\beta\rho_{0}\max\limits_{1\leq j\leq n}\|A_{j}\|\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}: = \alpha_{1}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}. \end{eqnarray*}$

It follows that

$\begin{eqnarray} \|I-B_{m-1}J_{m}\| &\leq& \|I-B_{m-1}J_{m-1}\|+\|B_{m-1}\|J_{m-1}-J_{m}\|\\ &\leq& \tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}+2\gamma\cdot\alpha_{1}\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}\leq(1+2\gamma\alpha_{1})\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m-1}}. \end{eqnarray}$

(4.42)

Notice that $B_{m} = B_{m-1}+B_{m-1}(2I-J({\bf c}^{m})B_{m-1})(I-J({\bf c}^{m})B_{m-1})$ , and we obtain

$I-B_{m}J_{m} = I-(B_{m-1}+B_{m-1}(2I-J( {\bf c}^{m})B_{m-1})(I-J( {\bf c}^{m})B_{m-1}))J_{m} = (I-B_{m-1}J_{m})^{3}.$

Together with (2.5), (4.26), and (4.42), one has

$\begin{eqnarray*} \|I-B_{m}J_{m}\|\leq\|I-B_{m-1}J_{m}\|^{3}\leq(1+2\gamma\alpha_{1})^{3}\tau^{3}\Big(\frac{\delta}{\tau}\Big)^{3^{m}}\leq\tau\Big(\frac{\delta}{\tau}\Big)^{3^{m}}, \end{eqnarray*}$

and therefore, (4.24) is true for $k = m$ and the proof is complete. □

5. Numerical experiments

In this section, we present the computational performance of Algorithm Ⅰ in addressing the IEP with several multiple eigenvalues. Algorithm Ⅰ is contrasted with the two-step Ulm-Chebyshev-like Cayley transform method (TUCT method), inexact Cayley transform method (ICT method), and Ulm-like Cayley transform method (UCT method) as presented in [30,35,36], respectively. The tests were carried out in MATLAB 7.10 running on a PC Intel Pentium Ⅳ of 3.0 GHz CPU. We will now consider the problem of finding the eigenvalues for a matrix with a Toeplitz structure, as previously investigated in references [35,36].

The QMR method [] was utilized to solve all linear systems in all algorithms, using the MATLAB QMR function and setting the maximum number of iterations to $1000$ . Notably, in the context of solving approximate Jacobian equations within the framework of the inexact Cayley transform method, an advanced approach was employed. This approach involved leveraging the preconditioned QMR method with a specific stopping tolerance, and integrating the MATLAB incomplete LU factorization as the designated preconditioner. Specifically, the drop tolerance in LUINC( $A$ , drop-tolerance) is fixed at $0.001$ . Furthermore, the precision level for the linear systems involved in all computational methods is adjusted to match the machine accuracy, ensuring the attainment of the desired solutions. The termination criterion for the outer (Newton) iterations is met in every algorithm when

$\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|\leq10^{-12}.$

Example 5.1. (See [35,]) Referring to the Toeplitz matrices stated in [,], consider $\{A_{i}\}_{i = 0}^{n}$ as

$A_0 = O,\ \ A_1 = I,\ \ A_2 = \left[ \begin{array}{rrrrr} 0 & 1 & 0 & \cdots & 0\\ 1 & 0 & 1 & \ddots &\vdots\\ 0 & 1 & \ddots&\ddots&0\\ \vdots&\ddots&\ddots&0 &1\\ 0 &\cdots& 0 & 1 & 0\\\end{array} \right],\cdots, A_n = \left[ \begin{array}{rrrrr} 0 & 0 & \cdots &0 & 1\\ 0 & \ddots & \ddots & \ddots &0\\ \vdots & \ddots & \ddots&\ddots&\vdots\\ 0&\cdots&\ddots&\ddots &0\\ 1 &0& \cdots & 0 & 0\\\end{array} \right].$

Hence, the matrix $A({\bf c})$ can be characterized as a symmetric Toeplitz matrix where the first column is identical to the vector ${\bf c}$ .

Here, we consider three cases: $n = 100,200,300$ . For any $n$ , we generate a vector $\tilde{ {\bf c}}^{*}$ such that $|\lambda_{k+1}(\tilde{ {\bf c}}^{*})-\lambda_{k}(\tilde{ {\bf c}}^{*})| < \eta$ with $1\leq l\leq n-1$ , where

$\eta = \begin{cases} 5\times 10^{-5},\quad&\text{ n = 100 };\\ 1\times 10^{-5},\quad&\text{ n = 200 };\\ 1\times 10^{-6},\quad&\text{ n = 300 }. \end{cases}$

Set

$\lambda_{i}^{*} = \begin{cases} \lambda_{k}(\tilde{ {\bf c}}^{*}),\quad&\text{ i = k, k+1 };\\ \lambda_{i}(\tilde{ {\bf c}}^{*}),\quad&\text{otherwise}. \end{cases}$

Subsequently, we choose $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ as the prescribed eigenvalues. It is clear that, in this way of selecting $\{\lambda_{i}^{*}\}_{i = 1}^{n}$ , multiple eigenvalues are present. Since all of the algorithms are locally convergent, the initial guess ${\bf c}_{0}$ is formed by chopping the components of $\tilde{ {\bf c}}^{*}$ to six decimal places for $n = 100, \ 200, \ 300$ . The information in presents the average values of $\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ , and "it." denotes the the averaged numbers of outer iterations. The data in presents the average total numbers of outer iterations $N_{0}$ throughout ten test scenarios and the average total numbers of inner iterations $N_{i}$ essential for solving the IEPs. A comparison of the averaged CPU time of all of the algorithms for the ten tests with different $n$ is shown in Table 3.

Table 1. Averaged values of

$\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ for the ten tests.

$n$	$\mathtt{it.}$	ICT method		UCT method	Algorithm Ⅰ	TUCT method
		$\beta=1.5$	$\beta=1.8$
100	0	$1.8973e-5$	$1.8973e-5$	$1.8973e-5$	$1.8973e-5$	$1.8973e-5$
	1	$1.0006e-6$	$9.5329e-5$	$3.4389e-5$	$7.2881e-9$	$2.3548e+3$
	2	$3.4623e-7$	$5.2316e-7$	$1.5623e-7$	$4.6864e-14$	$2.5421e+15$
	3	$1.5456e-9$	$2.5695e-9$	$9.2635e-9$
	4	$8.6994e-12$	$9.9999e-11$	$8.2312e-12$
	5	$9.5684e-15$	$1.0001e-14$	$6.2351e-15$
200	0	$2.6051e-5$	$2.6051e-5$	$2.6051e-5$	$2.6051e-5$	$2.6051e-5$
	1	$1.0003e-7$	$4.1858e-6$	$3.1889e-6$	$9.9999e-8$	$4.2151e+4$
	2	$9.9998e-8$	$3.9856e-8$	$4.4668e-8$	$6.2392e-13$	$8.2357e+21$
	3	$6.1254e-9$	$2.2254e-9$	$3.5563e-9$
	4	$1.5684e-12$	$9.5695e-11$	$1.5623e-12$
	5	$2.3564e-14$	$3.2356e-13$	$8.8563e-14$
300	0	$4.0904e-5$	$4.0904e-5$	$4.0904e-5$	$4.0904e-5$	$4.0904e-5$
	1	$9.1364e-7$	$4.2864e-7$	$7.2789e-7$	$1.9604e-9$	$3.2546e+3$
	2	$3.3874e-8$	$2.9856e-8$	$3.3668e-8$	$2.2275e-13$	$6.2534e+22$
	3	$4.2356e-9$	$8.5695e-9$	$4.5564e-9$
	4	$1.0012e-12$	$9.9898e-11$	$2.5873e-11$
	5	$6.5684e-14$	$1.2325e-13$	$8.2344e-13$

| Show Table

DownLoad: CSV

Table 2. Averaged total numbers of outer and inner iterations for the ten tests.

$n$		ICT method		Algorithm Ⅰ	UCT method
		$\beta=1.5$	$\beta=1.8$
100	$N_{0}$	4.9	4.9	2	4.9
	$N_{i}$	32.1	32.9	29.2	32.5
200	$N_{0}$	5.1	5.1	2	5.1
	$N_{i}$	47.2	47.8	38.9	47.3
300	$N_{0}$	5.1	5.1	2	5.1
	$N_{i}$	71.2	71.8	60.1	71.6

| Show Table

DownLoad: CSV

Table 3. Averaged CPU time in seconds of all algorithms for the ten tests.

$n$	50	100	150	200	250	300
UCT method	0.66	2.14	7.23	18.11	40.56	98.88
ICT method with $\beta=1.5$	0.64	1.91	8.12	17.68	32.43	86.37
Algorithm Ⅰ	0.33	1.18	5.53	12.52	28.99	55.67

| Show Table

DownLoad: CSV

From the data presented in Table 1, we see that Algorithm Ⅰ requires a lower number of iterations compared to the ICT and UCT methods, and the TUCT method fails to converge. It is evident from the data presented in Table 2 that Algorithm Ⅰ outperforms the inexact Cayley transform method and Ulm-like Cayley transform method. Table 3 shows that Algorithm Ⅰ demonstrates a more cost-effective CPU time compared to the other approaches.

Example 5.2. [] This is an inverse problem with multiple eigenvalues and $n = 8$ . We are given that $B = I+WW^T$ , where

$W = \left[ \begin{array}{rrrrr} 1 & -1 & -3 & -5 & -6 \\ 1 & 1 & -2 & -5 & -17 \\ 1 & -1 & -1 & 5 & 18 \\ 1 & 1 & 1 & 2 & 0 \\ 1 & -1 & 2 & 0 & 1 \\ 1 & 1 & 3 & 0 & -1 \\ 2.5 & 0.2 & 0.3 & 0.5 & 0.6 \\ 2 & -0.2 & 0.3 & 0.5 & 0.8 \\ \end{array} \right]_{8\times 5}.$

Define

$A_0 = 0,\quad A_i: = b_{ii} {\bf e}_i {\bf e}_i^T+\sum\limits_{j = 1}^{i-1}b_{ij} ( {\bf e}_i {\bf e}_j^T+ {\bf e}_j {\bf e}_i^T),\quad i = 1,\ldots,8.$

$\boldsymbol \lambda^* = (1,1,1,2.120754,9.218868,17.28137,35.70822,722.6808)^T,$

${\bf c}^* = (1,1,1,1,1,1,1,1)^T.$

We report our numerical results for different starting points: (a) ${\bf c}^0 = 10^{-5}\cdot(1, 1, 1, 1, 1, 1, 1, 1)^T$ ; (b) ${\bf c}^0 = (0, 0, 0, 0, 0, 0, 0, 0)^T$ .

presents the average values of $\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ , and "it." denotes the the averaged numbers of outer iterations.

Table 4. Averaged values of

$\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ for the ten tests.

	$\mathtt{it.}$	ICT method		UCT method	Algorithm Ⅰ	TUCT method
		$\beta=1.5$	$\beta=1.8$
$(a)$	0	$2.2113e-5$	$2.2113e-5$	$2.2113e-5$	$2.2113e-5$	$2.2113e-5$
	1	$3.2154e-6$	$4.2568e-6$	$8.2135e-5$	$9.889e-14$	$4.2356e+3$
	2	$8.3549e-8$	$4.2648e-8$	$2.1350e-8$		$3.2589e+15$
	3	$3.2111e-10$	$1.2309e-10$	$8.2156e-10$
	4	$8.2543e-13$	$5.2236e-13$	$9.9998e-14$
$(b)$	0	$7.2383e+2$	$7.2383e+2$	$7.2383e+2$	$7.2383e+2$	$7.2383e+2$
	1	$5.2469e+5$	$3.2699e+6$	$5.3216e+6$	$8.3269e+18$	$3.2156e+14$
	2	$5.2148e+12$	$7.2584e+12$	$5.2318e+13$

| Show Table

DownLoad: CSV

It can be observed from Table 4 that Algorithm Ⅰ requires a lower number of iterations compared to the ICT and UCT methods and Algorithm Ⅰ has global non-convergence and local cubic convergence.

To further illustrate the effectiveness of Algorithm Ⅰ, we present a practical engineering application in vibrations [,]. We consider the vibration of a taut string with $n$ beads. shows such a model for the case where $n = 4$ . Here, we assume that the $n$ beads are placed along the string, where the ends of the string are clamped. The mass of the $j$ th bead is denoted by $m_j$ . The horizontal lengths between masses $m_j$ and $m_{j+1}$ (and between each bead at each end and the clamped support) are set to be a constant $L$ . The horizontal tension is set to be a constant $T$ . Then the equation of motion is governed by

$\begin{equation} m_jy_j''(t) = T\frac{y_{j+1}-y_j}{L}-T\frac{y_{j}-y_{j-1}}{L},\quad j = 1,\ldots,n, \end{equation}$

(5.1)

Figure 1. A string with

$n = 4$ beads.

DownLoad: Full-Size Img PowerPoint

where $y_0 = y_{n+1} = 0$ . That is, the ends of the string are fixed. The matrix form of (5.1) is given by

$\begin{equation} y''(t) = -CJy(t), \end{equation}$

(5.2)

where $y(t) = (y_1(t), y_2(t), \ldots, y_n(t))^T$ , $C = {{\rm{diag}}}(c_1, c_2, \ldots, c_n)$ with $c_j = \frac{T}{m_jL}$ , and $J$ is the discrete Laplacian matrix

$J = \left[ \begin{array}{ccccc} 2 & -1 & & & \\ -1 & 2 & -1 &&\\ & \ddots & \ddots & \ddots & \\ && -1 & 2 & -1\\ &&& -1 & 2\\ \end{array} \right]\in\mathcal{S}^{n}.$

The general solution of (5.2) is given in terms of the eigenvalue problem

$CJy = \lambda y,$

where $\lambda$ is the square of the natural frequency of the vibration system and the nonzero vector $y$ accounts for the interplay between the masses. The inverse problem for the beaded string is to compute the masses $\{m_j\}_{j = 1}^n$ so that the resulting system has a prescribed set of natural frequencies.

It is easy to check that the eigenvalues of $J$ are given by

$\lambda_j(J) = 4\left(\sin\frac{j\pi}{n+1}\right)^2,\quad j = 1,2,\ldots,n.$

Thus, $J$ is symmetric and positive definite and $CJ$ is similar to $L^TCL$ , where $L$ is the Cholesky factor of $J = LL^T$ []. Then, the inverse problem is converted into the form of the IEP where $A_0 = 0$ and $A_j = L^TE_jL$ with $E_j = {{\rm{diag}}}(e_j)$ for $j = 1, 2, \ldots, n$ . The beaded string data in Examples 5.3 and 5.4 comes from the website $\mathtt{http://www.caam.rice.edu/{\sim}beads}$ .

Example 5.3. This is an inverse problem for the beaded string with $n = 4$ beads, where

$\begin{array}{c} (m_1,m_2,m_3,m_4) = (0.030783, 0.017804, 0.017804, 0.030783) \, (kg = kilogram),\\ (n+1)L = 1.12395 \,(meter),\quad T = 191.8199\, (Newton),\\ \boldsymbol \lambda^* = (15041.90, 42344.26, 88328.78, 15041.90)^T,\\ c^* = (27720.80, 47929.08, 47929.08, 27720.80)^T. \end{array}$

We report our numerical result for the starting points: $c^0 = 10^{-5}\cdot(27720.80, 47929.08, 47929.08,$ $27720.80)^T$ .

Example 5.4. This is an inverse problem for the beaded string with $n = 6$ beads, where

$\begin{array}{c} (m_1,m_2,m_3,m_4,m_5,m_6) = (0.017804, 0.030783, 0.017804, 0.017804, 0.030783, 0.017804) (kg),\\ (n+1)L = 1.12395 \,(meter),\quad T = 166.0370\, (Newton),\\ \boldsymbol \lambda^* = (9113.978, 30746.32, 83621.69, 148694.4, 148694.4, 193537.0)^T,\\ c^* = (58081.57, 33592.71, 58081.57, 58081.57, 33592.71, 58081.57)^T. \end{array}$

We report our numerical results for the starting points: $c^0 = 10^{-5}\cdot(58081.57, 33592.71, 58081.57, 58081.57, 33592.71, 58081.57)^T$ .

The information in presents the average values of $\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ , and Table 6 displays the computed masses for the beaded string.

Table 5. Averaged values of

$\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\|$ in the ten tests for Examples 5.3 and 5.4.

	Example 5.3
	$\\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\\|$ -value (last $3$ iterations)
ICT method	$(6.0235\times 10^{-8}, 3.6564\times 10^{-10}, 9.3356\times 10^{-13})$
UCT method	$(1.9587\times 10^{-8}, 8.3985\times 10^{-10}, 2.8872\times 10^{-14})$
Algorithm Ⅰ	$(5.9254\times 10^{-5}, 2.8123\times 10^{-11}, 6.5865\times 10^{-21})$
	Example 5.4
	$\\|P_{k}^{T}A(\mathbf{c}^{k})P_{k}-\Lambda^{*}\\|$ -value (last $3$ iterations)
ICT method	$(1.0562\times 10^{-7}, 5.0420\times 10^{-9}, 7.4001\times 10^{-13})$
UCT method	$(2.9125\times 10^{-7}, 9.0859\times 10^{-9}, 3.7523\times 10^{-13})$
Algorithm Ⅰ	$(1.9862\times 10^{-5}, 2.9546\times 10^{-11}, 5.5555\times 10^{-21})$

| Show Table

DownLoad: CSV

Table 6. Recovered masses for Examples 5.3 and 5.4.

	Example 5.3
	$m_1$	$m_2$	$m_3$	$m_4$
true	0.030783	0.017804	0.017804	0.030783
recovered	0.030783	0.017804	0.017804	0.030783
	Example 5.4
	$m_1$	$m_2$	$m_3$	$m_4$	$m_5$	$m_6$
true	0.017804	0.030783	0.017804	0.017804	0.030783	0.017804
recovered	0.017804	0.030783	0.017804	0.017804	0.030783	0.017804

| Show Table

DownLoad: CSV

From Table 5, we know that Algorithm Ⅰ requires a lower number of iterations compared to the ICT and UCT methods, and the TUCT method fails to converge. Table 6 shows that the desired masses are recovered. All of these numerical observations agree with our prediction and further validate our theoretical results.

6. Conclusions

In this paper, we have proposed a two-step Ulm-Chebyshev-like Cayley transform method for solving the IEP (1.2) with multiple eigenvalues, which avoids solving (approximate) Jacobian equations in each outer iteration. Furthermore, the proposed algorithm is proved to have cubical convergence under the following nonsingular condition in terms of the relative generalized Jacobian evaluated at a solution ${\bf c}^{*}$ : Each $J\in \partial_{Q|S} {\bf f}({\bf c}^*)$ is nonsingular. Furthermore, this kind of method can be worked with larger problems for the Toeplitz inverse eigenvalue problem and is efficient when working with small and medium-sized problems for a more general perspective on this problem. Nevertheless, the effective methods for general larger inverse eigenvalue problems should be studied in the future.

Author contributions

Wei Ma, Zhenhao Li and Yuxin Zhang: Algorithms, Software, Numerical examples, Writing-original draft, Writing-review & editing. All authors of this article have been contributed equally. All authors have read and approved the final version of the manuscript for publication.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

This work was supported by the Henan Province College Student Innovation and Entrepreneurship Training Program Project (No: 202410481003).

Conflict of interest

The authors declare no conflicts of interest.

References

[1]	F. Diele, T. Laudadio, N. Mastronardi, On some inverse eigenvalue problems with Toeplitz-related structure, SIAM J. Matrix Anal. Appl., 26 (2004), 285–294. https://doi.org/10.1137/S0895479803430680 doi: 10.1137/S0895479803430680
[2]	J. Peinado, A. M. Vidal, A new parallel approach to the Toeplitz inverse eigenproblem using Newton-like methods, In: Vector and parallel processing–VECPAR 2000, Springer, 2001. https://doi.org/10.1007/3-540-44942-6_29
[3]	W. F. Trench, Numerical solution of the inverse eigenvalue problem for real symmetric Toeplitz matrices, SIAM J. Sci. Comput., 18 (1997), 1722–1736. https://doi.org/10.1137/S1064827595280673 doi: 10.1137/S1064827595280673
[4]	N. Wagner, Inverse eigenvalue problems in structural dynamics, Proc. Appl. Math. Mech., 6 (2006), 339–340. https://doi.org/10.1002/pamm.200610151 doi: 10.1002/pamm.200610151
[5]	P. J. Brussard, P. W. M. Glaudemans, Shell model applications in nuclear spectroscopy, North-Holland, 1977.
[6]	M. S. Ravi, J. Rosenthal, X. A. Wang, On decentralized dynamic pole placement and feedback stabilization, IEEE Trans. Automat. Control, 40 (1995), 1603–1614. https://doi.org/10.1109/9.412629 doi: 10.1109/9.412629
[7]	O. Hald, On discrete and numerical Sturm-Liouville problems, New York University, 1972.
[8]	G. M. L. Gladwell, Inverse problems in vibration, Appl. Mech. Rev., 39 (1986), 1013–1018. https://doi.org/10.1115/1.3149517 doi: 10.1115/1.3149517
[9]	G. M. L. Gladwell, Inverse problems in vibration-Ⅱ, Appl. Mech. Rev., 49 (1996), S25–S34. https://doi.org/10.1115/1.3101973 doi: 10.1115/1.3101973
[10]	K. T. Joseph, Inverse eigenvalue problem in structural design, AIAA J., 30 (1992), 2890–2896. https://doi.org/10.2514/3.11634 doi: 10.2514/3.11634
[11]	R. L. Parker, K. A. Whaler, Numerical methods for establishing solutions to the inverse problem of electromagnetic induction, J. Geophys. Res., 86 (1981), 9574–9584. https://doi.org/10.1029/JB086iB10p09574 doi: 10.1029/JB086iB10p09574
[12]	N. Li, A matrix inverse eigenvalue problem and its application, Linear Algebra Appl., 266 (1997), 143–152. https://doi.org/10.1016/S0024-3795(96)00639-8 doi: 10.1016/S0024-3795(96)00639-8
[13]	M. Müller, An inverse eigenvalue problem: computing B-stable Runge-Kutta methods having real poles, BIT Numer. Math., 32 (1992), 676–688. https://doi.org/10.1007/BF01994850 doi: 10.1007/BF01994850
[14]	S. Elhay, Y. M. Ram, An affine inverse eigenvalue problem, Inverse Problems, 18 (2002), 455. https://doi.org/10.1088/0266-5611/18/2/311 doi: 10.1088/0266-5611/18/2/311
[15]	M. T. Chu, Inverse eigenvalue problems, SIAM Rev., 40 (1998), 1–39. https://doi.org/10.1137/S0036144596303984 doi: 10.1137/S0036144596303984
[16]	M. T. Chu, G. H. Golub, Structured inverse eigenvalue problems, Acta Numer., 11 (2002), 1–71. https://doi.org/10.1017/S0962492902000016 doi: 10.1017/S0962492902000016
[17]	M. T. Chu, G. H. Golub, Inverse eigenvalue problems: theory, algorithms, and applications, Oxford: Oxford University Press, 2005. https://doi.org/10.1093/acprof:oso/9780198566649.001.0001
[18]	S. F. Xu, An introduction to inverse algebric eigenvalue problems, Beijing: Peking University Press, 1998.
[19]	Z. J. Bai, Inexact Newton methods for inverse eigenvalue problems, Appl. Math. Comput., 172 (2006), 682–689. https://doi.org/10.1016/j.amc.2004.11.023 doi: 10.1016/j.amc.2004.11.023
[20]	V. N. Kublanovskaja, On one approach to the solution of the inverse eigenvalue problem, In: Automatic programming and numerical methods of analysis, Springer, 1972, 80–86. https://doi.org/10.1007/978-1-4615-8588-6_10
[21]	Y. Wang, W. Shen, An extended two-step method for inverse eigenvalue problems with multiple eigenvalues, Numer. Math. Theor. Methods Appl., 16 (2023), 968–992.
[22]	S. Friedland, J. Nocedal, M. L. Overton, The formulation and analysis of numerical methods for inverse eigenvalue problems, SIAM. J. Numer. Anal., 24 (1987), 634–667. https://doi.org/10.1137/0724043 doi: 10.1137/0724043
[23]	Z. J. Bai, R. H. Chan, B. Morini, An inexact Cayley transform method for inverse eigenvalue problems, Inverse Problems, 20 (2004), 1675. https://doi.org/10.1088/0266-5611/20/5/022 doi: 10.1088/0266-5611/20/5/022
[24]	R. H. Chan, S. F. Xu, H. M. Zhou, On the convergence rate of a quasi-Newton method for inverse eigenvalue problems, SIAM J. Numer. Anal., 36 (1999), 436–441. https://doi.org/10.1137/S0036142997327051 doi: 10.1137/S0036142997327051
[25]	R. H. Chan, H. L. Chung, S. F. Xu, The inexact Newton-like method for inverse eigenvalue problem, BIT Numer. Math., 43 (2003), 7–20. https://doi.org/10.1023/a:1023611931016 doi: 10.1023/a:1023611931016
[26]	W. P. Shen, C. Li, X. Q. Jin, A Ulm-like method for inverse eigenvalue problems, Appl. Numer. Math., 61 (2011), 356–367. https://doi.org/10.1016/j.apnum.2010.11.001 doi: 10.1016/j.apnum.2010.11.001
[27]	W. P. Shen, C. Li, An Ulm-like Cayley transform method for inverse eigenvalue problems, Taiwanese J. Math., 16 (2012), 367–386. https://doi.org/10.11650/twjm/1500406546 doi: 10.11650/twjm/1500406546
[28]	X. S. Chen, C. T. Wen, H. W. Sun, Two-step Newton-type methods for solving inverse eigenvalue problems, Numer. Linear Algebra Appl., 25 (2018), e2185. https://doi.org/10.1002/nla.2185 doi: 10.1002/nla.2185
[29]	C. T. Wen, X. S. Chen, H. W. Sun, A two-step inexact Newton-Chebyshev-like method for inverse eigenvalue problems, Linear Algebra Appl., 585 (2020), 241–262. https://doi.org/10.1016/j.laa.2019.10.004 doi: 10.1016/j.laa.2019.10.004
[30]	W. Ma, Two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems, Int. J. Comput. Math., 99 (2022), 391–406. https://doi.org/10.1080/00207160.2021.1913728 doi: 10.1080/00207160.2021.1913728
[31]	G. H. Golub, C. F. Van Loan, Matrix computations, 3 Eds., Johns Hopkins University Press, 1996.
[32]	L. Q. Qi, Convergence analysis of some algorithms for solving nonsmooth equations, Math. Oper. Res., 18 (1993), 227–244. https://doi.org/10.1287/moor.18.1.227 doi: 10.1287/moor.18.1.227
[33]	F. A. Potra, L. Q. Qi, D. F. Sun, Secant methods for semismooth equations, Numer. Math., 80 (1998), 305–324. https://doi.org/10.1007/s002110050369 doi: 10.1007/s002110050369
[34]	D. F. Sun, J. Sun, Strong semismoothness of eigenvalues of symmetric matrices and its application to inverse eigenvalue problems, SIAM J. Numer. Anal., 40 (2003), 2352–2367. https://doi.org/10.1137/s0036142901393814 doi: 10.1137/s0036142901393814
[35]	W. P. Shen, C. Li, X. Q. Jin, An inexact Cayley transform method for inverse eigenvalue problems with multiple eigenvalues, Inverse Problems, 31 (2015), 085007. https://doi.org/10.1088/0266-5611/31/8/085007 doi: 10.1088/0266-5611/31/8/085007
[36]	W. P. Shen, C. Li, X. Q. Jin, An Ulm-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues, Numer. Math. Theor. Methods Appl., 9 (2016), 664–685. https://doi.org/10.4208/nmtma.2016.y15030 doi: 10.4208/nmtma.2016.y15030
[37]	R. W. Freund, N. M. Nachtigal, QMR: a quasi-minimal residual method for non-Hermitian linear systems, Numer. Math., 60 (1991), 315–339. https://doi.org/10.1007/BF01385726 doi: 10.1007/BF01385726

This article has been cited by:

1.	Wei Ma, Ming Zhao, Jiaxin Li, A multi-step Ulm-Chebyshev-like method for solving nonlinear operator equations, 2024, 9, 2473-6988, 28623, 10.3934/math.20241389
2.	Rui Guo, Weiping Shen, Xinge Yang, A Two‐Step Method for Parameterized Generalized Inverse Eigenvalue Problem, 2025, 0170-4214, 10.1002/mma.11073

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.1

Metrics

Article views(1083) PDF downloads(53) Cited by(2)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(1) / Tables(6)

AIMS Mathematics

A two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues

Related Papers:

Abstract

1. Introduction

2. Preliminaries

2.1. Relative generalized Jacobian

2.2. Preliminary results

3. The two-step Ulm-Chebyshev-like Cayley transform method

4. Convergence analysis

5. Numerical experiments

6. Conclusions

Author contributions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

A two-step Ulm-Chebyshev-like Cayley transform method for inverse eigenvalue problems with multiple eigenvalues

Related Papers:

Abstract

1. Introduction

2. Preliminaries

2.1. Relative generalized Jacobian

2.2. Preliminary results

3. The two-step Ulm-Chebyshev-like Cayley transform method

4. Convergence analysis

5. Numerical experiments

6. Conclusions

Author contributions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog