Conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

Kanjanaporn Tansri; Sarawanee Choomklang; Pattrawut Chansangiam; Kanjanaporn Tansri; Sarawanee Choomklang; Pattrawut Chansangiam

doi:10.3934/math.2022299

AIMS Mathematics

2022, Volume 7, Issue 4: 5386-5407. doi: 10.3934/math.2022299

Previous Article Next Article

Research article

Conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

Department of Mathematics, School of Science, King Mongkut's Institute of Technology Ladkrabang, Bangkok 10520, Thailand

Received: 26 October 2021 Revised: 13 December 2021 Accepted: 21 December 2021 Published: 06 January 2022
MSC : 15A60, 15A69, 65F45

We develop an effective algorithm to find a well-approximate solution of a generalized Sylvester-transpose matrix equation where all coefficient matrices and an unknown matrix are rectangular. The algorithm aims to construct a finite sequence of approximated solutions from any given initial matrix. It turns out that the associated residual matrices are orthogonal, and thus, the desire solution comes out in the final step with a satisfactory error. We provide numerical experiments to show the capability and performance of the algorithm.

Keywords:

conjugate gradient agorithm,
generalized Sylvester-transpose matrix equation,
Kronecker product,
matrix norm,
orthogonality

Citation: Kanjanaporn Tansri, Sarawanee Choomklang, Pattrawut Chansangiam. Conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations[J]. AIMS Mathematics, 2022, 7(4): 5386-5407. doi: 10.3934/math.2022299

Related Papers:

[1]	Nunthakarn Boonruangkan, Pattrawut Chansangiam . Convergence analysis of a gradient iterative algorithm with optimal convergence factor for a generalized Sylvester-transpose matrix equation. AIMS Mathematics, 2021, 6(8): 8477-8496. doi: 10.3934/math.2021492
[2]	Fengxia Zhang, Ying Li, Jianli Zhao . The semi-tensor product method for special least squares solutions of the complex generalized Sylvester matrix equation. AIMS Mathematics, 2023, 8(3): 5200-5215. doi: 10.3934/math.2023261
[3]	Shousheng Zhu . Double iterative algorithm for solving different constrained solutions of multivariate quadratic matrix equations. AIMS Mathematics, 2022, 7(2): 1845-1855. doi: 10.3934/math.2022106
[4]	Huiling Wang, Nian-Ci Wu, Yufeng Nie . Two accelerated gradient-based iteration methods for solving the Sylvester matrix equation AX + XB = C. AIMS Mathematics, 2024, 9(12): 34734-34752. doi: 10.3934/math.20241654
[5]	Sani Aji, Poom Kumam, Aliyu Muhammed Awwal, Mahmoud Muhammad Yahaya, Kanokwan Sitthithakerngkiet . An efficient DY-type spectral conjugate gradient method for system of nonlinear monotone equations with application in signal recovery. AIMS Mathematics, 2021, 6(8): 8078-8106. doi: 10.3934/math.2021469
[6]	Habibu Abdullahi, A. K. Awasthi, Mohammed Yusuf Waziri, Issam A. R. Moghrabi, Abubakar Sani Halilu, Kabiru Ahmed, Sulaiman M. Ibrahim, Yau Balarabe Musa, Elissa M. Nadia . An improved convex constrained conjugate gradient descent method for nonlinear monotone equations with signal recovery applications. AIMS Mathematics, 2025, 10(4): 7941-7969. doi: 10.3934/math.2025365
[7]	Jamilu Sabi'u, Ibrahim Mohammed Sulaiman, P. Kaelo, Maulana Malik, Saadi Ahmad Kamaruddin . An optimal choice Dai-Liao conjugate gradient algorithm for unconstrained optimization and portfolio selection. AIMS Mathematics, 2024, 9(1): 642-664. doi: 10.3934/math.2024034
[8]	Sourav Shil, Hemant Kumar Nashine . Positive definite solution of non-linear matrix equations through fixed point technique. AIMS Mathematics, 2022, 7(4): 6259-6281. doi: 10.3934/math.2022348
[9]	Haixia Chang, Chunmei Li, Longsheng Liu . Generalized low-rank approximation to the symmetric positive semidefinite matrix. AIMS Mathematics, 2025, 10(4): 8022-8035. doi: 10.3934/math.2025368
[10]	Abdur Rehman, Ivan Kyrchei, Muhammad Zia Ur Rahman, Víctor Leiva, Cecilia Castro . Solvability and algorithm for Sylvester-type quaternion matrix equations with potential applications. AIMS Mathematics, 2024, 9(8): 19967-19996. doi: 10.3934/math.2024974

Abstract

1. Introduction

Sylvester-type matrix equations show up naturally in several branches of mathematics and engineering. Indeed, many problems in vibration and structural analysis, robotics control and spacecraft control can be represented by the following general dynamical linear model:

$\begin{align} \sum\limits_{i = 0}^{s_1}A_{i}x^{(i)} + \sum\limits_{j = 0}^{s_2}B_{j}u^{(j)} = 0 \end{align}$

(1.1)

where $x \in \mathbb{R}^{m \times 1}$ and $u \in \mathbb{R}^{n \times 1}$ are the state vector and the input vector respectively, and $A_{i} \in \mathbb{R}^{m \times m}$ and $B_{j} \in \mathbb{R}^{n \times n}$ are the system coefficient matrices; see e.g., ^[1,2]. The dynamical linear system (1.1) includes

$\begin{align} A_{1}\dot{x} + A_{0}x + B_{0}u & = 0, &\quad &{\rm{the\;descriptor\;linear\;system}} , \end{align}$

(1.2)

$\begin{align} A_{2}\ddot{x} + A_{1}\dot{x} + A_{0}x + B_{0}u & = 0, &\quad &{\rm{the\;second-order\;linear\;system}} , \end{align}$

(1.3)

$\begin{align} A_{k}x^{k} +A_{k-1}x^{k-1} + \dots + A_{0}x & = Bu,&\quad &{\rm{the\;high-order\;dynamical\;linear\;system}} , \end{align}$

(1.4)

as special cases. Certain problems in control engineering, such as pole/eigenstructure assignment and observer design of the system (1.1) are closely related to the Lyapunov matrix equation $AX+XA^T = B$ , the Sylvester matrix equation $AX+XB = C$ , and other realted equations; see e.g., ^{[2,3,4,5,6,7]}. In particular, the Sylvester matrix equation also plays a fundamental role in signal processing and model reduction; see e.g., ^[8,9,10]. These equations are special cases of a generalized Sylvester-transpose matrix equation:

$\begin{equation} \sum\limits_{i = 1}^{s}A_{i}XB_{i}+\sum\limits_{j = 1}^{t}C_{j}X^TD_{j} = E . \end{equation}$

(1.5)

A traditional way to solve Eq (1.5) for an exact solution is to transform it to an equivalent linear system via the Kronecker linearization; see Section 3 for details. However, this approach is only suitable when the dimensions of coefficient matrices are small. In practice, for a large-dimension case, it is enough to find a well-approximate solution via an iterative procedure, so that it is not necessary required memories as massive as traditional methods. There are several articles that consider problems that approximate the generalized Sylvester resistive matrix equations and constructs a finite sequence of approximated solutions from any given initial matrix. In the last five years, many researchers developed iterative methods for solving Sylvester-type matrix equations related to Eq (1.5). A group of Hermitian and skew-Hermitian splitting (HSS) methods aims to decompose a square matrix as the sum of its Hermitian part and skew-Hermitian part. There are several variantions of HSS, namely, GMHSS ^[11], preconditioned HSS ^[12], FPPSS ^[13], and ADSS ^[14]. A group of gradient-based iterative (GI) algorithms aims to construct a sequence of approximated solutions converging to the exact solution, based on the gradients of quadratic norm-error functions. The original GI method fora generalized Sylvester matrix equation was developed by Ding et al. ^[15]. Then Niu et al. ^[16] adjusted the GI method by introducing a weighted factor. After that a haif-step-update of GI method, called MGI method, introducing by Wang et al. ^[17]. The idea of GI algorithm can be used in conjuction with matrix diagonal-extraction to get AJGI ^[18] and MJGI ^[19] algorithms. See more GI algorithms for a generalized Sylvester matrix equations in ^[20,21,22]. For a generalized Sylvester-transpose matrix equation $AXB + CX^{T}D = E$ , there are GI algorithm ^[23] and an accelerated gradient-based iterative (AGBI) algorithm ^[24] to construct approximate solutions. There are also GI techniques based on optimization, e.g., ^[25,26,27]. See more computational methods for linear matrix equations in a survey ^[28]. The iterative procedures can be used to find solutions of certain nonlinear matrix equations; see e.g., ^[29,30,31]. There are also applications of such techniques to parameter estimation in dynamical systems; see e.g., ^[32,33,34].

An idea of conjugate gradient (CG) is a remarkable technique constructing an orthogonal basis from the gradient of the associated quadratic function. There are several variations of CG to solve such matrix equations, e.g., BiCG ^[35], Bi-conjugate residual method ^[35], CGS ^[36], preconditioned nested splitting CG ^[37], generalized conjugate direction (GCD) method ^[38], conjugate gradient least-squares (CGLS) method ^[39], and GPBiCG ^[40].

In this paper, we propose a conjugate gradient algorithm to solve the generalized Sylvester-transpose matrix Eq (1.5) in the consistent case, where all given coefficient matrices and the unknown matrix are rectangular (see Section 4). The algorithm aims to construct a sequence of approximate solutions of (1.5) from any given initial value. It turns out that a desire solution comes out in the final step of iterations with a satisfactory error (see Section 4). To validate the theory, we provide numerical experiments to show the applicability and the performance of the algorithm (see Section 5). In particular, the performance of the algorithm is significantly better than that of the direct Kronecker linearization and recent gradient-based iterative algorithms.

2. Preliminaries

In this section, we recall useful tools and facts from matrix analysis that are used in later discussions. Throughout, we denote the set of all $m$ -by- $n$ real matrices by $\mathbb{ R }^{m \times n}$ .

Recall that the Kronecker product of $A = [a_{ij}] \in \mathbb{ R }^{m \times n}$ and $B \in \mathbb{ R }^{p \times q}$ is defined by

$\begin{align*} A \otimes B \; = \; [a_{ij}B] \in \mathbb{ R }^{mq \times np} . \end{align*}$

Lemma 1 (see, e.g., ^[41]). The following properties hold for any compatible matrices $A, B, C$ :

1) $\left(A \otimes B\right)^T\: = \:A^T \otimes B^T$ ,

2) $\left(A+B\right) \otimes C \: = \: A \otimes C+B \otimes C$ ,

3) $A \otimes \left(B+C\right)\: = \: A \otimes B+A \otimes C$ .

The vector operator ${\rm{Vec}}(\cdot)$ assigns to each matrix $A = [a_{ij}] \in \mathbb{ R }^{m \times n}$ the column vector

$\begin{align*} {\rm{Vec}}{A} \; = \; \left[a_{11} \cdots a_{m1} \cdots a_{12} \cdots a_{m2} \cdots a_{1n} \cdots a_{mn}\right]^{T} \:\in\: \mathbb{ R }^{mn} . \end{align*}$

This operator is bijective, linear, and compatible with the usual matrix multiplication in the following sense.

Lemma 2 (see, e.g., ^[41]). For any $A \in \mathbb{ R }^{m \times n}$ , $B \in \mathbb{ R }^{p \times q}$ and $X \in \mathbb{ R }^{n \times p}$ , we have

$\begin{align*} {\rm{Vec}}{AXB} \; = \; (B^{T} \otimes A){\rm{Vec}}{X}. \end{align*}$

Recall that the commutation matrix $P(m, n)$ is a permutation matrix defined by

$\begin{align} P(m,n) \; = \; \sum\limits_{i = 1}^m \sum\limits_{j = 1}^n E_{ij} \otimes E_{ij}^T \: \in \: \mathbb{ R }^{mn \times mn} \end{align}$

(2.1)

where each $E_{ij} \in \mathbb{R} ^{m \times n}$ has entry $1$ in $(i, j)$ -th position and all other entries are $0$ .

Lemma 3 (see, e.g., ^[41]). For any $A \in \mathbb{R}^{m \times n}$ and $B \in \mathbb{R}^{p \times q}$ , we have

$\begin{equation} B \otimes A \: = \: P(n,p)^T\left(A \otimes B\right)P(n,q). \end{equation}$

(2.2)

Lemma 4 (see, e.g., ^[41]). For any matrix $X \in \mathbb{R}^{m \times n}$ , we have

$\begin{equation} {\rm{Vec}}(X^T)\: = \:P(m,n){\rm{Vec}}(X). \end{equation}$

(2.3)

Lemma 5 (see, e.g., ^[41]). For any matrices $A, B, X, Y$ of compatible dimensions, we have

$\begin{equation} \left({\rm{Vec}}\left(Y\right)\right)^T\left(A \otimes B\right){\rm{Vec}}\left(X\right) \; = \; {\rm{tr}}\left(A^TY^TBX\right). \end{equation}$

(2.4)

Recall that the Frobenius norm of $A \in \mathbb{R}^{m \times n}$ is defined by

$\begin{equation*} \lVert A \rVert\; = \;\left( \sum\limits_{i = 1}^m \sum\limits_{j = 1}^n a_{ij}^2\right)^{\frac{1}{2}}\: = \:\left({\rm{tr}}\left(A^TA\right)\right)^{\frac{1}{2}}. \end{equation*}$

3. The direct Kronecker linearization for a generalized Sylvester-transpose matrix equation

From now on, let $m, n, p, q, r, s, k, \in \mathbb{ N }$ be such that $mq = np$ . Consider the generalized Sylvester-transpose matrix Eq (1.5) where $A_{i} \in \mathbb{ R }^{m \times n}$ , $B_{i} \in \mathbb{ R }^{p \times q}$ , $C_{j} \in \mathbb{ R }^{m \times p}$ , $D_{j} \in \mathbb{ R }^{n \times q}$ , $D \in \mathbb{ R }^{m \times q}$ , $E \in \mathbb{R}^{m \times q}$ are given matrices, and $X \in \mathbb{R}^{n \times p}$ is unknown. The Eq (1.5) includes the Lyapunov equation, the Sylvester equation, the equation $AXB + CXD = E$ , and the equation $AXB + CX^{T}D = E$ as special cases.

A direct method to solve Eq (1.5) is to transform it to an equivalent linear system. For convenience, denote $P = P(n, p)$ . By taking the vector operator to (1.5) and utilizing Lemma 4, we get

$\begin{align} {\rm{Vec}}{E} \;& = \; {\rm{Vec}}\left( {\sum^{s}_{i = 1}{A_{i}XB_{i}} + \sum^{t}_{j = 1}{C_{j}X^TD_{j}}} \right) \\ \;& = \; \sum^{s}_{i = 1}{(B_{i}^{T} \otimes A_{i})} {\rm{Vec}}{X} + \sum^{t}_{j = 1}{(D_{j}^{T} \otimes C_{j})} {\rm{Vec}}{X^{T}} \\ \;& = \; \sum^{s}_{i = 1}{(B_{i}^{T} \otimes A_{i})} {\rm{Vec}}{X} + \sum^{t}_{j = 1}{(D_{j}^{T} \otimes C_{j})} P {\rm{Vec}}{X} \\ \;& = \; \left( \sum^{s}_{i = 1}{(B_{i}^{T} \otimes A_{i})} + \sum^{t}_{j = 1}{(D_{j}^{T} \otimes C_{j})} P \right) {\rm{Vec}}{X} . \end{align}$

(3.1)

Let us denote $x = {\rm{Vec}}{X}$ , $b = {\rm{Vec}}{E}$ , and

$\begin{align} K \; = \; \sum^{s}_{i = 1}{(B_{i}^{T} \otimes A_{i})} + \sum^{t}_{j = 1}{(D_{j}^{T} \otimes C_{j})}P \in \mathbb{ R }^{mq \times np} . \end{align}$

(3.2)

Thus, Eq (3.1) is equivalent to a linear algebraic system $Kx = \:b$ . Hence, Eq (3.1) is consistent if and only if the associated linear system is consistent (i.e., ${\rm{rank}}{[K \; b]} = {\rm{rank}}{K}$ ). When we solve for $x$ , we can get the unknown matrix $X$ using the injectivity of the vector operator. However, if the matrix coefficients $A_i, B_i, C_j, D_j$ are of large sizes, then the size of $K$ can be very large due to the Kronecker multiplication. Thus, traditional methods such as Gaussian elimination and $LU$ factorization require a large memory to solve the linear system for an exact solution. Thus, the direct method is suitable for matrices of small sizes. For matrices of moderate/large sizes, it is enough to find a well-approximate solution for Eq (3.1) via an iterative procedure.

4. A conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

The main task is to find a well-approximate solution of the matrix Eq (1.5):

Problem 4.1. Let $m, n, p, q, r, s, k, \in \mathbb{ N }$ be such that $mq = np$ . Consider the generalized Sylvester-transpose matrix Eq (1.5) where $A_{i} \in \mathbb{ R }^{m \times n}$ , $B_{i} \in \mathbb{ R }^{p \times q}$ , $C_{j} \in \mathbb{ R }^{m \times p}$ , $D_{j} \in \mathbb{ R }^{n \times q}$ , $D \in \mathbb{ R }^{m \times q}$ , $E \in \mathbb{R}^{m \times q}$ are given matrices, and $X \in \mathbb{R}^{n \times p}$ is unknown. Suppose that Eq (1.5) has a solution. Given an error $\epsilon > 0$ , find $\tilde{X} \in \mathbb{R}^{n \times p}$ such that

$\begin{align*} \Big\lVert E - \sum^{s}_{i = 1}{A_{i} \tilde{X} B_{i}} - \sum^{t}_{j = 1}{C_{j}{\tilde{X}^TD_{j}}} \Big\rVert \; < \; \epsilon. \end{align*}$

We will solve Problem 1 under an additional assumption that $K$ in (3.2) is symmetric. We propose the following algorithm:

Algorithm 1: A CG algorithm for a generalized Sylvester-transpose matrix equation
$A_{i} \in \mathbb{ R }^{m \times n}$ , $B_{i} \in \mathbb{ R }^{p \times q}$ , $C_{j} \in \mathbb{ R }^{m \times p}$ , $D_{j} \in \mathbb{ R }^{n \times q}$ for any $i = 1, 2, \dots, s$ , $j = 1, 2, \dots, t$ and $E \in \mathbb{ R }^{m \times q}$ ;
Given $\epsilon > 0$ , set $k = 0$ , $U_{0} = 0$ . Choose $X_{0} \in \mathbb{ R }^{n\times p}$
$R_{0} = E - {\sum^{s}_{i=1}{A_{i}X_{0}B_{i}} - \sum^{t}_{j=1}{C_{j}X_{0}D_{j}}}$

| Show Table

DownLoad: CSV

We call $X_k$ the approximate solution at the $k$ -th step. The main computation

$\begin{equation*} X_{k+1} \; = \; X_k+\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}U_{k+1}, \end{equation*}$

means that the next approximation $X_{k+1}$ is the sum between the current one $X_{k}$ and the search direction $U_{k+1}$ with the step size $\lVert R_k \rVert^2 / \alpha_{k+1}$ .

Remark 4.2. The stopping rule of is based on the size of the residual matrix $R_k$ . One can impose another stopping criterion besides $\lVert R_k \rVert \leqslant \epsilon$ , e.g., the norm of the difference $X_{k+1} - X_{k}$ between sucessive iterates is small enough.

Remark 4.3. Let us discuss the complexity analysis for . For convenience, suppose that all matrices in Eq (1.5) are of sizes $n \times n$ . Each step of the algorithm requires the matrix addition ( $O(n^2)$ ), the matrix multiplication ( $O(n^3)$ ), the matrix transposition ( $O(n^2)$ ), the matrix norm ( $O(n)$ ), and the matrix trace ( $O(n)$ ). In summary, the complexity analysis for each step is $O(n^3)$ , and thus the algorithm runtime complexity is cubic time.

Next, we will show that, for any given initial matrix $X_0$ , Algorithm 1 produces approximate solutions so that the set of residual matrices is an orthogonal set and, thus, we get the disire solution in a finite step. We divides the proof into several lemmas.

Lemma 6. Consider Problem 1. Suppose that the sequence $\{ R_i \}$ is generated by Algorithm 1. We have

$\begin{align} R_{k+1}\: = \:R_k-\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}V_{k+1}\qquad for\quad k\: = \:1,2,... \end{align}$

(4.1)

Proof. From , we have that for any $k$ ,

$\begin{align*} R_{k+1} \;& = \; E - \sum\limits_{i = 1}^s{A_{i}X_{k+1}B_{i}} - \sum\limits_{j = 1}^t{C_{j}X_{k+1}^TD_{j}} \\ \;& = \; E - \sum\limits_{i = 1}^s{A_{i}\left( X_{k}+\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}U_{k+1}\right) B_{i}} - \sum\limits_{j = 1}^t{C_{j}\left( X_{k}^T+\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}U_{k+1}^T\right) D_{j}} \\ \;& = \; E - \sum\limits_{i = 1}^s{A_{i}X_{k}B_{i}} - \sum\limits_{j = 1}^t{C_{j}X_{k}^TD_{j}} - \frac{\lVert R_k \rVert^2}{\alpha_{k+1}} \left( \sum\limits_{i = 1}^s{A_{i}U_{k}B_{i}} + \sum\limits_{j = 1}^t{C_{j}U_{k}^TD_{j}} \right)\\ \;& = \; R_k-\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}V_{k+1}. \end{align*}$

Lemma 7. Assume that the matrix $K$ in (3.2) is symmetric. The sequences $\{ U_i \}$ and $\{V_i\}$ generated by Algorithm 1 satisfy

$\begin{align} {\rm{tr}}\left(U_{m}^TV_{n} \right)\: = \:{\rm{tr}}\left(V_{m}^TU_{n}\right)\qquad for\;any\quad m, n. \end{align}$

(4.2)

Proof. Applying Lemmas 1–5 and the symmetry of $K$ , we have

$\begin{align*} {\rm{tr}}\left(V_{m}^TU_{n}\right) \;& = \; \left( {\rm{Vec}}{V_m}\right) ^{T}{\rm{Vec}}{U_n} \\ \;& = \; \left[ \sum^{s}_{i = 1}{(B_{i}^{T} \otimes A_{i})} {\rm{Vec}}{U_{m}} + \sum^{t}_{j = 1}{(D_{j}^{T} \otimes C_{j})} {\rm{Vec}}{U_{m}^{T}} \right] ^{T}{\rm{Vec}}{U_n} \\ \;& = \; \left[ K {\rm{Vec}}{U_{m}} \right] ^{T}{\rm{Vec}}{U_n} \\ \;& = \; \left( {\rm{Vec}}{U_m}\right) ^{T}\ K {\rm{Vec}}{U_n} \\ \;& = \; \left( {\rm{Vec}}{U_m}\right) ^{T}\left[ {\rm{Vec}} \left( \sum\limits_{i = 1}^s{A_{i}U_{n}B_{i}} + \sum\limits_{j = 1}^t{C_{j}U_{n}^TD_{j}}\right) \right] \\ \;& = \; \left( {\rm{Vec}}{U_m}\right) ^{T}{\rm{Vec}}{V_n} \\ \;& = \; {\rm{tr}}\left(U_{m}^TV_{n}\right) \end{align*}$

for any $m, n$ .

Lemma 8. Assume that the matrix $K$ is symmetric. The sequences $\{ R_i \}$ , $\{U_i\}$ and $\{V_i\}$ are generated by Algorithm 1 satisfy

$\begin{equation} {\rm{tr}}\left(R_{m}^TR_{m-1}\right)\: = \:0\quad and \quad {\rm{tr}}\left(U_{m+1}^TV_m\right)\: = \:0 \qquad for\;any \quad m. \end{equation}$

(4.3)

Proof. To prove this conclusion, we use induction principle. In order to compute related terms, we use Lemmas 6 and 7. For $m = 1,$ we get

$\begin{align*} {\rm{tr}}\left(R_1^TR_0\right) \;& = \; {\rm{tr}}\left[ \left(R_{0}-\frac{\lVert R_{0} \rVert^2}{\alpha_{1}}V_{1}\right)^{T}R_{0}\right] \\ \;& = \; {\rm{tr}}\left(R_0^TR_0\right)-\frac{\lVert R_0 \rVert^2}{\alpha_{1}}{\rm{tr}}\left(V_1^TR_0\right)\\ \;& = \; \lVert R_0 \rVert^2-\lVert R_0 \rVert^2 \; = \; 0, \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_2^TV_1\right) \;& = \; {\rm{tr}}\left[ \left(R_{1} + \frac{\lVert R_{1} \rVert^2}{\lVert R_{0} \rVert^2}U_{1}\right)^{T}V_1\right] \\ \;& = \; {\rm{tr}}\left(R_{1}^TV_1\right)+\frac{\lVert R_1 \rVert^2}{\lVert R_{0} \rVert^2}{\rm{tr}}\left(U_1^TV_1\right)\\ \;& = \; - \frac{\alpha_1}{\lVert R_0 \rVert^2}{\rm{tr}}\left(R_1^TR_1\right)+\alpha_1 \frac{\lVert R_1 \rVert^2}{\lVert R_{0} \rVert^2} \; = \;0. \end{align*}$

These imply that (4.3) hold for $m = 1$ .

In the inductive step, for $m\: = \:k$ we assume that ${\rm{tr}} (R_{k}^TR_{k-1}) = 0$ and ${\rm{tr}} (U_{k+1}^{T}V_{k}) = 0$ . Then

$\begin{align*} {\rm{tr}}\left(R_{k+1}^TR_k\right) \;& = \; {\rm{tr}}\left[ \left(R_{k}-\frac{\lVert R_{k} \rVert^2}{\alpha_{k+1}}V_{k+1}\right)^{T}R_{k}\right] \\ \;& = \; {\rm{tr}}\left(R_k^TR_k\right)-\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}{\rm{tr}}\left(V_{k+1}^{T}R_k\right)\\ \;& = \; {\rm{tr}}\left(R_k^TR_k\right)-\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}{\rm{tr}}\left(V_{k+1}^{T}\left( U_{k+1} - \frac{\lVert R_k \rVert^2}{\lVert R_{k-1}\rVert ^2}U_{k}\right) \right)\\ \;& = \; \lVert R_k \rVert^2-\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}{\rm{tr}}\left(V_{k+1}^TU_{k+1}\right)\\ \;& = \; 0, \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_{k+2}^TV_{k+1}\right) \;& = \; {\rm{tr}}\left[ \left( R_{k+1} + \frac{\lVert R_{k+1} \rVert^2}{\lVert R_{k}\rVert^2 }U_{k+1}\right)^{T}V_{k+1}\right] \\ \;& = \; {\rm{tr}}\left(R_{k+1}^{T}V_{k+1}\right)+\frac{\lVert R_{k+1} \rVert^2}{\lVert R_{k} \rVert^2}{\rm{tr}}\left(U_{k+1}^{T}V_{k+1}\right)\\ \;& = \; {\rm{tr}}\left(R_{k+1}^{T}\left( \frac{-\alpha_{k+1}}{\lVert R_{k}\rVert^2 } (R_{k+1} - R_{k})\right) \right) + \frac{\lVert R_{k+1} \rVert^2}{\lVert R_{k} \rVert^2}\alpha_{k+1}\\ \;& = \; \frac{\alpha_{k+1}}{\lVert R_{k} \rVert^2}{\rm{tr}}\left[\left(R_{k+1}^TR_{k}\right)-\left(R_{k+1}^TR_{k+1}\right)\right]+\frac{\lVert R_{k+1} \rVert^2}{\lVert R_{k} \rVert^2}\alpha_{k+1}\\ \;& = \; 0. \end{align*}$

Hence, Eq (4.3) holds for any $m$ .

Lemma 9. Assume that the matrix $K$ is symmetric. Suppose the sequences $\{R_i\}$ , $\{U_i\}$ and $\{V_i\}$ are generated by Algorithm 1.Then

$\begin{align} {\rm{tr}}\left(R_{m}^TR_0\right)\; = \;0, \quad {\rm{tr}}\left(U_{m+1}^TV_1\right)\; = \;0 \qquad for\;any \quad m. \end{align}$

(4.4)

Proof. From Lemma 8 for $m\: = \:1$ , we get ${\rm{tr}}(R_{1}^TR_0)\: = \:0$ and ${\rm{tr}}(P_{2}^TQ_1)\: = \:0$ . Now, suppose that Eq (4.4) is true for all $\:m\: = \:1, \dots, k$ . From Lemmas 6 and 7, for $m\: = k+1$ we write

$\begin{align*} {\rm{tr}}\left(R_{k+1}^TR_0\right) & = {\rm{tr}}\left[ \left(R_{k}-\frac{\lVert R_{k} \rVert^2}{\alpha_{k+1}}V_{k+1}\right)^{T}R_{0}\right] \\ & = {\rm{tr}}\left( R_{k}^TR_0\right) -\frac{\lVert R_{k} \rVert^2}{\alpha_{k+1}}{\rm{tr}}(V_{k+1}^TR_0)\\ \;& = \; -\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}{\rm{tr}}(V_{k+1}^TU_1)\\ \;& = \; -\frac{\lVert R_k \rVert^2}{\alpha_{k+1}}{\rm{tr}}(U_{k+1}^TV_1) \; = \; 0, \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_{k+2}^TV_{1}\right) \;& = \; {\rm{tr}}\left(V_{k+2}^TU_{1}\right) \\ \;& = \; {\rm{tr}}\left[ \left( \frac{-\alpha_{k+2}}{\lVert R_{k+1} \rVert^2}\left( R_{k+2} - R_{k+1}\right)\right) ^{T}U_{1}\right] \\ \;& = \; \frac{-\alpha_{k+2}}{\lVert R_{k+1} \rVert^2}\left[ {\rm{tr}} \left( R_{k+2}^{T}U_1\right) - {\rm{tr}}\left(R_{k+1}^{T}U_1\right) \right] \\ \;& = \; \frac{-\alpha_{k+2}}{\lVert R_{k+1} \rVert^2}\left[ {\rm{tr}} \left( R_{k+2}^{T}R_{0}\right) - {\rm{tr}}\left(R_{k+1}^{T}R_{0}\right) \right] \; = \; 0. \end{align*}$

Hence, Eq (4.4) holds for any $m$ .

Theorem 4.4. Assume that $K$ is symmetric. Suppose the sequences $\{R_{i}\}$ , $\{U_{i}\}$ and $\{V_{i}\}$ are generated by . Then for any $m, n$ such that $m\neq n$ , we have

$\begin{align} {\rm{tr}}\left(R_{m-1}^TR_{n-1}\right)\; = \;0 \quad \mathit{and} \quad {\rm{tr}}\left(U_{m}^TV_{n}\right)\; = \;0. \end{align}$

(4.5)

Proof. By Lemma 7 and the fact that ${\rm{tr}}(R_{m-1}^{T}R_{n-1}) = {\rm{tr}}(R_{n-1}^{T}R_{m-1})$ for any $m, n$ , it suffices to prove (4.5) for any $m, n$ such that $m > n$ . By Lemma 8, Eq (4.5) holds for $m = n +1$ . For $m = n +2$ , we have

$\begin{align*} {\rm{tr}}\left(R_{n+2}^TR_{n}\right) \;& = \; {\rm{tr}}\left[ \left(R_{n+1} - \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}V_{n+2}\right)^{T}R_{n}\right] \\ \;& = \; -\frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}{\rm{tr}}\left[ V_{n+2}^T\left( U_{n+1} - \frac{\lVert R_{n} \rVert^2}{\lVert R_{n-1}\rVert ^2}U_{n}\right)\right] \\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\left[{\rm{tr}}\left(R_{n+1} + \frac{\lVert R_{n+1}\rVert^2}{\lVert R_{n} \rVert^2}U_{n+1}\right)^TV_{n} \right]\\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\left[{\rm{tr}}\left( \frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2} R_{n+1}^{T} \left( R_{n} - R_{n-1}\right) \right) \right] \\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2}{\rm{tr}}(R_{n+1}^TR_{n-1}), \end{align*}$

$\begin{align*} {\rm{tr}}\left(R_{n+1}^TR_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left(R_{n} - \frac{\lVert R_{n} \rVert^2}{\alpha_{n+1}}V_{n+1}\right)^{T}R_{n-1}\right] \\ \;& = \; -\frac{\lVert R_{n} \rVert^2}{\alpha_{n+1}}{\rm{tr}}\left[ V_{n+1}^T\left( U_{n} - \frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2 }U_{n-1}\right)\right] \\ \;& = \; \frac{\lVert R_{n} \rVert^2}{\alpha_{n+1}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\left[{\rm{tr}}\left(R_{n} + \frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}U_{n}\right)^TV_{n-1} \right]\\ \;& = \; \frac{\lVert R_{n} \rVert^2}{\alpha_{n+1}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\left[{\rm{tr}}\left( \frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2} R_{n}^{T} \left( R_{n-1} - R_{n-2} \right) \right) \right] \\ \;& = \; \frac{\lVert R_{n} \rVert^2}{\alpha_{n+1}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}{\rm{tr}}(R_{n}^TR_{n-2}), \end{align*}$

$\begin{align*} {\rm{tr}}\left(U_{n+2}^TV_{n}\right) \;& = \; {\rm{tr}}\left[ \left( R_{n+1} - \frac{\lVert R_{n+1} \rVert^2}{\lVert R_{n}\rVert^2 }U_{n+1}\right)^{T}V_{n}\right] \\ \;& = \; {\rm{tr}}\left[R_{n+1}^{T} \left( \frac{-\alpha_{n}}{\lVert R_{n-1}\rVert^2}\left( R_{n} - R_{n-1}\right) \right) \right] \\ \;& = \; \frac{\alpha_{n}}{\lVert R_{n-1 }\rVert^2}{\rm{tr}}\left[ \left( R_{n} - \frac{\lVert R_{n}\rVert^2}{\alpha_{n+1}}V_{n+1} \right) ^TR_{n-1}\right]\\ \;& = \; -\frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2}\frac{\lVert R_{n }\rVert^2}{\alpha_{n+1}} \left[{\rm{tr}}\left(V_{n+1}^TU_{n}\right)-\frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left(V_{n+1}^TU_{n-1}\right)\right]\\ \;& = \; \frac{\alpha_{n}}{\lVert R_{n-1 }\rVert^2}\frac{\lVert R_{n }\rVert^2}{\alpha_{n+1}}\frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left(U_{n+1}^TV_{n-1}\right), \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_{n+1}^TV_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left( R_{n} - \frac{\lVert R_{n} \rVert^2}{\lVert R_{n-1}\rVert^2 }U_{n}\right)^{T}V_{n-1}\right] \\ \;& = \; {\rm{tr}}\left[R_{n}^{T} \left( \frac{-\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}\left( R_{n-1} - R_{n-2}\right) \right) \right] \\ \;& = \; \frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left[ \left( R_{n-1} - \frac{\lVert R_{n-1}\rVert^2}{\alpha_{n}}V_{n} \right) ^TR_{n-2}\right]\\ \;& = \; -\frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}\frac{\lVert R_{n-1 }\rVert^2}{\alpha_{n}} \left[{\rm{tr}}\left(V_{n}^TU_{n-1}\right)-\frac{\lVert R_{n-2} \rVert^2}{\lVert R_{n-3}\rVert^2}{\rm{tr}}\left(V_{n}^TU_{n-2}\right)\right]\\ \;& = \; \frac{\alpha_{n-1}}{\lVert R_{n-2 }\rVert^2}\frac{\lVert R_{n-1 }\rVert^2}{\alpha_{n}}\frac{\lVert R_{n-2} \rVert^2}{\lVert R_{n-3}\rVert^2}{\rm{tr}}\left(U_{n}^TV_{n-2}\right), \end{align*}$

Similarly, we can write ${\rm{tr}}(R_{n+2}^TR_{n})$ and ${\rm{tr}}(U_{n+2}^TV_{n})$ in terms of ${\rm{tr}}(R_{n+1}^TR_{n-1})$ and ${\rm{tr}}(U_{n+1}^TV_{n-1})$ , respectively. Repeating this process until the terms ${\rm{tr}}(R_2^TR_{0})$ and ${\rm{tr}}(U_3^TV_{1})$ show up. By Lemma 9, we get ${\rm{tr}}(R_{n+2}^TR_{n}) = 0$ and ${\rm{tr}}(U_{n+2}^TV_{n}) = 0$ .

Next, for $m = n + 3$ , we have

$\begin{align*} {\rm{tr}}\left(R_{n+3}^TR_{n}\right) \;& = \; {\rm{tr}}\left[ \left(R_{n+2} - \frac{\lVert R_{n+2} \rVert^2}{\alpha_{n+3}}V_{n+3}\right)^{T}R_{n}\right] \\ \;& = \; -\frac{\lVert R_{n+2} \rVert^2}{\alpha_{n+3}}{\rm{tr}}\left[ V_{n+3}^T\left( U_{n+1} - \frac{\lVert R_{n} \rVert^2}{\lVert R_{n-1}\rVert^2 }U_{n}\right)\right] \\ \;& = \; \frac{\lVert R_{n+2} \rVert^2}{\alpha_{n+3}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\left[{\rm{tr}}\left(R_{n+2} + \frac{\lVert R_{n+2}\rVert^2}{\lVert R_{n+1} \rVert^2}U_{n+2}\right)^TV_{n} \right]\\ \;& = \; \frac{\lVert R_{n+2} \rVert^2}{\alpha_{n+3}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\left[{\rm{tr}} \left( \frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2} R_{n+2}^{T} \left( R_{n} - R_{n-1}\right)\right) \right] \\ \;& = \; \frac{\lVert R_{n+2} \rVert^2}{\alpha_{n+3}}\frac{\lVert R_{n}\rVert^2}{\lVert R_{n-1} \rVert^2}\frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2}{\rm{tr}}(R_{n+2}^TR_{n-1}), \end{align*}$

$\begin{align*} {\rm{tr}}\left(R_{n+2}^TR_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left(R_{n+1} - \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}V_{n+2}\right)^{T}R_{n-1}\right] \\ \;& = \; -\frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}{\rm{tr}}\left[ V_{n+2}^T\left( U_{n} - \frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert ^2}U_{n-1}\right)\right] \\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\left[{\rm{tr}}\left(R_{n+1} + \frac{\lVert R_{n+1}\rVert^2}{\lVert R_{n} \rVert^2}U_{n+1}\right)^TV_{n-1} \right]\\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\left[{\rm{tr}} \left( \frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}R_{n+1}^{T} \left( R_{n-1} - R_{n-2}\right) \right) \right] \\ \;& = \; \frac{\lVert R_{n+1} \rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}\frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}{\rm{tr}}(R_{n+1}^TR_{n-2}), \end{align*}$

$\begin{align*} {\rm{tr}}\left(U_{n+3}^TV_{n}\right) \;& = \; {\rm{tr}}\left[ \left( R_{n+2} - \frac{\lVert R_{n+2} \rVert^2}{\lVert R_{n+1}\rVert ^2}U_{n+2}\right)^{T}V_{n}\right] \\ \;& = \; {\rm{tr}}\left[R_{n+2}^{T} \left( \frac{-\alpha_{n}}{\lVert R_{n-1}\rVert^2}\left( R_{n} - R_{n-1}\right) \right) \right] \\ \;& = \; \frac{\alpha_{n}}{\lVert R_{n-1 }\rVert^2}{\rm{tr}}\left[ \left( R_{n+1} - \frac{\lVert R_{n+1}\rVert^2}{\alpha_{n+2}}V_{n+2} \right) ^TR_{n-1}\right]\\ \;& = \; -\frac{\alpha_{n}}{\lVert R_{n-1}\rVert^2}\frac{\lVert R_{n+1 }\rVert^2}{\alpha_{n+2}} \left[{\rm{tr}}\left(V_{n+2}^TU_{n}\right)-\frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left(V_{n+2}^TU_{n-1}\right)\right]\\ \;& = \; \frac{\alpha_{n}}{\lVert R_{n-1 }\rVert^2}\frac{\lVert R_{n+1 }\rVert^2}{\alpha_{n+2}}\frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left(U_{n+2}^TV_{n-1}\right), \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_{n+2}^TV_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left( R_{n+1} - \frac{\lVert R_{n+1} \rVert^2}{\lVert R_{n}\rVert^2 }U_{n+1}\right)^{T}V_{n-1}\right] \\ \;& = \; {\rm{tr}}\left[R_{n+1}^{T} \left( \frac{-\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}\left( R_{n-1} - R_{n-2}\right) \right) \right] \\ \;& = \; \frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left[ \left( R_{n} - \frac{\lVert R_{n}\rVert^2}{\alpha_{n+1}}V_{n+1} \right) ^TR_{n-2}\right]\\ \;& = \; -\frac{\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}\frac{\lVert R_{n}\rVert^2}{\alpha_{n+1}} \left[{\rm{tr}}\left(V_{n+1}^TU_{n-1}\right)-\frac{\lVert R_{n-2} \rVert^2}{\lVert R_{n-3}\rVert^2}{\rm{tr}}\left(V_{n+1}^TU_{n-2}\right)\right]\\ \;& = \; \frac{\alpha_{n-1}}{\lVert R_{n-2 }\rVert^2}\frac{\lVert R_{n }\rVert^2}{\alpha_{n+1}}\frac{\lVert R_{n-2} \rVert^2}{\lVert R_{n-3}\rVert^2}{\rm{tr}}\left(U_{n+1}^TV_{n-2}\right). \end{align*}$

Hence, we can write ${\rm{tr}}(R_{n+3}^TR_{n})$ and ${\rm{tr}}(U_{n+3}^TV_{n})$ in terms of ${\rm{tr}}(R_{n+2}^TR_{n-1})$ and ${\rm{tr}}(U_{n+2}^TV_{n-1})$ , respectively. Repeating this process until the terms ${\rm{tr}}(R_{3}^TR_{0})$ and ${\rm{tr}}(U_{4}V_{1})$ by Lemma 9, we get ${\rm{tr}}(R_{n+3}^TR_{n}) = 0$ and ${\rm{tr}}(U_{n+3}^TV_{n}) = 0$ .

Suppose that ${\rm{tr}}(R_{m-1}^TR_{n-1}^T) = {\rm{tr}}(U_{m}^TV_{n}^T) = 0$ for $\; m = n+1, \dots, k$ . Then for $m = k+1$ , we have

$\begin{align*} {\rm{tr}}\left(R_{k}^TR_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left(R_{k-1} - \frac{\lVert R_{k-1} \rVert^2}{\alpha_{k}}V_{k}\right)^{T}R_{n-1}\right] \\ \;& = \; {\rm{tr}}(R_{k-1}^TR_{n-1}) - \frac{\lVert R_{k-1} \rVert^2}{\alpha_{k}}{\rm{tr}}(V_{k}^TR_{n-1})\\ \;& = \; -\frac{\lVert R_{k-1} \rVert^2}{\alpha_{k}}{\rm{tr}}\left[ V_{k}^T\left( U_{n} - \frac{\lVert R_{n-1} \rVert^2}{\lVert R_{n-2}\rVert^2 }U_{n-1}\right)\right] \\ \;& = \; -\frac{\lVert R_{k-1} \rVert^2}{\alpha_{k}}\left[{\rm{tr}}\left(V_{k}^TU_{n}\right) - \frac{\lVert R_{n-1}\rVert^2}{\lVert R_{n-2} \rVert^2}{\rm{tr}}\left(V_{k}^TU_{n-1}\right)\right] \; = \; 0. \end{align*}$

and

$\begin{align*} {\rm{tr}}\left(U_{k+1}^TV_{n-1}\right) \;& = \; {\rm{tr}}\left[ \left( R_{k} + \frac{\lVert R_{k} \rVert^2}{\lVert R_{k-1}\rVert^2 }U_{k}\right)^{T}V_{n-1}\right] \\ \;& = \; {\rm{tr}}(R_{k}^TV_{n-1})+\frac{\lVert R_{k} \rVert^2}{\lVert R_{k-1}\rVert^2}{\rm{tr}}(U_{k}^TV_{n-1})\\ \;& = \; {\rm{tr}}\left[R_{k}^{T} \left( \frac{-\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}\left( R_{n-1} - R_{n-2}\right) \right) \right] \\ \;& = \; \frac{-\alpha_{n-1}}{\lVert R_{n-2}\rVert^2}{\rm{tr}}\left(R_{k}^{T} R_{n-1} - R_{k}^{T}R_{n-2}\right) \; = \; 0. \end{align*}$

Hence, ${\rm{tr}}(R_{m-1}^TR_{n-1}) = 0$ and ${\rm{tr}}(U_{m}^TV_{n}) = 0$ for any $m, n$ such that $m \neq n$ .

Theorem 4.5. Consider Problem 4.1 under the assumption that the matrix $K$ is symmetric. Suppose that the sequence $\{X_i\}$ is generated by . Then for given initial matrix $X_0 \in \mathbb{R}^{n \times p}$ , an exact solution $X$ can be obtained in at most $np$ iteration steps.

Proof. Suppose that $R_{i} \neq 0$ for $i \: = \:0, 1, \dots, np - 1$ . Then we compute $X_{np}$ according to . Assume that $R_{np} \neq 0$ . By Theorem 4.4, the set $\{R_0, R_1, ..., R_{np}\}$ is orthogonal in $\mathbb{R}^{n\times p}$ . So, $\{R_0, R_1, ..., R_{np}\}$ is linearly independent. Since the dimension of $\mathbb{R}^{n\times p}$ is ${np}$ , any linearly independent subset of $\mathbb{R}^{n\times p}$ must have at most ${np}$ elements. So this is false because the set $\{R_0, R_1, ..., R_{np}\}$ has ${np+1}$ elements. Thus, $R_{np} = 0$ , hence $X_{np}$ is a solution of the equation.

5. Numerical experiments with discussions

In this section, we report numerical results to illustrate the applicability and the effectiveness of . All iterations have been carried out by MATLAB R2021a, on a macos (M1 chip 8C CPU/8C GPU/8GB/512GB). We perform the experiments for several generalized Sylvester-transpose matrix equations, and an interesting special case, namely, the Sylvester equation. We vary given coefficient matrices so that they are square/non-square sparse/dense matrices of moderate/large sizes. The dense matrices considered here are involved a matrix whose all entries are $1$ , which is denoted by ones. The identity matrix of size $n \times n$ is denoted by $I_n$ . For each experiment, we set the stopping rule to be $\lVert R_k \rVert \leqslant \epsilon$ where $\epsilon = 10^{-3}$ . We discuss the performance of the algorithm through the norm of residual matrices, iteration number, and computational time (CT). The CT (in seconds) is measured by tic-toc function in MATLAB.

In the following three examples, we concern the applicability of Algorithm 1 as well as its performance comparing to the direct Kronecker linearization mentioned in Section 3.

Example 1. Consider a moderate-scaled generalized Sylvester-transpose equation

$\begin{align*} A_{1}XB_{1} + A_{2}XB_{2} + C_{1}X^{T}D_{1} + C_{2}X^{T}D_{2} = E \end{align*}$

where all matrices are $50 \times 50$ tridiagonal matrices given by

$\begin{array}{l} A_{1} = {\rm{tridiag}} (-1,2,-1) , \;\;A_{2} = {\rm{tridiag}} ( 1,-1,1), \;\; B_{1} = {\rm{tridiag}} (-2,0,-2), \;\; \\B_{2} = {\rm{tridiag}} (-2,-1,-2),\;\;C_{1} = {\rm{tridiag}} (0,2,0), \;\;C_{2} = {\rm{tridiag}} (1,2,1), \\D_{1} = {\rm{tridiag}} (0,-4,0), \;\;D_{2} = {\rm{tridiag}} (-2,-4,-2), \\ E = {\rm{tridiag}} (-1,1,9). \end{array}$

We run using an initial matrix $X_{0} = 0.25 \times {\rm{ones}} \in \mathbb{ R }^{50 \times 50}$ . According to Theorem 4.5, will produce a solution of the equation within $10^4$ iterations. The resulting simulation illustrated in shows the norms of residual matrices $R_k$ at each iteration.

Figure 1. Relative error for Example 1.

DownLoad: Full-Size Img PowerPoint

Althouh the errors $\lVert R_{k} \rVert$ grow up and down during iterations, they generally climb down to zero. The algorithm takes $138$ iterations to get a desire solution (so that $\lVert R_k \rVert \leqslant 10^{-3}$ ), which is significantly less than the theoretical one ( $10^4$ iterations). For the computational time, spends totally $0.131079$ seconds to get a desire solution, while the direct Kronecker linearization consuming $1.581769$ seconds to obtain the exact soluton. Thus, the performace of is significantly better than the direct method. Moreover, for sparse coefficient matrices, Agorithm 1 can produce a desire solution in a fewer iterations (that is, $138$ iterations) than the theoretical one (that is, $10^4$ iterations in this case).

Example 2. Consider a generalized Sylvester-transpose matrix equation

$\begin{align*} A_{1}XB_{1} + A_{2}XB_{2} + A_{3}XB_{3} + C_{1}X^{T}D_{1} = E \end{align*}$

with rectangular coefficient matrices of moderate-scaled as follows:

$\begin{align*} &A_{1} = {\rm{tridiag}} (1,3,1) , \;\; A_{2} = {\rm{tridiag}} ( -1,2,-1), \;\; A_{3} = {\rm{tridiag}} ( -1,1,-1) \in \mathbb{ R }^{40 \times 40}, \\ &B_{1} = {\rm{tridiag}} (-2,1,-2), \;\; B_{2} = {\rm{tridiag}} (1,-3,1), \;\; B_{3} = {\rm{tridiag}} (2,-3,2) \in \mathbb{ R }^{50 \times 50} ,\\ &C_{1} = 3 \times {\rm{ones}} \in \mathbb{ R }^{40 \times 50} , \;\; D_{1} = -3 \times {\rm{ones}} \in \mathbb{ R }^{40 \times 50} , \;\; E = -0.9 \times {\rm{ones}} \in \mathbb{ R }^{40 \times 50}. \end{align*}$

Taking an initial matrix $X_0 \in \mathbb{ R }^{40 \times 50}$ , we get an approximate solution $X_{k} \in \mathbb{ R }^{40 \times 50}$ with a satisfactory error $\lVert R_{k} \rVert \leqslant 10^{-3}$ in $164$ steps, using $0.196250$ seconds. We see in that during iterations, althouh the errors $\lVert R_{k} \rVert$ grow up and down, they generally climb down to zero. On the other hand, the direct Kronecker linearization consumes $0.811170$ seconds to get an exact solution. Thus, Agorithm 1 is applicable and effective.

Figure 2. Relative error for Example 2.

DownLoad: Full-Size Img PowerPoint

Example 3. Consider a large-scaled generalized Sylvester-transpose equation

$\begin{align*} A_{1}XB_{1} + C_{1}X^{T}D_{1} + C_{2}X^{T}D_{2} = E \end{align*}$

where all matrices are $100 \times 100$ tridiagonal matrices given by

$\begin{align*} &A_{1} = {\rm{tridiag}} (-2,-6,-2) , \; B_{1} = {\rm{tridiag}} (2,-1,2), \; C_{1} = {\rm{tridiag}} (0,-1,0), \; C_{2} = {\rm{tridiag}} (-1,2,-1) , \\ & D_{1} = {\rm{tridiag}} (0,2,0), \; D_{2} = {\rm{tridiag}} (2,-4,2), \; E = {\rm{tridiag}} (1,-8,1). \end{align*}$

The resulting simulation of Agorithm 1 using an initial matrix $X_{0} = 0.5 \times {\rm{ones}} \in \mathbb{ R }^{100 \times 100}$ is shown in the next figure.

shows the error gradually decreasing into $\epsilon = 10^{-3}$ in $774$ steps, consuming around $2$ seconds.

Figure 3. Relative error for Example 3.

DownLoad: Full-Size Img PowerPoint

Next, we investigate the effect of changing initial points. So we make experiments for the initial matrices $X_{0} = 5 \times {\rm{ones}}$ , $X_0 = 0$ , and $X_{0} = -5 \times {\rm{ones}}$ . shows that, no matter the initial point, we get a desire solution in around $2$ seconds. On the other hand the direct method consumes around $70$ seconds to get an exacy solution. Thus, Agorithm 1 significantly outperforms the direct mathod.

Table 1. Relative error and CTs for Example 3.

Initial matrix	Iterations	CT	Relative error
Direct	-	69.500953	0
$X_{0} = 5 \times {\rm{ones}}$	830	2.247507	8.9145 $\times 10^{-4}$
$X_{0} = 0.5 \times {\rm{ones}}$	774	1.986782	7.5755 $\times 10^{-4}$
$X_{0} = 0$	16	0.106482	5.3862 $\times 10^{-4}$
$X_{0} = -5 \times {\rm{ones}}$	830	2.190269	9.1232 $\times 10^{-4}$

| Show Table

DownLoad: CSV

In the rest of numerical examples, we compare the performance of Algorithm 1 to the direct method as well as recent gradient-based iterative algorithms mentioned in Introduction.

Example 4. Consider a large-scaled generalized Sylvester-transpose matrix equation

$\begin{align*} AXB + CX^{T}D \; = \; E, \end{align*}$

where $A, B, C, D, E$ are $100 \times 100$ matrices as follows:

$\begin{align*} A = {\rm{tridiag}} ( -1,3,-1), \;\; B = {\rm{tridiag}} (1,7,1 ), \;\; C = 6 \times {\rm{ones}} , \;\; D = -3 \times {\rm{ones}} , \;\; E = 0.7 \times I_{100}. \end{align*}$

In fact, this equation has a unique solution. Despite the direct method, we compare the performance of to GI ^[23] and AGBI ^[24] algorithms. All iterative algorithms are implemented using the initial $X_{0} = -0.001 \times I_{100} \in \mathbb{ R }^{100 \times 100}$ .

According to ^[23], the GI algorithm is applicable as long as a convergent factor $\mu$ satisfies

$\begin{align*} 0 < \mu < \frac{2}{\lambda_{max}(AA^T)\lambda_{max}(B^TB) + \lambda_{max}(CC^T)\lambda_{max}(D^TD)}, \end{align*}$

where $\lambda_{max}(AA^T)$ is the largest eigenvalue of $AA^{T}$ . We run GI algorithm under $3$ different convergent factors, namely, $m1 = 6.1728 \times 10^{-12}$ , $m2 = 8.8183 \times 10^{-12}$ and $m3 = 3.0864 \times 10^{-11}$ . We implement AGBI algorithm with a convergent factor $0.000988$ and a weighted factor $10^{-8}$ . shows that the CG algorithm () converges faster than GI with $m1$ , GI with $m2$ , GI with $m3$ , and AGBI algorithms. shows that, in $30$ iterations, GI, AGBI and the direct method consume a big amount of time to get the exact solution, while produces a small-error solution in a small time ( $0.073613$ seconds).

Figure 4. Relative error for Example 4.

DownLoad: Full-Size Img PowerPoint

Table 2. Relative error and CTs for Example 4.

Method	Iterations	CT	Relative error
CG	30	0.073613	0.000001
GI with m1	30	0.109370	18.788266
GI with m2	30	0.115890	16.853503
GI with m3	30	0.109668	16.724640
AGBI	30	0.118294	16.724926
Direct	-	66.928143	0

| Show Table

DownLoad: CSV

Example 5. Consider a consistent generalized Sylvester-transpose matrix equation

$\begin{align*} AXB + CX^{T}D \; = \; E, \end{align*}$

with $100 \times 100$ coefficient matrices:

$\begin{array}{l} A = {\rm{tridiag}} ( -1,2,-1), \;\; B = \frac{1}{3}\times {\rm{ones}} ,\;\; C = -3 \times {\rm{ones}} , \\D = {\rm{tridiag}} (3,-6,3), \;\; E = -1.2 \times {\rm{ones}}. \end{array}$

In fact, this matrix equation has a solution, which is not unique. We will seek for a solution of the equation using , GI and AGBI algorithms with the same initial matrix $X_{0} = -0.4 \times {\rm{ones}} \in \mathbb{ R }^{100 \times 100}$ .

We carry out GI algorithm with three different convergent factors, namely, $m1 = 1.7132 \times 10^{-8}$ , $m2 = 3.0837 \times 10^{-8}$ and $m3 = 1.5418\times 10^{-7}$ . We implement AGBI algorithm with the convergent factor $0.000112$ and the weighted factor $0.00005$ . and express the computational time and the errors for $200$ iterations of the simulations. We see that the computational time of CG algorithm is slightly less than those of GI (with parameters $m1$ , $m2$ , $m3$ ) and AGBI algorithms. However, the outcoming error produced by CG algorithm is significantly less than those of other algorithms.

Table 3. Relative error and CTs for Example 5.

Method	Iterations	CT	Relative error
CG	200	0.395984	0.361597
GI with m1	200	0.654457	1473.481117
GI with m2	200	0.591405	1186.764341
GI with m3	200	0.595052	645.799529
AGBI	200	0.599059	1718.220885

| Show Table

DownLoad: CSV

Figure 5. Relative error for Example 5.

DownLoad: Full-Size Img PowerPoint

Example 6. Consider the following Sylvester matrix equation

$\begin{align*} AX + XB \; = \; C, \end{align*}$

where all coefficient matrices are $100 \times 100$ tridiagonal matrices given by

$\begin{align*} A = {\rm{tridiag}} (1,-6,1) , \quad B = {\rm{tridiag}} (3,0,3), \quad C = {\rm{tridiag}} (1,1,9). \end{align*}$

We compare the performance of CG algorithm (Algorithm 1) to GI ^[23], RGI ^[16], MGI ^[17] and AGBI ^[24] algorithms with parameters as shown in . We implement the algorithms with the same initial matrix $X_{0} = -5 \times {\rm{ones}} \in \mathbb{ R }^{100 \times 100}$ . shows that the computational times for implementing $30$ iterations of CG and other algorithms are close together. However, the relative errors in Figure 6 and Table 4 express that CG algorithm produces a sequence of well-approximate solutions in a few iterations with the lowest error comparing to other GI algorithms.

Table 4. Relative error and CTs for Example 6.

Method	Parameters		Iterations	CT	Relative error
	Convergent factor	weighted factor
CG	-	-	10	0.018412	0.000000
GI	$\mu$ = 0.034482	-	10	0.019581	7.365534
RGI	$\mu$ = 0.266489	$\omega$ = 0.05	10	0.015338	8.786233
MGI	$\mu$ = 0.025316	-	10	0.019361	2.544848
AGBI	$\mu$ = 0.233918	$\omega$ = 0.05	10	0.022275	8.791699

| Show Table

DownLoad: CSV

Figure 6. Relative error for Example 6.

DownLoad: Full-Size Img PowerPoint

6. Conclusions

We propose an iterative procedure () to construct a sequence of approximate solutions for the generalized Sylvester-transpose matrix Eq (1.5) with rectangular coefficient matrices. The algorithm is applicable whenever the matrix $K$ , defined by Eq (3.2), is symmetric. In fact, the residual matrices $R_{k}$ , produced by the algorithm, form an orthogonal set with respect to the usual inner product for matrices. Thus, we obtain the desire solution within a finite step, says, $np$ steps. Numerical simulations have verified the applicability of the algorithm for square/non-square sparse/dense matrices of moderate/large sizes. The algorithm is always applicable no matter how we choose an initial matrix. Moreover, for sparse coefficient matrices of large size, the iteration number to get a desire solution can be dramatically less than $np$ iterations. The performance of the algorithm is significantly better than the direct Kronecker linearization and recent gradient-based iterative algorithms when the matrix coefficients are of moderate/large sizes.

Acknowledgments

This research project is supported by National Research Council of Thailand (NRCT): (N41A640234).

Conflict of interest

All authors declare that they have no conflict of interest.

References

[1]	Y. Kim, H. S. Kim, J. Junkins, Eigenstructure assignment algorithm for second order systems, J. Guid. Control Dyn., 22 (1999), 729–731. http://dx.doi.org/10.2514/2.4444 doi: 10.2514/2.4444
[2]	B. Zhou, G. R. Duan, On the generalized Sylvester mapping and matrix equations, Syst. Control Lett., 57 (2008), 200–208. http://dx.doi.org/10.1016/j.sysconle.2007.08.010 doi: 10.1016/j.sysconle.2007.08.010
[3]	L. Dai, Singular control systems, Berlin: Springer, 1989.
[4]	G. R. Duan, Eigenstructure assignment in descriptor systems via output feedback: A new complete parametric approach, Int. J. Control., 72 (1999), 345–364. http://dx.doi.org/10.1080/002071799221154 doi: 10.1080/002071799221154
[5]	F. Lewis, A survey of linear singular systems, Circ. Syst. Signal Process., 5 (1986), 3–36. http://dx.doi.org/10.1007/BF01600184 doi: 10.1007/BF01600184
[6]	G. R. Duan, Parametric approaches for eigenstructure assignment in high-order linear systems, Int. J. Control Autom. Syst., 3 (2005), 419–429.
[7]	K. Nouri, S. Beik, L. Torkzadeh, D Baleanu, An iterative algorithm for robust simulation of the Sylvester matrix differential equations, Adv. Differ. Equ., 2020 (2020), http://dx.doi.org/10.1186/s13662-020-02757-z doi: 10.1186/s13662-020-02757-z
[8]	M. Epton, Methods for the solution of $AXD$ - $BXC$ = $E$ and its applications in the numerical solution of implicit ordinary differential equations, BIT., 20 (1980), 341–345. http://dx.doi.org/10.1007/BF01932775 doi: 10.1007/BF01932775
[9]	D. Hyland, D. Bernstein, The optimal projection equations for fixed order dynamic compensation, IEEE Trans. Control., 29 (1984), 1034–1037. http://dx.doi.org/10.1109/TAC.1984.1103418 doi: 10.1109/TAC.1984.1103418
[10]	D. Calvetti, L. Reichel, Application of ADI iterative methods to the restoration of noisy images, SIAM J. Matrix Anal. Appl., 17 (1996), 165–186. http://dx.doi.org/10.1137/S0895479894273687 doi: 10.1137/S0895479894273687
[11]	M. Dehghan, A. Shirilord, A generalized modified Hermitian and skew-Hermitian splitting (GMHSS) method for solving complex Sylvester matrix equation, Appl. Math. Comput., 348 (2019), 632–651. http://dx.doi.org/10.1016/j.amc.2018.11.064 doi: 10.1016/j.amc.2018.11.064
[12]	S. Y. Li, H. L. Shen, X. H. Shao, PHSS Iterative method for solving generalized Lyapunov equations, Mathematics, 7 (2019), 38. http://dx.doi.org/10.3390/math7010038 doi: 10.3390/math7010038
[13]	H. L. Shen, Y. R. Li, X. H. Shao, The four-parameter PSS method for solving the Sylvester equation, Mathematics, 7 (2019), 105. http://dx.doi.org/10.3390/math7010105 doi: 10.3390/math7010105
[14]	M. Dehghan, A. Shirilord, Solving complex Sylvester matrix equation by accelerated double-step scale splitting (ADSS) method, Engineering with Computers, 37 (2021), 489–508. http://dx.doi.org/10.1007/s00366-019-00838-6 doi: 10.1007/s00366-019-00838-6
[15]	F. Ding, T. Chen, Gradient based iterative algorithms for solving a class of matrix equations, IEEE Trans. Automat. Comtr., 50 (2005), 1216–1221. http://dx.doi.org/10.1109/TAC.2005.852558 doi: 10.1109/TAC.2005.852558
[16]	Q. Niu, X. Wang, L. Z. Lu, A relaxed gradient based algorithm for solving Sylvester equation, Asian J. Control, 13 (2011), 461–464. http://dx.doi.org/10.1002/asjc.328 doi: 10.1002/asjc.328
[17]	X. Wang, L. Dai, D. Liao, A modified gradient based algorithm for solving Sylvester equation, Appl. Math. Comput., 218 (2012), 5620–5628. http://dx.doi.org/10.1016/j.amc.2011.11.055 doi: 10.1016/j.amc.2011.11.055
[18]	Z. Tian, M. Tian, C. Gu, X. Hao, An accelerated Jacobi-gradient based iterative algorithm for solving Sylvester matrix equations, Filomat, 31 (2017), 2381–2390. http://dx.doi.org/10.2298/FIL1708381T doi: 10.2298/FIL1708381T
[19]	N. Sasaki, P. Chansangiam, Modified Jacobi-gradient iterative method for generalized Sylvester matrix equation, Symmetry, 12 (2020), 1831. http://dx.doi.org/10.3390/sym12111831 doi: 10.3390/sym12111831
[20]	X. Zhang, X. Sheng, The relaxed gradient based iterative algorithm for the symmetric (skew symmetric) solution of the Sylvester equation $AX + XB = C$ , Math. Probl. Eng., 2017 (2017), 1624969. http://dx.doi.org/10.1155/2017/1624969 doi: 10.1155/2017/1624969
[21]	A. Kittisopaporn, P. Chansangiam, W. Lewkeeratiyukul, Convergence analysis of gradient-based iterative algorithms for a class of rectangular Sylvester matrix equation based on Banach contraction principle, Adv. Differ. Equ., 2021 (2021), 17. http://dx.doi.org/10.1186/s13662-020-03185-9 doi: 10.1186/s13662-020-03185-9
[22]	N. Boonruangkan, P. Chansangiam, Convergence analysis of a gradient iterative algorithm with optimal convergence factor for a generalized Sylvester-transpose matrix equation, AIMS Mathematics, 6 (2021), 8477–8496. http://dx.doi.org/10.3934/math.2021492 doi: 10.3934/math.2021492
[23]	L. Xie, J. Ding, F. Ding, Gradient based iterative solutions for general linear matrix equations, Comput. Math. Appl., 58 (2009), 1441–1448. http://dx.doi.org/10.1016/j.camwa.2009.06.047 doi: 10.1016/j.camwa.2009.06.047
[24]	Y. J. Xie, C. F. Ma, The accelerated gradient based iterative algorithm for solving a class of generalized Sylvester transpose matrix equation, Appl. Math. Comp., 273 (2016), 1257–1269. http://dx.doi.org/10.1016/j.amc.2015.07.022 doi: 10.1016/j.amc.2015.07.022
[25]	A. Kittisopaporn, P. Chansangiam, Gradient-descent iterative algorithm for solving a class of linear matrix equations with applications to heat and Poisson equations, Adv. Differ. Equ., 2020 (2020), 324. http://dx.doi.org/10.1186/s13662-020-02785-9 doi: 10.1186/s13662-020-02785-9
[26]	A. Kittisopaporn, P. Chansangiam, The steepest descent of gradient-based iterative method for solving rectangular linear system with an application to Poisson's equation, Adv. Differ. Equ., 2020 (2020), 259. http://dx.doi.org/10.1186/s13662-020-02715-9 doi: 10.1186/s13662-020-02715-9
[27]	Y. Qi, L. Jin, H. Li, Y. Li, M. Liu, Discrete computational neural dynamics models for solving time-dependent Sylvester equation with applications to robotics and MIMO systems, IEEE Trans. Ind. Inform., 16 (2020), 6231–6241. http://dx.doi.org/10.1109/TII.2020.2966544 doi: 10.1109/TII.2020.2966544
[28]	V. Simoncini, Computational methods for linear matrix equations, SIAM Rev., 58 (2016), 377–441. http://dx.doi.org/10.1137/130912839 doi: 10.1137/130912839
[29]	H. Zhang, H. Yin, Refinements of the Hadamard and Cauchy Schwarz inequalities with two inequalities of the principal angles, J. Math. Inequal., 13 (2019), 423–435. http://dx.doi.org/10.7153/jmi-2019-13-28 doi: 10.7153/jmi-2019-13-28
[30]	H. Zhang, Quasi gradient-based inversion-free iterative algorithm for solving a class of the nonlinear matrix equations, Comput. Math. Appl., 77 (2019), 1233–1244. http://dx.doi.org/10.1016/j.camwa.2018.11.006 doi: 10.1016/j.camwa.2018.11.006
[31]	H. Zhang, L. Wan, Zeroing neural network methods for solving the Yang-Baxter-like matrix equation, Neurocomputing, 383 (2020), 409–418. http://dx.doi.org/10.1016/j.neucom.2019.11.101 doi: 10.1016/j.neucom.2019.11.101
[32]	F. Ding, G. Liu, X. Liu, Parameter estimation with scarce measurements, Automatica, 47 (2011), 1646–1655. http://dx.doi.org/10.1016/j.automatica.2011.05.007 doi: 10.1016/j.automatica.2011.05.007
[33]	F. Ding, Y. Liu, B. Bao, Gradient based and least squares based iterative estimation algorithms for multi-input multi-output systems, P. I. Mech. Eng. I-J. Sys., 226 (2012), 43–55. http://dx.doi.org/10.1177/0959651811409491 doi: 10.1177/0959651811409491
[34]	F. Ding, Combined state and least squares parameter estimation algorithms for dynamic systems, Appl. Math. Model., 38 (2014), 403–412. http://dx.doi.org/10.1016/j.apm.2013.06.007 doi: 10.1016/j.apm.2013.06.007
[35]	M. Hajarian, Developing BiCG and BiCR methods to solve generalized Sylvester-transpose matrix equations, Int. J. Autom. Comput., 11 (2014), 25–29. http://dx.doi.org/10.1007/s11633-014-0762-0 doi: 10.1007/s11633-014-0762-0
[36]	M. Hajarian, Matrix form of the CGS method for solving general coupled matrix equations, Appl. Math. Lett., 34 (2014), 37–42. http://dx.doi.org/10.1016/j.aml.2014.03.013 doi: 10.1016/j.aml.2014.03.013
[37]	Y. F. Ke, C. F. Ma, A preconditioned nested splitting conjugate gradient iterative method for the large sparse generalized Sylvester equation, Appl. Math. Comput., 68 (2014), 1409–1420. http://dx.doi.org/10.1016/j.camwa.2014.09.009 doi: 10.1016/j.camwa.2014.09.009
[38]	M. Hajarian, Generalized conjugate direction algorithm for solving the general coupled matrix equations over symmetric matrices, Numer. Algor., 73 (2016), 591–609. http://dx.doi.org/10.1007/s11075-016-0109-8 doi: 10.1007/s11075-016-0109-8
[39]	M. Hajarian, Extending the CGLS algorithm for least squares solutions of the generalized Sylvester-transpose matrix equations, J. Franklin Inst., 353 (2016), 1168–1185. http://dx.doi.org/10.1016/j.jfranklin.2015.05.024 doi: 10.1016/j.jfranklin.2015.05.024
[40]	M. Dehghan, R. Mohammadi-Arani, Generalized product-type methods based on Bi-conjugate gradient (GPBiCG) for solving shifted linear systems, Comput. Appl. Math., 36 (2017), 1591–1606. http://dx.doi.org/10.1007/s40314-016-0315-y doi: 10.1007/s40314-016-0315-y
[41]	R. Horn, C. Johnson, Topics in matrix analysis, Cambridge: Cambridge University Press, 1991. http://dx.doi.org/10.1017/CBO9780511840371

This article has been cited by:

1.	Kanjanaporn Tansri, Pattrawut Chansangiam, Conjugate Gradient Algorithm for Least-Squares Solutions of a Generalized Sylvester-Transpose Matrix Equation, 2022, 14, 2073-8994, 1868, 10.3390/sym14091868
2.	Victor G. Lopez, Matthias A. Müller, 2023, An Efficient Off-Policy Reinforcement Learning Algorithm for the Continuous-Time LQR Problem, 979-8-3503-0124-3, 13, 10.1109/CDC49753.2023.10384256
3.	Rui Qi, Caiqin Song, Iterative algorithm to the generalized coupled Sylvester‐transpose matrix equations with application in robust and minimum norm observer design of linear systems, 2024, 45, 0143-2087, 2008, 10.1002/oca.3134
4.	Janthip Jaiprasert, Pattrawut Chansangiam, Exact and least-squares solutions of a generalized Sylvester-transpose matrix equation over generalized quaternions, 2024, 32, 2688-1594, 2789, 10.3934/era.2024126

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(2709) PDF downloads(107) Cited by(4)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(6) / Tables(4)

AIMS Mathematics

Conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

Related Papers:

Abstract

1. Introduction

2. Preliminaries

3. The direct Kronecker linearization for a generalized Sylvester-transpose matrix equation

4. A conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

5. Numerical experiments with discussions

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

Related Papers:

Abstract

1. Introduction

2. Preliminaries

3. The direct Kronecker linearization for a generalized Sylvester-transpose matrix equation

4. A conjugate gradient algorithm for consistent generalized Sylvester-transpose matrix equations

5. Numerical experiments with discussions

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog