We extend the notion of classical metric geometric mean (MGM) for positive definite matrices of the same dimension to those of arbitrary dimensions, so that usual matrix products are replaced by semi-tensor products. When the weights are arbitrary real numbers, the weighted MGMs possess not only nice properties as in the classical case, but also affine change of parameters, exponential law, and cancellability. Moreover, when the weights belong to the unit interval, the weighted MGM has remarkable properties, namely, monotonicity and continuity from above. Then we apply a continuity argument to extend the weighted MGM to positive semidefinite matrices, here the weights belong to the unit interval. It turns out that this matrix mean posses rich algebraic, order, and analytic properties, such as, monotonicity, continuity from above, congruent invariance, permutation invariance, affine change of parameters, and exponential law. Furthermore, we investigate certain equations concerning weighted MGMs of positive definite matrices. It turns out that such equations are always uniquely solvable with explicit solutions. The notion of MGMs can be applied to solve certain symmetric word equations in two letters.
Citation: Arnon Ploymukda, Pattrawut Chansangiam. Metric geometric means with arbitrary weights of positive definite matrices involving semi-tensor products[J]. AIMS Mathematics, 2023, 8(11): 26153-26167. doi: 10.3934/math.20231333
[1] | Pattrawut Chansangiam, Arnon Ploymukda . Riccati equation and metric geometric means of positive semidefinite matrices involving semi-tensor products. AIMS Mathematics, 2023, 8(10): 23519-23533. doi: 10.3934/math.20231195 |
[2] | Arnon Ploymukda, Kanjanaporn Tansri, Pattrawut Chansangiam . Weighted spectral geometric means and matrix equations of positive definite matrices involving semi-tensor products. AIMS Mathematics, 2024, 9(5): 11452-11467. doi: 10.3934/math.2024562 |
[3] | Tong Wu, Yong Wang . Super warped products with a semi-symmetric non-metric connection. AIMS Mathematics, 2022, 7(6): 10534-10553. doi: 10.3934/math.2022587 |
[4] | Mohd Danish Siddiqi, Meraj Ali Khan, Ibrahim Al-Dayel, Khalid Masood . Geometrization of string cloud spacetime in general relativity. AIMS Mathematics, 2023, 8(12): 29042-29057. doi: 10.3934/math.20231487 |
[5] | Yajun Xie, Changfeng Ma, Qingqing Zheng . On the nonlinear matrix equation $ X^{s}+A^{H}F(X)A = Q $. AIMS Mathematics, 2023, 8(8): 18392-18407. doi: 10.3934/math.2023935 |
[6] | Wenbin Gong, Yaqiang Wang . Some new criteria for judging $ \mathcal{H} $-tensors and their applications. AIMS Mathematics, 2023, 8(4): 7606-7617. doi: 10.3934/math.2023381 |
[7] | Rajesh Kumar, Sameh Shenawy, Lalnunenga Colney, Nasser Bin Turki . Certain results on tangent bundle endowed with generalized Tanaka Webster connection (GTWC) on Kenmotsu manifolds. AIMS Mathematics, 2024, 9(11): 30364-30383. doi: 10.3934/math.20241465 |
[8] | Tinglan Yao . An optimal $ Z $-eigenvalue inclusion interval for a sixth-order tensor and its an application. AIMS Mathematics, 2022, 7(1): 967-985. doi: 10.3934/math.2022058 |
[9] | Wenxv Ding, Ying Li, Anli Wei, Zhihong Liu . Solving reduced biquaternion matrices equation $ \sum\limits_{i = 1}^{k}A_iXB_i = C $ with special structure based on semi-tensor product of matrices. AIMS Mathematics, 2022, 7(3): 3258-3276. doi: 10.3934/math.2022181 |
[10] | Dongjian Bai, Feng Wang . New methods based $ \mathcal{H} $-tensors for identifying the positive definiteness of multivariate homogeneous forms. AIMS Mathematics, 2021, 6(9): 10281-10295. doi: 10.3934/math.2021595 |
We extend the notion of classical metric geometric mean (MGM) for positive definite matrices of the same dimension to those of arbitrary dimensions, so that usual matrix products are replaced by semi-tensor products. When the weights are arbitrary real numbers, the weighted MGMs possess not only nice properties as in the classical case, but also affine change of parameters, exponential law, and cancellability. Moreover, when the weights belong to the unit interval, the weighted MGM has remarkable properties, namely, monotonicity and continuity from above. Then we apply a continuity argument to extend the weighted MGM to positive semidefinite matrices, here the weights belong to the unit interval. It turns out that this matrix mean posses rich algebraic, order, and analytic properties, such as, monotonicity, continuity from above, congruent invariance, permutation invariance, affine change of parameters, and exponential law. Furthermore, we investigate certain equations concerning weighted MGMs of positive definite matrices. It turns out that such equations are always uniquely solvable with explicit solutions. The notion of MGMs can be applied to solve certain symmetric word equations in two letters.
The notion of metric geometric mean (MGM for short) of positive definite matrices involves many mathematical areas, e.g. matrix/operator theory, geometry, and group theory. For any positive definite matrices A and B of the same size, the MGM of A and B is defined as
A♯B=A1/2(A−1/2BA−1/2)1/2A1/2. | (1.1) |
From algebraic viewpoint, A♯B is a unique positive solution of the Riccati equation (see [1, Ch. 4])
XA−1X=B. | (1.2) |
In fact, the explicit formula (1.1) and the equation (1.2) are two equivalent ways to describe the geometric mean; see [2]. From differential-geometry viewpoint, A♯B is a unique midpoint of the Riemannian geodesic interpolated from A to B, called the weighted geometric mean of A and B:
γ(t):=A♯tB=A1/2(A−1/2BA−1/2)tA1/2,0⩽t⩽1. | (1.3) |
This midpoint is measured through a natural Riemannian metric; see a monograph [1, Ch. 6] or Section 2. The weighted MGMs (1.3) posses rich algebraic, order, and analytic properties, namely, positive homogeneity, congruent invariance, permutation invariance, self duality, monotonicity, and continuity from above; see [3, Sect. 3].
In operator-theoretic approach, the weighted MGMs (1.3) for positive operators on a Hilbert space are Kubo-Ando means [4] in the sense that they satisfy the monotonicity, transformer inequality, continuity from above, and normalization. More precisely, A♯tB is the Kubo-Ando mean associated with the operator monotone function f(x)=xt on the positive half-line, e.g. [3, Sect. 3]. The geometric mean serves as a tool for deriving matrix/operators inequalities; see the original idea in [5], see also [1, Ch. 4], [3, Sect. 3], and [6]. The cancellability of the weighted MGM together with spectral theory can be applied to solve operator mean equations; see [7]. Indeed, given two positive invertible operators A and B acting on the same Hilbert space (or positive definite matrices of the same dimension), the equation A♯tX=B is uniquely solvable in terms of the weighted MGM in which the weight can be any nonzero real number. See more development of geometric mean theory for matrices/operators in [8,9,10].
A series of Lawson and Lim works investigated the theory of (weighted) MGM in various frameworks. Indeed, the geometric mean (with weight 1/2) can be naturally defined on symmetric cones [11], symmetric sets [12], two-powered twisted subgroups [12,13], and Bruhat-Tits spaces [2]. The framework of reflection quasigroups (based on point-reflection geometry and quasigroup theory) [13] allows us to define weighted MGMs via geodesics, where the weights can be any dyadic rationals. On lineated symmetric spaces [14], the weights can be arbitrary real numbers, due to the density of the dyadic rationals on the real line. Their theory can be applied to solve certain symmetric word equations in two matrix letters; see [15]. Moreover, mean equations related MGMs were investigated in [13,16].
From the formulas (1.1) and (1.3), the matrix products are the usual products between matching-dimension matrices. We can extend the MGM theory by replacing the usual product to the semi-tensor product (STP) between square matrices of general dimension. The STP was introduced by Cheng [17], so that the STP reduces to the usual matrix product in matching-dimension case. The STP keeps various algebraic properties of the usual matrix product such as the distribution over the addition, the associativity, and compatibility with transposition, inversion and scalar multiplication. The STP is a useful tool when dealing with vectors and matrices in classical and fuzzy logic, lattices and universal algebra, and differential geometry; see [18,19]. The STPs turn out to have a variety of applications in other fields: networked evolutionary games [20], finite state machines [21], Boolean networks [19,22,23], physics [24], and engineering [25]. Many authors developed matrix equations based on STPs; e.g. Sylvester-type equations [26,27,28], and a quadratic equation A⋉X⋉X=B [29].
The present paper aims to develop further theory on weighted MGMs for positive definite matrices of arbitrary dimensions, where the matrix products are given by the STPs. In this case, the weights can be arbitrary real numbers; see Section 3. It turns out that this mean satisfies various properties as in the classical case. The most interesting case is when the weights belong to the unit interval. In this case, the weighted MGM satisfies the monotonicity, and the continuity from above. Then we investigate the theory when either A or B is not assumed to be invertible. In such case, the weights are restricted to be in [0,∞) or (−∞,1]. We also use a continuity argument to study the weighted MGMs, in which the weights belong to [0,1]. Moreover, we prove the cancellability of the weighted MGMs, and apply to solve certain nonlinear matrix equations concerning MGMs; see Section 4. In Section 5, we apply the theory to solve the Riccati equation and certain symmetric word equations in two matrix letters. Finally, we summarize the whole work in Section 6.
In the next section, we setup basic notation and provide preliminaries results on the Riemannian geometry of positive definite matrices, the tensor product, and the semi-tensor product.
Throughout, let Mm,n be the set of all m×n complex matrices, and abbreviate Mn,n to Mn. Define Cn=Mn,1, the set of n-dimensional complex vectors. Denote by AT and A∗ the transpose and conjugate transpose of a matrix A, respectively. The n×n identity matrix is denoted by In. The general linear group of n×n invertible complex matrices is denoted by GLn. The symbols Hn, and PSn stand for the vector space of n×n Hermitian matrices, and the cone of n×n positive semidefinite matrices, respectively. For a pair (A,B)∈Hn×Hn, the partial ordering A⩾B means that A−B lies in the positive cone PSn. In particular, A∈PSn if and only if A⩾0. Let us denote the set of n×n positive definite matrices by Pn. For each A∈Hn, the strict inequality A>0 indicates that A∈Pn.
A matrix pair (A,B)∈Mm,n×Mp,q is said to satisfy factor-dimension condition if n|p or p|n. In this case, we write A≻kB when n=kp, and A≺kB when p=kn.
Recall that Mn is a Hilbert space endowed with the Hilbert-Schmidt inner product ⟨A,B⟩HS=trA∗B and the associated norm ‖A‖HS=(trA∗A)1/2. The subset Pn, which is an open subset in Hn, is a Riemannian manifold endowed with the trace Riemannian
ds=‖A−1/2dAA−1/2‖HS=[tr(A−1dA)2]1/2. |
If γ:[a,b]→Pn is a (piecewise) differentiable path in Pn, we define the length of γ by
L(γ)=∫ba‖γ−1/2(t)γ′(t)γ−1/2(t)‖HSdt. |
For each X∈GLn, the congruence transformation
ΓX:Pn→Pn,A↦X∗AX | (2.1) |
is bijective and the composition ΓX∘γ:[a,b]→Pn is another path in Pn. For any A,B∈Pn, the distance between A and B is given by
δHS(A,B)=inf{ L(γ):γ is a path from A to B }. |
Lemma 2.1. (e.g. [1]). For each A,B∈Pn, there is a unique geodesic from A to B, parametrized by
γ(t)=A1/2(A−1/2BA−1/2)tA1/2,0⩽t⩽1. | (2.2) |
This geodesic is natural in the sense that δHS(A,γ(t))=tδHS(A,B) for each t.
For any A,B∈Pn, denote the geodesic from A to B by [A,B].
This subsection is a brief review on tensor products and semi-tensor products of matrices. Recall that for any matrices A=[aij]∈Mm,n and B∈Mp,q, their tensor product is defined by
A⊗B=[aijB]∈Mmp,nq. |
The tensor operation (A,B)↦A⊗B is bilinear and associative.
Lemma 2.2 (e.g. [3]). Let (A,B)∈Mm,n×Mp,q and (P,Q)∈Mm×Mn. Then we have
(1). A⊗B=0 if and only if either A=0 or B=0;
(2). (A⊗B)∗=A∗⊗B∗;
(3). if (P,Q)∈GLm×GLn, then (P⊗Q)−1=P−1⊗Q−1;
(4). if (P,Q)∈PSm×PSn, then P⊗Q∈PSmn and (P⊗Q)1/2=P1/2⊗Q1/2;
(5). if (P,Q)∈Pm×Pn, then P⊗Q∈Pmn;
(6). det(P⊗Q)=(detP)n(detQ)m.
To define the semi-tensor product, first consider a pair (X,Y)∈M1,m×Cn of row and column vectors, respectively. If X≻kY, then we split X into X1,X2,…,Xn∈M1,k and define the STP of X and Y as
X⋉Y=n∑i=1yiXi∈M1,k. |
If X≺kY, then we split Y into Y1,Y2,…,Ym∈Ck and define the STP of X and Y as
X⋉Y=m∑i=1xiYi∈Ck. |
In general, for a pair (A,B)∈Mm,n×Mp,q satisfying the factor-dimensional condition, we define
A⋉B=[Ai⋉Bj]m,qi,j=1, |
where Ai is i-th row of A and Bj is the j-th column of B. More generally, for an arbitrary matrix pair (A,B)∈Mm,n×Mp,q, we let α=lcm(n,p) and define
A⋉B=(A⊗Iα/n)(B⊗Iα/p)∈Mαmn,αqp. |
The operation (A,B)↦A⋉B turns out to be bilinear, associative, and continuous.
Lemma 2.3 (e.g. [18]). Let (A,B)∈Mm,n×Mp,q and (P,Q)∈Mm×Mn. Then we have
(1). (A⋉B)∗=B∗⋉A∗;
(2). if (P,Q)∈GLm×GLn, then (P⋉Q)−1=Q−1⋉P−1;
(3). det(P⋉Q)=(detP)α/m(detQ)α/n where α=lcm(m,n).
Proposition 2.4. Let A∈Mm,X∈Mn and S,T∈Hm.
(1). If A⩾0, then X∗⋉A⋉X⩾0.
(2). If S⩾T, then X∗⋉S⋉X⩾X∗⋉T⋉X.
(3). If A>0 and X∈GLn, then X∗⋉A⋉X>0.
(4). If S>T, then X∗⋉S⋉X>X∗⋉T⋉X.
Proof. 1) Since (X∗⋉A⋉X)∗=X∗⋉A⋉X, we have that X∗⋉A⋉X is Hermitian. Let α=lcm(m,n) and u∈Cα. Set v=X⋉u. Using Lemma 2.2, we obtain that A⊗Iα/p⩾0 and then, by Lemma 2.3,
u∗(X∗⋉A⋉X)u=(X⋉u)∗⋉A⋉(X⋉u)=v∗(A⊗Iα/n)v⩾0. |
This implies that X∗⋉A⋉X⩾0. 2) Since S⩾T, we have S−T⩾0. Applying the assertion 1, we get X∗⋉(S−T)⋉X⩾0, i.e., X∗⋉S⋉X⩾X∗⋉T⋉X. The proofs of the assertions 3)-4) are similar to the assertions 1)-2), respectively.
We extend the classical weighted MGM (1.3) for a pair of positive definite matrices of different sizes as follows.
Definition 3.1. Let (A,B)∈Pm×Pn. For any t∈R, the t-weighted metric geometric mean (MGM) of A and B is defined by
A♯tB=A1/2⋉(A−1/2⋉B⋉A−1/2)t⋉A1/2∈Mα, | (3.1) |
where α=lcm(m,n).
Note that when m=n, Eq (3.1) reduces to the classical one (1.3). In particular, A♯0B=A⊗Iα/m, A♯1B=B⊗Iα/n, A♯−1B=A⋉B−1⋉A, and A♯2B=B⋉A−1⋉B. Fundamental properties of the weighted MGMs (3.1) are as follows.
Theorem 3.2. Let (A,B)∈Pm×Pn. Let r,s,t∈R and α=lcm(m,n). Then
(1). Positivity: A♯tB>0.
(2). Fixed-point property: A♯tA=A.
(3). Positive homogeneity: c(A♯tB)=(cA)♯t(cB) for all c>0.
(4). Congruent invariance: C∗(A♯tB)C=(C∗⋉A⋉C)♯t(C∗⋉B⋉C) for all C∈GLα.
(5). Self duality: (A♯tB)−1=A−1♯tB−1.
(6). Permutation invariance: A♯1/2B=B♯1/2A. More generally, A♯tB=B♯1−tA.
(7). Affine change of parameters: (A♯rB)♯t(A♯sB)=A♯(1−t)r+tsB.
(8). Exponential law: A♯r(A♯sB)=A♯rsB.
(9). C♯−1(A♯tB)=(C♯−1A)♯t(C♯−1B) for any C∈Pm.
(10). Left cancellability: Let Y1,Y2∈Pn and t∈R−{0}. Then the equation A♯tY1=A♯tY2 implies Y1=Y2. In other words, for each t≠0, the map X↦A♯tX is an injective map from Pn to Pα.
(11). Right cancellability: Let X1,X2∈Pm and t∈R−{1}. Then the equation X1♯tB=X2♯tB implies X1=X2. In other words, for each t≠1, the map X↦X♯tB is an injective map from Pm to Pα.
(12). Determinantal identity: det(A♯tB)=(detA)(1−t)αm(detB)tαn.
Proof. The positivity of ♯t follows from Proposition 2.4(3). Properties 2 and 3 follow directly from the formula (3.1). Let γ be the natural parametrization of the geodesic [A⊗Iα/m,B⊗Iα/n] on the space Pα as discussed in Lemma 2.1. To prove the congruent invariance, let C∈GLα and consider the congruence transformation ΓC defined by (2.1). Then the path γC(t):=ΓC(γ(t)) joins the points γC(0)=C∗⋉A⋉C and γC(1)=C∗⋉B⋉C. By Lemma 2.1, we obtain
C∗(A♯tB)C=ΓC(γ(t))=γC(t)=(C∗⋉A⋉C)♯t(C∗⋉B⋉C). |
To prove the self duality, let β(t)=(γ(t))−1. By Lemma 2.2, we have β(0)=A−1⊗Iα/m and β(1)=B−1⊗Iα/n. Thus
(A♯tB)−1=(γ(t))−1=β(t)=A−1♯tB−1. |
To prove the 6th item, define f:R→R by f(t)=1−t. Let δ=γ∘f. Then δ(0)=B⊗Iα/n and δ(1)=A⊗Iα/m. By Lemma 2.1, we have δ(t)=B♯tA, and thus
A♯tB=γ(t)=γ(f(1−t))=δ(1−t)=B♯1−tA. |
To prove the 7th item, fix r,s and let t vary. Let δ(t)=(A♯rB)♯t(A♯sB). We have δ(0)=A♯rB and δ(1)=A♯sB. Define f:R→R,f(t)=(1−t)r+ts and β=γ∘f. We obtain
β(t)=γ((1−t)r+ts)=A♯(1−t)r+tsB |
and β(0)=A♯rB, β(1)=A♯sB. Hence, δ(t)=β(t), i.e., (A♯rB)♯t(A♯sB)=A♯(1−t)r+tsB. The exponential law is derived from the 7th item as follows: A♯r(A♯sB)=(A♯0B)♯r(A♯sB)=A♯rsB. The 9th item follows from the congruent invariance and the self duality. To prove the left cancellability, let t∈R−{0} and suppose that A♯tY1=A♯tY2. We have by the exponential law that
Y1⊗Iα/n=A♯1Y1=A♯1/t(A♯tY1)=A♯1/t(A♯tY2)=A♯1Y2=Y2⊗Iα/n. |
It follows that (Y1−Y2)⊗Iα/n=0, and thus by Lemma 2.2 we conclude that Y1=Y2. Hence, the map X↦A♯tX is injective. The right cancellability follows from the left cancellability together with the permutation inavariance. The determinantal identity follows directly from Lemmas 2.3 and 2.2:
det(A♯tB)=det(A1/2)α/mdet((A−1/2⋉B⋉A−1/2)t)(detA1/2)α/m=(detA)α/m(det(A−1/2⋉B⋉A−1/2)t)=(detA)α/m(detA)−tα/m(detB)tα/n=(detA)(1−t)α/m(detB)tα/n. |
This finishes the proof.
Remark 3.3. From Theorem 3.2, a particular case of the exponential law is that (Bs)r=Bsr for any B>0 and s,t∈R. The congruent invariance means that the operation ♯t is invariant under the congruence transformation ΓC for any C∈GLα.
Now, we focus on weighted MGMs in which the weight lies in the interval [0,1]. Let us write Ak→A when the matrix sequence (Ak) converges to the matrix A. If (Ak) is a sequence in Hn, the expression Ak↓A means that (Ak) is a decreasing sequence and Ak→A. Recall the following well known matrix inequality:
Lemma 3.4 (Löwner-Heinz inequality, e.g. [3]). Let S,T∈PSn and w∈[0,1]. If S⩽T, then Sw⩽Tw.
When the weights are in [0,1], this mean has remarkable order and analytic properties:
Theorem 3.5. Let (A,B),(C,D)∈Pm×Pn and w∈[0,1].
(1). Monotonicity: If A⩽C and B⩽D, then A♯wB⩽C♯wD.
(2). Continuity from above: Let (Ak,Bk)∈Pm×Pn for all k∈N. If Ak↓A and Bk↓B, then Ak♯wBk↓A♯wB.
Proof. To prove the monotonicity, suppose that A⩽C and B⩽D. By Proposition 2.4, we have that A−1/2⋉B⋉A−1/2⩽A−1/2⋉D⋉A−1/2. Using Lemma 3.4 and Proposition 2.4, we obtain
A♯wB=A1/2⋉(A−1/2⋉B⋉A−1/2)w⋉A1/2⩽A1/2⋉(A−1/2⋉D⋉A−1/2)w⋉A1/2=A♯wD. |
This shows the monotonicity of ♯w in the second argument. This property together with the permutation invariance in Theorem 3.2 yield
A♯wB=B♯1−wA⩽B♯1−wC=C♯wB⩽C♯wD. |
To prove the continuity from above, suppose that Ak↓A and Bk↓B. Applying the monotonicity and the positivity, we conclude that (Ak♯wBk) is a decreasing sequence of positive definite matrices. The continuity of the semi-tensor multiplication implies that A−1/2k⋉Bk⋉A−1/2k converges to A−1/2⋉B⋉A−1/2, and thus
A1/2k⋉(A−1/2k⋉Bk⋉A−1/2k)w⋉A1/2k→A1/2⋉(A−1/2⋉B⋉A−1/2)w⋉A1/2. |
Hence, Ak♯wBk↓A♯wB.
Now, we extend the weighted MGM to positive semidefinite matrices. Indeed, when the first matrix argument is positive definite but the second one is positive semidefinite, the weights can be any nonnegative real numbers.
Definition 3.6. Let (A,B)∈Pm×PSn. For any t∈[0,∞), the t-weighted MGM of A and B is defined by
A♯tB=A1/2⋉(A−1/2⋉B⋉A−1/2)t⋉A1/2. | (3.2) |
Here, we apply a convention X0=Iα for any X∈Mα.
This definition is well-defined since the matrix A−1/2⋉B⋉A−1/2 is positive semidefinite according to Proposition 2.4. The permutation invariance suggests the following definition.
Definition 3.7. Let (A,B)∈PSm×Pn. For any t∈(−∞,1], the t-weighted MGM of A and B is defined by
A♯tB=B♯1−tA=B1/2⋉(B−1/2⋉A⋉B−1/2)1−t⋉B1/2. | (3.3) |
Here, we apply the convention X0=Iα for any X∈Mα.
This definition is well-defined according to Definition 3.6 (since 1−t⩾0). Note that when A>0 and B>0, Definitions 3.1, 3.6, and 3.7 are coincide. Fundamental properties of the means (3.2) and (3.3) are as follows.
Theorem 3.8. Denote α=lcm(m,n). If either
(i) (A,B)∈Pm×PSn and r,s,t⩾0, or
(ii) (A,B)∈PSm×Pn and r,s,t⩽1,
then
(1). Positivity: A♯tB⩾0.
(2). Positive homogeneity: c(A♯tB)=(cA)♯t(cB) for all c>0.
(3). Congruent invariance: C∗(A♯tB)C=(C∗⋉A⋉C)♯t(C∗⋉B⋉C) for all C∈GLα.
(4). Affine change of parameters: (A♯rB)♯t(A♯sB)=A♯(1−t)r+tsB.
(5). Exponential law: A♯r(A♯sB)=A♯rsB.
(6). Determinantal identity: det(A♯tB)=(detA)(1−t)αm(detB)tαn.
Proof. The proof of each assertion is similar to that in Theorem 3.2. When A∈PSm, we consider A+ϵIm∈Pm and take limits when ϵ→0+. When B∈PSn, we consider B+ϵIn∈Pn and take limits when ϵ→0+.
It is natural to extend the weighted MGMs of positive definite matrices to those of positive semidefinite matrices by a limit process. Theorem 3.5 (or both Definitions 3.6 and 3.7) then suggests us that the weights must be in the interval [0,1].
Definition 3.9. Let (A,B)∈PSm×PSn. For any w∈[0,1], the w-weighted MGM of A and B is defined by
A♯wB=limε↓0+(A+εIm)♯w(B+εIn)∈Mα, | (3.4) |
where α=lcm(m,n). Here, we apply the convention X0=Iα for any X∈Mα.
Lemma 3.10. Definition 3.4 is well-defined. Moreover, if (A,B)∈PSm×PSn, then A♯wB∈PSα.
Proof. When ε↓0+, the nets A+εIm and B+εIn are decreasing nets of positive definite matrices. From the monotonicity property in Theorem 3.5, the net (A+εIm)♯w(B+εIn) is decreasing. Since this net is also bounded below by the zero matrix, the order-completeness of the matrix space guarantees an existence of the limit (3.4). Moreover, the matrix limit is positive semidefinite.
Fundamental properties of weighted MGMs are listed below.
Theorem 3.11. Let (A,B),(C,D)∈PSm×PSn. Let w,r,s∈[0,1] and α=lcm(m,n). Then
(1). Fixed-point property: A♯wA=A.
(2). Positive homogeneity: c(A♯wB)=(cA)♯w(cB) for all c⩾0.
(3). Congruent invariance: T∗⋉(A♯wB)⋉T=(T∗⋉A⋉T)♯w(T∗⋉B⋉T) for all T∈GLα.
(4). Permutation invariance: A♯1/2B=B♯1/2A. More generally, A♯wB=B♯1−wA.
(5). Affine change of parameters: (A♯rB)♯w(A♯sB)=A♯(1−w)r+wsB.
(6). Exponential law: A♯r(A♯sB)=A♯rsB.
(7). Determinantal identity: det(A♯wB)=(detA)(1−w)αm(detB)wαn.
(8). Monotonicity: If A⩽C and B⩽D, then A♯wB⩽C♯wD.
(9). Continuity from above: If Ak∈PSm and Bk∈PSn for all k∈N are such that Ak↓A and Bk↓B, then Ak♯wBk↓A♯wB.
Proof. When A∈PSm and B∈PSn, we can consider A+εIm∈Pm and B+εIn∈Pn, and then take limits when ε→0+. The 1st-7th items now follow from Theorem 3.2 (or Theorems 3.8). The 8th-9th items follow from Theorem 3.5. For the continuity from above, if Ak↓A and Bk↓B, then the monotonicity implies the decreasingness of the sequence Ak♯wBk when k→∞. Moreover,
limk→∞Ak♯wBk=limk→∞limε→0+(Ak+εIm)♯w(Bk+εIn)=limε→0+limk→∞(Ak+εIm)♯w(Bk+εIn)=limε→0+(A+εIm)♯w(B+εIn)=A♯wB. |
Thus, Ak♯wBk↓A♯wB as desire.
In particular, the congruent invariance implies the transformer inequality:
C⋉(A♯wB)⋉C⩽(C⋉A⋉C)♯w(C⋉B⋉C),C⩾0. |
This property together with the monotonicity, the above continuity, and the fixed-point property yield that the mean ♯w is a Kubo-Ando mean when w∈[0,1]. The results in this section include those for the MGM with weight 1/2 in [30].
In this section, we apply our theory to solve certain matrix equations concerning weighted MGMs for positive definite matrices.
Corollary 4.1. Let A∈Pm and B,X∈Pn with A≺kB. Let t∈R−{0}. Then the mean equation
A♯tX=B | (4.1) |
is uniquely solvable with an explicit solution X=A♯1/tB. Moreover, the solution varies continuously on the given matrices A and B. In particular, the geometric mean problem A♯1/2X=B has a unique solution X=A♯2B=B⋉A−1⋉B.
Proof. The exponential law in Theorem 3.2 implies that
A♯tX=B=A♯1B=A♯t(A♯1/tB). |
Then the left cancellability implies that Eq (4.1) has a unique solution X=A♯1/tB. The continuity of the solution follows from the explicit formula (3.1) and the continuity of the semi-tensor operation.
Remark 4.2. We can investigate the Eq (4.1) when A∈Pm and B∈PSn. In this case, the weight t must be positive. This equation is uniquely solvable with an explicit solution X=A♯1/tB∈PSn. For simplicity in this section, we consider only the case when all given matrices are positive definite.
Corollary 4.3. Let A∈Pm and B,X∈Pn with A≺kB. Let r,s,t∈R.
(1). If s≠0 and t≠1, then the mean equation
(A♯sX)♯tA=B | (4.2) |
is uniquely solvable with an explicit solution X=A♯1s(1−t)B.
(2). If s+t≠st, then the equation
(A♯sX)♯tX=B | (4.3) |
is uniquely solvable with an explicit solution X=A♯λB, where λ=1/(s+t−st).
(3). If s(1−t)≠1, then the mean equation
(A♯sX)♯tB=X | (4.4) |
is uniquely solvable with an explicit solution X=A♯λB, where λ=t/(st−s+1).
(4). If s≠t, then the equation
A♯sX=B♯tX | (4.5) |
is uniquely solvable with an explicit solution X=A♯λB, where λ=(1−t)/(s−t).
(5). If s−rs+rt≠1, then the equation
(A♯sX)♯r(B♯tX)=X | (4.6) |
is uniquely solvable with an explicit solution X=A♯λB, where λ=(rt−r)/(s−rs+rt−1).
(6). If t≠0, then the mean equation
(A♯tX)♯r(B♯tX)=X | (4.7) |
is uniquely solvable with an explicit solution X=A♯rB.
Proof. For the 1st assertion, using Theorem 3.2, we have
A♯s(1−t)X=A♯1−t(A♯sX)=(A♯sX)♯tA=B. |
By Corollary 4.1, we get the desire solution.
For the 2nd item, applying Theorem 3.2, we get
A♯s+t−stX=X♯(1−s)(1−t)A=X♯1−t(X♯1−sA)=(A♯sX)♯tX=B. |
Using Corollary 4.1, we obtain that X=A♯1s+t−stB.
For the 3rd item, the trivial case s=t=0 yields X=A♯0B. Assume that s,t≠0. From B♯1−t(A♯sX)=X, we get by Theorem 3.2 and Corollary 4.1 that A♯sX=B♯11−tX=X♯tt−1B. Consider
B=X♯t−1t(A♯sX)=X♯t−1t(X♯1−sA)=X♯(t−1)(1−s)tA=A♯1λX, |
where λ=t/(st−s+1). Now, we can deduce the desire solution from Corollary 4.1.
For the 4th item, the trivial case s=0 yields that the equation B♯tX=A⊗Ik has a unique solution X=B♯1tA=A♯t−1tB. Now, consider the case s≠0. Using Corollary 4.1, we have
X=A♯1s(B♯tX)=(B♯tX)♯s−1sA. |
Applying (4.4), we obtain that X=B♯(s−1)/(s−t)A=A♯λB, where λ=(1−t)/(s−t).
For the 5th item, the case r=0 yields that the equation A♯sX=X has a unique solution X=A♯0B. Now, consider the case r≠0. Using Theorem 3.2 and Corollary 4.1, we get
B♯tX=(A♯sX)♯1rX=X♯r−1r(X♯1−sA)=X♯(r−1)(1−s)rA=A♯rs−s+1rX. |
Applying the equation (4.5), we obtain that X=A♯λB, where λ=rt/(s−rs+rt).
For the 6th item, setting s=t in (4.6) yields the desire result.
Remark 4.4. Note that the cases t=0 in Eqs (4.1)–(4.5) all reduce to Eq (4.1). A particular case of (4.3) when s=t=1/2 reads that the mean equation
(A♯X)♯X=B | (4.8) |
has a unique solution X=A♯4/3B. In Eq (4.4), when s=t=1/2, the mean equation
(A♯X)♯B=X |
has a unique solution X=A♯2/3B. If r=s=t=1/2, then Eqs (4.6) or (4.7) implies that the geometric mean X=A♯B is a unique solution of the equation
(A♯X)♯(B♯X)=X. | (4.9) |
Equtions (4.8) and (4.9) were studied in [13] in the framework of dyadic symmetric sets.
Recall that a matrix word in two letters A,B∈Mn is an expression of the form
W(A,B)=Ar1Bs1Ar2Bs2⋯ArpBspArp+1 |
in which the exponents ri,si∈R−{0} for all i=1,2,…,p and rp+1∈R. A matrix word is said to be symmetric if it is identical to its reversal. A famous symmetric matrix-word equation is the Riccati equation
XAX=B, | (5.1) |
here A,B,X are positive definite matrices of the same dimension. Indeed, in control engineering, an optimal regulator problem for a linear dynamical system reduces to an algebraic Riccati equation (under the controllability and the observability conditions) X∗A−1X−R∗X−X∗R=B, where A,B are positive definite, and R is an arbitrary square matrix. A simple case R=0 yields the Riccati equation X∗A−1X=B.
In our context, the Riccati equation (5.1) can be written as A−1♯2X=B or X♯−1A−1=B. The case t=2 in Corollary 4.1 reads:
Corollary 5.1. Let A∈Pm and B,X∈Pn with A≺kB. Then the Riccati equation X⋉A⋉X=B has a unique positive solution X=A−1♯1/2B.
More generally, consider the following symmetric word equation in two positive definite letters A,B∈Pn with respect to the usual products:
B=XAXAX⋯AXAX((p+1)-terms of X, and p-terms of A)=X(AX)p, |
here p∈N. We now investigate such equations with respect to semi-tensor products.
Corollary 5.2. Let A∈Pm and B,X∈Pn with A≺kB.
(1). Let p∈N. Then the symmetric word equation
X(A⋉X)p=B | (5.2) |
is uniquely solvable with an explicit solution X=A−1♯1p+1B.
(2). Let r∈R−{−1}. Then the symmetric word equation
X(X⋉A⋉X)rX=B | (5.3) |
is uniquely solvable with an explicit solution
X=(A−1♯1r+1B)1/2. |
Proof. First, since p∈N, we can observe the following:
X(A⋉X)p=X1/2(X1/2⋉A⋉X1/2)pX1/2. |
It follows from the results in Section 3 that
X(A⋉X)p=X1/2(X−1/2⋉A−1⋉X−1/2)−pX1/2=X♯−pA−1=A−1♯p+1X. |
According to Corollary 4.1, the equation A−1♯p+1X=B has a unique solution X=A−1♯1p+1B.
To solve Eq (5.3), we observe the following:
X(X⋉A⋉X)rX=X(X−1⋉A−1⋉X−1)−rX=X2♯−rA−1=A−1♯r+1X2. |
According to Corollary 4.1, the equation A−1♯r+1X2=B has a unique solution X2=A−1♯1r+1B. Hence, we get the desire formula of X.
We extend the notion of the classical weighted MGM to that of positive definite matrices of arbitrary dimensions, so that the usual matrix products are generalized to the semi-tensor products. When the weights are arbitrary real numbers, the weighted MGMs posses not only nice properties as in the classical case, e.g. the congruent invariance and the self duality, but also affine change of parameters, exponential law, and left/right cancellability. Moreover, when the weights belong to the unit interval, the weighted MGM has remarkable properties, namely, monotonicity and continuity from above (according to the famous Löwner-Heinz inequality). When the matrix A or B is not assumed to be invertible, we can define A♯tB where the weight t is restricted to [0,∞) or (−∞,1]. Then we apply a continuity argument to extend the weighted MGM to positive semidefinite matrices, here the weights belong to the unit interval. It turns out that this matrix mean posses rich algebraic, order, and analytic properties, such as, monotonicity, continuity from above, congruent invariance, permutation invariance, affine change of parameters, and exponential law. Furthermore, we investigate certain equations concerning weighted MGMs of positive definite matrices. Due to the cancellability and another properties of the weighted MGM, such equations are always uniquely solvable with solutions expressed in terms of weighted MGMs. The notion of MGMs can be applied to solve certain symmetric word equations in two positive definite letters. A particular interest of the word equations in the field of control engineering is the Riccati equation in a general form involving semi-tensor products. Our results include the classical weighted MGMs of matrices as special case.
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
This research is supported by postdoctoral fellowship of School of Science, King Mongkut's Institute of Technology Ladkrabang. The authors thank anonymous referees for their comments and suggestions on the manuscript.
The authors declare there is no conflicts of interest.
[1] | R. Bhatia, Positive Definite Matrices, New Jersey: Princeton University Press, 2007. |
[2] |
J. Lawson, Y. Lim, The geometric mean, matrices, metrics, and more, J. Amer. Math. Soc., 108 (2001), 797–812. https://doi.org/10.2307/2695553 doi: 10.2307/2695553
![]() |
[3] |
F. Hiai, Matrix analysis: matrix monotone functions, matrix means, and majorization, Interdiscip. Inf. Sci., 16 (2010), 139–248. https://doi.org/10.4036/iis.2010.139 doi: 10.4036/iis.2010.139
![]() |
[4] |
F. Kubo, T. Ando, Means of positive linear operators, Math. Ann., 246 (1980), 205–224. https://doi.org/10.1007/BF01371042 doi: 10.1007/BF01371042
![]() |
[5] |
T. Ando, Concavity of certain maps on positive definite matrices and applications to Hadamard products, Linear Alg. Appl., 26 (1979), 203–241. https://doi.org/10.1016/0024-3795(79)90179-4 doi: 10.1016/0024-3795(79)90179-4
![]() |
[6] |
A. Ploymukda, P. Chansangiam, Concavity and convexity of several maps involving Tracy-Singh products, Khatri-Rao products, and operator-monotone functions of positive operators, Sci. Asia, 45 (2019), 194–201. https://doi.org/10.2306/scienceasia1513-1874.2019.45.194 doi: 10.2306/scienceasia1513-1874.2019.45.194
![]() |
[7] |
P. Chansangiam, Cancellability and regularity of operator connections with applications to nonlinear operator equations involving means, J. Ineq. Appl., 2015 (2015), 411. https://doi.org/10.1186/s13660-015-0934-7 doi: 10.1186/s13660-015-0934-7
![]() |
[8] |
Y. Lim, Factorizations and geometric means of positive definite matrices, Linear Algebra Appl., 437 (2012), 2159–2172. https://doi.org/10.1016/j.laa.2012.05.039 doi: 10.1016/j.laa.2012.05.039
![]() |
[9] |
A. Ploymukda, P. Chansangiam, Geometric means and Tracy-Singh products for positive operators, Commun. Math. Appl., 9 (2018), 475–488. https://doi.org/10.26713/cma.v9i4.547 doi: 10.26713/cma.v9i4.547
![]() |
[10] | A. Ploymukda, P. Chansangiam, Weighted Lim's geometric mean of positive invertible operators on a Hilbert space, J. Comput. Anal. Appl., 29 (2020), 390–400. |
[11] |
Y. Lim, Geometric means on symmetric cones, Arch. der Math., 75 (2000), 39–45. https://doi.org/10.1007/s000130050471 doi: 10.1007/s000130050471
![]() |
[12] |
J. Lawson, Y. Lim, Symmetric sets with midpoints and algebraically equivalent theories, Result. Math., 46 (2004), 37–56. https://doi.org/10.1007/BF03322869 doi: 10.1007/BF03322869
![]() |
[13] | J. Lawson, Y. Lim, Geometric means and reflection quasigroups, Quasigroups Relat. Syst., 14 (2006), 43–59. |
[14] |
J. Lawson, Y. Lim, Symmetric space with convex metrics, Forum Math., 19 (2007), 571–602. https://doi.org/10.1515/FORUM.2007.023 doi: 10.1515/FORUM.2007.023
![]() |
[15] |
J. Lawson, Y. Lim, Solving symmetric matrix word equations via symmetric space machinery, Linear Algebra Appl., 414 (2006), 560–569. https://doi.org/10.1016/j.laa.2005.10.035 doi: 10.1016/j.laa.2005.10.035
![]() |
[16] | P. Chansangiam, Weighted means and weighted mean equations in lineated symmetric spaces, Quasigroups Relat. Syst., 26 (2018), 197–210. |
[17] |
D. Cheng, Semi-tensor product of matrices and its application to Morgen's problem, Sci. China Ser. F, 44 (2001), 195–212. https://doi.org/10.1007/BF02714570 doi: 10.1007/BF02714570
![]() |
[18] |
D. Cheng, H. Qi, A. Xue, A survey on semi-tensor product of matrices, J. Syst. Sci. Complexity, 20 (2007), 304–322. https://doi.org/10.1007/s11424-007-9027-0 doi: 10.1007/s11424-007-9027-0
![]() |
[19] | D. Cheng, H. Qi, Q. Li, Analysis and control of boolean networks: a semi-tensor product approach, London: Springer-Verlag, 2011. |
[20] |
G. Zhao, H. Li, P. Duan, F. E. Alsaadi, Survey on applications of semi-tensor product method in networked evolutionary games, J. Appl. Anal. Comput., 10 (2020), 32–54. https://doi.org/ 10.11948/20180201 doi: 10.11948/20180201
![]() |
[21] |
Y. Yan, D. Cheng, J. Feng, H. Li, J. Yue, Survey on applications of algebraic state space theory of logical systems to finite state machines, Sci. China Inf. Sci., 66 (2023), 111201. https://doi.org/10.1007/s11432-022-3538-4 doi: 10.1007/s11432-022-3538-4
![]() |
[22] |
Y. Yan, J. Yue, Z. Chen, Algebraic method of simplifying Boolean networks using semi‐tensor product of Matrices, Asian J. Control, 21 (2019), 2569–2577. https://doi.org/10.1002/asjc.2125 doi: 10.1002/asjc.2125
![]() |
[23] |
H. Ji, Y. Li, X. Ding, J. Lu, Stability analysis of Boolean networks with Markov jump disturbances and their application in apoptosis networks, Electronic Res. Arch., 30 (2022), 3422–3434. https://doi.org/10.3934/era.2022174 doi: 10.3934/era.2022174
![]() |
[24] |
D. Cheng, Y. Dong, Semi-tensor product of matrices and its some applications to physics, Meth. Appl. Anal., 10 (2003), 565–588. https://dx.doi.org/10.4310/MAA.2003.v10.n4.a5 doi: 10.4310/MAA.2003.v10.n4.a5
![]() |
[25] |
H. Li, G. Zhao, M. Meng, J. Feng, A survey on applications of semi-tensor product method in engineering, Sci. China Inf. Sci., 61 (2018), 010202. https://doi.org/10.1007/s11432-017-9238-1 doi: 10.1007/s11432-017-9238-1
![]() |
[26] |
Z. Ji, J. Li, X. Zhou, F. Duan, T. Li, On solutions of matrix equation AXB=C under semi-tensor product, Linear Multilinear Algebra, 69 (2019), 1–29. https://doi.org/10.1080/03081087.2019.1650881 doi: 10.1080/03081087.2019.1650881
![]() |
[27] |
P. Chansangiam, S. V. Sabau, Sylvester matrix equation under the semi-tensor product of matrices, An. Ştiinţ. Univ. Al. I. Cuza Iaşi. Mat. (N.S.), 68 (2022), 263–278. https://doi.org/10.47743/anstim.2022.00020 doi: 10.47743/anstim.2022.00020
![]() |
[28] |
J. Jaiprasert, P. Chansangiam, Solving the Sylvester-transpose matrix equation under the semi-tensor product, Symmetry, 14 (2022), 1094. https://doi.org/10.3390/sym14061094 doi: 10.3390/sym14061094
![]() |
[29] |
J. Wang, J. E. Feng, H. L. Huang, Solvability of the matrix equation AX2=B with semi-tensor product, Electronic Res. Arch., 29 (2021), 2249–2267. https://doi.org/10.3934/era.2020114 doi: 10.3934/era.2020114
![]() |
[30] |
P. Chansangiam, A. Ploymukda, Riccati equation and metric geometric means of positive semidefinite matrices involving semi-tensor products, AIMS Math., 8 (2023), 23519–23533. https://doi.org/10.3934/math.20231195 doi: 10.3934/math.20231195
![]() |
1. | Arnon Ploymukda, Kanjanaporn Tansri, Pattrawut Chansangiam, Weighted spectral geometric means and matrix equations of positive definite matrices involving semi-tensor products, 2024, 9, 2473-6988, 11452, 10.3934/math.2024562 |