In this paper, we study the nonlinear Choquard equation
−Δu+V(x)u=(∑y≠xy∈ZN|u(y)|p|x−y|N−α)|u|p−2u
on lattice graph ZN. Under some suitable assumptions, we prove the existence of a ground state solution of the equation on the graph when the function V is periodic or confining. Moreover, when the potential function V(x)=λa(x)+1 is confining, we obtain the asymptotic properties of the solution uλ which converges to a solution of a corresponding Dirichlet problem as λ→∞.
Citation: Jun Wang, Yanni Zhu, Kun Wang. Existence and asymptotical behavior of the ground state solution for the Choquard equation on lattice graphs[J]. Electronic Research Archive, 2023, 31(2): 812-839. doi: 10.3934/era.2023041
[1] | Alexander Pigazzini, Cenap Özel, Saeid Jafari, Richard Pincak, Andrew DeBenedictis . A family of special case sequential warped-product manifolds. Journal of Geometric Mechanics, 2023, 15(1): 116-127. doi: 10.3934/jgm.2023006 |
[2] | Valentin Duruisseaux, Melvin Leok . Time-adaptive Lagrangian variational integrators for accelerated optimization. Journal of Geometric Mechanics, 2023, 15(1): 224-255. doi: 10.3934/jgm.2023010 |
[3] | Jacob R. Goodman . Local minimizers for variational obstacle avoidance on Riemannian manifolds. Journal of Geometric Mechanics, 2023, 15(1): 59-72. doi: 10.3934/jgm.2023003 |
[4] | Xavier Rivas, Daniel Torres . Lagrangian–Hamiltonian formalism for cocontact systems. Journal of Geometric Mechanics, 2023, 15(1): 1-26. doi: 10.3934/jgm.2023001 |
[5] | Maulik Bhatt, Amit K. Sanyal, Srikant Sukumar . Asymptotically stable optimal multi-rate rigid body attitude estimation based on lagrange-d'alembert principle. Journal of Geometric Mechanics, 2023, 15(1): 73-97. doi: 10.3934/jgm.2023004 |
[6] | Jordi Gaset, Arnau Mas . A variational derivation of the field equations of an action-dependent Einstein-Hilbert Lagrangian. Journal of Geometric Mechanics, 2023, 15(1): 357-374. doi: 10.3934/jgm.2023014 |
[7] | Álvaro Rodríguez Abella, Melvin Leok . Discrete Dirac reduction of implicit Lagrangian systems with abelian symmetry groups. Journal of Geometric Mechanics, 2023, 15(1): 319-356. doi: 10.3934/jgm.2023013 |
[8] | William Clark, Anthony Bloch . Existence of invariant volumes in nonholonomic systems subject to nonlinear constraints. Journal of Geometric Mechanics, 2023, 15(1): 256-286. doi: 10.3934/jgm.2023011 |
[9] | Robert I McLachlan, Christian Offen . Backward error analysis for conjugate symplectic methods. Journal of Geometric Mechanics, 2023, 15(1): 98-115. doi: 10.3934/jgm.2023005 |
[10] | Francesco Bonechi, Jian Qiu, Marco Tarlini . Generalised Kähler structure on CP2 and elliptic functions. Journal of Geometric Mechanics, 2023, 15(1): 188-223. doi: 10.3934/jgm.2023009 |
In this paper, we study the nonlinear Choquard equation
−Δu+V(x)u=(∑y≠xy∈ZN|u(y)|p|x−y|N−α)|u|p−2u
on lattice graph ZN. Under some suitable assumptions, we prove the existence of a ground state solution of the equation on the graph when the function V is periodic or confining. Moreover, when the potential function V(x)=λa(x)+1 is confining, we obtain the asymptotic properties of the solution uλ which converges to a solution of a corresponding Dirichlet problem as λ→∞.
Numerical computations on the real (compact) Stiefel manifold viewed as the embedded submanifold Stn,k={X∈Rn×k∣X⊤X=Ik} of Rn×k arise in many branches of applied mathematics like numerical linear algebra and, moreover, in the engineering context, as well. Beside interpolation problems [1], we mention the following examples which are closely linked to optimization. For instance, the symmetric eigenvalue problem can be formulated as an optimization problem on the Stiefel manifold [2]. Moreover, one encounters optimization problems on Stn,k in connection with machine learning [3], multivariate data analysis [4] and computer vision [5,6]. These problems can be tackled by Riemannian optimization methods, see e.g. [2,7,8,9]. An essential part of their design is the choice of an appropriated Riemannian metric [7, Chap. 1]. The Euclidean metric, see e.g. [2], and the so-called canonical metric, see e.g. [10], are well-known, common choices for the Stiefel manifold. For these two metrics, explicit formulas for Riemannian gradients and Riemannian Hessians of smooth functions are known. Such formulas are desirable for the application of several Riemannian optimization methods.
However, there is no reason to restrict to one of these two metrics. In principle, the performance of a Riemannian optimization method could be improved by choosing an alternative metric adapted to the particular function under consideration. For example, the dependence of the speed of convergence of a Riemannian optimization method on the Riemannian metric is investigated in [11] on "Riemannian preconditioning". Moreover, a family of metrics on the generalized Stiefel manifold is introduced in [11] which differs from the family of metrics on Stn,k discussed here.
In this paper, we investigate a 2k-parameter family of pseudo-Riemannian metrics on Stn,k from an extrinsic point of view. This family does not coincide with the family of metrics considered in [12]. Nevertheless, it contains the Euclidean metric and the so-called canonical metric. In addition, the whole one-parameter family which has been recently introduced in [13] is included. An emphasize is put on deriving explicit formulas for gradients and Hessians suitable for applying them in connection with Riemannian optimization methods. In particular, specific results of the conference paper [14] are reproduced as special cases.
Next we give an overview of this text which is kept as self-contained as possible. We start with endowing Rn×k with a family of covariant 2-tensors depending on 2k parameters, which are invariant under the O(n)-left action on Rn×k by matrix multiplication from the left. For suitable choices of these parameters, the corresponding 2-tensor induces a pseudo-Riemannian metric on an open subset U of Rn×k such that Stn,k⊆U becomes a pseudo-Riemannian submanifold of U. Hence it makes sense to consider the normal bundle of Stn,k and the orthogonal projections onto the tangent spaces of Stn,k which can be described by explicit formulas.
In order to put this extrinsic approach into context to existing works on families of metrics on the Stiefel manifold we also consider Stn,k, equipped with our family, as a pseudo-Riemannian reductive homogeneous SO(n)-space. This point of view shows that, for the Riemannian case, the family of metrics which is discussed in this text, is partially contained in the family considered in the work [15] on Einstein metrics. Nevertheless, at least to our best knowledge, the family of metrics on Stn,k considered in this paper has never been treated before from an extrinsic point of view.
After this short detour, we come back to the extrinsic approach. We derive an explicit expression for the spray S:TStn,k→T(TStn,k) associated with the metric. To this end, we exploit a well-known fact, see e.g. [16, Sec. 7.5] for the Riemannian case. The metric spray of a pseudo-Riemannian manifold coincides with the Lagrangian vector field on its tangent bundle associated with the kinetic energy defined by means of the pseudo-Riemannian metric. This allows for computing the metric spray on the tangent bundle TU, where U⊆Rn×k is the open set of which Stn,k is a pseudo-Riemannian submanifold. Eventually, by using a result from [16, Sec. 8.4] on constrained Lagrangian systems, combined with the explicit expression for the orthogonal projections, the metric spray on TStn,k is computed. As a by-product, the geodesic equation is obtained as an explicit second order matrix valued ordinary differential equation (ODE).
Next we derive expressions for pseudo-Riemannian gradients and pseudo-Riemannian Hessians of smooth functions on Stn,k involving only "ordinary" matrix operations. Using the formula for the orthogonal projection onto tangent spaces, we derive an explicit formula for pseudo-Riemannian gradients. Moreover, since we have an expression for the geodesic equation as explicit second order matrix valued ODE, we obtain an explicit formula for pseudo-Riemannian Hessians, too. The expression for the pseudo-Riemannian gradient is valid for all metrics in the 2k-parameter family, while, for the pseudo-Riemannian Hessian, we restrict ourself to a subfamily depending on (k+1)-parameters in order to obtain formulas which are not too complicated. This (k+1)-parameter subfamily still contains the Euclidean metric and the canonical metric as well as the one-parameter family from [13].
Finally, a formula for the second fundamental form of Stn,k considered as pseudo-Riemannian submanifold of an open U⊆Rn×k is derived. We give a concrete expression for the second fundamental form with respect to the metrics in the (k+1)-parameter subfamily. By means of the Gauß formula, an explicit matrix-type formula for the Levi-Civita covariant derivative is obtained.
Throughout this text, except for Section 3.4, we view the real (compact) Stiefel manifold Stn,k as an embedded submanifold of the real (n×k)-matrices Rn×k which is given by
Stn,k={X∈Rn×k∣X⊤X=Ik}⊆Rn×k,1≤k≤n. | (2.1) |
We point out that Stn,k is a proper subset of Rn×k although the inclusion is denoted by Stn,k⊆Rn×k. In the sequel, we often denote proper inclusions by "⊆". The symbol "⊂" is only used if we want to emphasize that an inclusion is not an equality. The tangent bundle of Stn,k is denoted by TStn,k which is considered as a submanifold of TRn×k≅Rn×k×Rn×k. More generally, for a manifold M, we denote by TM and T∗M its tangent and cotangent bundle, respectively. In the sequel, if not indicated other-wise, we identify Rn×k with its dual space (Rn×k)∗ via the linear isomorphism
Rn×k→(Rn×k)∗,V↦tr(V⊤(⋅))=(W↦tr(V⊤W)) | (2.2) |
induced by the Frobenius scalar product. The following characterization of the tangent space of Stn,k at X∈Stn,k considered as subspace of Rn×k is used frequently
TXStn,k={V∈Rn×k∣X⊤V=−V⊤X}⊆Rn×k. | (2.3) |
We write
O(n)=Stn,n={R∈Rn×n∣R⊤R=RR⊤=In} | (2.4) |
for the orthogonal group and
SO(n)={R∈Rn×k∣R⊤R=RR⊤=In and det(R)=1} | (2.5) |
for the special orthogonal group. Their Lie algebras coincide and are denoted by
so(n)={ξ∈Rn×n∣ξ⊤=−ξ}. | (2.6) |
Moreover, we write
skew:Rn×n→so(n)⊆Rn×n, A↦12(A−A⊤) | (2.7) |
for the projection onto so(n) whose kernel is given by the set of symmetric matrices Rn×nsym. The O(n)-left action on Rn×k by matrix multiplication from the left is denoted by
Ψ:O(n)×Rn×k→Rn×k,(R,X)↦RX. | (2.8) |
By restricting the second argument of Ψ one obtains the O(n)-action
O(n)×Stn,k→Stn,k,(R,X)↦RX | (2.9) |
on Stn,k from the left which we denote by Ψ, as well. It is well-known that this O(n)-action on Stn,k is transitive. For fixed R∈O(n) we denote the diffeomorphisms induced by the actions from (2.8) and (2.9)
Rn×k∋X↦RX∈Rn×k and Stn,k∋X↦RX∈Stn,k | (2.10) |
both by ΨR.
If U⊆Rn×k is some subset, we write
ιU:U→Rn×k | (2.11) |
for the canonical inclusion of U into Rn×k. Moreover, the canonical inclusion of Stn,k into Rn×k is often denoted by
ι:Stn,k→Rn×k | (2.12) |
for short.
Next let pr:F→M be a vector bundle over a manifold M with dual bundle F∗. The smooth sections of F are denoted by Γ∞(F). Moreover, we denote by F⊗ℓ, Sℓ(F) and Λℓ(F) the ℓ-th tensor power, the ℓ-th symmetrized tensor power and the ℓ-th antisymmetrized tensor power of F, respectively. In addition, we write End(F)≅F∗⊗F for the endormorphism bundle of F. The vertical bundle of F is denoted by Ver(F)⊆TF.
Let f:M→N be a smooth map between manifolds and let α∈Γ∞((T∗N)⊗ℓ) be a covariant tensor field on N. The pullback of α by f is denoted by f∗α. If α is a differential form, i.e. α∈Γ∞(Λℓ(T∗M)), the exterior derivative of α is denoted by d α. The tangent map of f is denoted by Tf:TM→TN. If f is a map between (open subsets of) finite dimensional R-vector spaces, we write Df(X)V for the derivative of f at X evaluated at V. Sometimes, the tangent map of a smooth map f between arbitrary manifolds at the point X evaluated at a tangent vector V is denoted by Df(X)V, as well.
Next let M⊆Rn×k be a submanifold. A vector field V:M→TM⊆Rn×k×Rn×k is often implicitly identified with the map M→Rn×k defined by its second component which we denote by V, as well, i.e. the "foot point" X∈M is suppressed in our notation. If S∈Γ∞(T(TM)) is a vector field on TM, we view it as a map S:TM→T(TM)⊆(Rn×k)4 usually not suppressing the "foot point" (X,V)∈TM.
For a smooth function F:Rn×k→R we write ∇F(X) for the gradient of F at X∈Rn×k with respect to the Frobenius scalar product, i.e. the unique matrix ∇F(X)∈Rn×k with
d F|X(V)=tr((∇F(X))⊤V) | (2.13) |
for all V∈Rn×k. Furthermore Eij∈Rn×k denotes the matrix whose entries fulfill (Eij)fℓ=δifδjℓ for all f∈{1,…,n} and ℓ∈{1,…,k} with δif and δjℓ being Kronecker deltas.
Finally, following the convention in [17, Chap. 2], a scalar product is a non-degenerated symmetric bilinear form. Moreover, an inner product is a positive definite symmetric bilinear form.
We start with investigating a 2k-parameter family of symmetric covariant 2- tensors on Rn×k. For certain choices of these parameters, it defines a pseudo-Riemannian metric on an open subset U⊆Rn×k such that Stn,k⊆U becomes a pseudo-Riemannian submanifold of U.
We introduce a 2k-parameter family of symmetric covariant 2-tensors on Rn×k.
Lemma 3.1. Let D=diag(D11,…,Dkk)∈Rk×k and E=diag(E11,…,Ekk)∈Rk×k be both diagonal. Then the point-wise definition
⟨V,W⟩D,EX=tr(V⊤WD)+tr(V⊤XX⊤WE) | (3.1) |
with X∈Rn×k and V,W∈TXRn×k≅Rn×k yields a smooth covariant 2-tensor ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Rn×k)) which is invariant under the O(n)-action Ψ defined in (2.8).
Proof. Obviously, (3.1) defines a smooth covariant 2-tensor. Let R∈O(n). Then Ψ∗R⟨⋅,⋅⟩D,E(⋅)=⟨⋅,⋅⟩D,E(⋅) holds due to
⟨DΨR(X)V,DΨR(X)W⟩D,EΨR(X)=⟨RV,RW⟩D,ERX=⟨V,W⟩D,EX |
for X∈Rn×k and V,W∈TXRn×k≅Rn×k showing the Ψ-invariance of ⟨⋅,⋅⟩D,E(⋅).
Remark 3.2. Observe that the diagonal entry Eii∈R of the diagonal matrix E=diag(E11,…,Ekk)∈Rk×k shall not be confused with the matrix Eii∈Rn×k introduced at the end of Section 2. In the sequel, it should be clear by the context how the symbol Eii has to be understood.
Remark 3.3. Let E=0. Then ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Rn×k)) becomes independent of X∈Rn×k. Hence we may identify ⟨⋅,⋅⟩D,0(⋅) with the symmetric bilinear form
⟨⋅,⋅⟩D:Rn×k×Rn×k→R,(V,W)↦⟨V,W⟩D=tr(V⊤WD). | (3.2) |
If we want to emphasize that ⟨⋅,⋅⟩D is a symmetric bilinear form on Rn×k, we denote it by ⟨⋅,⋅⟩DRn×k.
Remark 3.4. The pull-back ι∗⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)) of ⟨⋅,⋅⟩D,E(⋅) with ι:Stn,k→Rn×k simplifies for the following values of k:
1. For k=n one has Stn,n=O(n). Thus for X∈O(n) and V,W∈TXRn×k≅Rn×k one obtains
⟨V,W⟩D,EX=tr(V⊤W(D+E))=⟨V,W⟩D+E | (3.3) |
due to X⊤X=XX⊤=In, i.e. ι∗⟨⋅,⋅⟩D,E(⋅)=⟨⋅,⋅⟩D+E holds.
2. For k=1 one has Stn,1=Sn−1⊆Rn. Using X⊤V=0 for all X∈Sn−1 and V∈TXSn−1 yields
⟨V,W⟩D,EX=⟨V,W⟩D, | (3.4) |
i.e. ι∗⟨⋅,⋅⟩D,E(⋅)=⟨⋅,⋅⟩D holds.
Remark 3.5. The pull-back ι∗⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)) yields well-known metrics on Stn,k for certain choices of D and E:
1. For D=Ik and E=0 one obtains the Euclidean metric, see e.g. [10], [18, Sec. 23.5] or [2]
2. Setting D=Ik and E=−12Ik yields the canonical metric, see e.g. [10] or [18, Sec. 23.5]
3. For D=2Ik and E=νIk with ν=−2α+1α+1 and α∈R∖{−1} the metric ⟨⋅,⋅⟩D,E(⋅) reproduces a one-parameter family which has been introduced in [13], see in particular [13, Eq. (55)].
In order to investigate ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Rn×k)) and its pull-back to Stn,k we first list some properties of ⟨⋅,⋅⟩D.
Lemma 3.6. Let D=diag(D11,…,Dkk)∈Rk×k be diagonal. The following assertions are fulfilled:
1. The symmetric bilinear form ⟨⋅,⋅⟩D:Rn×k×Rn×k→R is a scalar product iff D is invertible.
2. The bilinear form ⟨⋅,⋅⟩D:Rn×k×Rn×k→R is an inner product iff Dii>0 holds for all i∈{1,…,k}.
3. Assume that D is invertible. Then ⟨⋅,⋅⟩D:Rk×k×Rk×k→R induces a scalar product on so(k) iff
Dii+Djj≠0 | (3.5) |
holds for all i,j∈{1,…,k}. This condition is always satisfied for k=1.
4. Let k≥2. Then ⟨⋅,⋅⟩D|so(k)×so(k):so(k)×so(k)→R defines an inner product on so(n) iff
Dii+Djj>0 | (3.6) |
holds for all 1≤i<j≤k. For k=1, this bilinear form defines always an inner product.
Proof. Let Eij∈Rn×k denote the matrix whose entries fulfill (Eij)fℓ=δifδjℓ. Clearly, the set
B={Eij∣i∈{1,…,n}andj∈{1,…,k}} |
defines a basis of Rn×k. Thus it suffices to show that for all Eij∈B the associated linear forms
Rn×k→R,V↦⟨Eij,V⟩D | (3.7) |
are non-zero iff D is invertible. We have
⟨Eij,V⟩D=tr(E⊤ijVD)=VijDjj | (3.8) |
with V=(Vij)∈Rn×k. Equation (3.8) implies that D is invertible iff the linear forms in (3.7) are non-vanishing for all i∈{1,…,n} and j∈{1,…,k} showing Claim 1.
Next we prove Claim 2. Let 0≠V=(Vij)∈Rn×k. Then ⟨V,V⟩D>0 holds iff Dii>0 for i∈{1,…,k} due to
⟨V,V⟩D=tr(V⊤VD)=k∑i=1n∑j=1V2jiDii. |
We now prove Claim 3. For k=1 the assertion is trivial due to dim(so(1))=0. For k≥2 the set {Eij−Eji∣1≤i<j≤k} is a basis of so(k). Thus ⟨⋅,⋅⟩D induces a scalar product on so(k) iff the linear forms
so(k)→R,A↦⟨Eij−Eji,A⟩D |
are non-vanishing for all 1≤i<j≤k. Writing A=(Aij)=(−Aji)∈so(k) we compute
⟨Eij−Eji,A⟩D=⟨Eij,A⟩D−⟨Eji,A⟩D=AijDjj−AjiDii=Aij(Djj+Dii) |
showing that ⟨⋅,⋅⟩D defines a scalar product on so(k) iff
Dii+Djj≠0,i,j∈{1,…,k} |
holds. Here we exploited that Dii+Dii≠0 is automatically fulfilled because D is invertible.
It remains to prove Claim 4. The case k=1 is trivial due to so(1)={0}. Thus assume k≥2. Let A=(Aij)∈so(k). Exploiting Aij=−Aji we calculate
⟨A,A⟩D=12tr(A⊤AD)+12tr(A⊤AD)=12k∑i,j=1A2ij(Dii+Djj). | (3.9) |
Using Aii=0 we conclude that ⟨A,A⟩D>0 holds for all 0≠A∈so(k) iff Dii+Djj>0 is fulfilled for all 1≤i<j≤k.
The next lemma shows that ⟨⋅,⋅⟩D,E(⋅) induces a pseudo-Riemannian metric on the Stiefel manifold for certain choices of D and E.
Lemma 3.7. Let D=diag(D11.…,Dkk)∈Rk×k and E=diag(E11,…,Ekk)∈Rk×k be both diagonal and let X∈Stn,k. Then the following assertions are fulfilled:
1. Let 1≤k<n. The bilinear form
⟨⋅,⋅⟩D,EX:TXRn×k×TXRn×k≅Rn×k×Rn×k→R | (3.10) |
is a scalar product iff D and D+E are both invertible. For k=n the bilinear form in (3.10) defines a scalar product iff D+E is invertible.
2. Assume that (3.10) defines a scalar product. Then the pull-back ι∗⟨⋅,⋅⟩D,E(⋅) to Stn,k defines a pseudo-Riemannian metric on Stn,k, i.e.
⟨⋅,⋅⟩D,EX:TXStn,k×TXStn,k→R | (3.11) |
is a scalar product on TXStn,k, iff the condition
Dii+Eii+Djj+Ejj≠0,i,j∈{1,…,k} | (3.12) |
holds.
3. Assume that (3.10) defines a scalar product. For 2≤k≤n−1 the symmetric covariant 2-tensor ι∗⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)) is a Riemannian metric on Stn,k, i.e.
⟨⋅,⋅⟩D,EX:TXStn,k×TXStn,k→R | (3.13) |
is an inner product on TXStn,k, iff the conditions Dii>0 for all i∈{1,…,k} and
Dii+Eii+Djj+Ejj>0,1≤i<j≤k | (3.14) |
are fulfilled. For k=1 one obtains a Riemannian metric iff D11>0 holds. For k=n the tensor ι∗⟨⋅,⋅⟩D,E(⋅) defines a Riemannian metric iff Dii+Eii+Djj+Ejj>0 holds for all 1≤i<j≤n.
Proof. Since the O(n)-left action Ψ on Rn×k defined in (2.8) is isometric with respect to ⟨⋅,⋅⟩D,E(⋅) by Lemma 3.1 and, moreover, Ψ restricts to a transitive action on Stn,k it suffices to prove the claims for a single point X0∈Stn,k.
We first consider the case k=n. Then ⟨⋅,⋅⟩D,EX=⟨⋅,⋅⟩D+E holds for all X∈Stn,n=O(n) by Remark 3.4, Claim 1. Hence ⟨⋅,⋅⟩D,EX is non-degenerated iff D+E is invertible according to Lemma 3.6, Claim 1. Next we consider the case 1≤k<n. We choose X0=In,k, where
In,k=[Ik0]∈Stn,k, |
and write
V=[V1V2]∈Rn×kandW=[W1W2]∈Rn×k |
with V1,W1∈Rk×k and V2,W2∈R(n−k)×k. By this notation and identifying TXRn×k≅Rn×k we calculate
⟨V,W⟩D,EIn,k=tr([V1V2]⊤[W1W2]D)+tr([V1V2]⊤[Ik000][W1W2]E)=tr(V⊤1W1(D+E))+tr(V⊤2W2D). | (3.15) |
By (3.15) and Lemma 3.6, Claim 1, the bilinear form ⟨⋅,⋅⟩D,EIn,k defines a scalar product on TXRn×k iff D and D+E are both invertible.
Next we assume that D and D+E are choosen such that ⟨⋅,⋅⟩D,EX defines a scalar product on TXRn×k for each X∈Stn,k. We now prove Claim 2 for 1≤k≤n−1. To this end, it is sufficient to show that
⟨⋅,⋅⟩D,EIn,k:TIn,kStn,k×TIn,kStn,k→R | (3.16) |
is a scalar product iff (3.12) holds. The tangent space TIn,kStn,k is given by
TIn,kStn,k={[V1V2]|V1∈so(k) and V2∈R(n−k)×k}⊆TIn,kRn×k≅Rn×k, | (3.17) |
see e.g. [10, Sec. 2.2.1]. Thus we may write V,W∈TXStn,k as
V=[V1V2]∈Rn×k andW=[W1W2]∈Rn×k |
with V1,W1∈so(k) and V2,W2∈R(n−k)×k. We now obtain
ι∗⟨V,W⟩D,EIn,k=tr(V⊤1W1(D+E))+tr(V⊤2W2D) | (3.18) |
analogously to (3.15). Clearly, Equation (3.18) defines a scalar product on TIn,kStn,k iff
so(k)×so(k)→R,(V1,W1)↦tr(V⊤1W1(D+E)) |
yields a scalar product on so(k) and
R(n−k)×k×R(n−k)×k→R,(V2,W2)↦tr(V⊤2W2D) |
defines a scalar product on R(n−k)×k. By applying Lemma 3.6, Claim 3 we obtain the desired result. Next we consider the case k=n. By exploiting the O(n)-invariance of ⟨⋅,⋅⟩D,E(⋅) and TInStn,n=so(n) as well as ⟨⋅,⋅⟩D,E(⋅)=⟨⋅,⋅⟩D+E for k=n, Claim 2 follows by Lemma 3.6, Claim 3.
It remains to prove Claim 3. We first consider the case 2≤k≤n−1. Since the bilinear form on TIn,kStn,k induced by ⟨⋅,⋅⟩D,E(⋅) is given by (3.18), the desired result is a consequence of Lemma 3.6, Claim 2 and Lemma 3.6, Claim 4. For k=1, we observe that ι∗⟨⋅,⋅⟩D,E(⋅) is independent of E due to X⊤V=0 for all X∈Stn,1 and V∈TXStn,1, see also Remark 3.4, Claim 2. Hence (3.18) implies that ⟨⋅,⋅⟩D,E(⋅) is positive definite iff
R(n−k)×k×R(n−k)×k→R,(V2,W2)↦tr(V⊤2W2D) |
is positive definite. The desired result follows by Lemma 3.6, Claim 2. For k=n, the assertion holds due to ⟨⋅,⋅⟩D,EX=⟨⋅,⋅⟩D+E for all X∈Stn,n=O(n) by Lemma 3.6, Claim 3.
The next lemma generalizing [14, Lem. 2] shows that there is an open neigbourhood U⊆Rn×k of Stn,k such that Stn,k⊆(U,ι∗U⟨⋅,⋅⟩D,E(⋅)) is a pseudo-Riemannian submanifold. This fact is crucial for the following discussion.
Lemma 3.8. Let D,E∈Rk×k be both diagonal such that for each X∈Stn,k
⟨⋅,⋅⟩D,EX:TXRn×k×TXRn×k→R | (3.19) |
defines a scalar product on TXRn×k≅Rn×k which induces a scalar product on TXStn,k⊆TXRn×k. Then there exists an open neighbourhood U⊆Rn×k of Stn,k such that ι∗U⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗U)) is a pseudo-Riemannian metric on U and (Stn,k,ι∗⟨⋅,⋅⟩D,E(⋅)) is a pseudo-Riemannian submanifold of (U,ι∗U⟨⋅,⋅⟩D,E(⋅)).
Proof. We identify ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Rn×k)) with the continuous map
φ:Rn×k→S2((Rn×k)∗),X↦⟨⋅,⋅⟩D,EX=((V,W)↦⟨V,W⟩D,EX). |
The bilinear form φ(X)=⟨⋅,⋅⟩D,EX∈S2((Rn×k)∗) is a scalar product for all X∈Stn,k by assumption. Hence, by the continuity of φ, there is an on open neighbourhood UX of X in Rn×k such that φ(˜X)∈S2((Rn×k)∗) is non-degnerated for all ˜X∈UX. We set
U=⋃X∈Stn,kUX. |
Then U⊆Rn×k is open as a union of open sets and fulfills Stn,k⊆U by definition. Moreover, φ(˜X) is non-dengenerated for all ˜X∈U by construction. Hence ι∗U⟨⋅,⋅⟩D,E(⋅) defines a pseudo-Riemannian metric on U such that Stn,k⊆(U,ι∗U⟨⋅,⋅⟩D,E(⋅)) is a pseudo-Riemannian submanifold.
Obviously, the inclusion Stn,k⊆U from Lemma 3.8 is always proper since Stn,k is closed in Rn×k while U is open in Rn×k.
Notation 3.9. From now on, unless indicated otherwise, pull-backs of ⟨⋅,⋅⟩D,E(⋅) to submanifolds of Rn×k are suppressed in the notation.
In the case k=n, the 2k-parameter family of covariant 2-tensors ⟨⋅,⋅⟩D,E(⋅) is actually a k-parameter family by Remark 3.4, Claim 1. Indeed, ⟨⋅,⋅⟩D,E(⋅) depends only on D+E. Hence one may ask if there exits always such an over-parameterization.
Lemma 3.10. Let D=diag(D11,…,Dkk)∈Rk×k be some diagonal matrix. Then the following assertions are fulfilled:
1. The bilinear form ⟨⋅,⋅⟩D:Rn×k×Rn×k→R vanishes identically iff D=0 holds.
2. The restriction ⟨⋅,⋅⟩D|so(k)×so(k):so(k)×so(k)→R of ⟨⋅,⋅⟩D:Rk×k×Rk×k→R fulfills the following assertions:
(a) For k=1 one has ⟨⋅,⋅⟩D|so(k)×so(k)=0 for all D∈R1×1≅R.
(b) For k=2 one has ⟨⋅,⋅⟩D|so(k)×so(k)=0 iff D11+D22=0 holds.
(c) For k≥3 one has ⟨⋅,⋅⟩D|so(k)×so(k)=0 iff D=0 holds.
Proof. Let Eij∈Rn×k the matrix whose entries fulfill (Eij)fℓ=δifδjℓ. Then
⟨Eij,V⟩D=VijDjj,i∈{1,…,n}, j∈{1,…,k}, | (3.20) |
where V=(Vij)∈Rn×k. Since ⟨⋅,⋅⟩D=0 holds iff the linear forms ⟨Eij,⋅⟩D:Rn×k→R vanishes for all 1≤i≤n and 1≤j≤k, the first claim follows by (3.20).
Next, we consider ⟨⋅,⋅⟩D|so(k)×so(k):so(k)×so(k)→R. Clearly, it vanishes for k=1 for all D∈R1×1 due to so(1)={0}.
We now assume k≥2. Then ⟨⋅,⋅⟩D|so(k)×so(k):so(k)×so(k)→R vanishes iff the linear forms
⟨Eij−Eji,⋅⟩D:so(k)→R | (3.21) |
vanish for all 1≤i<j≤k. Writing A=(Aij)=(−Aji)∈so(k) we obtain
⟨Eij−Eji,A⟩D=AijDii−AjiDjj=Aij(Dii+Djj). |
Thus the linear forms (3.21) are zero iff Dii+Djj=0 holds for all 1≤i<j≤k. For k=2 this is equivalent to D11+D22=0. It remains to consider the case k≥3. The conditions Dii+Djj=0 for all 1≤i<j≤k include the conditions
D11+Dii=0⟺D11=−Dii for all2≤i≤k | (3.22) |
and
D(k−1)(k−1)+Dkk=0. | (3.23) |
In particular D11=−Dk−1 and D11=−Dkk holds. Plugging these identities into (3.23) yields
−D11−D11=−2D11=0⟺D11=0. |
Hence (3.22) implies Dii=0 for all 2≤i≤k. Therefore ⟨⋅,⋅⟩D|so(k)×so(k)=0 iff D=0 as desired.
The next lemma justifies calling ⟨⋅,⋅⟩D,E(⋅) a 2k-parameter family provided that 3≤k≤n−1 holds.
Lemma 3.11. Let
Rk×kdiag={diag(D11,…,Dkk)∣D11,…,Dkk∈R}⊆Rk×k |
denote the k-dimensional real vector space of (k×k)-diagonal matrices. Moreover, define
ψ:Rk×kdiag×Rk×kdiag→Γ∞(S2(T∗Stn,k)),(D,E)↦⟨⋅,⋅⟩D,E(⋅). | (3.24) |
Then ψ is a linear map which fulfills the following assertions depending on k and n:
1. For k=1=n, one has dim(im(ψ))=0 and ker(ψ)=R×R.
2. For k=1 and n>1 one has dim(im(ψ))=1 and ker(ψ)={(0,E)∣E∈R}⊆R×R.
3. For k=2=n one has dim(im(ψ))=1 and
ker(ψ)={((D11,D22),(E11,−D11−D22−E11))∣D11,D22,E11∈R}⊆R2×2diag×R2×2diag. |
4. For 2<k<n one has dim(im(ψ))=2k and ker(ψ)={0}⊆Rk×kdiag×Rk×kdiag.
5. For k=n>2 one has dim(im(ψ))=k and ker(ψ)={(D,−D)∣D∈Rk×kdiag}⊆Rk×kdiag×Rk×kdiag.
Proof. Clearly, the map ψ is linear. Next we define the linear map
˜ψ:Rk×kdiag×Rk×kdiag→S2(T∗In,kStn,k),(D,E)↦⟨⋅,⋅⟩D,EIn,k. |
Obviously, for each (D,E)∈Rk×kdiag×Rk×kdiag one has (ψ(D,E))(In,k)=⟨⋅,⋅⟩D,EIn,k=˜ψ(D,E). Since ⟨⋅,⋅⟩D,E(⋅) is invariant under the transitive O(n)-action Ψ on Stn,k according to Lemma 3.1, this yields
(D,E)∈ker(ψ)⟺(D,E)∈ker(˜ψ). | (3.25) |
Moreover, the equivalence
(D,E)∈ker(˜ψ)⟺(⟨V,W⟩D,EIn,k=0 for all V,W∈TIn,kStn,k) | (3.26) |
is clearly fulfilled. We again write
V=[V1V2]∈TIn,kStn,kandW=[W1W2]∈TIn,kStn,k |
with V1,W1∈so(k) and V2,W2∈R(n−k)×k. By this notation and the description of ker(˜ψ) from (3.26), we study each case separately:
1. Obviously, for k=1=n the claim ker(˜ψ)=R×R is correct due to TI1St1,1={0} implying dim(S2(T∗I1St1,1))=0.
2. For k=1 and n>1 we have
(˜ψ(D,E))(V,W)=tr(V⊤1W1(D+E))+tr(V⊤2W2D)=⟨V1,W1⟩D+E|so(1)×so(1)+⟨V2,W2⟩DR(n−1)×1. | (3.27) |
Clearly, Equation (3.27) vanishes iff D=0 holds independent of the value of D+E by Lemma 3.10. Hence the kernel of ψ is given by ker(˜ψ)={(0,E)∣E∈R}
3. For k=2=n we have
(˜ψ(D,E))(V,W)=⟨V,W⟩D+E|so(2)×so(2). |
Lemma 3.10 yields ˜ψ(D,E)=0 iff (D+E)11+(D+E)22=0 is fulfilled. Therefore we obtain
ker(˜ψ)={((D11,D22),(E11,−D11−D22−E11))∣D11,D22,E11∈R}. |
4. We now consider the case 3≤k≤n−1. Then one has
(˜ψ(D,E))(V,W)=tr(V⊤1W1(D+E))+tr(V⊤2W2D)=⟨V1,W1⟩D+E|so(k)×so(k)+⟨V2,W2⟩DR(n−k)×k. |
By Lemma 3.10, we have ˜ψ(D,E)=0 iff D=0 and D+E=0 holds. Therefore the kernel of ψ is given by ker(˜ψ)={(D,E)∈Rk×kdiag×Rk×kdiag∣D=0=E}={0}.
5. It remains to consider the case k=n≥3. We obtain
(˜ψ(D,E))(V,W)=tr(V⊤W(D+E))=⟨V,W⟩D+E|so(k)×so(k). |
for all V,W∈TInStn,n=so(n). Thus ˜ψ(D,E)=0 holds iff D+E=0 is fulfilled by Lemma 3.10. Hence the kernel of ˜ψ is given by ker(˜ψ)={(D,−D)∣D∈Rk×kdiag}.
The equality ker(ψ)=ker(˜ψ) is satisfied according to (3.25). Moreover, we have
dim(im(ψ))=dim(Rk×kdiag×Rk×kdiag)−dim(ker(ψ))=2k−dim(ker(˜ψ)) |
as desired.
Remark 3.12. Lemma 3.7, Claim 3 shows that the set of all parameters
{(D,E)∈Rk×kdiag×Rk×kdiag∣⟨⋅,⋅⟩D,E(⋅) defines a pseudo-Riemannian metric on Stn,k} |
contains the non-empty subset {(D,E)∈Rk×kdiag×Rk×kdiag∣Dii>0 and Eii>0 for all i∈{1,…,k}} which is open in Rk×kdiag×Rk×kdiag. Moreover, the linear map ψ:Rk×kdiag×Rk×kdiag→Γ∞(S2(T∗Stn,k)) is injective for 2<k<n according to Lemma 3.11. This point of view justifies calling ⟨⋅,⋅⟩D,E(⋅) a 2k-parameter family at least for 2<k<n. For other choices of k and n one has rather a (dim(im(ψ)))-parameter family of metrics. However, ignoring this over parameterization, we call them 2k-parameter family, nevertheless.
The Stiefel manifold Stn,k endowed with ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)) can be viewed as a pseudo-Riemannian submanifold of (U,⟨⋅,⋅⟩D,E(⋅)) with some suitable open U⊆Rn×k by Lemma 3.8. Consequently, for any given point X∈Stn,k, we may consider the orthogonal projection
PX:TXRn×k→TXStn,k⊆Rn×k, |
where TXRn×k≅Rn×k is endowed with the scalar product ⟨⋅,⋅⟩D,EX. Moreover, it makes sense to consider the normal space NXStn,k=(TXStn,k)⊥⊆Rn×k with respect to ⟨⋅,⋅⟩D,EX:TXRn×k×TXRn×k→R.
Notation 3.13. From now on, unless indicated otherwise, we always assume that D,E∈Rk×k are both diagonal matrices such that ⟨⋅,⋅⟩D,EX defines a scalar product on Rn×k for each X∈Stn,k and ⟨⋅,⋅⟩D,E(⋅) induces a pseudo-Riemannian metric on Stn,k. In particular, we may assume that D and D+E are both invertible. In view of Lemma 3.7, Claim 1 this assumption is of no restriction. For the case k=n, we replace D by D+E and E by 0, if necessary.
Lemma 3.14. Let D=diag(D11,…,Dkk)∈Rk×k be invertible such that Dii+Djj≠0 holds for all i,j∈{1,…,k}. Then the following assertions are fulfilled:
1. The orthogonal complement of so(k) in Rk×k with respect to the scalar product ⟨⋅,⋅⟩D is given by
so(k)⊥D={A∈Rk×k∣AD=(AD)⊤}={ΛD−1∣Λ∈Rk×ksym}⊆Rk×k. | (3.28) |
Moreover, so(k)⊕so(k)⊥D=Rk×k holds.
2. The orthogonal projection
πD:Rk×k→so(k)⊆Rk×k,A↦πD(A) | (3.29) |
onto so(k) with respect ⟨⋅,⋅⟩D is entry-wise given by
πD(A)ij=1Dii+Djj(AD−DA⊤)ij=1Dii+Djj(AijDjj−AjiDii),i,j∈{1,…,k}. | (3.30) |
Proof. We first determine so(k)⊥D. To this end, we calculate
so(k)⊥D={A∈Rk×k∣⟨A,B⟩D=0 for all B∈so(n)}={A∈Rk×k∣tr((AD)⊤B)=0 for all B∈so(n)}={A∈Rk×k∣AD=(AD)⊤∈Rk×ksym is symmetric }. |
Let Λ∈Rk×ksym. Then (ΛD−1)D=Λ=Λ⊤=D(ΛD−1)⊤ showing {ΛD−1∣Λ∈Rk×ksym}⊆so(k)⊥D. The equality so(k)⊥D={ΛD−1∣Λ∈Rk×ksym} follows by counting dimensions. By Lemma 3.6, Claim 3 the assumptions on D ensure that ⟨⋅,⋅⟩D induces a scalar product on so(k). Hence so(k)⊕so(k)⊥D=Rk×k holds, see e.g. [17, Chap. 2, Lem. 23].
It remains to prove Claim 2. To this end, we show im(πD)=so(k) and ker(πD)=so(k)⊥D as well as πD|so(k)=idso(k). We first prove im(πD)⊆so(n). Let A=(Aij)∈Rk×k. We compute
((πD(A))⊤)ij=πD(A)ji=1Djj+Dii(AjiDii−AijDjj)=−1Dii+Djj(AijDjj−AjiDii)=−πD(A)ij. |
for i,j∈{1,…,k} showing im(πD)⊆so(k). Moreover, for A∈so(k), i.e. Aij=−Aji, we have
πD(A)ij=1Dii+Djj(AijDjj−(−Aij)Dii)=1Dii+DjjAij(Djj+Dii)=Aij. |
This yields πD(A)=A for all A∈so(k), i.e. πD|so(k)=idso(k). Moreover, the inclusion im(πD)⊆so(k) is in fact an equality. Next let A∈so(k)⊥D. Then AD=DA⊤ holds according to Claim 1 implying
πD(A)ij=1Dii+Djj(AD−DA⊤)ij=0. |
Thus πD|so⊥D=0 follows.
The formula for πD can be rewritten in terms of the so-called Hadamard or Schur product. For matrices A,B∈Rk×k, it is entry-wise defined by
(A⊚B)ij=AijBij,i,j∈{1,…,k}. | (3.31) |
Remark 3.15. Let μ∈Rk×k be defined entry-wise by
μij=1Dii+Djj,i,j∈{1,…,k}. | (3.32) |
Then the projection πD:Rn×k→so(k) from Lemma 3.14 can be rewritten as
πD(A)=μ⊚(AD−DA⊤),A∈Rk×k. | (3.33) |
Corollary 3.16. Let 0≠β∈R and define D=βIk. Then, for each A∈Rk×k the map πD from Lemma 3.14 simplifies to
πβIk(A)=12(A−A⊤)=skew(A). | (3.34) |
Proof. The desired result follows by a straightforward calculation exploiting Dii=β≠0 for all i∈{1,…,k}.
We determine the normal spaces of Stn,k with respect ⟨⋅,⋅⟩D,E(⋅) generalizing [19, Chap. 1, Lem. 3.15] and [14, Lem. 3].
Lemma 3.17. The normal space NXStn,k=(TXStn,k)⊥⊆TXRn×k≅Rn×k at X∈Stn,k with respect to ⟨⋅,⋅⟩D,EX is given by
NXStn,k={XΛ(D+E)−1∈Rn×k∣Λ=Λ⊤∈Rk×ksym}. | (3.35) |
Proof. Clearly, the set {XΛ(D+E)−1∈Rn×k∣Λ=Λ⊤∈Rk×ksym} is a linear subspace of Rn×k of dimension (k2+k)/2 being the image of the injective linear map
Rk×ksym→Rn×k,Λ↦XΛ(D+E)−1. |
Moreover, every matrix V=XΛ(D+E)−1 with Λ∈Rk×ksym is orthogonal to the tangent space TXStn,k. Indeed, we have for W∈TXStn,k
⟨V,W⟩D,EX=tr((XΛ(D+E)−1)⊤WD)+tr((XΛ(D+E)−1)⊤XX⊤WE)=tr(Λ⊤(X⊤W))=0 |
due to Λ=Λ⊤ and X⊤W=−W⊤X. Therefore {XΛ(D+E)−1∈Rn×k∣Λ=Λ⊤∈Rk×ksym}⊆NXStn,k follows. By counting dimensions, this inclusion is in fact an equality.
Theorem 3.18. Let X∈Stn,k. The orthogonal projection of TXRn×k≅Rn×k onto TXStn,k⊆Rn×k with respect to ⟨⋅,⋅⟩D,EX is given by
PX:Rn×k→TXStn,k⊆Rn×k,V↦PX(V)=V−XX⊤V+XπD+E(X⊤V). | (3.36) |
Proof. We first show im(PX)=TXStn,k. Let X∈Stn,k and V∈Rn×k. One calculates
X⊤(PX(V))=X⊤(V−XX⊤V+XπD+E(X⊤V))=X⊤V−X⊤V+πD+E(X⊤V)=πD+E(X⊤V). |
Moreover, using im(πD+E)=so(n), we obtain
(PX(V))⊤X=(V−XX⊤V+XπD+E(X⊤V))⊤X=V⊤X−V⊤X+(πD+E(X⊤V))⊤=−πD+E(X⊤V). |
Hence X⊤(PX(V))=πD+E(X⊤V)=−(PX(V))⊤X follows, i.e. im(PX)⊆TXStn,k as desired.
We now assume V∈TXStn,k. By using X⊤V=−V⊤X and πD|so(n)=idso(n), we calculate
PX(V)=V−XX⊤V+XπD(X⊤V)=V−XX⊤V+X(X⊤V)=V |
proving PX|TXStn,k=idTXStn,k and implying that im(PX)⊆TXStn,k is indeed an equality.
It remains to show ker(PX)=(TXStn,k)⊥. Let V∈NXStn,k. We may write V=XΛ(D+E)−1 with some suitable symmetric matrix Λ∈Rk×ksym by exploiting Lemma 3.17. Consequently, we have
PX(V)=PX(XΛ(D+E)−1)=XΛ(D+E)−1−XX⊤(XΛ(D+E)−1)+XπD+E(X⊤XΛ(D+E)−1)=XπD+E(Λ(D+E)−1)=0, |
by using Lemma 3.14, Claim 1 which shows πD+E(Λ(D+E)−1)=0.
Theorem 3.18 reproduces several results known in the literature.
Remark 3.19. Let X∈Stn,k. We obtain the following special cases for PX:Rn×k→TXStn,k by using Corollary 3.16:
1. For D=Ik and E=0 we get the formula
PX(V)=V−XX⊤V+Xskew(X⊤V)=(In−12XX⊤)V−12XV⊤X | (3.37) |
that can be found for example in [2, Ex. 3.6.2] or [10, Eq. (2.4)]
2. More generally, for D=2Ik and E=νIn with ν∈R∖{−2} one obtains
PX(V)=V−XX⊤V+Xskew(X⊤V)=V−12XX⊤V−12XV⊤X | (3.38) |
reproducing the orthogonal projection from [14, Prop. 2].
Next we determine an orthonormal basis of (TIn,kStn,k,⟨⋅,⋅⟩D,E(⋅)) which allows for computing the signature of ⟨⋅,⋅⟩D,E(⋅), as well.
Remark 3.20. We define the subsets B1,B2⊆Rn×k such that B=B1∪B2 is an orthonormal basis of (TIn,kStn,k,⟨⋅,⋅⟩D,EIn,k). To this end, let Eij∈Rn×k denote the matrix whose entries fulfill (Eij)fℓ=δifδjℓ as usual. We set B1=∅ for k=1 and define
B1={1√|sij|(Eij−Eji)|sij=Dii+Eii+Djj+Ejj, 1≤i<j≤k},2≤k≤n. | (3.39) |
Moreover, we set
B2={1√|Djj|Eij|k+1≤i≤n,1≤j≤k},1≤k<n. | (3.40) |
and B2=∅ for k=n. A straightforward calculation shows ⟨V,W⟩D,EIn,k=0 for all V,W∈B with V≠W. Moreover, for V=W∈B one obtains
⟨1√|sij|(Eij−Eji),1√|sij|(Eij−Eji)⟩D,EIn,k=sij|sij|=±1,1≤i<j≤k | (3.41) |
and
⟨1√|Djj|Eij,1√|Djj|Eij⟩D,EIn,k=Djj|Djj|=±1,k+1≤i≤n, 1≤j≤k. | (3.42) |
Hence B is in fact an orthonormal basis. Thus we may compute the signature of ⟨⋅,⋅⟩D,E(⋅). The number of negative signs associated with ⟨⋅,⋅⟩D,E(⋅), named index in [17, Chap. 2, Def. 18], is given by
s=♯{(i,j)∣1≤i<j≤k and sij<0}+(n−k)⋅♯{j∣1≤j≤k and Djj<0}, | (3.43) |
where ♯S denotes the number of elements in the finite set S.
Before we continue with the extrinsic approach, we briefly discuss the metric ⟨⋅,⋅⟩D,E(⋅) on Stn,k viewed as a pseudo-Riemannian reductive homogeneous SO(n)-space. This point of view allows for relating ⟨⋅,⋅⟩D,E(⋅) to the metrics investigated in [15]. For general properties of reductive homogeneous space we refer to [17, Chapter 11] as well as [18, Section 23.4].
Throughout this subsection, we assume 1≤k≤n−1 and n≥3. Then the Killing form on SO(n) given by
⟨ξ,η⟩=(n−2)tr(ξη),ξ,η∈so(n) |
is negative definite, see e.g. [18, Sec. 21.6]. In addition, Stn,k is diffeomorphic to the reductive homogeneous space SO(n)/SO(n−k), where SO(n−k) is realized as a closed subgroup of SO(n) via
SO(n−k)≅{[Ik00R]|R∈SO(n−k)}⊆SO(n) |
and a reductive split is given by so(n)=h⊕m, where
h={[000ξ22]|ξ22∈so(n−k)} and m={[ξ11−ξ⊤21ξ210]|ξ11∈so(k), ξ21∈R(n−k)×k}, |
see e.g. [18, Sec. 23.5]. In particular, since SO(n)×Stn,k∋(R,X)↦RX∈Stn,k is a transitive SO(n)-left action whose stabilizer subgroup of In,k coincides with SO(n−k)⊆SO(n), the map
pr:SO(n)→Stn,k≅SO(n)/SO(n−k),R↦RIn,k | (3.44) |
is a surjective submersion which induces a SO(n)-equivariant diffeomorphism
ˇpr:SO(n)/SO(n−k)→Stn,k,R⋅SO(n−k)↦RIn,k. | (3.45) |
Here R⋅SO(n−k)∈SO(n)/SO(n−k) denotes the coset defined by R∈SO(n). We refer to [20, Thm. 6.4] and [21, Thm. 21.18] for more details on diffeomorphisms associated with transitive actions.
In the sequel, we construct a scalar product
⟨⋅,⋅⟩red(D,E):so(n)×so(n)→R |
on so(n) which induces a left-invariant metric on SO(n) such that (3.44) becomes a pseudo-Riemannian submersion. In addition, equipping SO(n)/SO(n−k) with this submersion metric turns (3.45) into a SO(n)-equivariant isometry to (Stn,k,⟨⋅,⋅⟩D,E(⋅)).
Throughout this section we denote by D,E∈Rk×k diagonal matrices such that D and D+E are both invertible, see also Notation 3.13.
Lemma 3.21. Let Eij∈Rn×n be the matrix whose entries fulfill (Eij)fℓ=δifδjℓ and let F=D+E∈Rk×k. Then
A:so(n)→so(n),ξ=[ξ11−ξ⊤21ξ21ξ22]↦A(ξ)=[skew(ξ11(D+E))−12Dξ⊤2112ξ21Dξ22] | (3.46) |
is linear, where ξ11∈so(k), ξ22∈so(n−k) and ξ21∈R(n−k)×k. Moreover, evaluating A at the basis {(Eij−Eji)∣1≤i<j≤k} of so(n) yields
A(Eij−Eji)={Fii+Fjj2(Eij−Eji) if 1≤i<j≤k,12Djj(Eij−Eji) if k+1≤i≤n, 1≤j≤k,Eij−Eji if k+1≤i<j≤n. | (3.47) |
In particular, A:so(n)→so(n) as well as its restriction A|m:m→m are linear isomorphisms.
Proof. Clearly, A is linear. We show (3.47) by using the definition of A in (3.46). First we consider the case 1≤i<j≤k. Then Eij−Eji is mapped by A to
A(Eij−Eji)=[skew((ˆEij−ˆEji)F)000]=[skew(ˆEijFjj−ˆEjiFii)000]=Fii+Fjj2(Eij−Eji), |
with ˆEij∈Rk×k defined by (ˆEij)fℓ=δifδjℓ. Next assume k+1≤i≤n and 1≤j≤k. One obtains
A(Eij−Eji)=12Djj(Eij−Eji). |
The equality A(Eij−Eji)=Eij−Eji for k+1≤i<j≤n is obvious.
Lemma 3.22. Define
⟨⋅,⋅⟩red(D,E):so(n)×so(n)→R,(ξ,η)↦⟨ξ,η⟩red(D,E)=tr(ξ⊤A(η)), | (3.48) |
where A:so(n)→so(n) is the linear map from Lemma 3.21. Then the following assertions are fulfilled:
1. ⟨⋅,⋅⟩red(D,E) is a scalar product on so(n).
2. The restriction of ⟨⋅,⋅⟩red(D,E) to m defines a scalar product ⟨⋅,⋅⟩red(D,E):m×m→R on m.
3. Writing
ξ=[ξ11−ξ⊤21ξ21ξ22]∈so(n)andη=[η11−η⊤21η21η22]∈so(n) | (3.49) |
with ξ11,η11∈so(k) and ξ21,η21∈R(n−k)×k yields
⟨ξ,η⟩red(D,E)=tr(ξ⊤11skew(η11(D+E)))+tr(ξ⊤21η21D)+tr(ξ⊤22η22). | (3.50) |
4. ⟨⋅,⋅⟩red(D,E) is Ad(SO(k))-invariant.
5. Declaring Tepr:TeSO(n)→Tpr(e)(SO(n)/SO(n−k)) as an isometry defines a SO(n)-invariant pseudo-Riemannian metric on SO(n)/SO(n−k) such that pr:SO(n)→SO(n)/SO(n−k) is a pseudo-Riemannian submersion, where SO(n) is equipped with the left-invariant metric defined by ⟨⋅,⋅⟩red(D,E).
Proof. Obviously, ⟨⋅,⋅⟩red(D,E) is a bilinear form. Using the notation introduced in (3.49) one calculates
⟨ξ,η⟩red(D,E)=tr([ξ11−ξ⊤21ξ21ξ22]⊤[skew(η11(D+E))−12Dη⊤2112η21Dη22])=tr(ξ⊤11(skew(η11(D+E)))+tr(ξ⊤21η21D)+tr(ξ⊤22η22)=⟨η,ξ⟩red(D,E). | (3.51) |
Hence ⟨⋅,⋅⟩red(D,E) is symmetric. Claim 3 follows by (3.51), as well. Moreover, ⟨⋅,⋅⟩red(D,E) is a scalar product since A:so(n)→so(n) is a linear isomorphism by Lemma 3.21 showing Claim 1. Claim 2 follows since A|m:m→m is an isomorphism, too.
In order to show the Ad(SO(n−k))-invariance we calculate
Adg(ξ)=[Ik00R][ξ11−ξ21ξ21ξ22][Ik00R]⊤=[ξ11−(Rξ21)⊤Rξ21Rξ22R⊤] |
for ξ=[x11−ξ⊤21ξ21ξ22]∈so(n) and g=[Ik00R]∈SO(n−k)⊆SO(n) implying
⟨Adg(ξ),Adg(η)⟩red(D,E)=tr(ξ⊤11(skew(η11(D+E))))+tr(((Rξ21)⊤Rη21D)+tr((Rξ22R⊤)⊤Rη22R⊤)=⟨ξ,η⟩red(D,E) |
as desired.
It remains to prove Claim 5. By (3.50) the vector spaces m⊆so(n) and h⊆so(n) are orthogonal complements with respect to ⟨⋅,⋅⟩red(D,E). Moreover, by exploiting the Ad(SO(n−k))-invariance of ⟨⋅,⋅⟩red(D,E), this claim follows by [18, Prop. 23.23] which extends to the pseudo-Riemannian setting because its proof only relies on the non-degeneracy of the metric.
After this preparation, we are in the position to show that ⟨⋅,⋅⟩red(D,E) has indeed the desired property. To this end, the tangent map of (3.44) at In∈SO(n) is determined as
TInpr:so(n)→TIn,kStn,k,ξ↦ξIn,k. | (3.52) |
Proposition 3.23. Let SO(n)/SO(n−k) be equipped with the pseudo-Riemannian metric constructed by means of the scalar product ⟨⋅,⋅⟩red(D,E):m×m→R and let Stn,k be endowed with the metric ⟨⋅,⋅⟩D,E(⋅).
1. The restriction of (3.52) to m, i.e. the linear map
TInpr|m:m→TIn,kStn,k,ξ↦ξIn,k | (3.53) |
is an isometry, where TIn,kStn,k is equipped with the scalar product ⟨⋅,⋅⟩D,EIn,k.
2. The SO(n)-equivariant diffeomorphism (3.45) is an isometry.
Proof. We write ξ,η∈m as
ξ=[ξ11−ξ⊤21ξ210] andη=[η11−η⊤21η210] |
with ξ11,η11∈so(k) as well as ξ21,η21∈R(n−k)×k and compute
⟨TInprξ,TInprη⟩D,Epr(In)=⟨[ξ11ξ21],[η11η21]⟩D,EIn,k=tr(ξ⊤11(skew(η11(D+E))))+tr(ξ⊤21η21D)=⟨ξ,η⟩red(D,E), |
where the last equality holds by Lemma 3.22, Claim 3. It remains to show Claim 2. Since the metric on SO(n)/SO(n−k) induced by ⟨⋅,⋅⟩red(D,E) and the metric ⟨⋅,⋅⟩D,E(⋅) on Stn,k are both SO(n)-invariant, the map ˇpr:SO(n)/SO(n−k)→Stn,k is an isometry by Claim 1 due to its SO(n)-equivariance.
Proposition 3.23 allows for relating the metric ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)) to the metrics on Stn,k defined in [15, Eq. (3.2)]. In order to compare these metrics we introduce some notation following [15]. We choose k1,…,ks∈N with
k1+⋯+ks=k and ki≥2 for all i∈{1,…,s} |
and write
D=diag(˜D11Ik1,…,˜DssIks) and E=diag(˜E11Ik1,…,˜EssIks), | (3.54) |
where ˜Dii,˜Eii∈R. Using the notation from [15] with p=m and t=1 we rewrite ⟨⋅,⋅⟩red(D,E) as
⟨⋅,⋅⟩red(D,E)=s∑i=1˜Dii+˜Eii+˜Dii+˜Eii2(n−2)⟨⋅,⋅⟩pi+∑1≤i<j≤s˜Dii+˜Eii+˜Djj+˜Ejj2(n−2)⟨⋅,⋅⟩pij+s∑i=1˜Dii2(n−2)⟨⋅,⋅⟩pi,s+1. |
Here ⟨⋅,⋅⟩pij=(n−2)tr((⋅)⊤(⋅))|pij×pij denotes the Killing form on so(n) scaled by −1 restricted to pij. Hence ⟨⋅,⋅⟩red(D,E) coincides with the inner product defined in [15, Eq. (3.2)], where xi=xii and
xij={˜Dii+˜Eii+˜Djj+˜Ejj2(n−2) if 1≤i≤j≤s˜Dii2(n−2) if j=s+1 and 1≤i≤s, |
provided that D and E are defined as in (3.54) as well as
˜Dii>0,i∈{1,…,s} and ˜Dii+˜Eii+˜Djj+˜Ejj>0,i,j∈{1,…,s} |
holds. This can be seen by observing that for ξ∈m=p the unique decomposition of ξ into sums of ξij∈pij can be rewritten in terms of block matrices as
![]() |
Finally, we point out that the Einstein metrics discussed in [15, Sec. 6] yield the following equations for D and E
x=˜Dii+˜Eiin−2 for 1≤i≤sy=˜Dii+˜Eii+˜Djj+˜Ejj2(n−2) for 1≤i<j≤s,z=˜Dii2(n−2) for 1≤i≤s, |
where x,y,z denote the parameters of the metric from [15, Eq. (6.2)]. Thus
D=2(n−2)zIk⟹D+E=2z(n−2)Ik+E=(n−2)xIk⟹E=(n−2)(x−2z)Ik |
and therefore y=(n−2)(2z+(x−2z)+2z+(x−2z))2(n−2)=x holds for x,z∈R. In particular, the metrics on Stn,k defined by ⟨⋅,⋅⟩D,E(⋅) contain only the two SO(n)×SO(k)-invariant Einstein metrics from [15], the so-called Jensen metrics. However, they do not contain the "new" Einstein metrics from that paper.
Remark 3.24. Although the "new" Einstein metrics form [15] are not contained in the family of metrics on Stn,k defined by ⟨⋅,⋅⟩D,E(⋅), we are not able to rule out that the family ⟨⋅,⋅⟩D,E(⋅) includes Einstein metrics different from the Jensen metrics. However, searching for Einstein metrics in ⟨⋅,⋅⟩D,E(⋅) is out of the scope of this text.
The goal of this section is to derive an explicit expression for the spray S∈Γ∞(T(TStn,k)) associated with the metric ⟨⋅,⋅⟩D,E(⋅). An expression for S yields an expression for the geodesic equation with respect to ⟨⋅,⋅⟩D,E(⋅) as an explicit second order ODE, as well.
First we recall the definition of a metric spray, also known as spray associated with a metric, from [22, Chap. 8, §4] whose existence and uniqueness is proven in [22, Chap. 8, Thm. 4.2]. For general properties of sprays we refer to [22, Chap. 4, §3-4]. Moreover, a discussion of the relation of sprays to torsion-free covariant derivatives can be found in [22, Chap. 8 §2].
Definition 4.1. Let (M,⟨⋅,⋅⟩) be a pseudo-Riemannian manifold. The metric spray S∈Γ∞(T(TM)) is the unique spray which is associated with the Levi-Civita covariant derivative defined by the pseudo-Riemannian metric ⟨⋅,⋅⟩.
An expression of a metric spray in local coordinates is given in (4.2) below. Next we discuss the relation of metric sprays to Lagrangian mechanics.
Let (M,⟨⋅,⋅⟩) be pseudo-Riemannian and let ω0∈Γ∞(Λ2T∗(T∗M)) denote the canonical symplectic form on T∗M. It is given by
ω0=−d θ0 |
with θ0∈Γ∞(T∗(T∗M)) being the canonical 1-form on T∗M. We refer to [16, Sec. 6.2] for the definition of ω0 and θ0. Consider the Lagrange function
L:TM→R,vx↦L(vx)=12⟨vx,vx⟩x. |
Let FL:TM→T∗M denote the fiber derivative of L defined by
((FL)(vx))(wx)=d d tL(vx+twx)|t=0,x∈M,vx,wx∈TxM, |
see e.g. [16, Eq. (7.2.1)]. The pullback
ωL=(FL)∗ω0 |
is a closed 2-from on TM, the so-called Lagrangian 2-form, see [16, Sec. 7.2]. In addition, ωL is non-degenerated, i.e. symplectic, since FL:TM→T∗M is a diffeomorphism due to
FL:TM→T∗M,vx↦FL(vx)=⟨vx,⋅⟩ | (4.1) |
by [16, Eq. (7.5.3)]. Moreover, the energy
EL:TM→R,vx↦((FL)(vx))(vx)−L(vx) |
associated with L fulfills EL=L, see e.g. [16, Sec. 7.3]. Let XEL∈Γ∞(T(TM)) denote the Lagrangian vector field and write iXELωL for the insertion of XEL into the first argument of ωL as usual. Then XEL is uniquely determined by
iXELωL=d EL⟺ωL(XEL,V)=d EL(V) for all V∈Γ∞(T(TM)). |
according to [16, Sec. 7.3]. Moreover, the Lagrangian vector field XEL coincides with the spray associated with the metric ⟨⋅,⋅⟩, see e.g. [16, Sec. 7.5]. It is exactly the so-called canonical spray from [22, Chap. 7, §7] which coincides with the metric spray, see [22, Chap. 8, Thm. 4.2]. Finally, we mention a local expression for sprays, see e.g. [22, Chap. 8, §4]. A metric spray S:TM→T(TM) can be represented in a chart (TU,(x,v)) of TM induced by a chart (U,x) of M by
S(x,v)=(x,v,v,−Γx(v,v)). | (4.2) |
Here Γx denotes the quadratic map defined by (Γx(v,v))k=Γkij(x)vivj using Einstein summation convention, where Γkij are the Christoffel symbols of the Levi-Civita covariant derivative with respect to the chart (U,x). In order to apply these general results to our particular situation, we introduce some notation.
Notation 4.2. Throughout this section U⊆Rn×k denotes an open subset of Rn×k with the property from Lemma 3.8. Moreover, we denote by ˜L the Lagrange function
˜L:TU→R,(X,V)↦˜L(X,V)=12⟨V,V⟩D,EX, | (4.3) |
where we identify TU≅U×Rn×k as usual.
We use the formula for ω0∈Γ∞(Λ2T∗(T∗U)) on T∗U given in the next remark.
Remark 4.3. The canonical symplectic form ω0∈Γ∞(Λ2T∗(T∗U)) on T∗U is given by
ω0|(X,V)((X,V,Y,Z),(X,V,˜Y,˜Z))=tr(Y⊤˜Z)−tr(˜Y⊤Z), | (4.4) |
for (X,V,Y,Z),(X,V,˜Y,˜Z)∈T(T∗U) identifying T(T∗U)≅U×(Rn×k)∗×Rn×k×(Rn×k)∗ as well as Rn×k≅(Rn×k)∗ via V↦tr(V⊤(⋅)). Indeed, Equation (4.4) follows by the local formula for the canonical symplectic form ω0 on T∗U, see e.g. [16, Sec. 6.2], applied to the gobal chart (U,idU)=(U,Xij).
We now calculate the Lagrangian 2-from ω˜L=(F˜L)∗ω0. To this end, we first determine the fiber derivative F˜L:TU→T∗U and its tangent map.
Lemma 4.4. For (X,V)∈TU the fiber derivative F˜L:TU→T∗U of ˜L is given by
F˜L(X,V)=(X,tr((VD+XX⊤VE)⊤(⋅))). | (4.5) |
Proof. Let (X,V),(X,W)∈TU. We have (F˜L(X,V))(X,W)=⟨V,W⟩D,EX by the Definition of ˜L and (4.1). Using the definition of ⟨⋅,⋅⟩D,E(⋅) and exploiting properties of the trace we obtain
(F˜L(X,V))(X,W)=tr(V⊤WD)+tr(V⊤XX⊤WE)=tr((VD+XX⊤VE)⊤W) |
as desired.
Lemma 4.5. The tangent map T(F˜L):T(TU)→T(T∗U) is given by
(T(F˜L))(X,V,Y,Z)=(F˜L(X,V),Y,tr((ZD+YX⊤VE+XY⊤VE+XX⊤ZE)⊤(⋅))) |
for (X,V,Y,Z)∈T(TU)≅U×(Rn×k)3, where we identify T(T∗U)≅U×(Rn×k)∗×Rn×k×(Rn×k)∗.
Proof. Let (X,V,Y,Z)∈T(TU). The smooth curve γ:(−ϵ,ϵ)∋t↦(X+tY,V+tZ)∈TU, for ϵ>0 sufficiently small, fulfills γ(0)=(X,V) with ˙γ(0)=(Y,Z). Then
d d tF˜L(γ(t))|t=0=(Y,tr((ZD+YX⊤VE+XY⊤VE+XX⊤ZE)⊤(⋅))). |
This yields the desired result.
Lemma 4.6. The Lagrangian 2-form ω˜L=(F˜L)∗ω0∈Γ∞(Λ2T∗(TU)) is given by
ω˜L|(X,V)((X,V,Y,Z),(X,V,˜Y,˜Z)=tr(Y⊤(˜ZD+˜YX⊤VE+X˜Y⊤VE+XX⊤˜ZE))−tr(˜Y⊤(ZD+YX⊤VE+XY⊤VE+XX⊤ZE)) | (4.6) |
with (X,V)∈TU≅U×Rn×k and (X,V,Y,Z),(X,V,˜Y,˜Z)∈T(X,V)TU.
Proof. Using the formula for ω0∈Γ∞(Λ2T∗(T∗U)) from Remark 4.3, a straightforward calculation shows that ω˜L=(F˜L)∗ω0 is given by (4.6). To this end, the formulas from Lemma 4.4 and Lemma 4.5 are plugged into the definition of the pull-back (F˜L)∗ω0.
Next the spray ˜S∈Γ∞(T(TU)) associated with ⟨⋅,⋅⟩D,E(⋅) is calculated exploiting ˜S=XE˜L, where XE˜L is the Lagrangian vector field. A closed form expression for ˜S(X,V) is obtained for all (X,V)∈Stn,k×Rn×k⊆TU.
Lemma 4.7. For (X,V)∈TU and (X,V,Y,Z)∈T(X,V)TU one has
d E˜L|(X,V)(X,V,Y,Z)=tr(V⊤ZD)+tr(Z⊤XX⊤VE)+tr(V⊤YX⊤VE). | (4.7) |
Proof. Let (X,V),(Y,Z)∈TU. We calculate
d d tE˜L(X+tY,V+tZ)|t=0=12(tr(Z⊤VD+V⊤ZD)+tr(Z⊤XX⊤VE+V⊤YX⊤VE+V⊤XY⊤VE+V⊤XX⊤ZE)). |
Using properties of the trace yields the desired result.
Next we consider a linear matrix equation of a certain form. We need to solve this equation for computing the metric spray on TU, see Proposition 4.9. Moreover, one encounters this equation in the proof of Proposition 5.2 on pseudo-Riemannian gradients below.
Lemma 4.8. Let D,E∈Rk×kdiag such that D and D+E are both invertible and let W∈Rn×k. Moreover, let U⊆Rn×k be open with the property from Lemma 3.8. Then for X∈U the linear equation
˜ΓD+XX⊤˜ΓE=W | (4.8) |
has a unique solution in terms of ˜Γ. Moreover, for X∈Stn,k, it is explicitly given by
˜Γ=(W−XX⊤W(D+E)−1E)D−1. | (4.9) |
Proof. For each X∈U the linear map ϕ:Rn×k∋˜Γ↦˜ΓD+XX⊤˜ΓE∈Rn×k is an isomorphism since the bilinear form
Rn×k×Rn×k→R,(Y,Z)↦tr(V⊤ϕ(W))=⟨V,W⟩D,EX |
is non-degenerated by assumption. Hence (4.8) admits a unique solution. Now assume X∈Stn,k. We briefly explain how (4.9) can be derived. By exploiting X⊤X=Ik, Equation (4.8) implies
X⊤W=X⊤˜ΓD+X⊤˜ΓE=X⊤˜Γ(D+E)⟺X⊤˜Γ=X⊤W(D+E)−1. |
Plugging X⊤˜Γ=X⊤W(D+E)−1 into (4.8) yields
˜ΓD+X(X⊤W(D+E)−1)E=W⟺˜Γ=(W−X(X⊤W(D+E)−1)E)D−1. |
A straightforward calculation shows that ˜Γ is indeed a solution of (4.8).
Proposition 4.9. The spray ˜S∈Γ∞(T(TU)) associated with the metric ⟨⋅,⋅⟩D,E(⋅) is given by
˜S(X,V)=(X,V,V,−˜Γ)=(X,V,V,−˜ΓX(V,V)) | (4.10) |
for all (X,V)∈TU≅U×Rn×k. Here ˜Γ=˜ΓX(V,V)∈Rn×k depending on (X,V)∈TU is the unique solution of the linear equation
˜ΓD+XX⊤˜ΓE=VX⊤VE+XV⊤VE−VEV⊤X | (4.11) |
in terms of ˜Γ with fixed (X,V)∈TU. Moreover, for (X,V)∈Stn,k×Rn×k one has
˜ΓX(V,V)=(VX⊤VE+XV⊤VE−VEV⊤X)D−1+(XX⊤VEV⊤X−X(X⊤V)2E−XV⊤VE)(D+E)−1ED−1. | (4.12) |
Proof. Using ˜S=XE˜L we compute ˜S via solving iXE˜Lω˜L=d E˜L for XE˜L, i.e. ˜S=XE˜L fulfills
ω˜L(XE˜L(X,V),(X,V,Y,Z))=d E˜L|(X,V)(X,V,Y,Z). | (4.13) |
for all (X,V,Y,Z)∈T(TU). Since ω˜L is non-degenerated, XE˜L is uniquely determined by (4.13). The local form of a metric spray, see (4.2), motivates the Ansatz
XE˜L(X,V)=(X,V,V,−˜ΓX(V,V))=(X,V,V,−˜Γ) |
with ˜Γ=˜ΓX(V,V)∈Rn×k depending on (X,V)∈TU. Inserting XE˜L into ω˜L from Lemma 4.6 yields the 1-form
(iXE˜Lω˜L)|(X,V)(X,V,Y,Z)=ω˜L|(X,V)(XE˜L(X,V),(X,V,Y,Z))=tr(V⊤(ZD+YX⊤VE+XY⊤VE+XX⊤ZE)−tr(Y⊤(−˜ΓD+VX⊤VE+XV⊤VE−XX⊤˜ΓE)) | (4.14) |
with (X,V)∈TU and (X,V,Y,Z)∈T(TU). Using (4.14) and the formula for d E˜L from Lemma 4.7, the equation iXE˜Lω˜L=d E˜L becomes
tr(V⊤ZD)+tr(Z⊤XX⊤VE)+tr(V⊤YX⊤VE)=tr(V⊤(ZD+YX⊤VE+XY⊤VE+XX⊤ZE)−tr(Y⊤(−˜ΓD+VX⊤VE+XV⊤VE−XX⊤˜ΓE)) | (4.15) |
for all (X,V,Y,Z)∈TU. Clearly, Equation (4.15) is equivalent to
tr(Y⊤(VEV⊤X))=tr(Y⊤(−˜ΓD+VX⊤VE+XV⊤VE−XX⊤˜ΓE)) |
for all Y∈Rn×k. This can be equivalently rewritten as
˜ΓD+XX⊤˜ΓE=VX⊤VE+XV⊤VE−VEV⊤X | (4.16) |
showing the first claim.
We now assume X∈Stn,k. Writing W=VX⊤VE+XV⊤VE−VEV⊤X and invoking Lemma 4.8 in order to solve (4.16) for ˜Γ yields
˜Γ=WD−1−XX⊤W(D+E)−1ED−1=(VX⊤VE+XV⊤VE−VEV⊤X)D−1+(XX⊤VEV⊤X−X(X⊤V)2E−XV⊤VE)(D+E)−1ED−1 |
as desired.
Remark 4.10. Obviously, for E=0, Proposition 4.9 implies ˜ΓX(V,V)=0 for all (X,V)∈TU.
Proposition 4.9 admits a relatively simple expression for ˜S∈Γ∞(T(TU)) evaluated at (X,V)∈TStn,k for a subfamily of ⟨⋅,⋅⟩D,E(⋅). Since this subfamily will be discussed several times below, it deserves its own notation.
Notation 4.11. We write ⟨⋅,⋅⟩D,ν(⋅) for the covariant 2-tensor ⟨⋅,⋅⟩D,E(⋅) which is obtained by specifying E=νIk with ν∈R, i.e.
⟨V,W⟩D,νX=tr(V⊤WD)+νtr(V⊤XX⊤W),X∈Rn×k and V,W∈TXRn×k≅Rn×k. |
Unless indicated otherwise, pull-backs of ⟨⋅,⋅⟩D,ν(⋅)∈Γ∞(S2(T∗Rn×k)) to submanifolds of Rn×k are omitted in the notation. Moreover, we assume that D and ν are chosen such that Stn,k⊆(U,⟨⋅,⋅⟩D,ν(⋅)) is a pseudo-Riemannian submanifold. In particular, we assume that D and D+νIk are both invertible.
Corollary 4.12. The spray ˜S∈Γ∞(T(TU)) on TU associated with ⟨⋅,⋅⟩D,ν(⋅)∈Γ∞(S2(T∗U)) evaluated at (X,V)∈TStn,k is given by
˜S(X,V)=(X,V,V,−˜ΓX(V,V)), | (4.17) |
where
˜ΓX(V,V)=(2νVX⊤V+νXV⊤V(D(D+νIk)−1)−2ν2X(X⊤V)2(D+νIk)−1)D−1. | (4.18) |
Proof. Let (X,V)∈TStn,k and write ˜Γ=˜ΓX(V,V) for short. Plugging E=νIk into Formula (4.12) from Proposition 4.9 and using X⊤V=−V⊤X we obtain
˜Γ=(VX⊤VE+XV⊤VE−VEV⊤X)D−1+(XX⊤VEV⊤X−X(X⊤V)2E−XV⊤VE)(D+E)−1ED−1=ν(VX⊤V+XV⊤V−VV⊤X)D−1+ν2(XX⊤VV⊤X−X(X⊤V)2−XV⊤V)(D+νIk)−1D−1=ν(VX⊤V+XV⊤V+VX⊤V)D−1+ν2(−X(X⊤V)2−X(X⊤V)2−XV⊤V)(D+νIk)−1D−1=(2νVX⊤V+XV⊤V(νIk)−2ν2X(X⊤V)2(D+νIk)−1−XV⊤V(ν2(D+νIk)−1))D−1=(2νVX⊤V+XV⊤V(νIk−ν2(D+νIk)−1)−2ν2X(X⊤V)2(D+νIk)−1)D−1=(2νVX⊤V+νXV⊤V(D(D+νIk)−1)−2ν2X(X⊤V)2(D+νIk)−1)D−1, |
where the last equality holds due to
(νIk−ν2(D+νIk)−1)ii=ν−ν2Dii+ν=ν(Dii+ν)−ν2Dii+ν=ν(D(D+νIk)−1)ii |
for i∈{1,…,k}.
We now determine the spray S∈Γ∞(T(TStn,k)) associated with the metric ⟨⋅,⋅⟩D,E(⋅). To this end, a result from [16, Prop. 8.4.1] is exploited which is stated for Riemannian manifolds. The proof works for pseudo-Riemannian manifolds, as well, since it only exploits the non-degeneracy of the metric. We reformulate it in the following proposition.
Proposition 4.13. Let M⊆˜M be a pseudo-Riemannian submanifold of a pseudo-Riemannian manifold (˜M,⟨⋅,⋅⟩) and let ˜S∈Γ∞(T(T˜M)) denote the metric spray on T˜M. Then the spray S∈Γ∞(T(TM)) on TM associated with the induced pseudo-Riemannian metric is given by
S=TP∘˜S|TM:TM→T(TM), | (4.19) |
where P:T˜M|M→TM denotes the vector bundle morphism that is defined fiber-wise by the orthogonal projections Px:Tx˜M→TxM⊆Tx˜M with respect to ⟨⋅,⋅⟩, where x∈M.
Lemma 4.14. The tangent map TP:T(Stn,k×Rn×k)→T(TStn,k) of
P:Stn,k×Rn×k→TStn,k,(X,V)↦(X,PX(V)), | (4.20) |
where PX(V)=V−XX⊤V+XπD+E(X⊤V) is the orthogonal projection from Theorem 3.18, is given by
TP(X,V,Y,Z)=(X,V,Y,Z−XY⊤V−XX⊤Z+XπD+E(Y⊤V+X⊤Z))) | (4.21) |
for all (X,V,Y,Z)∈T(Stn,k×Rn×k)≅TStn,k×(Rn×k)2.
Proof. By exploiting πD+E(X⊤V)=X⊤V due to X⊤V=−V⊤X∈so(k) for (X,V)∈TStn,k one calculates
T(X,V)P(Y,Z)=(Y,Z−YX⊤V−XY⊤V−XX⊤Z+YπD+E(X⊤V)+XπD+E(Y⊤V+X⊤Z))=(Y,Z−XY⊤V−XX⊤Z+XπD+E(Y⊤V+X⊤Z)), |
where (X,V,Y,Z)∈T(Stn,k×Rn×k).
Theorem 4.15. The spray S∈Γ∞(T(TStn,k)) associated with ⟨⋅,⋅⟩D,E(⋅) is given by
S(X,V)=(X,V,V,−˜ΓX(V,V)−XV⊤V+XX⊤˜ΓX(V,V)+XπD+E(V⊤V−X⊤˜ΓX(V,V))) | (4.22) |
for all (X,V)∈TStn,k. Here ˜ΓX(V,V)∈Rn×k depending on (X,V)∈TStn,k is given by
˜ΓX(V,V)=(VX⊤VE+XV⊤VE−VEV⊤X)D−1+(XX⊤VEV⊤X−X(X⊤V)2E−XV⊤VE)(D+E)−1ED−1. | (4.23) |
Proof. One can view Stn,k equipped with ⟨⋅,⋅⟩D,E(⋅) as a pseudo-Riemannian submanifold of (U,⟨⋅,⋅⟩D,E(⋅)) according to Lemma 3.8. Let ˜S∈Γ∞(T(TU)) be the metric spray on TU determined in Proposition 4.9. Then S=TP∘˜S|TStn,k holds by Proposition 4.13. Using Lemma 4.14 yields
S(X,V)=TP∘˜S|TStn,k(X,V)=TP(X,V,V,−˜ΓX(V,V))=(X,V,V,−˜ΓX(V,V)−XV⊤V+XX⊤˜ΓX(V,V)+XπD+E(V⊤V−X⊤˜ΓX(V,V))) |
for all (X,V)∈TStn,k as desired.
Remark 4.16. We often denote the spray on TStn,k associated with ⟨⋅,⋅⟩D,E(⋅) from Theorem 4.15 by
S(X,V)=(X,V,V,−Γ)=(X,V,V,−ΓX(V,V)), |
i.e. we write −Γ or −ΓX(V,V) for the fourth component of S. For (X,V)∈TStn,k it is given by
−ΓX(V,V)=−˜ΓX(V,V)−XV⊤V+XX⊤˜ΓX(V,V)+XπD+E(V⊤V−X⊤˜ΓX(V,V)) | (4.24) |
according to Theorem 4.15, where ˜ΓX(V,V) is determined by (4.23). Obviously, Equation (4.24) yields a well-defined expression for all X∈Rn×k and V∈Rn×k which is quadratic in V. Hence, by polarization, (4.24) can be viewed as the definition of the smooth map
Γ:U→S2((Rn×k)∗)⊗Rn×k,X↦((V,W)↦ΓX(V,W)). | (4.25) |
Clearly, Equation (4.25) yields a smooth extension of the fourth component of the metric spray S∈Γ∞(T(TStn,k)). This extension is used in Proposition 6.5 and Proposition 6.8 below.
Corollary 4.17. The spray S∈Γ∞(T(TStn,k)) associated with the metric ⟨⋅,⋅⟩D,E(⋅) from Theorem 4.15 has the following properties:
1. The spray S∈Γ∞(T(TStn,k)) is complete.
2. The maximal integral curve R∋t↦ΦSt((X0,V0))=(X(t),V(t))∈TStn,k of S through the point (X0,V0)∈TStn,k at t=0 fulfills the explicit non-linear first order ODE
˙X=V˙V=−˜ΓX(V,V)−XV⊤V+XX⊤˜ΓX(V,V)+XπD+E(V⊤V−X⊤˜ΓX(V,V)), | (4.26) |
with initial condition (X(0),V(0))=(X0,V0)∈TStn,k writing X=X(t) and V=V(t) for short.
3. Let pr:TStn,k→Stn,k be the canonical projection. The curve R∋t↦pr∘ΦSt(X0,V0)=X(t)∈Stn,k is a geodesic with respect to ⟨⋅,⋅⟩D,E(⋅) through the point X(0)=X0∈Stn,k with initial velocity ˙X(0)=V0∈TX0Stn,k.
4. The geodesic equation on Stn,k with respect to ⟨⋅,⋅⟩D,E(⋅) is given by the non-linear explicit second order ODE
¨X=−˜ΓX(˙X,˙X)−X˙X⊤˙X+XX⊤˜ΓX(˙X,˙X)+XπD+E(˙X⊤˙X−X⊤˜ΓX(˙X,˙X)) | (4.27) |
with initial conditions X(0)=X0∈Stn,k and ˙X(0)=˙X0∈TX0Stn,k.
Proof. We first show that S is complete. The transitive O(n)-action Ψ acts on (Stn,k,⟨⋅,⋅⟩D,E(⋅)) by isometries according to Lemma 3.1, i.e. (Stn,k,⟨⋅,⋅⟩D,E(⋅)) is a compact pseudo-Riemannian homogeneous manifold. Hence completeness follows by [23].
The other statements are well-known consequences of general properties of sprays associated with a metric, see e.g. [16, Sec. 7.5], combined with the explicit formula for S∈Γ∞(T(TStn,k)) from Theorem 4.15.
The formula for the metric spray S from Theorem 4.15 admits a simplification for ⟨⋅,⋅⟩D,ν(⋅).
Corollary 4.18. For (Stn,k,⟨⋅,⋅⟩D,ν(⋅)) the metric spray is given by S(X,V)=(X,V,V,−ΓX(V,V)) with
−ΓX(V,V)=2νVV⊤XD−1+2νX(X⊤V)2D−1−XV⊤V+XπD+νIk(V⊤V) | (4.28) |
for (X,V)∈TStn,k. Moreover, the geodesic equation reads
¨X=2ν˙X˙X⊤XD−1+2νX(X⊤˙X)2D−1−X˙X⊤˙X+XπD+νIk(˙X⊤˙X). | (4.29) |
Proof. Let (X,V)∈TStn,k. Using the formula for ˜ΓX(V,V) from Corollary 4.12 we calculate
X⊤˜ΓX(V,V)=X⊤(2νVX⊤V+νXV⊤V(D(D+νIk)−1)−2ν2X(X⊤V)2(D+νIk)−1)D−1=2νX⊤VX⊤VD−1+νV⊤V((D+νIk)−1D)D−1−2ν2(X⊤V)2(D+νIk)−1D−1=2ν(X⊤V)2D−1−2ν2(X⊤V)2(D+νIk)−1D−1+νV⊤V((D+νIk)−1)=(X⊤V)2(2νD−1−2ν2(D+νIk)−1D−1)+νV⊤V((D+νIk)−1)=2ν(X⊤V)2(D+νIk)−1+νV⊤V(D+νIk)−1=ν(V⊤V+2(X⊤V)2)(D+νIk)−1, |
where the identity
(2νD−1−2ν2(D+νIk)−1D−1)ii=2(νDii+ν2)−2ν2(Dii+ν)Dii=2νDii+ν=2ν((D+νIk)−1)ii |
is used. This yields
XX⊤˜ΓX(V,V)=νX(V⊤V+2(X⊤V)2)(D+νIk)−1. |
Moreover, using the symmetry of ν(V⊤V+2(X⊤V)2)∈Rk×ksym we obtain by Lemma 3.14, Claim 1
πD+νIk(X⊤˜ΓX(V,V))=πD+νIk(ν(V⊤V+2(X⊤V)2)(D+νIk)−1)=0. |
Therefore ΓX(V,V) can be obtained by Theorem 4.15 via calculating
−ΓX(V,V)=−˜ΓX(V,V)−XV⊤V+XX⊤˜ΓX(V,V)+XπD+νIk(V⊤V−X⊤˜ΓX(V,v))=(−2νVX⊤VD−1−νXV⊤V(D+νIk)−1+2ν2X(X⊤V)2(D+νIk)−1D−1)−XV⊤V+(νXV⊤V(D+νIk)−1+2νX(X⊤V)2(D+νIk)−1)+XπD+νIk(V⊤V)=2νVV⊤XD−1+2X(X⊤V)2(D+νIk)−1(ν2D−1+νIk)−XV⊤V+XπD+νIk(V⊤V)=2νVV⊤XD−1+2νX(X⊤V)2D−1−XV⊤V+XπD+νIk(V⊤V), |
where the last equality follows due to
((D+νIk)−1(ν2D−1+νIk))ii=(ν2/Dii)+νDii+ν=ν(ν+Dii)Dii(ν+Dii)=ν(D−1)ii. |
This yields the desired result.
Remark 4.19. Corollary 4.18 generalizes the geodesic equation from [13]. Indeed, setting D=2Ik and ν=−2α+1α+1 with α∈R∖{−1} yields
−ΓX(V,V)=νVV⊤X+νX(X⊤V)2−XV⊤V | (4.30) |
due to π(2+ν)Ik(V⊤V)=skew(V⊤V)=0 in accordance with [13, Eq. (65)].
Remark 4.20. We are not aware of an explicit solution of the geodesic equation for general diagonal matrices D and E. To our best knowledge, an explicit solution is only known for the special case D=2Ik and E=νIk, see [13]. Nevertheless, one could exploit that (TStn,k,ω(Tι)∗˜L,(Tι)∗˜L) defines a Hamiltonian system whose Hamiltonian vector field is given by the metric spray S∈Γ∞(T(TStn,k)). This point of view would allow to study the geodesic equation using the theory of integrable systems. However, investigating these aspects in detail is out of the scope of this paper. In this context, we only refer to [24], where geodesic flows on the cotangent bundle T∗Stn,k and their integrability are studied.
We now determine pseudo-Riemannian gradients and pseudo-Riemannian Hessians of smooth functions on Stn,k. Specific results from [14] are generalized, where similar ideas were used to obtain the gradients and Hessians of smooth function on Stn,k with respect to the one-parameter family of metrics from [13]. Moreover, similar formulas for gradients and Hessians on Stn,k with respect to a family of metrics corresponding to ⟨⋅,⋅⟩D,E(⋅), where D=α0Ik and E=(α1−α0)Ik with α0,α1∈R, i.e. a scaled version of the metrics introduced in [13], are independently obtained in [25].
Notation 5.1. From now on, unless indicated otherwise, we denote by U⊆Rn×k an open subset with the property from Lemma 3.8.
We first determine the gradient of a smooth function on f:Stn,k→R with respect to the metric ⟨⋅,⋅⟩D,E(⋅)∈Γ∞(S2(T∗Stn,k)). Let ♯D,E:T∗XStn,k→TXStn,k denote the sharp map associated with ⟨⋅,⋅⟩D,E(⋅), i.e. the inverse of the flat map ♭:TXStn,k∋V↦⟨V,⋅⟩D,EX∈T∗XStn,k. Then gradf∈Γ∞(TStn,k) is the unique vector field that fulfills
⟨gradf(X),V⟩D,EX=d f|X(V)⟺gradf(X)=(d f|X(⋅))♯D,E | (5.1) |
for all X∈Stn,k and V∈TXStn,k, see e.g. [26, Sec. 8.1] for the Riemannian case, which clearly extends to the pseudo-Riemannian case.
Proposition 5.2. Let f:Stn,k→R be smooth with some smooth extension F:U→R. Then the gradient of f at X∈Stn,k with respect to ⟨⋅,⋅⟩D,E(⋅) is given by
gradf(X)=∇F(X)D−1−XX⊤∇F(X)D−1+XπD+E(X⊤∇F(X)(D−1−(D+E)−1ED−1)). | (5.2) |
Proof. We first compute the gradient of F:U→R with respect to ⟨⋅,⋅⟩D,E(⋅). Let X∈U. Then gradF(X)∈Rn×k fulfills
⟨gradF(X),V⟩D,EX=d F|X(V)=tr((∇F(X))⊤V) | (5.3) |
for all V∈TXRn×k≅Rn×k. Using the definition of ⟨⋅,⋅⟩D,E(⋅), Equation (5.3) can be rewritten as
tr(V⊤(gradF(X)D+XX⊤gradF(X)E))=tr(V⊤∇F(X)). |
Since ⟨⋅,⋅⟩D,E(⋅) is non-degenerated, this is equivalent to the linear equation
gradF(X)D+XX⊤gradF(X)E=∇F(X) | (5.4) |
in terms of gradF(X). Now assume X∈Stn,k. Then the unique solution of (5.4) is given by
gradF(X)=∇F(X)D−1−XX⊤∇F(X)(D+E)−1ED−1 |
according to Lemma 4.8. Next, we use the well-known formula gradf(X)=PX(gradF(X)), where PX:Rn×k→TXStn,k is determined in Theorem 3.18. One calculates
gradf(X)=PX(∇F(X)D−1−XX⊤∇F(X)(D+E)−1ED−1)=(∇F(X)D−1−XX⊤∇F(X)D−1+XπD+E(X⊤∇F(X)D−1))−(XX⊤∇F(X)(D+E)−1ED−1−XX⊤(XX⊤∇F(X)(D+E)−1ED−1)+XπD+E(X⊤XX⊤∇F(X)(D+E)−1ED−1))=∇F(X)D−1−XX⊤∇F(X)D−1+XπD+E(X⊤∇F(X)(D−1−(D+E)−1ED−1)) |
for X∈Stn,k as desired.
Next we specialize the formula for the gradient to the subfamily ⟨⋅,⋅⟩D,ν(⋅).
Proposition 5.3. Let f:Stn,k→R be smooth with some smooth extension F:U→R. Then the gradient of f with respect to ⟨⋅,⋅⟩D,ν(⋅) is given by
gradf(X)=∇F(X)D−1−XX⊤∇F(X)D−1+XπD+νIk(X⊤∇F(X)(D+νIk)−1) | (5.5) |
for all X∈Stn,k.
Proof. Using Proposition 5.2 we obtain for X∈TXStn,k
gradf(X)=∇F(X)D−1−XX⊤∇F(X)D−1+XπD+νIk(X⊤∇F(X)(D−1−ν(D+νIk)−1D−1))=∇F(X)D−1−XX⊤∇F(X)D−1+XπD+νIk(X⊤∇F(X)(D+νIk)−1), |
where the identity
(D−1−ν(D+νIk)−1D−1)ii=1Dii−ν(Dii+ν)Dii=Dii+ν−ν(Dii+ν)Dii=((D+νIk)−1)ii |
is used to obtain the last equality.
Corollary 5.4. Let α∈T∗XStn,k be given by
α=tr(V⊤(⋅)):TXStn,k→R,W↦tr(V⊤W)∈R, | (5.6) |
where V∈Rn×k is some matrix. The sharp map ♯D,E:T∗XStn,k→TXStn,k associated with ⟨⋅,⋅⟩D,E(⋅) applied to α is given by
α♯D,E=(tr(V⊤(⋅)))♯D,νIk=VD−1−XX⊤VD−1+XπD+E(X⊤V(D−1−(D+E)−1ED−1)). | (5.7) |
Specializing E=νIk yields the sharp map with respect to ⟨⋅,⋅⟩D,ν(⋅) applied to α, namely
α♯D,νIk=(tr(V⊤(⋅)))♯D,νIk=VD−1−XX⊤VD−1+XπD+νIk(X⊤V(D+νIk)−1). | (5.8) |
Proof. Consider the smooth function F:Rn×k∋X↦tr(V⊤X)∈R and set f=F|Stn,k:Stn,k→R. Then d F|X(W)=tr(V⊤W) and thus ∇F(X)=V follows. Applying Proposition 5.2 and Proposition 5.3, respectively, yields the desired result because of (5.1).
Corollary 5.5. Proposition 5.3 reproduces some results known from the literature as special cases:
1. For D=Ik and ν=0 one has
gradf(X)=∇F(X)−12XX⊤∇F(X)−12X(∇F(X))⊤X. | (5.9) |
This coincides with the gradient with respect to the Euclidean metric, see e.g. [2].
2. For D=Ik and ν=−12, one has
gradf(X)=∇f(X)−X(∇F(X))⊤X | (5.10) |
reproducing the formula for the gradient from [10, Eq. (2.53)].
3. For D=2Ik and −2≠ν∈R the gradient of f simplifies to
gradf(X)=12(∇f(X)−ν+12+νXX⊤∇F(X)−12+νX(∇F(X))⊤X) | (5.11) |
reproducing the expression for the gradient from [14, Thm. 1].
Proof. These formulas follow by straightforward calculations by plugging the particular choices for D and ν into the expression for gradf from Proposition 5.3.
Next we determine the pseudo-Riemannian Hessian of a smooth function f:Stn,k→R. Here we only consider the subfamily ⟨⋅,⋅⟩D,ν(⋅) in order to obtain formulas which are not too complicated.
Lemma 5.6. Let X∈Stn,k, V∈TXStn,k and let f:Stn,k→R be smooth with some smooth extension F:U→R. The Hessian of f with respect ⟨⋅,⋅⟩D,ν(⋅) considered as quadratic form is given by
Hess(f)|X(V,V)=D2F(X)(V,V)+DF(X)(2νVV⊤XD−1+2νX(X⊤V)2D−1−XV⊤V+XπD+νIk(V⊤V)), | (5.12) |
where X∈Stn,k and V,W∈TXStn,k.
Proof. The geodesic γ:R→Stn,k through γ(0)=X∈Stn,k with initial velocity ˙γ(0)=V∈TXStn,k fulfills the explicit second order ODE
¨γ(t)=2ν˙γ(t)˙γ(t)⊤γ(t)D−1+2νγ(t)(γ(t)⊤˙γ(t))2D−1−γ(t)˙γ(t)⊤˙γ(t)+γ(t)πD+νIk(˙γ(t)⊤˙γ(t)) | (5.13) |
according to Corollary 4.18. Evaluating (5.13) at t=0 yields
¨γ(0)=2νVV⊤XD−1+2νX(X⊤V)2D−1−XV⊤V+XπD+νIk(V⊤V) | (5.14) |
due to the initial conditions γ(0)=X and ˙γ(0)=V. The Hessian of f considered as quadratic form can be determined as
Hess(f)|X(V,V)=d 2d t2(f∘γ)(t)|t=0, | (5.15) |
see e.g. [26, Prop. 8.3] for the Riemannian case, which clearly extends to pseudo-Riemannian manifolds. Using f=F|Stn,k, Formula (5.15) yields
Hess(f)|X(V,V)=D2F(X)(˙γ(0),˙γ(0))+DF(X)¨γ(0) | (5.16) |
by the chain rule. Plugging (5.14) into (5.16) yields the desired result.
Theorem 5.7. Let X∈Stn,k and V,W∈TXStn,k. Moreover, define ˜D=D+νIk. The Hessian of a smooth function f:Stn,k→R with smooth extension F:U→R with respect ⟨⋅,⋅⟩D,ν(⋅) is given by
Hess(f)|X(V,W)=tr((D(∇F)(X)V)⊤W)+νtr((XD−1(∇F(X))⊤V+∇F(X)D−1X⊤V)⊤W)+νtr((XV⊤XX⊤∇F(X)D−1+XX⊤∇F(X)D−1V⊤X)⊤W)−12tr((VX⊤∇F(X)+V(∇F(X))⊤X)⊤W)+12tr((Vπ˜D(X⊤∇F(X)˜D−1)˜D−V˜Dπ˜D(X⊤∇F(X)˜D−1))⊤W). | (5.17) |
Proof. Let (X,V),(X,W)∈TStn,k. We obtain for the Hessian of f as symmetric 2-tensor
Hess(f)|X(V,W)=tr((D(∇F)(X)V)⊤W)+νtr((∇F(X))⊤(VW⊤+WV⊤)XD−1)+νtr((∇F(X))⊤X(X⊤VX⊤W+X⊤WX⊤V)D−1)−12tr((∇F(X))⊤X(V⊤W+W⊤V))+12tr((∇F(X))⊤XπD+νIk(V⊤W+W⊤V)) | (5.18) |
by applying polarization to the quadratic form obtained in Lemma 5.6 and using the identities
DF(X)V=tr((∇F(X))⊤V) andD2F(X)(V,W)=tr((D(∇F)(X)V)⊤W). |
Next, we set ˜D=D+νIk which is invertible according to Notation 3.13. Since the orthogonal projection π˜D:Rk×k→so(k)⊆Rk×k is self-adjoint with respect to the scalar product
⟨⋅,⋅⟩˜D:Rk×k×Rk×k→R,(V,W)↦tr(V⊤W˜D) |
on Rk×k, we calculate
tr((∇F(X))⊤Xπ˜D(V⊤W+W⊤V))=tr((X⊤∇F(X)˜D−1)⊤π˜D(V⊤W+W⊤V)˜D)=⟨X⊤∇F(X)˜D−1,π˜D(V⊤W+W⊤V)⟩˜D=⟨π˜D(X⊤∇F(X)˜D−1),V⊤W+W⊤V⟩˜D=tr((Vπ˜D(X⊤∇F(X)˜D−1)˜D−V˜Dπ˜D(X⊤∇F(X)˜D−1))⊤W) | (5.19) |
by exploiting im(π˜D)=so(k). The desired result follows by rewriting (5.18) using well-known properties of the trace and applying (5.19) to the last summand of (5.18).
Corollary 5.8. 5.7: Let D=2Ik and −2≠ν∈R. Then the Hessian of the smooth function f:Stn,k→R with respect to ⟨⋅,⋅⟩D,ν(⋅) reads
Hess(f)|X(V,W)=tr((D(∇F)(X)V)⊤W)+ν2tr((X(∇F(X))⊤V+∇F(X)X⊤V)⊤W)+ν2tr((XV⊤XX⊤∇F(X)+XX⊤∇F(X)V⊤X)⊤W)−12tr((VX⊤∇F(X)+V(∇F(X))⊤X)⊤W) | (5.20) |
with X∈Stn,k and V,W∈TXStn,k reproducing the formula from [14, Thm. 2].
Proof. We set D=2Ik in Theorem 5.7. Obviously, ˜D=(2+ν)Ik holds. Hence
π˜D(X⊤∇F(X)˜D−1)˜D=π˜D(X⊤∇F(X))=˜Dπ˜D(X⊤∇F(X)˜D−1) |
is fulfilled by the linearity of π˜D:Rn×k→so(k)⊆Rn×k. Thus the last summand of (5.17) vanishes.
Theorem 5.7 yields an expression for the Hessian of f:Stn,k→R as covariant 2-tensor. However, for applications, see e.g. [2, Chap. 6], an expression for the Hessian of f viewed as section of End(TStn,k) is desirable. Thus, following [14, Re. 6], we state the next remark and the next corollary.
Remark 5.9. In [2, Eq. (6.3)] the Hessian of a smooth function f:M→R on a Riemannian manifold (M,⟨⋅,⋅⟩) endowed with a covariant derivative ∇ is defined as
~Hess(f)|x(vx)=∇vxgradf|x |
for x∈M and vx∈TxM. In particular, ~Hess(f)∈Γ∞(End(TM)) holds. If ∇ is chosen as the Levi-Civita covariant derivative ∇LC, then ~Hess(f) is related to Hess(f)∈Γ∞(S2(T∗M)) via
⟨~Hess(f)|x(v),w⟩=⟨∇LCvxgradf|x,wx⟩=Hess(f)|x(vx,wx), | (5.21) |
where x∈M and vx,wx∈TxM, see e.g. [26, Prop. 8.1] for a proof for the Riemannian case. Clearly, Equation (5.21) holds in the pseudo-Riemannian case, too. We rewrite (5.21) equivalently as
⟨~Hess(f)|x(vx),⋅⟩=Hess(f)|x(vx,⋅). | (5.22) |
Applying the sharp map ♯:T∗xM→TxM associated with ⟨⋅,⋅⟩ on both sides of (5.22) yields
~Hess(f)|x(vx)=(Hess(f)|x(vx,⋅))♯. | (5.23) |
Corollary 5.10. Let f:Stn,k→R be smooth with some smooth extension F:U→R. The Hessian of f with respect to ⟨⋅,⋅⟩D,ν(⋅) considered as a section of End(TStn,k) is given by
~Hess(f)|X(V)=LD,νX(D(∇F)(X)V+ν(XD−1(∇F(X))⊤V+∇F(X)D−1X⊤V)+ν(XV⊤XX⊤∇F(X)D−1+XX⊤∇F(X)D−1V⊤X)−12(VX⊤∇F(X)+V(∇F(X))⊤X)+12(Vπ˜D(X⊤∇F(X)˜D−1)˜D−V˜Dπ˜D(X⊤∇F(X)˜D−1))) |
for X∈Stn,k and V∈TXStn,k, where ˜D=D+νIk and LD,νX:Rn×k→TXStn,k⊆Rn×k is the linear map given by
LD,νX(V)=VD−1−XX⊤VD−1+XπD+νIk(X⊤V(D+νIk)−1). |
Proof. We have already obtained Hess(f) in Theorem 5.7 in such a form that the formula for the sharp map from Corollary 5.4 can be applied to Hess(f)|X(V,⋅)∈T∗XStn,k. Now Remark 5.9 yields the desired result.
In this section, we compute the second fundamental form of Stn,k considered as pseudo-Riemannian submanifold of (U,⟨⋅,⋅⟩D,E(⋅)). Moreover, an expression for the Levi-Civita covariant derivative on Stn,k is derived. We first recall Notation 5.1. Unless indicated otherwise, we denote by U⊆Rn×k an open neighbourhood of Stn,k with the property from Lemma 3.8.
We consider the Levi-Civita covariant derivative ~∇LC on U with respect to ⟨⋅,⋅⟩D,E(⋅). Recall Proposition 4.9. For (X,V)∈TU the spray ˜S∈Γ∞(T(TU)) associated with the metric ⟨⋅,⋅⟩D,E(⋅) on U is given by
˜S(X,V)=(X,V,V,−˜ΓX(V,V)) | (6.1) |
where ˜ΓX(V,V) is the unique solution of the linear equation (4.11). We now discuss how ˜ΓX(V,V)∈Rn×k is related to the Christoffel symbols of the Levi-Civita covariant derivative ~∇LC on (U,⟨⋅,⋅⟩D,E(⋅)). To this end, we view idU:U∋X↦X∈U as the global chart (U,idU)=(U,Xij) of U and identify the coordinate vector fields ∂∂Xij with the constant functions U∋X↦Eij∈Rn×k. Then (6.1) is a coordinate expression for the metric spray ˜S with respect to the global chart (TU,(Xij,Vij)) induced by the chart (U,Xij), see also Proposition 4.9. Thus the local form of metric sprays, see (4.2), implies that the entry (˜ΓX(V,V))ij fulfills
(˜ΓX(V,V))ij=n∑a,c=1k∑b,d=1˜Γ(i,j)(a,b),(c,d)|XVabVcd, | (6.2) |
where V=(Vij)∈Rn×k and the functions ˜Γ(i,j)(a,b),(c,d):U∋X↦˜Γ(i,j)(a,b),(c,d)|X∈R denote the Christoffel symbols of ~∇LC with respect to (U,(Xij)). Hence ~∇LC can be expressed with respect to the global chart (U,Xij) as
~∇LC˜V˜W|X=D˜W(X)˜V|X+˜ΓX(˜V|X,˜W|X), | (6.3) |
for vector fields ˜V,˜W∈Γ∞(TU) and X∈U, see e.g. [27, Chap. 4]. A similar "matrix notation" for Christoffel symbols has already appeared in [10, Sec. 2.2.3], where, in addition, it is mentioned that (for fixed X∈U) the symmetric bilinear map Rn×k×Rn×k∋(V,W)↦˜ΓX(V,W)∈Rn×k can be obtained from the quadratic map Rn×k∋V↦˜ΓX(V,V)∈Rn×k by polarization. Hence the Christoffel symbols on U can be identified with the smooth map
˜Γ:U→S2((Rn×k)∗)⊗Rn×k,X↦((V,W)↦˜ΓX(V,W)). | (6.4) |
The "Christoffel symbols" from [10, Sec. 2.2.3] will be discussed in Remark 6.11 below.
Next we give an expression for the Levi-Civita covariant derivative ∇LC on Stn,k with respect ⟨⋅,⋅⟩D,E(⋅). We refer to Proposition 6.8 as well as Corollary 6.9 below for an alternative formula for ∇LC.
Proposition 6.1. Let V,W∈Γ∞(TStn,k). The Levi-Civita covariant derivative on (Stn,k,⟨⋅,⋅⟩D,E(⋅)) is given by
∇LCVW|X=PX(D˜W(X)V|X+˜ΓX(V|X,W|X)) | (6.5) |
for all X∈Stn,k, where ˜V∈Γ∞(TU) is a smooth extensions of V. Here ˜Γ is defined by (6.4). Moreover, PX:Rn×k→TXStn,k is the orthogonal projection with respect to ⟨⋅,⋅⟩D,E(⋅) from Theorem 3.18.
Proof. Since Stn,k is a pseudo-Riemannian submanifold of (U,⟨⋅,⋅⟩D,E(⋅)), the result follows by (6.3) due to
∇LCVW|X=PX(~∇LCV˜W|X), |
see e.g. [17, Chap. 4, Lem. 3].
We now consider the second fundamental form, also called shape operator, of Stn,k⊆(U,⟨⋅,⋅⟩D,E(⋅)). We refer to [17, Chap. 4] for general properties of pseudo-Riemanian submanifolds and the second fundamental form, see also [27, Chap. 8] for the Riemannian case. Using these references, we briefly introduce the notation which is used in the sequel subsections.
Let M be a pseudo-Riemannian submanifold of a pseudo-Riemannian manifold (˜M,⟨⋅,⋅⟩). The corresponding Levi-Civita covariant derivatives on M and ˜M are denoted by ∇LC and ~∇LC, respectively. Moreover, let NM→M be the normal bundle of M and let II∈Γ∞((S2(T∗M))⊗NM) be the second fundamental form of M, see e.g. [17, Chap. 4, Lem. 4], defined by
II(V,W)|x=P⊥x(~∇LC˜V˜W|x),x∈M,V,W∈Γ∞(TM), | (6.6) |
where ˜V,˜W∈Γ∞(T˜M) are smooth extensions of V,W∈Γ∞(TM), respectively, and P⊥x:Tx˜M→NxM denotes the orthogonal projection onto the normal space NxM=(TxM)⊥. The Levi-Civita covariant derivative on M fulfills
∇LCVW=~∇LCV˜W−II(V,W) | (6.7) |
for all V,W∈Γ∞(TM) by [17, Chap. 4]. Here ˜W is again some smooth extension of W to ˜M. The identity (6.7) is named Gauß formula in [27, Thm. 8.2], which includes a proof for the Riemannian case, as well.
Lemma 6.2. Define ∇:Γ∞(T˜M)×Γ∞(T˜M)→Γ∞(T˜M) by
∇˜V˜W=~∇LC˜V˜W−~II(˜V,˜W),˜V,˜W∈Γ∞(T˜M), | (6.8) |
where ~II∈Γ∞((S2T∗˜M)⊗T˜M) denotes a smooth extension of the second fundamental form II on M to ˜M. Then ∇ is a covariant derivative on ˜M whose restriction to M coincides with ∇LC, i.e.
∇LCVW|x=∇˜V˜W|x | (6.9) |
holds for all x∈M and V,W∈Γ∞(TM) with smooth extensions ˜V,˜W∈Γ∞(T˜M). Moreover, the Christoffel symbols of ∇ with respect to the local chart (U,x) of ˜M are given by
Γkij=˜Γkij−~IIkij. | (6.10) |
Here ~IIkij is defined by ~II(∂∂xi,∂∂xj)=~IIkij∂∂xk using Einstein summation convention and ˜Γkij denote the Christoffel symbols of ~∇LC with respect to the chart (U,x).
Proof. Obviously, the definition of ∇ yields a covariant derivative on M. Moreover, the Gauß formula (6.7) implies
for all and all with smooth extensions , respectively.
It remains to show the formula for the Christoffel symbols. Let be a local chart of . Using Einstein summation convention one obtains
showing the desired result.
Remark 6.3. The definition of the covariant derivative on in Lemma 6.2 depends on the choice of the smooth extension of . Nevertheless, Equation (6.9) is independent of the extension of .
Reformulating [16, Cor. 8.4.2] yields the next lemma which allows for computing the second fundamental form of .
Lemma 6.4. Let be a pseudo-Riemannian submanifold of . Moreover, we denote by and the metric sprays on and , respectively. Then
(6.11) |
holds for all and , where
is the vertical lift and is the second fundamental form of .
Proof. This is a direct consequence of [16, Cor. 8.4.2] as well as the definition recalled in (6.6).
Lemma 6.4 applied to yields an expression for the second fundamental form.
Proposition 6.5. Consider as pseudo-Riemannian submanifold. Then the following assertions are fulfilled:
1. The second fundamental form of is given by
(6.12) |
for all and , where and denote the symmetric bilinear maps associated with the quadratic maps defined by the sprays and , respectively.
2. A smooth extension of is given
(6.13) |
for all and , Here we view as in Remark 4.16, i.e. as the smooth map defined in (4.25).
Proof. Lemma 6.4 applied to implies that
(6.14) |
holds for all . The vertical lift for fixed is the linear isomorphism
according to its local expression, see e.g. [20, Sec. 8.12]. Thus
follows by (6.14). Since the quadratic map determines uniquely the associated symmetric billinear map, Claim 1 is shown. Now Claim 2 is obvious.
The second fundamental from can be simplified for all metrics in the subfamily .
Corollary 6.6. The second fundamental form of is given by
(6.15) |
for all and .
Proof. Let and . We first compute the quadratic map associated with . Using Corollary 4.12 and Corollary 4.18, Proposition 6.5 implies
(6.16) |
Here we exploited
as well as
The desired result follows by polarization.
Corollary 6.7. For and the second fundamental form is given by
(6.17) |
for and .
Proof. Plugging into the formula from Corollary 6.6 the claim follows by a straightforward calculation by exploiting .
Next we derive an alternative expression for the Levi-Civita covariant derivative on .
Proposition 6.8. The covariant derivative on from Lemma 6.2 fulfills for and
(6.18) |
where denotes the smooth map defined in (4.25). If are smooth extensions of , respectively, then
(6.19) |
is satisfied for all .
Proof. Using Lemma 6.2 and Proposition 6.5 we compute
for and showing (6.18). If are smooth extensions of , respectively, we obtain
for all by Lemma 6.2 proving (6.19).
Proposition 6.8 yields a more explicit formula for the subfamily .
Corollary 6.9. Let and let be smooth extension of . The Levi-Civita covariant derivative on with respect to the metric is given by
(6.20) |
for , where
(6.21) |
writing and for short.
Proof. The quadratic map is determined in Corollary 4.18. The associated symmetric bilinear map can be obtained by polarization. Now Proposition 6.8 yields the desired result.
Corollary 6.9 yields an expression for the covariant derivative with respect to the family of metrics introduced in [13].
Corollary 6.10. Using the notation from Corollary 6.9 one obtains for on with and
(6.22) |
Proof. Plugging into the formula from Corollary 6.9 yields the desired result by exploiting for all .
By setting and for , Corollary 6.9 reproduces [25, Eq. (5.4)], where this expression has been obtained independently. Formulas for of a similar form as in Proposition 6.8 or Corollary 6.9 have already appeared in the literature in [10,25], see also [28, Sec. 4]. In the next remark we relate the summand in these formulas to the Christoffel symbols of the covariant derivative on the open .
Remark 6.11. Consider the smooth map from (4.25) in Remark 4.16. The Christoffel symbols of the covariant derivative on with respect to corresponds to the entries of the matrix by Proposition 6.8. More precisely, we again identify the coordinate vector field with the map . Then the -entry of is given by a formula similar to (6.2), namely
(6.23) |
where are the Christoffel symbols of with respect to the global chart , see Lemma 6.2 and Proposition 6.8. We point out that the map in (4.25) corresponds to the Christoffel symbols of the covariant derivative on but it cannot correspond to the Christoffel symbols of on due to . Nevertheless, if is applied to vector fields which are tangent to evaluated at points , it yields the same result as by Proposition 6.8.
A similar expression for "Christoffel symbols" has already appeared in [10] for the so-called canonical metric as well as for the Euclidean metric, however, without relating them to the Christoffel symbols of the covariant derivative on . Indeed, by exploiting Corollary 6.9, for and we obtain
reproducing in [10, Sec. 2.2.3]. Analogously, setting and in Corollary 6.9 yields
for and . This expression coincides with [10, Eq. (2.49)].
We investigated a multi-parameter family of metrics on an open such that becomes a pseudo-Riemannian submanifold. The corresponding geodesic equation for as explicit matrix-valued second order ODE was derived by computing the metric spray on . In principle, this approach to determine the geodesic equation is not limited to . It seems to be applicable to a pseudo-Riemannian submanifold of an open subset of a vector space as soon as the metric spray on the open subset and the tangent map of the orthogonal projection are known. Beside the geodesic equation, several other quantities related to the geometry of the pseudo-Riemannian submanifold were determined in terms of explicit matrix-type formulas. In particular, the expressions for pseudo-Riemannian gradients and pseudo-Riemannian Hessians could pave the way for designing new optimization methods on . Moreover, we reproduced several well-known results from the literature putting them into a new perspective.
This work has been supported by the German Federal Ministry of Education and Research (BMBF-Projekt 05M20WWA: Verbundprojekt 05M2020 - DyCA).
The author declares there is no conflict of interest.
[1] |
E. H. Lieb, Existence and uniqueness of the minimizing solution of Choquard's nonlinear equation, Stud. Appl. Math., 57 (1977), 93–105. https://doi.org/10.1002/sapm197757293 doi: 10.1002/sapm197757293
![]() |
[2] |
P. L. Lions, Some remarks on Hartree equation, Nonlinear Anal. Theory Methods Appl., 5 (1981), 1245–1256. https://doi.org/10.1016/0362-546X(81)90016-X doi: 10.1016/0362-546X(81)90016-X
![]() |
[3] |
V. Moroz, J. Van Schaftingen, Groundstates of nonlinear Choquard equations: existence, qualitative properties and decay asymptotics, J. Funct. Anal., 265 (2013), 153–184. https://doi.org/10.1016/j.jfa.2013.04.007 doi: 10.1016/j.jfa.2013.04.007
![]() |
[4] |
V. Moroz, J. Van Schaftingen, Groundstates of nonlinear Choquard equations: Hardy-Littlewood-Sobolev critical exponent, Commun. Contemp. Math., 17 (2015), 1550005. https://doi.org/10.1142/S0219199715500054 doi: 10.1142/S0219199715500054
![]() |
[5] |
V. Moroz, J. Van Schaftingen, Existence of groundstates for a class of nonlinear Choquard equations, Trans. Am. Math. Soc., 367 (2015), 6557–6579. https://doi.org/10.1090/S0002-9947-2014-06289-2 doi: 10.1090/S0002-9947-2014-06289-2
![]() |
[6] |
V. Moroz, J. Van Schaftingen, A guide to the Choquard equation, J. Fixed Point Theory Appl., 19 (2017), 773–813. https://doi.org/10.1007/s11784-016-0373-1 doi: 10.1007/s11784-016-0373-1
![]() |
[7] | S. I. Pekar, Untersuchung über die Elektronentheorie der Kristalle, Akademie Verlag, Berlin, 1954. https://doi.org/10.1515/9783112649305 |
[8] |
R. Penrose, On gravity's role in quantum state reduction, Gen. Relativ. Gravitation, 28 (1996), 581–600. https://doi.org/10.1007/BF02105068 doi: 10.1007/BF02105068
![]() |
[9] |
A. Elgart, B. Schlein, Mean field dynamics of Boson stars, Commun. Pure Appl. Math., 60 (2007), 500–545. https://doi.org/10.1002/cpa.20134 doi: 10.1002/cpa.20134
![]() |
[10] |
N. Ackermann, On a periodic Schrödinger equation with nonlocal superlinear part, Math. Z., 248 (2004), 423–443. https://doi.org/10.1007/s00209-004-0663-y doi: 10.1007/s00209-004-0663-y
![]() |
[11] |
C. O. Alves, A. B. Nóbrega, M. B. Yang, Multi-bump solutions for Choquard equation with deepening potential well, Calc. Var. Partial Differ. Equations, 55 (2016). https://doi.org/10.1007/s00526-016-0984-9 doi: 10.1007/s00526-016-0984-9
![]() |
[12] |
V. Moroz, J. Van Schaftingen, Semi-classical states for the Choquard equation, Calc. Var. Partial Differ. Equations, 52 (2015), 199–235. https://doi.org/10.1007/s00526-014-0709-x doi: 10.1007/s00526-014-0709-x
![]() |
[13] |
L. Guo, T. X. Hu, S. J. Peng, W. Shuai, Existence and uniqueness of solutions for Choquard equation involving Hardy-Littlewood-Sobolev critical exponent, Calc. Var. Partial Differ. Equations, 58 (2019), 128. https://doi.org/10.1007/s00526-019-1585-1 doi: 10.1007/s00526-019-1585-1
![]() |
[14] |
P. L. Lions, Solutions of Hartree-Fock equations for Coulomb systems, Commun. Math. Phys., 109 (1987), 33–97. https://doi.org/10.1007/BF01205672 doi: 10.1007/BF01205672
![]() |
[15] |
J. Wang, Existence of normalized solutions for the coupled Hartree-Fock type system, Math. Nachr., 294 (2021), 1987–2020. https://doi.org/10.1002/mana.201900230 doi: 10.1002/mana.201900230
![]() |
[16] |
J. Wang, Q. P. Geng, M. C. Zhu, Existence of the normalized solutions to the nonlocal elliptic system with partial confinement, Discrete Contin. Dyn. Syst., 39 (2019), 2187–2201. https://doi.org/10.3934/dcds.2019092 doi: 10.3934/dcds.2019092
![]() |
[17] |
J. K. Xia, X. Zhang, Saddle solutions for the critical Choquard equation, Calc. Var. Partial Differ. Equations, 60 (2021), 53. https://doi.org/10.1007/s00526-021-01919-5 doi: 10.1007/s00526-021-01919-5
![]() |
[18] |
I. M. Moroz, R. Penrose, P. Tod, Spherically-symmetric solutions of the Schrödinger-Newton equations, Classical Quantum Gravity, 15 (1998), 2733–2742. https://doi.org/10.1088/0264-9381/15/9/019 doi: 10.1088/0264-9381/15/9/019
![]() |
[19] |
A. Grigor'yan, Y. Lin, Y. Y. Yang, Yamabe type equations on graphs, J. Differ. Equations, 261 (2016), 4924–4943. https://doi.org/10.1016/j.jde.2016.07.011 doi: 10.1016/j.jde.2016.07.011
![]() |
[20] |
A. Grigor'yan, Y. Lin, Y. Y. Yang, Kazdan-Warner equation on graph, Calc. Var. Partial Differ. Equations, 55 (2016), 92. https://doi.org/10.1007/s00526-016-1042-3 doi: 10.1007/s00526-016-1042-3
![]() |
[21] |
A. Grigor'yan, Y. Lin, Y. Y. Yang, Existence of positive solutions to some nonlinear equations on locally finite graphs, Sci. China Math., 60 (2017), 1311–1324. https://doi.org/10.1007/s11425-016-0422-y doi: 10.1007/s11425-016-0422-y
![]() |
[22] |
X. Han, M. Shao, L. Zhao, Existence and convergence of solutions for nonlinear biharmonic equations on graphs, J. Differ. Equations, 268 (2020), 3936–3961. https://doi.org/10.1016/j.jde.2019.10.007 doi: 10.1016/j.jde.2019.10.007
![]() |
[23] |
H. Y. Huang, J. Wang, W. Yang, Mean field equation and relativistic Abelian Chern-Simons model on finite graphs, J. Funct. Anal., 281 (2021), 109218. https://doi.org/10.1016/j.jfa.2021.109218 doi: 10.1016/j.jfa.2021.109218
![]() |
[24] | B. Hua, W. Xu, Existence of ground state solutions to some Nonlinear Schrödinger equations on lattice graphs, preprint, arXiv: 2108.00711. |
[25] |
Y. Lin, Y. T. Wu, On-diagonal lower estimate of heat kernels on graphs, J. Math. Anal. Appl., 456 (2017), 1040–1048. https://doi.org/10.1016/j.jmaa.2017.07.028 doi: 10.1016/j.jmaa.2017.07.028
![]() |
[26] |
Y. Lin, Y. T. Wu, The existence and nonexistence of global solutions for a semilinear heat equation on graphs, Calc. Var. Partial Differ. Equations, 56 (2017), 102. https://doi.org/10.1007/s00526-017-1204-y doi: 10.1007/s00526-017-1204-y
![]() |
[27] |
N. Zhang, L. Zhao, Convergence of ground state solutions for nonlinear Schrödinger equations on graphs, Sci. China Math., 61 (2018), 1481–1494. https://doi.org/10.1007/s11425-017-9254-7 doi: 10.1007/s11425-017-9254-7
![]() |
[28] | A. Szulkin, T. Weth, The method of Nehari manifold, 2010. Available from: https://staff.math.su.se/andrzejs/publications/Nehari.pdf |
[29] |
H. Ge, Kazdan-Warner equation on graph in the negative case, J. Math. Anal. Appl., 453 (2017), 1022–1027. https://doi.org/10.1016/j.jmaa.2017.04.052 doi: 10.1016/j.jmaa.2017.04.052
![]() |
[30] |
A. Huang, Y. Lin, S. Yau, Existence of solutions to mean field equations on graphs, Commun. Math. Phys., 377 (2020), 613–621. https://doi.org/10.1007/s00220-020-03708-1 doi: 10.1007/s00220-020-03708-1
![]() |
[31] |
M. Keller, M. Schwarz, The Kazdan-Warner equation on canonically compactifiable graphs, Calc. Var. Partial Differ. Equations, 57 (2018), 70. https://doi.org/10.1007/s00526-018-1329-7 doi: 10.1007/s00526-018-1329-7
![]() |
[32] |
L. Sun, J. Zhu, Global existence and convergence of a flow to Kazdan-Warner equation with non-negative prescribed function, Calc. Var. Partial Differ. Equations, 60 (2021), 42. https://doi.org/10.1007/s00526-020-01873-8 doi: 10.1007/s00526-020-01873-8
![]() |
[33] |
Y. Lin, Y. T. Wu, Blow-up problems for nonlinear parabolic equations on locally finite graphs, Acta Math. Sci., 38 (2018), 843–856. https://doi.org/10.1016/S0252-9602(18)30788-4 doi: 10.1016/S0252-9602(18)30788-4
![]() |
[34] |
Y. Lin, Y. Y. Yang, A heat flow for the mean field equation on a finite graph, Calc. Var. Partial Differ. Equations, 60 (2021), 206. https://doi.org/10.1007/s00526-021-02086-3 doi: 10.1007/s00526-021-02086-3
![]() |
[35] |
P. L. Lions, The concentration-compactness principle in the calculus of variations, The locally compact case, Part Ⅰ, Ann. Inst. Henri Poincare C, 1 (1984), 109–145. https://doi.org/10.1016/S0294-1449(16)30428-0 doi: 10.1016/S0294-1449(16)30428-0
![]() |
[36] |
P. L. Lions, The concentration-compactness principle in the calculus of variations, The locally compact case, Part Ⅱ, Ann. Inst. Henri Poincare C, 1 (1984), 223–283. https://doi.org/10.1016/S0294-1449(16)30422-X doi: 10.1016/S0294-1449(16)30422-X
![]() |
[37] |
T. Bartsch, Z. Q. Wang, Multiple positive solutions for a nonlinear Schrödinger equation, Z. Angew. Math. Phys., 51 (2000), 366–384. https://doi.org/10.1007/PL00001511 doi: 10.1007/PL00001511
![]() |
[38] |
B. Hua, R. Li, The existence of extremal functions for discrete Sobolev inequalities on lattice graphs, J. Differ. Equations, 305 (2021), 224–241. https://doi.org/10.1016/j.jde.2021.10.016 doi: 10.1016/j.jde.2021.10.016
![]() |
[39] | M. Willem, Minimax Theorems, Birkhäuser, Boston, MA, 1996. https://doi.org/10.1007/978-1-4612-4146-1 |
[40] |
H. Brézis, E. Lieb, A relation between pointwise convergence of functions and convergence of functionals, Proc. Am. Math. Soc., 88 (1983), 486–490. https://doi.org/10.2307/2044999 doi: 10.2307/2044999
![]() |
[41] |
G. G. Huang, C. M. Li, X. M. Yin, Existence of the maximizing pair for the discrete Hardy-Littlewood-Sobolev inequality, Discrete Contin. Dyn. Syst., 35 (2015), 935–942. https://doi.org/10.3934/dcds.2015.35.935 doi: 10.3934/dcds.2015.35.935
![]() |
[42] | K. C. Chang, Methods in Nonlinear Analysis, Springer-Verlag, Berlin, 2005. Available from: https://link.springer.com/book/10.1007/3-540-29232-2. |
1. | Markus Schlarb, Knut Hüper, Fátima Silva Leite, A universal approach to interpolation on reductive homogeneous spaces, 2025, 02, 2972-4589, 1, 10.1142/S2972458925500030 |