1.
Introduction
Control theory usually deals with dynamical systems whose current output depends not only on the current value of the input, but also on its past values. Such dynamical systems are often affected by an unexpected input called a disturbance, and failing to suppress its effect adequately on the output despite the use of feedback control means poor control performance. Thus, it is important in such systems to evaluate how the input affects the output in the worst case. For stable continuous-time linear time-invariant (LTI) systems, the Lq/Lp induced norm is a quantitative measure of the worst effect of the input w∈Lp[0,∞) on the output z, viewed as an element in Lq[0,∞), where Lp(I) denotes the Lebesgue space of vector-valued functions on the interval I endowed with the associated Lp norm. Similarly, the Lq/Lp Hankel norm is a quantitative measure of the worst effect of the past input w∈Lp(−∞,0) on the future output z, regarded as an element in Lq[0,∞). These two norms play an important role in evaluating the properties and performance of dynamical systems and control systems; thus, characterization of these norms and relevant issues have been studied intensively [1,2,3,4,5,6,7,8].
Because most control systems nowadays employ digital controllers in their implementation, sampled-data systems constitute a class of important dynamical systems; such a control system with a discrete-time controller for a continuous-time plant is called a sampled-data system, especially when the primary attention is paid to the intersample behavior of the continuous-time signals in the plant, rather than their discrete-time behavior that is viewed on the sampling instants associated with the sampling period h that is inherent to the digital controller (even though the case of non-periodic sampling has also been studied extensively [9]). The viewpoint on this intersample behavior in the periodic sampling case leads us to viewing sampled-data systems as h-periodic systems from the input/output relation viewed in continuous time, even when the continuous-time plant is LTI and the discrete-time controller is also LTI (see Section 2, above (2.3), for the precise meaning of this h-periodicity). This aspect has inspired many important and interesting studies on sampled-data systems, such as dealing with the topics on the H2 norms that are suitably defined for stable sampled-data systems [10,11,12,13,14], as well as the induced norm from L2 to L2 [15,16,17,18,19], the induced norm from L2 to L∞ [14,20] and the induced norm from L∞ to L∞ [21,22,23,24].
These studies on the induced norms of stable sampled-data systems with periodic sampling have recently been extended to studies on the Hankel norm from L2 to L∞ [25,26], the Hankel norm from L2 to L2 [27] and the Hankel norm from L∞ to L∞ [28], through a novel viewpoint focused on amending the somewhat insufficient arguments in the pioneering study [29]. More precisely speaking, the h-periodic nature of sampled-data systems was shown to give rise, in the Hankel norm analysis, to introducing the novel notions of what are called the associated quasi L∞/L2, L2/L2 and L∞/L∞ Hankel operators/norms at each Θ in the sampling interval [0,h). Then, the supremum of the quasi L2/L2 Hankel norms over the sampling interval [0,h) is defined as the L2/L2 Hankel norm, and this is similar for the L∞/L2 Hankel norm and the L∞/L∞ Hankel norm.
Among these operator-theoretic studies, those on the L∞/L2 induced norm and the L∞/L2 Hankel norm for sampled-data systems have shown that they actually coincide with each other [25, Corollary 3.6], [26]. On the other hand, it is known that these two norms coincide with each other also for continuous-time LTI systems [2, Corollary], [3, Theorem 2], in which case they also equal their H2 norm [8], as far as single-output systems are concerned [4, Corollary 3.1 and Remark 3.3]. In this sense, the L∞/L2 induced and Hankel norms for sampled-data systems can be regarded as an important quantity that has a close connection with the H2 norm of sampled-data systems introduced in [11,12], but with a slightly different viewpoint taken for dealing with the intersample behavior. Indeed, such an aspect has been discussed extensively in [14].
With the above situation in mind, the present paper is concerned with alternatively studying the induced norm from L1 to L2 and the Hankel norm from L1 to L2 for stable sampled-data systems for the following reason. For continuous-time LTI systems, these two norms coincide with each other [2, Corollary], [3, Theorem 2] and equal their H2 norm as far as single-input systems are concerned [4, Corollary 3.1 and Remark 3.3]. In this sense, the L2/L1 induced and Hankel norms could also be regarded as an alternative quantity that has a close connection with the H2 norm [11,12] for sampled-data systems. Hence, the L2/L1 induced and Hankel norms for sampled-data systems could be regarded as dealing with the intersample behavior of sampled-data systems through yet another viewpoint, and thus would deserve an independent study in addition to the existing studies for the L∞/L2 case.
This paper is organized as follows. The definitions of the L2/L1 induced norm and the L2/L1 Hankel norm are given for stable sampled-data systems in Section 2. For the operator-theoretic treatment of sampled-data systems for these norms, we employ the lifting treatment [17,18,30], which allows us to deal with the continuous-time input-output relation of sampled-data systems in a discrete-time fashion. This treatment, together with the associated operator-based discrete-time representation of sampled-data systems, is also briefly reviewed in this section. Section 3 gives the characterization of the L2/L1 induced norm of stable sampled-data systems through the use of a matrix-valued function denoted by H(φ),φ∈[0,h), which further admits the numerical computation of this norm as well. Section 4 first gives the characterization of the norm of the quasi L2/L1 Hankel operator, i.e., the quasi L2/L1 Hankel norm at Θ∈[0,h), and this result is further extended to the characterization of the L2/L1 Hankel norm for stable sampled-data systems. These arguments lead directly to the numerical computation methods for the quasi L2/L1 Hankel norm for each Θ∈[0,h), as well as the L2/L1 Hankel norm. In particular, H(φ) also plays a key role for the latter, through the use of which we can establish the fact that the L2/L1 induced norm (for the analysis of which the matrix-valued function H(φ) was first introduced) and the L2/L1 Hankel norms also coincide with each other for sampled-data systems. Furthermore, this section tackles the problem of whether a critical instant exists in the L2/L1 analysis of sampled-data systems, where a critical instant is defined as the maximum-attaining point Θ∈[0,h)—if one exists—at which the associated quasi L2/L1 Hankel norm equals the L2/L2 Hankel norm, i.e., the supremum of the quasi L2/L2 Hankel norms over the sampling interval [0,h). It is also discussed how H(φ) could be used to detect a critical instant, and some relevant issues are further studied. Section 5 then discusses the relationship between the L2/L1 induced/Hankel norm and the H2 norms for sampled-data systems, paying attention to the fact that there are different definitions for the H2 norm of sampled-data systems in the literature (for example, in addition to the aforementioned one given in [11,12], other definitions have been given in [10,14]). In particular, it is shown that the L2/L1 viewpoint could lead to introducing further different definitions for the H2 norm, and that the L2/L1 induced norm has a closer relationship to all these H2 norms than the L∞/L2 induced norm in sampled-data systems. Section 6 gives numerical examples illustrating the arguments developed in this paper and confirm that a critical instant can not only be zero, but also nonzero, depending on the given sampled-data systems. Finally, concluding remarks and future topics are stated in Section 7.
The notation in this paper is as follows. We use Rn and Rn×m to mean the set of n-dimensional real vectors and that of n×m real matrices, respectively. The 2-norm and 1-norm of a finite-dimensional vector are denoted by |⋅|2 and |⋅|1, respectively, i.e., |v|2:=(vTv)1/2 and |v|1:=∑mi=1|vi| for v∈Rm. We use ‖⋅‖L1,p(I) and ‖⋅‖L2(I) to mean the L1,p and L2 norms
respectively, for real-vector-valued functions w and z on the interval I such that the associated right-hand side is well-defined. To simplify the notation and facilitate the arguments, L1(I) is used to mean either or both of L1,1(I) or L1,2(I). For example, we would mean the associate statement with L1(I) replaced by L1,p(I) with p=1, as well as the statement with p=2 at the same time. Otherwise, the statement would refer to some viewpoint that simultaneously applies to both L1,1(I) and L1,2(I). The usage and distinction would be clear from the context. In addition to those in (1.1), this paper uses many other relevant norm symbols, among which most important ones are summarized in Table 1 to facilitate the understanding of the arguments in this paper. Finally, sq(X) with a matrix X is a shorthand notation for XTX, and μ1(⋅) and μ2(⋅) denote the maximum diagonal entry and maximum eigenvalue of a positive semidefinite matrix, respectively.
2.
The L2/L1 induced norm, the L2/L1 Hankel norm and the lifting treatment of sampled-data systems
Consider the stable sampled-data system ΣSD shown in Figure 1, where P denotes the continuous-time generalized plant, and Ψ, H and S denote the discrete-time controller, the zero-order hold and the ideal sampler, respectively, operating with the sampling period h. We suppose that P and Ψ are LTI and are respectively described by
where x(t)∈Rn, w(t)∈Rnw, u(t)∈Rnu, z(t)∈Rnz, y(t)∈Rny, ψk∈RnΨ, yk=y(kh), u(t)=uk (kh≤t<(k+1)h); note that the time instants at which the sampler takes its actions are assumed to be the integer multiples of the sampling period h, and the zero-order hold is assumed to operate in synchrony with the sampler. Throughout the paper, we are interested in how the effect of w∈L1 is suppressed in the response z∈L2 in the sampled-data system ΣSD, and, for the quantitative analysis developed in this paper, this section defines the L2/L1 induced norm and the L2/L1 Hankel norm for ΣSD.
We first define the L2/L1 induced norm of ΣSD as a quantitative measure for the worst effect of the input w∈L1,p[0,∞) affecting the output z that is viewed as an element in L2[0,∞). With the operator T denoting the associated input/output mapping from w to z in ΣSD, its L2/L1,p (or, simply, L2/L1) induced norm is defined by
which is also denoted by ‖ΣSD‖2/1 for simplicity. This definition follows exactly the same line as that in the case of continuous-time LTI systems, for which it is known that the L2/L1 induced norm coincides with the H2 norm [8] in the single-input case [4, Remark 3.3]. Since the H2 norm is a very important measure in feedback control, so is the L2/L1 induced norm for continuous-time LTI systems for the same reason. This paper is focused on developing generalized arguments to cover sampled-data systems in the context of the L2/L1 induced norm (as well as the L2/L1 Hankel norm).
Next, we define the L2/L1 Hankel norm in sampled-data systems along the same line as the preceding studies [25,26,27,28] on the L∞/L2, L2/L2 and L∞/L∞ Hankel norms of sampled-data systems. In sampled-data systems, the input/output behavior between w and z is h-periodic because of the h-periodic actions of the sampler S, the zero-order hold H and discrete-time controller Ψ. What is meant precisely is that the mapping T from w to z (under the treatment of the initial condition given shortly) satisfies that TSτ=SτT for the shift operator Sτ by the delay τ>0 (i.e., (Sτf)(t)=f(t−τ)) when τ is an integer multiple of the sampling period h. Thus, it matters quite significantly, in the studies of the Hankel norms of sampled-data systems, when to take the time instant separating past and future (even though a similar issue with respect to when to take the initial time that has arisen in the studies of induced norms of sampled-data systems does not make any difference after all). For this reason, we introduce Θ∈[0,h) and consider separating past and future at Θ, taking into account the success in such treatment in [27] to amend the treatment of considering only Θ=0 in the pioneering study [29] in the L2/L2 setting, as well as the success, e.g., in [25,26] in the L∞/L2 setting. We then consider the mapping from the past input w∈L1,p(−∞,Θ) to the future output z that is viewed as an element in L2[Θ,∞), assuming that the state at t=−∞ is at the origin. This mapping is denoted by H[Θ]2/(1,p), and we define it as the quasi L2/L1,p (or, simply, L2/L1) Hankel operator at Θ. In addition, we refer to the norm of H[Θ]2/(1,p), defined as
as the quasi L2/L1,p (or L2/L1) Hankel norm at Θ. Finally, we define the L2/L1,p (or L2/L1) Hankel norm of the sampled-data system ΣSD by
by taking the standpoint that the Hankel norm should be defined as a quantitative measure for the worst effect of the past input w on the future output. If the right-hand side of (2.4) is attained as the maximum over Θ∈[0,h), each maximum-attaining point Θ is basically called a critical instant; however, due to some subtle issues and the motivation suggested by some of the main results up to the early part of Section 4, a more precise definition of this term is deferred to the last subsection of Section 4 (i.e., after some of the main results are presented explicitly).
The present paper is devoted to characterizing the L2/L1 induced norm and the L2/L1 Hankel norm for the sampled-data system ΣSD, as well as further clarifying their mutual relationship and their relationship to the several types of H2 norms of the sampled-data system ΣSD that are discussed in the existing literature [10,11,12,13,14]. To facilitate these arguments, we employ the lifted representation of sampled-data systems [17,18,30]; the remaining part of this section is devoted to its brief description.
Given the function f(t), its lifting under the sampling period h is defined by the sequence of
in k. By applying this lifting treatment to the input w and the output z, we obtain the lifted representation of the input-output relation of the sampled-data system ΣSD given by
with ξk:=[xTk ψTk]T (xk=x(kh)). Basically, this representation follows immediately as a result of the expression for the solution x(t) for (2.1) under the given w and u. More precisely, the matrix A and the operators B, C, D are respectively given by
with
The matrix A is Schur-stable. This is simply because the standing assumption on the stability of ΣSD actually refers to the Schur stability assumption of this matrix, precisely speaking. Hence, A has all of its eigenvalues in the open unit disc.
To facilitate the notation in the following arguments, ‖ˆwk‖L1,p[0,h) is briefly denoted by ‖ˆwk‖(1,p), or, more simply, by ‖ˆwk‖1, and ‖ˆzk‖L2[0,h) is simply denoted by ‖ˆzk‖2.
3.
Characterizing the L2/L1 induced norm of sampled-data systems
This section tackles the L2/L1 induced norm of sampled-data systems by clarifying the "worst input" associated with the L2/L1 induced norm of ΣSD. The arguments eventually lead to an explicit computational method of the L2/L1 induced norm.
The L2/L1 induced norm is associated with the assumption that x(0)=0 and ψ0=0 (i.e., ξ0=0), and the associated input/output relation of ΣSD is described by its lifted representation in (2.6) as follows:
By defining ˆw:=[ˆwT0 ˆwT1 ˆwT2 ⋯]T, ˆz:=[ˆzT0 ˆzT1 ˆzT2 ⋯]T and
we are immediately led to the norm-preserving properties of lifting, i.e., ‖ˆz‖2,0+=‖z‖L2[0,∞), as well as ‖ˆw‖1,0+=‖w‖L1[0,∞), where the latter holds for the underlying p=1 and p=2. Applying the triangle inequality and the properties of the relevant norms to (3.1) leads to
where F is the first column of the operator matrix in (3.1) and we use the shorthand notation ‖F‖ to mean sup‖⋅‖1≤1‖F(⋅)‖2,0+ (see Appendix A for more rigorous arguments). This implies that the L2/L1 induced norm ‖ΣSD‖2/1 does not exceed ‖F‖.
On the other hand, it is not hard to see that
by considering the case in which ˆwk=0 (k≥1). Since the rightmost side of (3.4) is nothing but ‖F‖, this, together with (3.3), immediately leads to
To realize further and more explicit characterization of ‖ΣSD‖ and thus the above norm ‖F‖, we next employ the fast-lifting treatment [31] by taking an N∈N. The idea is to introduce h′:=h/N to divide the interval [0,h) into N subintervals, and, given the function f(t) on this interval, its fast-lifting representation is defined by the collection of
We apply fast-lifting to ˆw0 and consider the portion of F that acts on the resulting ˆw0,m, which we denote by Fm (i.e., for f′ defined on [0,h′), the definition of Fm precisely implies that Fmf′=Ff, with f on [0,h) the fast-lifting representation of which satisfies that fm′=δmm′f′ (m′=1,…,N) with the Kronecker delta δmm′, yielding Fˆw0=∑Nm=1Fmˆw0,m). It then follows from ‖Fˆw0‖2,0+≤∑Nm=1‖Fm‖⋅‖ˆw0,m‖L1[0,h′) that ‖F‖ satisfies
as a result of the triangle inequality and the properties of the relevant norms (in a similar fashion to the derivation of (3.3)), where the shorthand notation ‖Fm‖ means that sup‖f′‖L1[0,h′)≤1‖Fmf′‖2,0+ (m=1,…,N). Since the right-hand side of the above inequality implies consideration of the confined situation in which the input ˆw0 of F satisfies that ˆw0,m=0 (m≠m⋆), where m⋆ denotes argmaxm‖Fm‖, it readily follows that ‖F‖≥maxm=1,…,N‖Fm‖. Since the preceding arguments are true regardless of N, this inequality, together with (3.7), leads to
Hence, increasing N leads to the fact that we can associate the value of the induced norm with the situation in which the input is concentrated on an infinitesimally small interval around the worst timing in the interval [0,h).
The remaining part of the arguments for characterizing ‖ΣSD‖2/1 is carried out separately for p=1 and p=2.
The case of p=1. Define F(j)m as the portion of Fm (i.e., its jth column) that acts on the jth entry of the input so that Fmˆw0,m=∑nwj=1F(j)mˆw(j)0,m, where we further define ˆw(j)0,m as the jth entry of the associated input ˆw0,m. Applying once again the triangle inequality ‖Fmˆw0,m‖≤∑nwj=1‖F(j)m‖⋅‖ˆw(j)0,m‖L1[0,h′) and the properties of the relevant norms to the treatment of ‖Fm‖ leads to
in quite a similar fashion to (3.3), but entirely due to a key property of L1,1[0,h′), i.e., ∑nwj=1‖(⋅)(j)‖L1,1[0,h′)=‖⋅‖L1,1[0,h′) (because ∑nwj=1‖(⋅)(j)‖L1,2[0,h′)≠‖⋅‖L1,2[0,h′), we cannot have parallel arguments for p=2 in this context, which is why the arguments for p=1 and p=2 are developed separately). Since the right-hand side of the above inequality (3.9) implies consideration of the confined situation in which the input ˆw0,m of Fm satisfies that ˆw(j)0,m=0 (j≠j⋆), where j⋆ denotes argmaxj‖F(j)m‖, it is immediate that we also obtain that ‖Fm‖≥maxj=1,…,nw‖F(j)m‖. This, together with (3.9), leads to
by which we can associate the value of the induced norm with the situation in which the input is concentrated on the worst single (the j⋆th) input (as well as at the worst timing in [0,h) in the sense that corresponds to the interpretation below (3.8)).
To develop more explicit characterization for the L2/L1,1 induced norm of ΣSD on the basis of the above arguments, we consider the L2 norm of the output z for the situation in which ˆw0 is zero, except for its jth entry, and is applied only on the interval [φ,φ+ϵ) (φ∈[0,h)), with ϵ>0 that is small enough, instead of directly dealing with an infinitesimally small interval mentioned below (3.8). Even though we skip the details for the moment, this situation with the L1,1 norm of the input being 1 essentially corresponds to that with a unit impulse that is applied to the jth entry at t=φ (see Appendix B for more detailed arguments). It is easy to see, e.g., from the arguments in [14]*, that the corresponding output z is described by ˆz0(θ)=Dθ(φ)ej and ˆzk(θ)=CθAkBh(φ)ej for k≥1, where ej denotes the jth column of the identity matrix in Rnw×nw, and
*In fact, this is once again simply an immediate consequence of the expression for the the solution x(t) of (2.1) for given w and u.
with 1(⋅) denoting the unit step function. Hence, the square of the L2 norm of the corresponding output for such a situation is described by
where sq(X):=XTX, and H(j)(φ) denotes the jth diagonal entry of the positive semidefinite matrix H(φ) that is defined as
Recalling that we have to take the worst timing φ, as well as the worst entry j of the input w, we are readily led to one of the main results on the induced norm given as Theorem 3.1 below, where the matrix-valued function H(φ) can be regarded as reflecting the periodically time-varying nature of ΣSD. Note that the second term of H(φ) in ({3.15}) can readily be obtained by computing ∑∞k=0∫h0sq(CθAk)dθ, which (does converge by the Schur stability of A), in turn, can be obtained as the solution Xh to the discrete-time Lyapunov equation ATXhA−Xh+∫h0sq(Cθ)dθ=0, where the integrals on the right-hand side and the second term of H(φ) in (3.15) can also be computed through the well-known technique [32, Equation (2.2)]. These arguments are quite standard; thus the details are omitted.
Theorem 1. The L2/L1,1 induced norm of the sampled-data system ΣSD is given by
where μ1(⋅) denotes the maximum diagonal entry.
The case of p=2. As noted in the arguments for p=1, we cannot have (3.9) for the case of p=2. Hence, we have to develop different arguments for computing ‖F‖ when p=2, but the basic idea is the same as that in the case of p=1 in the sense that we only have to consider the situation in which the input is concentrated on an infinitesimally small interval around the worst timing φ in [0,h). Since (3.9) does not hold for p=2, the only difference is that we are led (eventually in an equivalent fashion) to considering the output z for the unit impulse applied at t=φ for all entries of w. More precisely, we take v∈Rnw such that |v|2=1 and w(t)=vδ(t−φ). Then, the corresponding output in the lifted form is given by ˆz0(θ)=Dθ(φ)v and ˆzk(θ)=CθAkBh(φ)v for k≥1. By the construction of H(φ), it is easy to see that the square of the L2 norm of the corresponding output z for this situation is described by vTH(φ)v, and its maximum over |v|2=1 is given by the maximum eigenvalue of H(φ) by the well-known Rayleigh quotient property. Hence, we are led to another main result on the induced norm, as follows.
Theorem 2. The L2/L1,2 induced norm of the sampled-data system ΣSD is given by
where μ2(⋅) denotes the maximum eigenvalue.
Remark 1. Since |v|1≥|v|2 for each v∈Rnw and thus ‖w‖L1,1[0,∞)≤1 is more restrictive than ‖w‖L1,2[0,∞)≤1, it is obvious that
(with the equality being true, obviously, when nw=1). This inequality can also be confirmed by the properties of μp(⋅), p=1,2, i.e., μ2(X)≥μ1(X) for each positive semidefinite matrix X. Theorems 1 and 2, by the way, clearly show that Proposition 2 of [33] that is stated without proof is completely wrong.
Remark 2. The arguments developed in Appendix B for p=1, justifying the treatment through the unit impulse response, are based on the situation in which the unit impulse is applied to a single entry of w. Nevertheless, the arguments therein are still valid for the case of p=2, in which an impulse is applied to all entries of w through the unit vector v (i.e., |v|2=1). To see this, consider ΣSD with B1 replaced by B1V, where V is an orthogonal matrix whose first column equals v. The L2/L1,2 induced norm remains unchanged by the introduction of such V, while applying the unit impulse at t=φ to all entries of w of the original ΣSD through v is exactly the same as applying the unit impulse at t=φ to the first (and thus single) entry of w to this modified ΣSD.
4.
Characterizing the L2/L1 Hankel norm of sampled-data systems
This section tackles the L2/L1 Hankel norm of the sampled-data system ΣSD and gives its characterization in such a way that the norm can readily be computed explicitly. In particular, it is shown that it actually coincides with the L2/L1 induced norm, as in the case of the continuous-time LTI case [2, Corollary], [3, Theorem 2]. In addition, after a rigorous definition is given for a critical instant, we further investigate the problem of whether or not a critical instant always exists in the L2/L1 Hankel norm analysis. Toward these directions, we start with the arguments on the quasi L2/L1 Hankel norm at Θ∈[0,h).
4.1. Characterizing the quasi L2/L1 Hankel norm at Θ
The quasi L2/L1 Hankel operator at Θ is associated with the assumption that x(−∞)=0, ψ−∞=0 (i.e., ξ−∞=0) and w(t)≡0 (t≥Θ). To facilitate the consideration of the relation between the past input in L1(−∞,Θ) and the future output in L2[Θ,∞) under this assumption, we introduce the operator ΦΘ:L2[0,h)→L2[0,h), given by
Then, the relation between the past input and future output of ΣSD is described by (2.6) as
(where the operator matrix on the right-hand side has the Toeplitz structure if ΦΘ is deleted). By defining ˆwΘ−:=[ˆwT0 ˆwT−1 ⋯]T, ˆzΘ+:=[(ΦΘˆz0)T ˆzT1 ⋯]T and
and noting the assumption that w(t)≡0 (t≥Θ), we once again have the norm-preserving properties of lifting, i.e., ‖ˆzΘ+‖2,0+=‖z‖L2[Θ,∞), as well as ‖ˆwΘ−‖1,0+=‖w‖L1[−∞,Θ), where the latter holds for p=1 and p=2. In addition, with the columns of the operator matrix in (4.2), we define Fk (k=0,1,2,…) as
Then, applying the triangle inequality and the properties of the relevant norms to (4.2) (in a similar fashion to the derivation of (3.3)) leads to
where F−k denotes the portion of Fk that acts on the input on the interval [0,Θ), while F+k denotes the portion of Fk that acts on the input on the interval [Θ,h) (the precise definitions follow essentially the same technique used in defining Fm from F in Section 3); also, for a vector-valued function on [0,h), its restrictions onto the subintervals [0,Θ) and [Θ,h) are denoted by (⋅)− and (⋅)+, respectively. Furthermore, ‖F−k‖ and ‖F+k‖ are defined accordingly in the obvious fashion. Here, we readily see that Fk corresponds to F0 with the first k entries removed and with (the norm-contracting operator) ΦΘ applied on the resulting first entry. This immediately leads to ‖F−0‖≥‖F−k‖ (k≥1) and ‖F+1‖≥‖F+k‖ (k≥2); thus, it is easy to see from (4.5) that
because
where the inequality follows by hypothesis.
On the other hand, we readily have that ‖z‖L2[Θ,∞)≥max(‖F−0‖,‖F+1‖) when ‖w‖L1(−∞,Θ)≤1 by essentially the same arguments as those leading to (3.4). This, together with (4.6), implies that
To realize further and more explicit characterization of the above quantity (i.e., the quasi L2/L1 Hankel norm at Θ), we apply the fast-lifting treatment as in the case of the L2/L1 induced norm studied in the preceding section, but with a slightly modified fashion. That is, for the treatment of ‖F−0‖, the interval [0,Θ) is divided into N subintervals, and, for the treatment of ‖F+1‖, the interval [Θ,h) is divided into N subintervals. Following essentially the same arguments as those in the preceding section leads to the observation that the quasi L2/L1 Hankel norm at Θ can be associated with the situation in which the past input is concentrated on an infinitesimally small interval around the worst timing in the interval [0,Θ) or [Θ−h,0). This is essentially the same as the situation in which, given Θ∈[0,h), a unit impulse is applied at the worst timing Θ−ϕ (∈[Θ−h,Θ)) for some ϕ∈(0,h]; also, the quasi L2/L1 Hankel norm at Θ corresponds to the L2 norm of the corresponding worst output z that is evaluated over the interval after Θ, i.e., [Θ,∞). Now, let
for the underlying Θ∈[0,h). That is†, instead of representing the worst timing relative to Θ through ϕ, we introduce φ to represent the worst timing relative to the sampling instant 0, taking into account the use of the lifting treatment. Then, φ∈[0,h) is such that it is the worst timing for the unit impulse in the aforementioned sense when φ∈[0,Θ), and φ−h is the worst timing when φ∈[Θ,h). Hence, similar to the arguments about (3.14) in the preceding section (where the response z was considered in the lifted form by using (2.6) for the impulse input w applied at φ∈[0,h)), we see that, if φ∈[0,Θ), then the corresponding output ˆz0(θ) can be represented by Dθ(φ) and ˆzk(θ) can be represented by CθAk−1Bh(φ) for k≥1. Similarly, if φ∈[Θ,h) (for which the worst timing φ−h is negative but no smaller than −h, so that Dθ(φ) becomes irrelevant and only CθAkBh(φ) plays a role in representing the output z after t=Θ≥0), then the corresponding output ˆzk can be represented by CθAkBh(φ) for k≥0 as an immediate result of applying (2.6). Through the above observations, we are led to introducing the positive semidefinite matrices
†We remark that, in the treatment throughout this paper, the timing φ=0 (or Θ−ϕ=0) corresponds to the instant immediately after the sampler takes its action, while φ=h−0 (or Θ−ϕ=−0) corresponds to the instant immediately before the sampler takes its action.
for φ∈[0,Θ) and [Θ,h), respectively, which can also be computed easily by using the solution of an appropriate discrete-time Lyapunov equation (see essentially the same remark before Theorem 1). The unit impulse mentioned above, by the way, actually corresponds to vδ(t−φ) (or vδ(t−(φ−h)) if φ∈[Θ,h)) with the delta function δ(t), where v=ej for some j=1,…,nw for p=1, while |v|2=1 for p=2. In addition, it is obvious from the construction of the above FΘ,0(φ) and FΘ,1(φ) (taking account of the aforementioned ˆzk) that ‖z‖2L2[Θ,∞)=‖ˆzΘ+‖22,0+ corresponds to vTFΘ,0(φ)v or vTFΘ,1(φ)v in such a situation with v; we further have to consider the worst v among those mentioned above. Since the quasi L2/L1 Hankel norm at Θ is also associated with the worst timing φ with respect to Θ, we are led to the following result on the quasi L2/L1 Hankel norm, where μp(⋅) (p=1,2) are as stated in Theorems 1 and 2, respectively.
Theorem 3. The quasi L2/L1 Hankel norm at Θ of the sampled-data system ΣSD is given by
4.2. Characterizing the L2/L1 Hankel norm
Since the quasi L2/L1 Hankel norm at Θ has been characterized by Theorem 3, we have, in principle, also successfully characterized the L2/L1 Hankel norm through the use of (2.4). Such characterization, however, obviously leads to a double supremum in Θ∈[0,h), as well as φ∈[0,h). This is not only inconvenient, it is actually redundant in a sense. Indeed, Theorem 4, given shortly, shows through the following lemma that the L2/L1 Hankel norm of ΣSD can actually be characterized by simply applying a single supremum in φ∈[0,h).
Lemma 1. For the positive semidefinite matrices X and Y of the same size, we have
We skip the proof of this lemma because (the proof is quite simple and) it is well known.
Theorem 4. The L2/L1 Hankel norm of the sampled-data system ΣSD is given by
Before proceeding to the proof, we note that FΘ,0(φ) as defined in (4.10) for φ∈[0,Θ), has a continuous extension for φ=Θ by the continuity of Bh(⋅) on [0,h) and that of Dθ(⋅) on [0,θ]; also, FΘ,1(φ) as defined in (4.11) for φ∈[Θ,h), has a continuous extension for φ=h. With this observation taken into account, the subsequent part of this paper takes the standpoint that we may refer to FΘ,0(Θ) and FΘ,1(h). Similarly, H(φ) as defined in (3.15) for φ∈[0,h), has a continuous extension for φ=h; thus, we further take the standpoint that we may refer to H(h), which is nothing but limφ→h−0H(φ). Under this standpoint, it readily follows from (3.15) and (4.10) that
since Dθ(φ)=0 for 0≤θ<φ. We further note from the direct comparisons with (3.15), (4.10) and (4.11) that‡
‡To see the equalities regarding H(h) in (4.19), note that Dθ(h)=0 for θ∈[0,h) by (3.13); thus, the first term of H(h) in (3.15) vanishes. Other relations follow quite immediately from (3.15), (4.10) and (4.11).
where the range for Θ is also extended in a similar fashion to [0,h] with respect to FΘ,0(φ) and FΘ,1(φ) in (4.18)–(4.20).
Proof of Theorem 4. It readily follows from (4.12) that
It further follows from the aforementioned arguments on continuous extension, together with (4.15), that
and this, together with (2.4), implies that
where the last equality follows since H(φ) is continuous at φ=0.
To show the converse inequality, we note (4.16). Then, applying Lemma 1 leads to
Similarly, by noting (4.17), we readily have
Hence, the converse inequality of (4.23) has been established by (4.12) together with (2.4), and this completes the proof. □
Furthermore, Theorem 4, together with Theorems 1 and 2, immediately leads to the following result.
Corollary 1. Let p=1 or p=2. The L2/L1,p Hankel norm coincides with the L2/L1,p induced norm in the sampled-data system ΣSD.
Just for reference, we note that parallel results for sampled-data systems have also been obtained between the L∞/L2 induced norm and Hankel norm [25, Corollary 3.6], [26], as well as between the L∞/L∞ induced norm and Hankel norm [28, Corollary 2] for the case in which the direct feedthrough matrix D11 from w to z in the continuous-time plant P is zero§.
§This matrix must be zero for the L2/L1 and L∞/L2 settings so that the associated induced norm is well-defined.
4.3. Critical instant and its existence
The remaining part of this section is devoted to the arguments on critical instants in the L2/L1 Hankel norm analysis, such as whether or not a critical instant always exists, together with the relevance of such arguments to the properties of the matrix-valued function H(φ). These aspects studied in this subsection are mutually related in quite a deep fashion; thus, some preliminary comments on the mutual relevance will be crucial before proceeding with the arguments in this subsection.
First, a rough (but not entirely rigorous) definition of a critical instant is, as stated after (2.4), an instant Θ=Θ⋆∈[0,h)—if one exists—such that the L2/L1 Hankel norm given by the left-hand side of (2.4) equals the quasi L2/L1 Hankel norm at Θ⋆, i.e., ‖H[Θ⋆]2/(1,p)‖. Despite the definition of the L2/L1 Hankel norm through the use of the supremum over Θ∈[0,h) in (2.4), on the other hand, we have established that it can actually be obtained through the treatment of the matrix-valued function H(φ) that does not involve Θ at all. This motivates us to tackle an interesting problem of whether (not merely the L2/L1 Hankel norm but as an addition) a "critical instant" can also be somehow detected only through the analysis of some properties of the matrix-valued function H(φ). To be helpful in such a direction of arguments, it turns out that it makes sense to employ the following rigorous definition of a critical instant by modifying the aforementioned rough definition.
Definition 1. In the L2/L1,p Hankel norm analysis, the instant Θ⋆∈(0,h) is called a critical instant if
while the instant Θ⋆=0 is called a critical instant if
Note that, underlying the above definition, in some sense, is a later-shown fact that ‖H[0]2/(1,p)‖≠‖H[+0]2/(1,p)‖, in general (see Remark 3 given later); however, there are also two main reasons why the case of the critical instant Θ⋆ at 0 is treated separately (as opposed to the L∞/L2 Hankel norm analysis [25,26] and the L2/L2 Hankel norm analysis [27]), as follows. First, when (4.26) holds, it roughly implies that the "worst input" in the L2/L1 Hankel norm analysis is close to an impulse applied at t=Θ⋆, but, if such Θ⋆ is 0, then this instant is a sampling instant; thus, an intuitive interpretation of applying an impulse at this specific instant becomes somewhat hard due to the possible two interpretations of (ⅰ) applying it just before the sampler takes its action, and (ⅱ) applying it just after that action. In this respect, the treatment in (4.27) corresponds to dealing with Θ⋆=0 only as the limit Θ⋆→+0; thus, it helps us to circumvent the aforementioned subtleties in intuitive understanding. The second reason is more mathematical, and it is crucial in leading us to more sophisticated overall arguments that take into account whether a critical instant can be detected only through the analysis of the matrix-valued function H(φ). By the end of this subsection, the rationale for the above definition would become much clearer.
With the above precise definition of a critical instant, we investigate the issue of whether a critical instant Θ=Θ⋆ always exists for the sampled-data system ΣSD, taking into account the relevant studies on the quasi L∞/L2 Hankel norm analysis [25,26] and the quasi L2/L2 Hankel norm analysis [27]. We first state the following result, where we say that μ1/2p(H(φ)) is maximum-attaining on [0,h) (for the underlying p=1 or p=2) if there exists a maximum-attaining point φ⋆∈[0,h) such that μ1/2p(H(φ⋆))=supφ∈[0,h)μ1/2p(H(φ)).
Theorem 5. Let p=1 or p=2, and suppose that μ1/2p(H(φ)) is maximum-attaining on [0,h). Then, for each maximum-attaining point φ⋆∈[0,h) of μ1/2p(H(φ)), the instant Θ=φ⋆ is a critical instant of ΣSD in the L2/L1,p Hankel norm analysis.
Proof. By the hypothesis of this theorem, together with Corollary 1 and Theorems 1 and 2, we have
On the other hand, it readily follows from (4.22) that
by the continuity of H(φ) at φ=0. Now, if φ⋆≠0, then let Θ=φ⋆ in (4.22) and apply (4.28). If φ⋆=0, on the other hand, then apply (4.28) to (4.29). Then, we have
which obviously implies that both sides actually coincide with each other (i.e., the inequality can, in fact, be replaced by an equality) in both inequalities; thus, Θ=φ⋆ is a critical instant. This completes the proof. □
Note that Theorem 5 leads us to be interested in whether or not H(h)=limφ→h−0H(φ) coincides with H(0). This is because, if they coincide with each other, then it is obvious that μ1/2p(H(φ)) is maximum-attaining on [0,h); thus, the existence of a critical instant is ensured by this theorem. However, the definition of H(φ) in (3.15) does not lead to such a general relation. To get around the difficulty, we can derive the following result.
Proposition 1. The following inequalities hold for p=1,2, where ‖H[h]2/(1,p)‖ is a shorthand notation for limΘ→h−0‖H[Θ]2/(1,p)‖.
Once this result is established, we immediately have the following result (Proposition 1 immediately leads to the first assertion; in particular, we are led, under the hypothesis of the following proposition, to ‖H[+0]2/(1,p)‖=‖ΣSD‖H,2/(1,p), which is nothing but the second assertion).
Proposition 2. Suppose that p=1 or p=2. If sup0≤φ<hμ1/2p(H(φ))=μ1/2p(H(h)), then ‖H[h]2/(1,p)‖=‖H[+0]2/(1,p)‖ and Θ=0 is a critical instant.
The above result leads to the following result, which covers the situation that is not handled by Theorem 5.
Theorem 6. Let p=1 or p=2, and suppose that μ1/2p(H(φ)) is not maximum-attaining on [0,h). Then, Θ=0 is a critical instant of ΣSD in the L2/L1,p Hankel norm analysis.
Proof. It is obvious that, if μ1/2p(H(φ)) is not maximum-attaining on [0,h), then sup0≤φ<hμ1/2p(H(φ))=μ1/2p(H(h)). Hence, the assertion follows from Proposition 2. □
Finally, the following result is an immediate consequence of Theorems 5 and 6.
Corollary 2. The sampled-data system ΣSD always has a critical instant in the L2/L1 Hankel norm analysis for p=1,2. In particular, at least one critical instant can be obtained through the analysis of H(φ) on [0,h).
The first assertion of the above result is similar to that for the quasi L2/L2 Hankel norm analysis [27, Theorem 3], but it is in sharp contrast to that for the quasi L∞/L2 Hankel norm analysis [25,26] for the sampled-data system ΣSD, where a counterexample is provided. For the second assertion, it should be quite important to note that a critical instant Θ=Θ⋆ can be detected from H(φ), whose definition does not involve Θ at all.
Proof of Proposition 1. The equalities in (4.31) and (4.32) are nothing but (4.14); thus, the second inequalities in (4.31) and (4.32) are obvious. With the arguments about continuous extension that are stated below Theorem 4, it follows from (4.12), together with (4.15), that
where the first equality follows from (4.12), and we used (4.19) in the second equality. On the other hand, the fact that FΘ,1(φ)→F0,1(φ) as Θ→+0 uniformly with respect to φ∈[0,h) was used in the second equality, and (4.19) was used again in the last equality. Similarly, it follows from (4.12) that
where the second equality holds since FΘ,0(φ)→Fh,0(φ) as Θ→h−0 uniformly with respect to φ∈[0,h), the third equality follows from (4.18) and the fourth equality follows from the continuous extension arguments. To see the fifth equality, we note that F0,1(h)=Fh,0(h)≥Fh,1(h) by (4.19) and (4.20), and the last equality follows again from (4.19). This completes the proof. □
Remark 3. If we directly consider ‖H[0]2/(1,p)‖ rather than ‖H[+0]2/(1,p)‖, we readily see from (4.12) that μ1/2p(F0,0(0)) (=μ1/2p(H(0))) should be treated as an empty object in (4.33). This implies that we have established
and, thus,
In particular, the first inequality of (4.34) would change into an equality if the left-hand side were ‖H[0]2/(1,p)‖, and this implies, by (4.35), that we have established
Supposing the standpoint that we are interested in ‖H[h]2/(1,p)‖, the above equality implies that it is redundant to also be interested in ‖H[0]2/(1,p)‖, and it would be more informative to be interested in ‖H[+0]2/(1,p)‖ instead.
We have the following result that is relevant to Proposition 1 and Remark 3; this corollary can also lead immediately to the first assertion of Corollary 2, but it does not seem helpful in the derivation of some of the preceding results, e.g., Theorem 6 and thus the second assertion of Corollary 2.
Corollary 3. The following inequality holds for p=1 and p=2.
Proof. The assertion follows immediately from (4.33) and (4.35). □ Furthermore, we have the following result.
Theorem 7. Let p=1 or p=2, and suppose that μ1/2p(H(φ)) is either (i) not maximum-attaining on [0,h), or (ii) maximum-attaining on [0,h) and (μ1/2p(H(h))=) limφ→h−0μ1/2p(H(φ)) equals the maximum. Then, the L2/L1,p Hankel norm ‖ΣSD‖H,2/(1,p) is given by ‖H[0]2/(1,p)‖ (=‖H[h]2/(1,p)‖). In other words, the right-hand side of (2.4) is attained as the maximum at Θ=0.
Proof. It follows from (4.37) that
where the continuous extension arguments and (4.19) are used. On the other hand, the hypothesis (i) implies that μ1/2p(H(h))=sup0≤φ<hμ1/2p(H(φ)); thus,
by (4.14), while the hypothesis (ⅱ) also implies (4.42) immediately, because the maximum must equal the quasi L2/L1,p Hankel norm by (4.14). It then follows from (4.41) and (4.42), together with (2.4), that the inequality in (4.41) must, in fact, be an equality. This, together with (4.39) and (4.42), completes the proof. □
Note that the above theorem is closely related to Theorem 6, but that the theorem was relevant to the limit Θ→+0 by virtue of Definition 1 for a critical instant (at 0), while the above theorem is associated with the direct treatment of Θ=0; the latter implies that the statement of Theorem 6 would remain valid even if the definition of a critical instant were modified in such a way that only (4.26) was used also for the case of Θ⋆=0. Similarly, under such an "alternative" definition for a critical instant, it readily follows from Theorem 7 (under the hypothesis (ⅱ)) that the statement of Theorem 5 would remain valid if the slight additional requirement of μ1/2p(H(h))=μ1/2p(H(0)) were added in the hypothesis for the case of φ⋆=0. More specifically, we have the following result also covering the case in which the condition about this additional requirement does not hold.
Theorem 8. Let p=1 or p=2, and suppose that μ1/2p(H(φ)) attains the maximum over φ∈[0,h) at φ=0. If μ1/2p(H(h))=μ1/2p(H(0)), then ‖H[Θ]2/(1,p)‖ attains the maximum over Θ∈[0,h) at Θ=0. Otherwise, i.e., if μ1/2p(H(h))<μ1/2p(H(0)), then the following properties (I) and (II) hold:
(I) (Not only for Θ0=0, but) for every Θ0∈[0,h), the quasi L2/L1,p Hankel norm ‖H[Θ]2/(1,p)‖ attains the maximum over Θ∈[0,h) at Θ=Θ0, provided that the following condition is satisfied for the subsystem, denoted by P11, of the generalized plant P restricted to the input w and the output z:
(i-1) When p=1: There exists j⋆ such that P(j⋆)11=0 and μp(H(0)) equals the j⋆th diagonal entry of H(0), where P(j)11 denotes the jth column of P11 associated with the jth entry of w(t);
(i-2) When p=2: There exists v⋆ such that P11v⋆=0, |v⋆|2=1 and μp(H(0)) equals (v⋆)TH(0)v⋆.
(II) The quasi L2/L1,p Hankel norm at 0, i.e., ‖H[0]2/(1,p)‖, is less than the L2/L1,p Hankel norm ‖ΣSD‖H,2/(1,p) of ΣSD, provided that the following conditions are satisfied for P11:
(ii-1) When p=1, there exists no j such that P(j)11=0;
(ii-2) When p=2, there exists no v≠0 such that P11v=0.
Proof. We first prove the first assertion. If μ1/2p(H(h))=μ1/2p(H(0)), then the hypothesis of this theorem implies that μ1/2p(H(h)) equals the maximum of μ1/2p(H(φ)) over φ∈[0,h). Hence, the assertion follows from Theorem 7 (under its hypothesis (ⅱ)).
We next prove part (I) of the second assertion. Note that, if μ1/2p(H(h))≠μ1/2p(H(0)), then μ1/2p(H(h))>μ1/2p(H(0)) can never be the case given the hypothesis that μ1/2p(H(φ)) attains the maximum over φ∈[0,h) at φ=0; thus, the opposite inequality holds (as asserted): μ1/2p(H(h))<μ1/2p(H(0)). Comparing the definition of H(φ) in (3.15) and that of F0,1(φ) in (4.11), we see that
We first consider the case of p=1. By the hypothesis (ⅰ-1) on P11 for p=1, taking the j⋆th diagonal entry of both sides of (4.43) with φ=0 leads to
(a1) μ1/2p(H(0))=F(j⋆)0,1(0)1/2
by the definition of j⋆, where F(j)Θ,i(φ) denotes the jth diagonal entry of FΘ,i(φ) for i=0,1. Similarly, since the j⋆th diagonal entry of the first term of FΘ,0(φ) in (4.10) vanishes under the hypothesis on P11, it readily follows that
(b1) F(j⋆)Θ,0(φ) is independent of Θ and equals F(j⋆)0,1(φ) for every φ∈[0,h)
by (4.11). Hence, for Θ>0, we have
where the first equality follows from (b1), the second equality follows from (a1) and the last equality follows from the hypothesis of the theorem and (4.14). For Θ=0, on the other hand, we also have
in a similar fashion. The above observations, together with (4.12), imply that ‖H[Θ]2/(1,p)‖≥‖ΣSD‖H,2/(1,p) for each Θ∈[0,h), which, in turn, implies that ‖H[Θ]2/(1,p)‖=‖ΣSD‖H,2/(1,p) for each Θ∈[0,h) by (2.4). This completes the proof of the first assertion for p=1.
We next prove part (Ⅰ) of the second assertion for p=2, following similar steps to those for the case of p=1. It follows from (4.43) that
(a2) μ1/2p(H(0))=(v⋆)TF0,1(0)v⋆
by the hypothesis (ⅰ-2) on P11 for p=2 and the associated definition of v⋆; the comparison of (4.10) and (4.11) leads to the following:
(b2) (v⋆)TFΘ,0(φ)v⋆ is independent of Θ and equals (v⋆)TF0,1(φ)v⋆ for every φ∈[0,h)
Taking (v⋆)TFΘ,0(φ)v⋆ and (v⋆)TFΘ,1(φ)v⋆ instead of F(j⋆)Θ,0(φ)1/2 and F(j⋆)Θ,1(φ)1/2, respectively, in the above arguments for p=1, we can readily complete the proof.
We next prove part (Ⅱ) of the second assertion. Noting that ‖H[0]2/(1,p)‖ cannot exceed ‖ΣSD‖H,2/(1,p) according to (2.4), suppose that ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p). By showing that we are then led to contradiction, the proof will be completed.
Assuming the above equality means that we should have ‖H[0]2/(1,p)‖=‖H[+0]2/(1,p)‖, because of (4.38), together with (2.4). Hence, it follows from (4.37) that
This, together with (4.17) with Θ set to 0, implies that
where the last equality follows from the hypothesis of this theorem. Thus, the inequalities must, in fact, be equalities, i.e.,
We first consider the case of p=1. By the condition (ⅱ-1), it follows from (4.43) that the jth diagonal entry of H(φ) satisfies that H(j)(φ)>F(j)0,1(φ) for each j=1,…,nw and every φ∈[0,h); thus,
Even though [0,h) is not a compact set, we can show that taking the supremum over [0,h) on both sides of the above inequality leads to
by taking account of the present hypothesis that μ1/2p(H(φ)) attains the maximum over φ∈[0,h) at φ and μ1/2p(H(h))<μ1/2p(H(0)). To see this, we take h0∈[0,h) such that μ1/2p(H(φ))≤μ1/2p(H(0))−d/2 whenever φ∈[h0,h), where d:=μ1/2p(H(0))−μ1/2p(H(h))>0. This, together with (4.49), leads to μ1/2p(H(0))>μ1/2p(H(0))−d/2≥suph0≤φ<hμ1/2p(F0,1(φ)), while sup0≤φ≤h0μ1/2p(H(φ))>sup0≤φ≤h0μ1/2p(F0,1(φ)) immediately follows from (4.49). Hence, we have that sup0≤φ<hμ1/2p(F0,1(φ))<max(μ1/2p(H(0)),sup0≤φ≤h0μ1/2p(H(φ)))≤max(μ1/2p(H(0)),sup0≤φ<hμ1/2p(H(φ)))=sup0≤φ<hμ1/2p(H(φ)), where the last equality follows from the hypothesis of this theorem. We have thus established (4.50) as claimed, but it obviously contradicts (4.48). This completes the proof for the case of p=1.
We finally prove part (Ⅱ) of the second assertion for p=2. By the condition (ⅱ-2), it follows from (4.43) that vTH(φ)v>vTF0,1(φ)v for every v such that |v|2=1; thus, (4.49) also holds for p=2. Then, (4.50) follows also for p=2 by exactly the same arguments as those for p=1. Hence, the proof is completed by the contradiction between (4.48) and (4.50). □
Regarding the above theorem, neither of the conditions (ⅰ-1) or (ⅱ-1) holds if P(j)11=0 for some j=1,…,nw, but, for none of such j, the jth diagonal entry of H(0) is the maximum among all of the diagonal entries; a similar situation exists where neither of the conditions (ⅰ-2) or (ⅱ-2) holds. We have the following result in connection with such a situation. The proof is given in Appendix C.
Corollary 4. The assertion (II) of Theorem 8 remains valid even if the conditions (ii-1) and (ii-2) are replaced by the following conditions, respectively:
(ii-1) when p=1, the condition (i-1) fails and φ=0 is the only maximum-attaining point of μ1/2p(H(φ)) on [0,φ);
(ii-2) when p=2, the condition (i-2) fails and φ=0 is the only maximum-attaining point of μ1/2p(H(φ)) on [0,φ).
Since we can apply such further alternative arguments through the use of the present definition of a critical instant, i.e., Definition 1 with the separate treatment (4.27), it is regarded as reasonable and useful.
5.
Relationship among the L2/L1 induced/Hankel norm and H2 norms of ΣSD
We have characterized the L2/L1 induced norm, as well as the L2/L1 Hankel norm for the stable sampled-data system ΣSD in the preceding sections. More precisely, we have clarified their numerical computation methods for p=1 and p=2, and it has been shown that the L2/L1 induced/Hankel norms are related to (or can alternatively be interpreted as) evaluating the L2 norm of the output z for some unit impulse applied to the input w at the worst timing φ∈[0,h).
In the stable continuous-time LTI systems, on the other hand, the H2 norm plays an important role as a qualitative measure for evaluating the worst effect of the input to the output. Furthermore, it is well known that the time-domain interpretation of the H2 norm is also relevant to the evaluation of the L2 norm of the output for the unit impulse input (which may be assumed to be applied at t=0 due to the time-invariance). In particular, the H2 norm is known to coincide with the L2/L1 induced norm for the single-input case [4, Remark 3.3]. These observations naturally lead one to be interested in the relationship between the L2/L1 induced norm and the H2 norm also for the sampled-data system ΣSD, and this section is devoted to studying such a relationship, taking account the fact that has been established in the preceding sections that the L2/L1 Hankel norm coincides with the L2/L1 induced norm, both for p=1 and p=2, for the sampled-data system ΣSD (i.e., we will not directly mention the L2/L1 Hankel norm of ‖ΣSD‖2/1 in the following paragraphs, even though we do mention the quasi L2/L1 Hankel norms for reasons that will become clear later).
Before proceeding to such arguments, however, it should be first noted that there are several different time-domain definitions of the H2 norms for ΣSD because of the h-periodicity of the associated input-output mapping, depending on how to take this periodicity into account when defining the H2 norm of ΣSD. We begin by summarizing these definitions (just for reference, some comments on the frequency-domain treatment of sampled-data systems and their H2 norm are given in the remark at the end of this section).
The first definition was introduced in [10], where the unit impulse was assumed to be applied only at t=0 (or, more precisely, t=−0, i.e., just before the sampler takes its action at t=0) to the input, in spite of the h-periodicity of the input-output mapping. When ΣSD is a multi-input system, then such a unit impulse is applied independently to each entry of the input, and the square of the L2 norm of the associated response is summed over the independently handled unit impulse inputs. The square root of the sum was then defined as the H2 norm of ΣSD. In what follows, this definition of the H2 norm is denoted by ‖ΣSD‖[−0]H2 (which, as well as all of the H2 norms introduced below, has no distinction between p=1 and p=2), and it turns out that it can be represented as¶
¶Even though H is not an h-periodic function, the input-output behavior of the sampled-data system ΣSD is h-periodic. Hence, note that an impulse input applied at t=h−0 is essentially the same as that at t=−0.
with H(φ) given by (3.15), where tr(⋅) denotes the trace of a matrix.
The second definition was introduced in [11,12] by modifying the somewhat unnatural treatment in the above definition that considers that the unit impulse is applied only at (or just before) the sampling instant t=0. The modification was made by considering the situations in which the unit impulse is applied at t=φ∈[0,h) rather than only at t=0. The sum of the squares of the L2 norms is then considered for each φ∈[0,h), and the square root of the average of such sums was defined as the H2 norm of ΣSD, which we denote by ‖ΣSD‖[0,h)H2. In other words,
The third definition was introduced in [14] in a similar fashion to the above second definition, but with the supremum taken instead of the average over φ∈[0,h). The associated H2 norm is denoted by|| ‖ΣSD‖[φ⋆]H2, where [φ⋆] should be regarded as a "non-numeric" symbol, referring to the standpoint of considering the "worst φ" in the sampling interval [0,h). Namely,
||We remark, just in case, that the exact notation used in [14] was ‖ΣSD‖[τ⋆], but the present paper employs φ⋆ instead of τ⋆ so that the relevance of this third definition to the arguments of the present paper with the variable φ becomes much clearer.
We could also introduce other relevant quantities for ΣSD by replacing the square of the L2 norm for φ in the second and third H2 norm definitions with the square of the quasi L2/L1 Hankel norm at φ. This is because (ⅰ) ΣSD reduces to a continuous-time LTI system when Ψ=0 (in which case the quasi L2/L1 Hankel norm at φ is obviously independent of φ and is nothing but the L2/L1 Hankel norm itself), (ⅱ) the L2/L1 Hankel norm coincides with the L2/L1 induced norm for continuous-time LTI systems [2, Corollary], [3, Theorem 2] and (ⅲ) the L2/L1 induced norm coincides with the H2 norm for continuous-time single-input LTI systems [4, Remark 3.3]. Hence, we see that each of these two quantities does reduce to the standard H2 norm of continuous-time systems for the special case with Ψ=0 and a single input; thus, they could be meaningful also for a single-input sampled-data system. The above observation would suggest two other definitions of the H2 norm of ΣSD that are not confined to the single-input case once we introduce the following quantity instead of (4.12) (with Θ replaced by φ in accordance with the situation in the present arguments, which thus involves replacing the original φ in (4.12) with θ):
More precisely, further new definitions of the H2 norm for ΣSD could be introduced (through the viewpoint of the quasi L2/L1 Hankel operator at φ) by replacing with the above ftr(φ) the L2 norm for φ in the second and third definitions. The resulting definitions are denoted by ‖ΣSD‖quasi,[0,h)H2 and ‖ΣSD‖quasi,[φ⋆]H2, respectively, and are given as follows:
Regarding the relationship among these possible definitions of the H2 norm and the L2/L1 induced norm for the sampled-data system ΣSD, we have the following result.
Theorem 9. The L2/L1 induced norm and H2 norms of the sampled-data system ΣSD satisfy the following relations:
In particular, for ΣSD with a single input (i.e., when nw=1), the following relation holds:
Proof. The proof is mostly straightforward if we note the relationship between the average and the supremum, as well as that between μp(⋅) (p=1,2) and tr(⋅). Indeed, we are thus led to (5.7), (5.9) and (5.10), as well as ‖ΣSD‖quasi,[0,h)H2≤‖ΣSD‖quasi,[φ⋆]H2. Furthermore, the equality assertion in (5.8) follows from essentially the same arguments as the proof of Theorem 4 by the use of the same inequalities of Fφ,0(θ)≤H(θ) and Fφ,1(θ)≤H(θ) used therein. Similarly, it follows from (5.4) that
where the two equalities follow from essentially the same arguments as those in the earlier half of the proof of Theorem 4. Hence, we are led to ‖ΣSD‖quasi,[0,h)H2≥‖ΣSD‖[0,h)H2. Combining these results leads to (5.8), and this completes the proof. □
Remark 4. Some of the above inequalities have been derived already in [14], where the relationship between the second H2 norm and the L∞/L2 induced norm was discussed for the sampled-data system ΣSD by introducing
We then have
and thus the second H2 norm can be represented by the use of F(θ), while the L∞/L2 induced norm (which equals the L∞/L2 Hankel norm) is also represented by the use of F(θ) [14,25,26]. Even though such an observation plays a key role in studying the relationship between the second H2 norm and the L∞/L2 induced norm in [14], it is not easy to relate (the non-integrated) tr(F(θ)) and tr(H(φ)) directly. Hence, no definite relations exist between the L∞/L2 induced (Hankel) norm and the first H2 norm, as well as between the former and the third H2 norm (see Examples 1–3 in [14]). Since we can combine (5.8) or (5.9) with (5.7), we could say that the L2/L1 induced (Hankel) norm is more deeply related than the L∞/L2 induced (Hankel) norm to different definitions of the H2 norms for the sampled-data system ΣSD.
Remark 5. There are frequency-domain studies of sampled-data systems, such as [34,35], which are entirely free from the time-domain lifting technique; the H2 norm is also defined in such a context of studies as in [13]. Alternatively, the transfer operator can be introduced to sampled-data systems as in [30] by applying the lifting technique used in the present paper, through which the H2 norm can also be introduced. In fact, the second definition [12] corresponds to an equivalent time-domain interpretation of such a definition.
6.
Numerical examples
This section studies some numerical examples to demonstrate the theoretical results on the quasi L2/L1 Hankel norms and the L2/L1 Hankel norm, as well as to confirm the relations among the L2/L1 induced norm and H2 norms of sampled-data systems.
In the following numerical analysis, the supremum of a function over [0,h) is computed as the maximum over ΦM:={0,h′,⋯,Mh′} with h′:=h/M and M=200, which includes h. This is because each relevant function has a continuous extension to the closed interval [0,h] and the above maximum tends to the relevant supremum as M→∞. For the same reason, the H2 norm defined by (5.1) is computed as ‖ΣSD‖[−0]H2=tr1/2(H(h)). Similarly, the averages in (5.2) and (5.5) are computed approximately by using those for the M+1 values of φ in ΦM.
Example 1. Consider the stable two-input (nw=2) sampled-data system with h=0.02 and
Example 2. Consider the stable three-input (nw=3) sampled-data system with h=0.02 and
For these examples, different types of norms considered in this paper are computed as shown in Tables 2 and 3. We can confirm from these tables the relations in (5.7)–(5.9), as well as (3.18). In addition, the L2/L1 Hankel norm can be computed (not only by (4.14) as in Tables 2 and 3, but also) by applying (2.4) through the use of the quasi L2/L1 Hankel norms for Θ∈[0,h) that are shown in Figures 2 and 3 for Example 1 (to which Figure 4 is also relevant, as mentioned later) and in Figures 5 and 6 for Example 2**, and can be confirmed to coincide with the L2/L1 induced norm in both examples (see Corollary 1), as seen from Tables 2 and 3. Furthermore, we can confirm the inequality ‖H[Θ]2/(1,p)‖≥μ1/2p(H(Θ)) in (4.22) in all of these figures.
**More precisely, these figures (as well as the figure for Example 3) for the quasi L2/L1 Hankel norms are actually drawn in such a way that the value at Θ=0 actually corresponds to ‖H[+0]2/(1,p)‖ rather than ‖H[0]2/(1,p)‖; by directly taking the value of ‖H[0]2/(1,p)‖ instead, we have confirmed in all examples that (4.38) and (4.39) hold, but the curve of the quasi L2/L1 Hankel norms then becomes messy, and this is why the figures are drawn in the aforementioned fashion.
Furthermore, we can confirm from Figures 2, 3, 5 and 6 the inequality (4.22), i.e., ‖H[Θ]2/(1,p)‖≥μ1/2p(H(φ)) for φ=Θ. Nevertheless, taking the supremum for φ=Θ∈[0,h) on both sides yields the identical value, as seen from these figures, and this is consistent with (2.4) and (4.14) on the L2/L1 Hankel norm. In connection with this consistency, we note that (2.4) with ‖H[Θ]2/(1,p)‖ is more directly related to determining the existence (and the value, if one exists) of a critical instant, and we see from the relevant curves in Figures 2 and 3, as well as Figures 5 and 6, that ΣSD has a critical instant at Θ=0 in Example 1 and Example 2, both for p=1 and p=2 (we can confirm in Figures 5 and 6 that the quasi L2/L1 Hankel norm at Θ tends to that at Θ=0 as Θ→h−0). In particular, the observation associated with Example 1 confirms the assertion in Theorem 5, which tackles the existence of a critical instant through the less relevant quantity H(φ). The observation associated with Example 2, on the other hand, implies that a converse type of the assertion in Theorem 5 does not hold, in general. That is, a critical instant can exist even if μ1/2p(H(φ)) is not maximum-attaining on [0,h); this is asserted in a more precise manner in Theorem 6, and Figures 5 and 6 confirm that Θ=0 is indeed a critical instant for such a case, both for p=1 and p=2. To summarize, Examples 1 and 2 are consistent with Corollary 2. Furthermore, Figures 2, 3, 5 and 6 are consistent with Corollary 3.
As an additional remark, we note that, for each interval of Θ (=φ) in Figures 2, 3, 5 and 6 such that the strict version of the inequality (i.e., ‖H[Θ]2/(1,p)‖>μ1/2p(H(φ))) actually holds in (4.22), it has the meaning that, for each Θ in the interval, the worst timing at which a unit impulse is (virtually) applied in the sense of the quasi L2/L1 Hankel norm at Θ is not immediately before Θ; more precisely, for Θ in such an interval, the corresponding ϕ∈(0,h] introduced above (4.9) is given by ϕ>0 (rather than "ϕ=+0"). This can be seen by noting that Fφ,0(φ)=H(φ) on the right-hand side of (4.12).
On the other hand, we can confirm from Figures 4 and 7 (respectively for Examples 1 and 2, and relevant to the comparison between the L2/L1 induced norm and the H2 norm) the inequality f1/2tr(φ)≥tr1/2(H(φ)) given in (5.11). In spite of this, taking the supremum in φ over [0,h) on both sides leads to the identical value, as seen from these figures, which corresponds to the equality in (5.8) about two different types of H2 norm of ΣSD. For each interval of ϕ in Figures 4 and 7 such that the strict version of the inequality (i.e., f1/2tr(φ)>tr1/2(H(φ))) actually holds in (5.11), it has a similar meaning to what was mentioned above. That is, for each Θ in the interval, the worst timing before Θ at which a unit impulse is applied (in the sense of evaluating the square root of ∑nwi=1‖zi‖2L2[Θ,∞), with zi being the response for the unit impulse applied to the ith input) is not immediately before Θ. This can be seen by noting that Fφ,0(φ)=H(φ) on the right-hand side of (5.4).
We finally give an example in which a nonzero critical instant exists.
Example 3. Consider the stable single-input (nw=1) LTI sampled-data system with h=0.2 and
We can readily ascertain from Figure 8 that this sampled-data system has a nonzero critical instant, unlike the two preceding examples for which only Θ=0 is a critical instant. In addition, we see that both inequalities in (4.22) hold as equalities for every Θ∈[0,h) in this example.
7.
Discussions and conclusions
This paper studied the L2/L1 induced norm and the L2/L1 Hankel norm for stable sampled-data systems through the application of an operator-theoretic treatment of their input-output relation via the lifting technique [17,18,30]. These norms have been characterized through the use of the positive-semidefinite matrix-valued function H(φ) in (3.15) that is defined over the sampling interval [0,h) (in such a way that the numerical computation of these norms can readily be carried out), as well as shown to coincide with each other, as in the case of stable continuous-time LTI systems. Very importantly, it is the intrinsic h-periodic nature of the input-output relation of sampled-data systems that leads us to the treatment of such a matrix-valued function, and, in this respect, it would be quite helpful to note the following standpoint of this paper in contrast to the L∞/L2 viewpoint, which led to another matrix-valued function F(θ) in (5.12).
Namely, this study was motivated by the fundamental facts for stable continuous-time LTI systems, particularly that (i) the L∞/L2 induced norm and the L∞/L2 Hankel norm coincide with each other and, for the single-output case, further equal another important quantity of the H2 norm, (ii) the L2/L1 induced norm and the L2/L1 Hankel norm coincide with each other and, for the single-input case, also equal the H2 norm, together with the further fact for sampled-data systems that (iii) the L∞/L2 induced norm and the L∞/L2 Hankel norm again coincide with each other, and their relationship to one definition of the H2 norm for sampled-data systems (given in [11,12]) has been discussed in an earlier study [14]. More specifically, this paper was naturally motivated by these series of facts, and we were led to studying the L2/L1 induced and Hankel norms as well for stable sampled-data systems. In other words, as a result of applying different treatments through another alternative viewpoint of tackling the h-periodic nature of the input-output relation of sampled-data systems from the L2/L1 viewpoint, we were instead led to H(φ), rather than F(θ).
Regarding the motivation and standpoint mentioned above, we have indeed arrived at clarifying new aspects on the H2 norms for sampled-data systems and their relevant norms, as discussed in Section 5 (in particular, Remark 4), and whose brief summary is as follows. In the study of the L∞/L2 induced/Hankel norms for sampled-data systems [14,25,26], their relationship has been clarified only with respect to the specific type of H2 norm of sampled-data systems given in [11,12]. However, it is important to note that there are other possible alternatives for the definition of the H2 norm for sampled-data systems. In this regard, the arguments of this paper have indeed suggested the introduction of further alternative possible definitions of the H2 norm through the L2/L1 viewpoint, as well as showed that some explicit relations can be established among the L2/L1 induced/Hankel norm and different definitions of the H2 norms, that is, if the L2/L1 viewpoint is taken. In this sense, it was shown that the L2/L1 induced norm has a closer relationship to all these H2 norms than the L∞/L2 induced norm in sampled-data systems.
Regarding the main issue of the L2/L1 Hankel norm analysis studied in this paper, the quasi L2/L1 Hankel norm at each Θ∈[0,h) was also characterized, and a relevant problem on the existence of a critical instant was also tackled. In particular, it was shown that a critical instant Θ always exists in the L2/L1 Hankel norm analysis, as in the L2/L2 Hankel norm analysis [27], but unlike a seemingly more related problem of the L∞/L2 Hankel norm analysis [25,26]. It was also shown that H(φ) can give some information on the locations of critical instants Θ. However, it remains open whether all of the critical instants can be detected only through the analysis of H(φ), whose definition does not involve Θ at all. In the L∞/L2 Hankel norm analysis, positive answers to a parallel question in terms of F(θ) have been derived in [26] under mild assumptions, and investigating whether a similar result can be derived in the L2/L1 setting will be a future topic.
Finally, we remark that, by following the same line as previous studies [36,37] on the analysis and synthesis of sampled-data systems under the specific H2 norm given in [14], we can tackle the problem of controller synthesis stabilizing the closed-loop system and minimizing the L2/L1 induced/Hankel norm for sampled-data systems. The details will be reported independently as a future topic.
Appendix
A. Rigorous arguments on (3.3)
This appendix is devoted to a more rigorous derivation of (3.3).
Since ∑Kk=0‖ˆzk‖2≤∑K′k=0‖ˆzk‖2 whenever K≤K′, it is obvious that
Since this is true for each nonnegative integer K, letting K→∞ leads to (3.3).
Remark 6. The above arguments implicitly assume that F is a bounded operator, but the independent arguments after (3.5) indeed establish that this is indeed the case.
B. Supplementary arguments on the impulse response treatment
This appendix is devoted to showing how dealing with L1 rigorously in our problem setting eventually leads, as a sort of limit, to an interpretation that is equivalent to the impulse response treatment suggested in Section 3.
Recall that the arguments therein were confined to the situation in which the input is concentrated on a single entry of w, as well as on an infinitesimally small interval around the worst timing φ in the interval [0,h). Thus, we assume that only the jth entry of w is nonzero, and this input, denoted by w(j), is possibly nonzero only on the interval [φ,φ+ϵ) for a given φ∈[0,h) and a sufficiently small ϵ>0. The corresponding response of z can be described by Bˆw0 (or by CAkBˆw0, more precisely speaking) and Dˆw0 by (2.6), and thus more explicitly by ∫φ+ϵφB(j)h(τ)ˆw(j)0(τ)dτ and ∫φ+ϵφD(j)θ(τ)ˆw(j)0(τ)dτ, because of (2.8), (2.10), (3.11) and (3.13), where B(j)h(⋅) and D(j)θ(⋅) denote the jth columns of Bh(⋅) and Dθ(⋅), respectively.
The subsequent arguments follow three steps. In the first step, we give the motivation and rationale for the approximations in which B(j)h(τ) and D(j)θ(τ) are equivalently regarded as taking the constant values B(j)h(φ) and D(j)θ(φ), respectively, or, more precisely,
and
(and state why such an approach could roughly be interpreted as an impulse response treatment). These two approximations are handled separately in Steps 1a and 1b. In the second step, we discuss how the treatment of B(j)h(τ) and D(j)θ(τ) through the use of B(j)h(φ) and D(j)θ(φ), respectively, leads to approximation errors in the framework of rigorous treatment of L1. The two cases for B(j)h(τ) and D(j)θ(τ) are handled separately in Steps 2a and 2b. Finally, the third step completes the arguments by combining the preceding arguments.
Step 1a. Regarding ∫φ+ϵφB(j)h(τ)ˆw(j)0(τ)dτ, we readily see that
By the continuity of Bh(⋅) on [0,h), we have
Roughly speaking, this implies that we can have the aforementioned approximation (7.2) for a sufficiently small ϵ>0, wherein B(j)h(τ), τ∈[φ,φ+ϵ) are replaced by a single quantity B(j)h(φ). Furthermore, in the sense that we only deal with B(j)h(φ), this approximation can be interpreted as considering the situation in which ˆw(j)0 is the unit impulse applied at t=φ.
Step 1b. On the other hand, we also have
For θ<φ (≤τ), both sides of the above inequality are zero by the definition (3.13) of Dθ(τ), while, for θ>φ, it follows from the continuity of Dθ(τ) on the sufficiently small interval [φ,φ′) that
for each θ∈[0,h) and every φ∈[0,h). This implies, roughly speaking, that we can have the aforementioned approximation (7.3) for a sufficiently small ϵ>0 (except at θ=φ, or as long as we consider integrals with respect to θ as in the computation of the L2 norm of z), and, in the sense that we only deal with D(j)θ(φ), this approximation can be interpreted as considering the same situation as above, i.e., ˆw(j)0 is the unit impulse applied at t=φ.
More precisely, our arguments in L1 proceed as follows on the basis of the above approximation ideas.
Step 2a. Regarding (7.4), it follows readily from (3.11) that there exists a positive function ηB,φ(ϵ) such that
and ηB,φ(ϵ)→0 uniformly with respect to φ∈[0,h) as ϵ→+0. Hence, we have
so that, if (ˆw(j)0(τ)=0 for τ∉[φ,φ+ϵ) and) ∫φ+ϵφ|ˆw(j)0(τ)|dτ≤1, then the triangle inequality leads to
for some αB,ϵ,φ(⋅) such that |αB,ϵ,φ(ˆw(j)0)|≤1. Essentially the same arguments lead to
for some αB,ϵ,φ,θ,k(⋅) such that |αB,ϵ,φ,θ,k(ˆw(j)0)|≤1 if ∫φ+ϵφ|ˆw(j)0(τ)|dτ≤1, and ηB,φ,θ,k(ϵ)→0 uniformly with respect to φ∈[0,h) and θ∈[0,h) as ϵ→+0. Hence, both ∫h0ηB,φ,θ,k(ϵ)dθ and ∫h0ηB,φ,θ,k(ϵ)2dθ tend to 0 uniformly with respect to φ∈[0,h) as ϵ→+0.
Step 2b. Regarding (7.6), on the other hand, we first note that, if τ≤θ, then θ≥φ; thus, it readily follows from (3.13) that there exists a positive function ηD,φ,θ(ϵ) such that
where the left-hand side is taken for τ within the intersection of [φ,φ+ϵ) and τ≤θ (i.e., the interval [φ,θ] if θ∈[φ,φ+ϵ), and the interval [φ,φ+ϵ) if θ≥φ+ϵ); also note that ηD,φ,θ(ϵ)→0 uniformly with respect to φ∈[0,h) and θ∈[0,h) as ϵ→+0. If τ>θ, on the other hand, we have that D(j)θ(τ)=0; thus, it readily follows again from (3.13) that there exists a positive constant ηD′,φ,θ such that
where the left-hand side is taken for τ within the intersection of [φ,φ+ϵ) and τ>θ (i.e., the interval (θ,φ+ϵ), assuming that θ∈[φ,φ+ϵ)) and ηD′,φ,θ is bounded with respect to φ∈[0,h) and θ∈[0,h).
Combining the above arguments for τ∈[φ,θ] and τ∈(θ,φ+ϵ) for θ∈[φ,φ+ϵ), and noting that (10.11) holds also for θ≥φ+ϵ, it follows that
satisfies that ηD",φ(ϵ)≤maxm=1,2max(ϵ(ηD,φ,θ(ϵ)+ηD′,φ,θ)m,hηD,φ,θ(ϵ)m); thus, ηD",φ(ϵ)→0 uniformly with respect to φ as ϵ→+0. Furthermore, it follows from the triangle inequality (together with (7.12) and (7.13) and essentially the same technique for the derivation of (7.10)) that, if ∫φ+ϵφ|ˆw(j)0(τ)|dτ≤1, then
for some αD,ϵ,φ,θ(⋅) such that |αD,ϵ,φ,θ(ˆw(j)0)|≤1 whenever ∫φ+ϵφ|ˆw(j)0(τ)|dτ≤1. Here, the aforementioned uniform convergence property of ηD",φ(ϵ) about (7.14) implies that ˉηD,φ,θ(ϵ) is such a function that both ∫hφˉηD,φ,θ(ϵ)dθ and ∫hφˉηD,φ,θ(ϵ)2dθ tend to 0 uniformly with respect to φ∈[0,h) as ϵ→+0. For the case of (7.15), we have thus arrived at a situation that is parallel to what has been established in Step 2a for the case of (7.11).
Step 3. We are in the final position to complete the justification of the treatment developed in Section 3. The square of the L2 norm of the output z corresponding to ˆw(j)0 can be computed by using (7.11) and (7.15), where the maximum of |∫φ+ϵφˆw(j)0(τ)dτ| under the assumption that ∫φ+ϵφ|ˆw(j)0(τ)|dτ≤1 is (attained and) obviously 1. Furthermore, recall that the L2/L1 induced norm ‖ΣSD‖2/1 was associated with the situation with an infinitesimally small ε. Thus, by considering ˆw(j)0 such that ∫φ+ϵφ|ˆw(j)0(τ)|dτ=1 and |∫φ+ϵφˆw(j)0(τ)dτ|=1 for each ϵ, and by taking the limit ϵ→+0 with the uniform convergence stated at the end of Steps 2a and 2b taken into account, it follows that the square of the aforementioned L2 norm tends to
since D(j)θ(φ)=0 for θ<φ according to (3.13), where the notation on the left-hand side is used because the right-hand side equals the jth diagonal entry of the matrix H(φ) defined in (3.15). Furthermore, it is easy to see that this value cannot be exceeded through the use of ˆw(j)0 such that |∫φ+ϵφˆw(j)0(τ)dτ|<1. Since the L2/L1 induced norm was further associated with the worst timing φ∈[0,h), together with the worst j among j=1,…,nw, we have justified Theorem 1 through no use of an actual treatment of directly applying an impulse input.
C. Proof of Corollary 4
This appendix is devoted to the proof of Corollary 4. We first consider the case of p=1. The proof proceeds in three steps.
Step 11. Suppose that the condition (ⅰ-1) fails, i.e., suppose that either of the following conditions holds:
(a-1) there exists no j=1,…,nw such that P(j)11=0;
(b-1) (a-1) is not true, and the jth diagonal entry of H(0) satisfies that H(j)(0)≠μp(H(0)) whenever P(j)11=0.
For the case (a-1), the assertion (Ⅱ) of Theorem 8 implies that
Hence, the proof for p=1 will be completed by showing that the same inequality follows also for the case (b-1), which can be rephrased equivalently as
(b-1)' (a-1) is not true, and (H(j)(0))1/2<μ1/2p(H(0))=‖ΣSD‖H,2/(1,p) whenever P(j)11=0
because H(j)(0)≤μp(H(0)) according to the definition of μp(⋅) for p=1, and μ1/2p(H(0))=‖ΣSD‖H,2/(1,p) by the hypothesis of Theorem 8 (which obviously is included implicitly in the hypothesis of this corollary) and (4.14). Now, let j be as required in (b-1)', i.e., P(j)11=0 and (H(j)(0))1/2<μ1/2p(H(0))=‖ΣSD‖H,2/(1,p), and we show that
under the hypothesis of this corollary (which includes the hypothesis about the assertion (Ⅱ) of Theorem 8). To this end, suppose the contrary to (7.18), which, obviously, yields sup0≤φ<h(H(j)(φ))1/2=‖ΣSD‖H,2/(1,p) by (4.14). Since ‖ΣSD‖H,2/(1,p)=μ1/2p(H(0))>μ1/2p(H(h)) by the hypothesis, we have that μ1/2p(H(0))>(H(j)(h))1/2; thus, assuming that sup0≤φ<h(H(j)(φ))1/2=‖ΣSD‖H,2/(1,p) leads to sup0≤φ<h(H(j)(φ))1/2=‖ΣSD‖H,2/(1,p)>(H(j)(h))1/2. This obviously implies the existence of φ0∈[0,h) such that ‖ΣSD‖H,2/(1,p)=(H(j)(φ0))1/2, but such φ0 cannot be zero according to (b-1)', i.e., φ0∈(0,h). On the other hand, since (H(j)(φ0))1/2≤μ1/2p(H(φ0)), we have that ‖ΣSD‖H,2/(1,p)≤μ1/2p(H(φ0)) given the fact that ‖ΣSD‖H,2/(1,p)=(H(j)(φ0))1/2 mentioned just above; however, the strict inequality cannot hold because of (4.14), which implies that φ0∈(0,h) is a maximum-attaining point of μ1/2p(H(φ)) (and further implies that ‖ΣSD‖H,2/(1,p)=μ1/2p(H(φ0))). However, this contradicts the hypothesis of this corollary; thus, we have established (7.18).
Step 21. The inequality (7.18) implies that the existence of the jth input channel of ΣSD has no direct contribution to the value of ‖ΣSD‖H,2/(1,p), and this fact, in turn, implies that the existence of the jth input channel has no influence on whether or not ‖ΣSD‖H,2/(1,p) is attained as ‖H[0]2/(1,p)‖. From this observation, let us consider the sampled-data system ΣSD with its jth input channel removed for each j such that P(j)11=0 and (H(j)(0))1/2<μ1/2p(H(0)). The set of all such j is denoted by J, and the resulting sampled-data system is denoted by ΣSD⟨J⟩, where it should be noted that J can never coincide with the entire set {1,…,nw}; this is because (H(j)(0))1/2<μ1/2p(H(0)) for all j=1,…,nw obviously contradicts the definition of μp(⋅) for p=1. Hence, ΣSD⟨J⟩ does make sense. Let the H(φ) corresponding to ΣSD⟨J⟩ be denoted by H⟨J⟩(φ). It is then obvious from the above arguments, together with (4.14), that
where the second equality is a direct consequence of the above arguments, together with (4.14), and the last equality follows from the hypothesis of Theorem 8. Furthermore, the inequality in (b-1)' implies that "removing the jth input channel for j∈J does not affect μ1/2p(H(0)), " which, precisely speaking, means that μ1/2p(H(0))=μ1/2p(H⟨J⟩(0)), because H⟨J⟩(φ) is nothing but the submatrix of H(φ) that is obtained by removing its jth row and column for all j∈J. Hence, it follows from (7.19) that
(i.e., μ1/2p(H⟨J⟩(φ)) attains the maximum over φ∈[0,h) at φ=0) and, also,
where the strict inequality follows from the hypothesis about the assertion (Ⅱ) of Theorem 8.
Step 31. Now, (7.20) and (7.21) imply that ΣSD⟨J⟩ satisfies the hypothesis corresponding to the assertion (Ⅱ) of Theorem 8; thus, we can apply it to ΣSD⟨J⟩. Noting that the present proof was initiated under the hypothesis of this corollary that (i-1) fails for the original sampled-data system ΣSD (which means either (a-1) or (b-1)), and further noting that the construction of ΣSD⟨J⟩ rules out the condition (b-1), we see that the condition (ⅱ-1) (corresponding to the condition (a-1)) must be satisfied for ΣSD⟨J⟩; thus, we are led to ‖H[0]2/(1,p)⟨J⟩‖<‖ΣSD⟨J⟩‖H,2/(1,p), where the right-hand side denotes the quasi L2/L1,p Hankel norm at 0 for ΣSD⟨J⟩. This immediately implies that
by (7.19). We finally derive (7.17) from the above inequality by supposing the contrary to (7.17), which is obviously ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p) by (2.4), and showing that we are then led to contradiction. To this end, we first note from (4.12) that ‖H[0]2/(1,p)‖=sup0≤φ<hμ1/2p(F0,1(φ)) and ‖H[0]2/(1,p)⟨J⟩‖=sup0≤φ<hμ1/2p(F0,1⟨J⟩(φ)), where F0,1⟨J⟩(φ) is the submatrix of F0,1(φ) that is obtained by removing its jth row and column for all j∈J. Hence, if ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p) in spite of (7.22), then there must exist some j∈J such that ‖H[0]2/(1,p)‖=sup0≤φ<h(F(j)0,1(φ))1/2. By (4.17), this implies that ‖H[0]2/(1,p)‖≤sup0≤φ<h(H(j)(φ))1/2; thus, ‖ΣSD‖H,2/(1,p)≤sup0≤φ<h(H(j)(φ))1/2 by the assumption ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p), but this contradicts (7.18). This completes the proof for p=1.
We next give the proof for the case of p=2. Even though it follows some similar arguments, it does not necessarily follow parallel steps to the arguments for p=1 to avoid some intrinsic issues relevant to the case of p=2, e.g., dealing with infinitely many v in connection with the treatment of μp(H(0)) through the use of vTH(0)v.
Step 12. Let us denote the maximum eigenvalue of H(0) by ¯λ. We first note that v with |v|2=1 satisfies that vTH(0)v=μp(H(0)) (=¯λ) if and only if v belongs to the eigenspace W¯λ of H(0) that corresponds to the eigenvalue ¯λ. In other words, v satisfies that vTH(0)v=¯λ|v|22 if and only if v∈W¯λ, which implies that the set of v such that vTH(0)v=¯λ|v|22 forms a linear space, which is nothing but W¯λ. On the other hand, the set of v such that P11v=0 is also a linear space, which we denote by W0. The hypothesis of this corollary that (ⅰ-2) fails for the sampled-data system ΣSD means that W¯λ∩W0={0}. Let vi (i=1,…,r¯λ) be orthonormal vectors spanning W¯λ, and let us take an orthogonal matrix V=[¯V V_] such that the columns of ¯V are given by these vectors. We further consider the sampled-data system ΣSD with B1 replaced by B1V, which means that P11 is replaced by P11V (and P21 is replaced by P21V). The resulting sampled-data system is denoted by ΣSDV. Noting that the orthogonal matrix V does not affect the quasi L2/L1,p Hankel norm and the L2/L1,p Hankel norm for the case of p=2, it follows that it suffices to justify the assertion of this corollary when the given sampled-data system is actually the modified sampled-data system ΣSDV. Note that v′ satisfies that (P11V)v′=0 for the modified sampled-data system ΣSDV if and only if Vv′∈W0, while H(φ) corresponding to ΣSDV, denoted by HV(φ), satisfies that v′HV(0)v′=μp(HV(0))|v′|22 (=¯λ|v′|22) if and only if Vv′∈W¯λ, because HV(φ) is obviously given by VTH(φ)V, i.e., because v′HV(0)v′=(Vv′)TH(0)(Vv′). These observations imply the natural consequence that the condition (ⅰ-2) of Theorem 8 also fails in the modified sampled-data system ΣSDV, because W¯λ∩W0={0}, as mentioned above.
Here, we may assume, without loss of generality, that r¯λ<nw. This is because, otherwise, W¯λ equals Rnw, i.e., every v such that |v|2=1 satisfies that vTH(0)v=μp(H(0)) according to the definition of W¯λ, which in turn implies that the failing of the condition (ⅰ-2) in the original sampled-data system ΣSD immediately implies that it satisfies the condition (ⅱ-2), leading to the conclusion that ‖H[0]2/(1,p)‖<‖ΣSD‖H,2/(1,p) by the assertion (Ⅱ) of Theorem 8; hence, the proof of this corollary is completed in such a case.
With the above arguments in mind, let J:={r¯λ+1,…,nw} and consider the modified sampled-data system ΣSDV with the jth input channel removed for j∈J (which is nothing but ΣSD¯V). Furthermore, this is also nothing but ΣSD⟨J⟩ in the notation introduced in the arguments for p=1, provided that the modified sampled-data system ΣSDV is identified with the original sampled-data system ΣSD (for the aforementioned reason that the orthogonal matrix V does not affect the quasi L2/L1,p Hankel norm and the L2/L1,p Hankel norm for the case of p=2), and we indeed do so in the following arguments. It is then obvious that H(φ) corresponding to ΣSD⟨J⟩, denoted by H⟨J⟩(φ), is given by ¯VTH(φ)¯V, with H(φ) being the one that is defined for the original sampled-data system ΣSD.
Before dealing with ΣSD⟨J⟩, however, we first establish
by following the arguments similar to those used in the case of p=1 for the derivation of (7.18). To this end, suppose the contrary to the above inequality for some v∉W¯λ with |v|2=1. Then, by using this v and replacing H(j)(φ) with vTH(φ)v (including the case in which φ is actually h or φ0) in the arguments for p=1, and further noting that μ1/2p(H(h))≥(vTH(h)v)1/2 because |v|2=1, we can readily see that ‖ΣSD‖H,2/(1,p)=(vTH(φ0)v)1/2 for some φ0∈[0,h). However, since ‖ΣSD‖H,2/(1,p)=μ1/2p(H(0)) by the hypothesis of Theorem 8, such φ0 cannot be zero because v∉W¯λ. Hence, we have that φ0>0, but this obviously contradicts the hypothesis of this corollary. Thus, we have established (7.23).
Step 22. The inequality (7.23) implies that, in an equivalent impulse response interpretation for the quasi L2/L1 Hankel norm and the L2/L1 Hankel norm, "applying an impulse input from the direction that does not belong to W¯λ does not contribute to the values of these norms", which, more precisely, implies (the second equality of) (7.19) for the case of p=2. More formally, we see that the maximum of sup0≤φ<h(vTH(φ)v)1/2 over v with |v|2=1 is attained at v∈W¯λ because sup0≤φ<hμ1/2p(H(φ))=μ1/2p(H(0)) by the hypothesis of Theorem 8; this, in turn, implies that sup0≤φ<hμ1/2p(H⟨J⟩(φ))=sup0≤φ<hμ1/2p(H(φ)) because H⟨J⟩(φ)=¯VTH(φ)¯V and Im¯V=W¯λ, where Im¯V denotes the image of ¯V. Thus, we are led to
where the last equality follows again from the hypothesis. Furthermore, (7.21) follows for the same reason (i.e., H⟨J⟩(0)=¯VTH(0)¯V). In particular, the equality in (7.21), together with (7.24), leads to (7.20).
Step 32. As in the case of p=1, it is important to note that (7.20) and (7.21) established in the above step imply that the hypothesis about the assertion (Ⅱ) of Theorem 8 is satisfied for ΣSD⟨J⟩. It is also important to note that, since the condition (ⅰ-2) fails for the original ΣSD by the hypothesis of this corollary, it suffices to consider only ΣSD⟨J⟩ for the original sampled-data system ΣSD satisfying the following condition:
(b-2) for each v∈W¯λ (or, equivalently, (vTH(0)v)1/2=μ1/2p(H(0))=‖ΣSD‖H,2/(1,p)) with |v|2=1, the condition P11v≠0 is satisfied.
Since H(φ) corresponding to ΣSD⟨J⟩ is given by ¯VTH(φ)¯V and Im¯V=W¯λ, and since P11 corresponding to ΣSD⟨J⟩ is given by P11¯V, with P11 being the one that is defined for the original sampled-data system ΣSD, we readily see, under the above condition (b-2), that the condition (ⅰ-2) fails, but the condition (ⅱ-2) does hold when Theorem 8 is applied to ΣSD⟨J⟩. Hence, we are led to ‖H[0]2/(1,p)⟨J⟩‖<‖ΣSD⟨J⟩‖H,2/(1,p), and this, in turn, together with (7.24), implies (7.22) for the case of p=2, i.e., ‖H[0]2/(1,p)⟨J⟩‖<‖ΣSD‖H,2/(1,p), as in the case of p=1.
We finally establish (7.17), i.e., ‖H[0]2/(1,p)‖<‖ΣSD‖H,2/(1,p), also for p=1, by supposing its contrary, which, obviously, is ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p), and by showing that we are then led to contradiction. To this end, we first note from (4.12) that ‖H[0]2/(1,p)‖=sup0≤φ<hμ1/2p(F0,1(φ)); thus, ‖H[0]2/(1,p)⟨J⟩‖=sup0≤φ<hμ1/2p(F0,1⟨J⟩(φ)), where F0,1⟨J⟩(φ)=¯VTF0,1(φ)¯V. Hence, if ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p) in spite of (7.22), i.e., ‖H[0]2/(1,p)⟨J⟩‖<‖ΣSD‖H,2/(1,p), then it implies that
Since Im¯V=W¯λ, the above inequality implies that the maximum of sup0≤φ<h(vTF0,1(φ)v)1/2 over v with |v|2=1 is attained at v∉W¯λ, and such v further satisfies that ‖H[0]2/(1,p)‖=sup0≤φ<h(vTF0,1(φ)v)1/2 by the above equality. By (4.17), this implies that ‖H[0]2/(1,p)‖≤sup0≤φ<h(vTH(φ)v)1/2; thus, ‖ΣSD‖H,2/(1,p)≤sup0≤φ<h(vTH(φ)v)1/2 by the assumption that ‖H[0]2/(1,p)‖=‖ΣSD‖H,2/(1,p). However, since v∉W¯λ and |v|2=1, this contradicts (7.23); hence, the proof for p=2 is completed.
Use of AI tools declaration
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
Acknowledgments
This work was supported in part by JSPS KAKENHI Grant Number 22K04153.
Conflict of interest
The authors declare no conflict of interest that may influence the publication of this paper.