Loading [MathJax]/jax/output/SVG/jax.js
Research article

Adaptive learning nonsynchronous control of nonlinear hidden Markov jump systems with limited mode information


  • Received: 14 May 2023 Revised: 14 September 2023 Accepted: 25 September 2023 Published: 19 October 2023
  • In this paper, an adaptive neural network learning based nonsynchronous control method is developed for hidden Markov jump systems with unmodeled nonlinear dynamics. In particular, the system modes are not directly accessible and the limited mode information can be partly estimated by the nonsynchronous controller. More precisely, the mode information with partly accessible transition rates is utilized based on the transition probability matrix. Moreover, the unmodeled nonlinear dynamics are more general in practical applications. Based on the designed mode-dependent controllers with mode observation, sufficient conditions are first exploited by means of the Lyapunov method, such that the desired control performance could be ensured in the mean-square sense. Then, the nonsynchronous mode-dependent controllers are further determined in terms of convex optimization. In the end, our proposed control strategy is applied to a robotic manipulator with varying loads to validate the feasibility with simulation results.

    Citation: Chao Ma, Hang Gao, Wei Wu. Adaptive learning nonsynchronous control of nonlinear hidden Markov jump systems with limited mode information[J]. Electronic Research Archive, 2023, 31(11): 6746-6762. doi: 10.3934/era.2023340

    Related Papers:

    [1] Yang Song, Beiyan Yang, Jimin Wang . Stability analysis and security control of nonlinear singular semi-Markov jump systems. Electronic Research Archive, 2025, 33(1): 1-25. doi: 10.3934/era.2025001
    [2] Hankang Ji, Yuanyuan Li, Xueying Ding, Jianquan Lu . Stability analysis of Boolean networks with Markov jump disturbances and their application in apoptosis networks. Electronic Research Archive, 2022, 30(9): 3422-3434. doi: 10.3934/era.2022174
    [3] Qianqian Zhang, Mingye Mu, Heyuan Ji, Qiushi Wang, Xingyu Wang . An adaptive type-2 fuzzy sliding mode tracking controller for a robotic manipulator. Electronic Research Archive, 2023, 31(7): 3791-3813. doi: 10.3934/era.2023193
    [4] Xinling Li, Xueli Qin, Zhiwei Wan, Weipeng Tai . Chaos synchronization of stochastic time-delay Lur'e systems: An asynchronous and adaptive event-triggered control approach. Electronic Research Archive, 2023, 31(9): 5589-5608. doi: 10.3934/era.2023284
    [5] Dandan Zuo, Wansheng Wang, Lulu Zhang, Jing Han, Ling Chen . Non-fragile sampled-data control for synchronizing Markov jump Lur'e systems with time-variant delay. Electronic Research Archive, 2024, 32(7): 4632-4658. doi: 10.3934/era.2024211
    [6] Lingling Zhang . Vibration analysis and multi-state feedback control of maglev vehicle-guideway coupling system. Electronic Research Archive, 2022, 30(10): 3887-3901. doi: 10.3934/era.2022198
    [7] Lichao Feng, Dongxue Li, Chunyan Zhang, Yanmei Yang . Note on control for hybrid stochastic systems by intermittent feedback rooted in discrete observations of state and mode with delays. Electronic Research Archive, 2024, 32(1): 17-40. doi: 10.3934/era.2024002
    [8] Yueying Liu, Mengping Sun, Zhen Wang, Xiangyun Lin, Cuihua Zhang . Nash equilibrium strategies for non-zero-sum differential games of SDEs with time-varying coefficient and infinite Markov jumps. Electronic Research Archive, 2025, 33(4): 2525-2542. doi: 10.3934/era.2025112
    [9] Ramalingam Sakthivel, Palanisamy Selvaraj, Oh-Min Kwon, Seong-Gon Choi, Rathinasamy Sakthivel . Robust memory control design for semi-Markovian jump systems with cyber attacks. Electronic Research Archive, 2023, 31(12): 7496-7510. doi: 10.3934/era.2023378
    [10] Hui Jiang, Di Wu, Xing Wei, Wenhao Jiang, Xiongbo Qing . Discriminator-free adversarial domain adaptation with information balance. Electronic Research Archive, 2025, 33(1): 210-230. doi: 10.3934/era.2025011
  • In this paper, an adaptive neural network learning based nonsynchronous control method is developed for hidden Markov jump systems with unmodeled nonlinear dynamics. In particular, the system modes are not directly accessible and the limited mode information can be partly estimated by the nonsynchronous controller. More precisely, the mode information with partly accessible transition rates is utilized based on the transition probability matrix. Moreover, the unmodeled nonlinear dynamics are more general in practical applications. Based on the designed mode-dependent controllers with mode observation, sufficient conditions are first exploited by means of the Lyapunov method, such that the desired control performance could be ensured in the mean-square sense. Then, the nonsynchronous mode-dependent controllers are further determined in terms of convex optimization. In the end, our proposed control strategy is applied to a robotic manipulator with varying loads to validate the feasibility with simulation results.



    During these past years, a significant research trend is to investigate the intelligence-based strategies instead of conventional control approaches for control theory and engineering in many areas. For instance, much success has been achieved in robotic systems [1,2], power systems [3,4], traffic systems [5] and so on [6,7]. One of the primary purposes of intelligent control methods is to deal with uncertain or unknown nonlinear complex dynamics. Among the various methods, artificial neural network (NN) has emerged as an effective tool owing to its capability of approximating any continuous nonlinearities [8]. As a consequence, significant research results working on NN-based control approaches have been reported in the literature [9,10,11,12,13,14,15]. These remarkable investigations have proven that the integration of the controller with NN can well improve the dynamical control performance of complex nonlinear systems [16,17]. So far, there are two general and main learning strategies for the NN: the gradient-descent learning [18,19] and the adaptive learning methods [20,21]. Especially, the adaptive learning strategy has the main advantages of online characteristics and guaranteed accuracy, while the convergence of NN can be effectively ensured during its learning processes. Meanwhile, it is noticed that the NN approximation method would still lead to certain approximation error to some extent, such that only uniformly boundedness can be achieved for most closed-loop control systems. In order to overcome this problem, some notable methods have been developed for compensating the NN approximation errors, which include sliding mode control [22,23], H control [24] and so on.

    On another active research front, as a significant sort of hybrid systems, Markov jump systems (MJSs) have been extensively investigated in many practical scenarios, which is mainly due to their notable features in describing jumping system structures or parameters [25,26,27,28]. Generally speaking, the system mode jumping for subsystems of the MJSs is conducted according to the transition probability rates. As is well known, the transition probabilities of jumping modes are supposed to be prior known conditions for the MJSs [29,30]. Nevertheless, these assumptions are always difficult to be satisfied in most realistic applications, especially for certain limited or vulnerable networked control environments [31,32]. In this circumstance, some initial research interests have been paid to the hidden Markov jump systems (HMJSs), which implies that only limited mode information can be acquired for analysis and synthesis procedures [33,34]. Obviously, it is more challenging for the mode-dependent designs that depend upon system mode information, although mode-dependent controllers can lead to less conservatism. Hence, a recently developed mode-observation based control strategy is reported to release the limitation on mismatched modes between the true system modes and the observed system modes [35,36]. The key idea is to utilize the estimated system modes instead of the true system modes, such that a conditional probability based strategy can be developed. More precisely, a nonsynchronous feature would be encountered for the mode-dependent controllers under this context. As a result, the nonsynchronous controllers for HMJSs are reasonable and important. Moreover, it should be pointed out that the mode transition probabilities could also be further restricted with partly unknown cases. However, to the authors' best knowledge, there are few existing results on the nonsynchronous control issues of HMJSs with limited mode information, let alone that with unknown nonlinear dynamics.

    Motivated by the aforementioned insight, this paper presents the nonsynchronous control methodology of HMJS with unknown nonlinearities and external disturbances. In comparison with most reported works, our paper contributes to the literature with two main aspects and novelties. Firstly, a NN-based control strategy associated with the adaptive online learning law is introduced to handle the unmodeled nonlinear dynamics and external disturbances of HMJS. Especially, the mode-dependent nonlinearities in HMJS are taken into account for more applicability due to the mode jumping. Secondly, a novel nonsynchronous mode-dependent controller is applied to cope with the influences of limited mode information in HMJS by considering that the mode information between the HMJS and the corresponding controller could be different. Furthermore, the limited mode information is considered as the partly accessible transition rates, which are more general cases for HMJS in practical applications. Finally, the optimal H performance index is employed with the developed nonsynchronous controller for optimizing the NN approximation error and external disturbance, such that a desired control performance can be achieved.

    The rest of the article is organized as follows: in Section 2, some necessary preliminaries of HMJS are given and the nonsynchronous controller along with the online learning NN are introduced. Section 3 constructs the mode-dependent Lyapunov function and presents sufficient conditions for guaranteeing a desired control performance. Based on these derived conditions, the nonsynchronous controller and NN learning law design procedures are presented. In Section 4, an illustrative example of a robotic manipulator is provided to demonstrate the usefulness of the established theoretical findings. Finally, Section 5 draws the conclusions of this paper with further research perspectives.

    Notation: Rn stands for n-Euclidean space. Matrix P0 means that P is positive definite, tr(P) represents the trace of P and * represents the ellipsis parts in symmetric block matrices. (O,F,P) corresponds a probability space. E denotes mathematical expectation. All matrices are set to have compatible dimensions.

    Consider the following nonlinear HMJS:

    ˙x(t)=A(σt)x(t)+f(σt,x(t))+B(σt)u(t)+d(t), (2.1)

    where x(t)=[x1(t),x2(t),,xn(t)]TRn denotes the system state, f(σt,x(t)):R×RnRn represents an unknown nonlinear function, u(t)Rm stands for the control input and d(t)Rl is an unknown disturbance; σtS={1,,N} corresponds the continuous-time discrete-state Markov process in (O,F,P), whose transition probability matrix is defined by Π=(πij)N×N with

    Pr(σt+Δ=j|σt=i)={πijΔ+o(Δ), ij,1+πiiΔ+o(Δ), i=j, (2.2)

    and πii=Nj=1, ijπij. Consequently, A(σt) and B(σt) are known system matrices with fixed system mode σt. Furthermore, the infinitesimal operator L for function V(t,i) can be defined as follows:

    LV(t,i)=limΔ0+1Δ{E{V(t+Δ,i)|t}V(t,i)}. (2.3)

    In addition, the transition probability rates are further assumed to be partly accessible to describe limited mode information, i.e.,

    [π11????π2N?πN2πNN],

    such that we can denote Ri=RikRiuk, where Ri represents the ith row of Π and

    {Rik={j:πij is known},Riuk={j:πij is unknown}. (2.4)

    Without loss of generality, denote Ri as Ri={πk1,πk2,,πkL} with 0LN [37].

    Remark 1. It is worth mentioning that partly accessible transition rates are more general cases for MJSs with all known transition rates, especially for HMJSs. When Riuk=, all accessible transition rates can be derived accordingly.

    In order to deal with the unknown nonlinearity and disturbance, a dynamical NN Wφ(x(t)) is adopted to reconstruct approximation. Then, f(σt,x(t))+d(t) can be described by the following:

    f(σt,x(t))+d(t)=W(σt)φ(x(t))+e(t), (2.5)

    where e(t) implies the approximate error, W(σt)=argminW(σt,t)ΩW{supxΩxe(t)} denotes the learned NN weights for the optimized minimization value of e(t).

    Remark 2. It is noteworthy that the NNs have the ability to approximate unknown nonlinearities with high accuracy when the corresponding weights are well chosen. As a result, if the adaptive learning laws can be carefully designed, the desired nonlinearities reconstruction can be obtained accordingly.

    As a result, the approximation of f(σt,x(t))+d(t) can be described as follows:

    ˆf(σt,x(t))+ˆd(t)=ˆW(σt,t)φ(x(t)), (2.6)

    where ˆW(σt,t) represents the estimation of W(σt).

    Subsequently, the following nonsynchronous mode-dependent controller with NN is designed based on observed system mode information

    u(t)=K(rt)x(t)ˆW(σt,t)φ(x(t)), (2.7)

    where K(rt) denotes controller gains to be determined and rtT={1,,M} denotes the stochastic process with the following conditional probability:

    Pr{rt=ρ|σt=i}=λiρ,Mρ=1λiρ=1.

    Remark 3. It is noted that the controller only utilizes the observed mode information of true system modes with conditional probability. As a result, the nonsynchronous feature should be considered during the controller design. Meanwhile, the integration of dynamical NN in the controller design can further improve the applicability in certain practical applications.

    By applying the above controller, the closed-loop dynamics can be represented as follows:

    ˙x(t)=A(σt)x(t)+B(σt)K(rt)x(t)+˜W(σt,t)φ(x(t))+e(t), (2.8)

    where ˜W(σt,t)=W(σt)ˆW(σt,t).

    For simplicity, denote σt=i and rt=ρ, respectively. Then, we can rewrite system (2.8) by the following:

    ˙x(t)=A(i)x(t)+B(i)K(ρ)x(t)+˜W(i,t)φ(x(t))+e(t). (2.9)

    Before proceeding further, the following definitions are introduced to deal with the NN approximation error:

    Definition 1. System (2.8) is said to achieve stochastic stability in the mean-square sense if it holds that

    0E{x(t)2}dt<,

    for any initial conditions.

    Definition 2. System (2.8) is said to achieve H performance in the mean-square sense if there exists a constant γ>0 such that

    0E{xT(t)x(t)}dt<γ20E{eT(t)e(t)}dt, (2.10)

    for zero initial conditions.

    To this end, an important lemma is also employed for deriving the main results.

    Lemma 1. [38] Given real matrices A, B, C, X, W1, W2 with appropriate dimensions, if there exists a matrix P0 it satisfies that

    [PAT+AP+XBPCTW1W2W3]<0, (2.11)

    then there exist a matrix Z0 and a positive scalar μ>0 such that

    [ZZTZAT+P0ZCTZμ1P+XB00W1W20W30μP]. (2.12)

    This section derives sufficient conditions for the resulting closed-loop HMJS and designs the nonsynchronous mode-dependent controller along with the NN updating laws by proven details.

    Theorem 1. System (2.8) can achieve the mean-square H performance with given controller gain K(ρ), ρT and parameter γ>0, if there exist mode-dependent matrices P(i)0 and QT(i)=Q(i), iS, such that the following linear matrix inequalities hold, where

    Θ(i)=[2P(i)A(i)+2Mρ=1λiρP(i)B(i)K(ρ)+NjRikπij(P(j)Q(i))P(i)Iγ2I0I]0, (3.1)
    P(j)Q(i)0,jRiuk,ij, (3.2)
    P(j)Q(i)0,jRiuk,i=j. (3.3)

    Based on the above condition, the online NN learning law can be designed by

    ˙ˆW(i,t)=ΩP(i)x(t)φT(x(t)), (3.4)

    where Ω0 is a constant matrix.

    Proof 1. For given mode i, select the mode-dependent Lyapunov function candidate as follows:

    V(t,i)=xT(t)P(i)x(t)+tr(˜WT(i,t))Ω1(˜W(i,t)). (3.5)

    Taking the infinitesimal operator along with evolution of V(t,i) yields

    LV(t,i)=E{˙xT(t)P(i)x(t)+xT(t)P(i)˙x(t)+Nj=1πijxT(t)P(j)x(t)}+2tr(˙˜WT(i,t)Ω1˜W(i,t))=2xT(t)P(i)A(i)x(t)+2xT(t)P(i)˜W(i,t)φ(x(t))+2Mρ=1λiρxT(t)P(i)B(i)K(ρ)x(t)+Nj=1πijxT(t)P(j)x(t)+2xT(t)P(i)e(t)+2tr(˙˜WT(i,t)Ω1˜W(i,t)). (3.6)

    Subsequently, it can be verified that

    2xT(t)P(i)e(t)1γ2xT(t)P(i)P(i)x(t)+γ2e(t)e(t). (3.7)

    Moreover, based on the trace property of matrices, it holds that

    xT(t)P(i)˜W(i,t)φ(x(t))=tr{φ(x(t))xT(t)P(i)˜W(i,t)). (3.8)

    Hence, it can be obtained by the online learning law that

    LV(t,i)2xT(t)P(i)A(i)x(t)+2xT(t)P(i)˜W(i,t)φ(x(t))+2Mρ=1λiρxT(t)P(i)B(i)K(ρ)x(t)+Nj=1πijxT(t)P(j)x(t)+1γ2xT(t)P(i)P(i)x(t)+γ2e(t)e(t)+2tr(˙˜WT(i,t))Ω1(˜W(i,t))=2xT(t)P(i)A(i)x(t)+Nj=1πijxT(t)P(j)x(t)+2Mρ=1λiρxT(t)P(i)B(i)K(ρ)x(t)+1γ2xT(t)P(i)P(i)x(t)+γ2e(t)e(t), (3.9)

    which implies that when

    2xT(t)P(i)A(i)x(t)+Nj=1πijxT(t)P(j)x(t)+xT(t)x(t)+2Mρ=1λiρxT(t)P(i)B(i)K(ρ)x(t)+1γ2xT(t)P(i)P(i)x(t)<0 (3.10)

    holds, then it can lead to

    E{V(,σ)V(0,σ0)}<E{0xT(t)x(t)dt+γ20eT(t)e(t)dt}. (3.11)

    Therefore, the desired H performance can be satisfied according to Definition 2.

    On the other hand, there must exist a mode-dependent matrix Q(i)0 such that

    Nj=1πijQ(i)=0, (3.12)

    and this means that

    Nj=1πijxT(t)Q(i)x(t)=0. (3.13)

    As a result, it can be further deduced that

    2xT(t)P(i)A(i)x(t)+Nj=1πijxT(t)(P(j)Q(i))x(t)+xT(t)x(t)+2Mρ=1λiρxT(t)P(i)B(i)K(ρ)x(t)+1γ2xT(t)P(i)P(i)x(t)=xT(t)(2P(i)A(i)+I+2Mρ=1λiρP(i)B(i)K(ρ)+1γ2P(i)P(i)+NjRikπijxT(t)(P(j)Q(i))+NjRiukπijxT(t)(P(j)Q(i)))x(t). (3.14)

    By invoking the fact that

    Nj=1πij=0,πii<0, (3.15)

    it can be verified that the established conditions in Theorem 1 can ensure the H performance and the proof is therefore completed.

    Remark 4. It is worth mentioning that the online NN learning law in Theorem 1 is developed according to the Lyapunov function with a desired H performance. As such, with the desired nonlinearity approximation accuracy, the effect of unknown nonlinear function and external disturbance can be well solved.

    Remark 5. Since the transition rates of the jumping process play a significant role for the controller design of HMJSs, the partly accessible modes are considered for more generalization ability. Moreover, another mode-dependent weighting matrix is adopted to deal with the transition rate relations, which can bring more mode-dependent controller design flexibility.

    Remark 6. Theorem 1 gives sufficient conditions in terms of linear matrix inequalities, which can be conveniently solved with feasible solutions. With the help of matrix manipulation technique, Theorem 2 will calculate the mode-dependent controller gains.

    Theorem 2. System (2.8) can achieve the mean-square H performance with parameter γ>0, if there exist mode-dependent matrices ˜P(i)0, ˜QT(i)=˜Q(i), iS, ˜K(ρ), ρT and matrix Z0, such that the following linear matrix inequalities hold, where

    ˆΘ(i)=[ˆΘ1(i)ˆΘ2(i)ˆΘ3(i)]0,iRik, (3.16)
    ˆΘ1(i)=[ZZTA(i)TZ+Mρ=1λiρB(i)˜K(ρ)0ZZμ1˜P(i)+πii˜P(i)NjRikπij˜Q(i)I00γ2I00I0μ˜P(i)],ˆΘ2(i)=[0000πik1˜P(i)πiki1˜P(i)πiki+1˜P(i)πikL˜P(i)000000000],ˆΘ3(i)=[˜P(k1)000000˜P(ki1)00˜P(ki+1)00˜P(kL)],ˇΘ(i)=[ˇΘ1(i)ˇΘ2(i)ˇΘ3(i)]0,iRiuk, (3.17)
    ˇΘ1(i)=[ZZTA(i)TZ+Mρ=1λiρB(i)˜K(ρ)0ZZμ1˜P(i)NjRikπij˜Q(i)I00γ2I00I0μ˜P(i)], (3.18)
    ˇΘ2(i)=[00πik1˜P(i)πikL˜P(i)00000],ˇΘ3(i)=[˜P(k1)00˜P(kL)],[˜Q(i)˜P(i)˜P(j)]0,jRiuk,ij, (3.19)
    ˜P(i)˜Q(i)0,jRiuk,i=j. (3.20)

    Based on the above conditions, the nonsynchronous mode-dependent controller gains can be calculated by

    K(ρ)=˜K(ρ)Z1, (3.21)

    and the online NN learning law is designed by

    ˙ˆW(i,t)=ΩP(i)x(t)φT(x(t)), (3.22)

    where Ω0 is a constant matrix.

    Proof 2. Denote

    ˜P(i)=P1(i), (3.23)
    ˜Q(i)=˜P(i)Q(i)˜P(i), (3.24)
    ˜K(ρ)=K(ρ)Z, (3.25)

    and perform congruent transformation to Θ(i)<0 with ˜P(i). Then, by applying Lemma 1, the rest of proof can be directly obtained by Theorem 1.

    Based on the above results, the following controller design algorithm can be presented:

    Algorithm 1. Require: Input: A(i), B(i), and γ, Output: K(ρ)

    Ensure:

    1: Given system parameters A(i), B(i), and γ

    2: Solving the linear matrix inequalities in Theorem 2

    3: Setting the learning law with ˙ˆW(i,t)=ΩP(i)x(t)φT(x(t))

    4: Obtaining the matrices with ˜K(ρ) and Z

    5: return K(ρ)=˜K(ρ)Z1

    Remark 7. The above established linear matrix inequalities are strict convex optimization conditions, which can be effectively solved by Matlab LMI toolbox once the system parameters are given. It can be found that the feasibility and complexity is related to the limited mode information, such that the computational efficiency should be considered by selecting appropriate models.

    Remark 8. The optimized minimization value of γ can be further obtained by solving the convex optimization problem:

    min γ,s.t.linearmatrixinequalitiesconditionsinTheorem2. (3.26)

    This section performs the validation simulation via a robotic manipulator example, where different loads modeled by hidden Markov chain are considered.

    Choose the following single-link robotic manipulator model as depicted in Figure 1 [39]:

    {˙ϑ(t)=θ(t),J(i)˙θ(t)=D(i)ϑ(t)gM(i)Lsin(ϑ(t))+u(t), (4.1)
    Figure 1.  Illustration of single-link robotic manipulator.

    where ϑ(t) and θ(t) denote the angular position and velocity, respectively. J(i) represents the moment of inertia, D(i) stands for the coefficient of viscous friction, M(i) denotes the mass of payload, g denotes the gravity acceleration and L denotes the manipulator length.

    In practical applications of robotic manipulators, the manipulator parameters are always kept varying, especially for the cases in unstructured working environments. One typical example is a transportation manipulator performing carrying up and laying down tasks. Since the loads are often different, the actual robotic manipulator parameters of inertias and mass centers are jumping according to the different loads. Under this context, it is reasonable and necessary to model the robotic manipulator as MJS or HMJS.

    Defining x(t)=[ϑ(t),θ(t)]T and considering the disturbance d(t), it can be derived that

    ˙x(t)=A(i)x(t)+f(i,x(t))+B(i)u(t)+d(t)

    where

    A(i)=[010D(i)/J(i)],f(i,x(t))=[0gM(i)Lsin(ϑ(t))/J(i)],B(i)=[01/J(i)].

    In the simulation, it is assumed that there are three operation modes for the robotic manipulator, and the corresponding parameters are set as

    M(1)=2kg,M(2)=4kg,M(3)=1kgJ(1)=4kg/m2,J(2)=8kg/m2,J(3)=2kg/m2D(1)=2Nm/rad/sec,D(2)=2Nm/rad/sec,D(3)=2Nm/rad/sec,L=0.5m,

    such that the state-space system parameters can be obtained by

    A(1)=[0100.5],A(2)=[0100.25],A(3)=[0101],B(1)=[00.25],B(2)=[00.125],B(3)=[00.5],

    and the nonlinear functions f(i,x(t)) and d(t) are assumed to be unknown.

    Consequently, it is supposed that σt=i has three jumping modes and its corresponding transition probability matrix is given by

    [0.80.20.6??0.7??0.5].

    Furthermore, the conditional probability for mode observation is set as

    [0.40.30.30.20.30.50.10.50.4],

    such that a nonsynchronous mode information can be utilized in the controller.

    Based on these established results in Theorem 2 with Ω=I, γ=2, μ=1, the controller gains are obtained as follows:

    K(1)=[3.64132.0110],K(2)=[3.00331.3353],K(3)=[3.15392.7305].

    Setting initial conditions as x(t)=[5,2]T and the parameters of NN as

    φ(x(t))=[11+exp{x1(t)}11+exp{x2(t)}],W(i,0)=[0000],

    Figure 2 depicts the system jumping modes and controller observed jumping modes. The closed-loop system trajectories are shown in Figure 3 and the NN learning errors can been seen in Figure 4, where the approximation errors can converge well. Therefore, the simulation results can support our developed theoretical results. In addition, Figure 5 illustrates the advantages of using the NN-based control method for unknown nonlinear dynamics, where a better control performance can be acquired.

    Figure 2.  System modes and nonsynchronous controller modes.
    Figure 3.  State trajectories of system state x(t).
    Figure 4.  State trajectories of NN learning approximate errors.
    Figure 5.  State trajectories of system state x(t) with and without the NN-based control.

    Furthermore, in order to illustrate the effect of limited mode information on optimal H performance index, Table 1 is presented.

    Table 1.  Comparison of H performance for different limited modes.
    Limited modes [0.80.20.6??0.7??0.5] [0.80.20.60.20.90.7??0.5] [0.80.20.60.20.90.70.10.40.5]
    γ 0.0006 0.0004 0.0001

     | Show Table
    DownLoad: CSV

    Based on the above optimization results, one can observe that the minimized H performance γ can be obtained by linear matrix inequality conditions in Theorem 2. On the other hand, a prescribed H performance can also be set in the controller design processes. Moreover, it can be found that a better optimal H control performance can be obtained by more mode information, which implies a tradeoff during the mode-dependent design procedure with limited modes.

    This paper addresses the adaptive NN learning control issue of nonlinear HMJSs with limited transition probability information. More precisely, a nonsynchronous mode-dependent controller is designed and its NN adaptive learning law is developed accordingly. In order to deal with the approximation error for unknown nonlinearities and disturbances, the H control performance is employed. By constructing a stochastic Lyapunov function, sufficient criteria are established and the controller gains are determined by convex optimization. The simulation results of a robotic manipulator verify that our developed control scheme is effective for HMJSs. Our future work would consider the hidden semi-Markov jump systems, whose transition probabilities are more applicable for certain practical implementations. In addition, as the mode information would be time-varying and inaccessible in the hidden semi-Markov jump systems, it is more desirable to develop the nonsynchronous mode-dependent control strategies.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    This work was supported by the National Natural Science Foundation of China under Grant 61703038, 61627808, 62173326, the Fundamental Research Funds for the Central Universities under Grant FRF-GF-20-11B, CAS Project for Young Scientists in Basic Research YSBR-041, and Major program of the National Natural Science Foundation of China T2293720/T2293724.

    The authors declare no potential conflict of interest.

    Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.



    [1] H. Mo, G. Farid, Nonlinear and adaptive intelligent control techniques for quadrotor UAV–A survey, Asian J. Control, 21 (2019), 989–1008. https://doi.org/10.1002/asjc.1758 doi: 10.1002/asjc.1758
    [2] S. Patel, A. Sarabakha, D. Kircali, E. Kayacan, An intelligent hybrid artificial neural network-based approach for control of aerial robots, J. Intell. Rob. Syst., 97 (2020), 387–398. https://doi.org/10.1007/s10846-019-01031-z doi: 10.1007/s10846-019-01031-z
    [3] H. J. Kaleybar, H. M. Kojabadi, S. S. Fazel, F. Foiadelli, An intelligent control method for capacity reduction of power flow controller in electrical railway grids, Electr. Power Syst. Res., 165 (2018), 157–166. https://doi.org/10.1016/j.epsr.2018.09.005 doi: 10.1016/j.epsr.2018.09.005
    [4] R. K. Pandey, D. K. Gupta, Intelligent multi-area power control: Dynamic knowledge domain inference concept, IEEE Trans. Power Syst., 32 (2017), 4310–4318. https://doi.org/10.1109/TPWRS.2017.2661799 doi: 10.1109/TPWRS.2017.2661799
    [5] J. Jin, X. Ma, I. Kosonen, An intelligent control system for traffic lights with simulation-based evaluation, Control Eng. Prac., 58 (2017), 24–33. https://doi.org/10.1016/j.conengprac.2016.09.009 doi: 10.1016/j.conengprac.2016.09.009
    [6] G. Yang, D. Tong, Q. Chen, W. Zhou, Fixed-time synchronization and energy consumption for kuramoto-oscillator networks with multilayer distributed control, IEEE Trans. Circuits Syst. Ⅱ Exp. Briefs, 70 (2022), 1555–1559. https://doi.org/10.1109/TCSII.2022.3221477 doi: 10.1109/TCSII.2022.3221477
    [7] D. Tong, B. Ma, Q. Chen, Y. Wei, P. Shi, Finite-time synchronization and energy consumption prediction for multilayer fractional-order networks, IEEE Trans. Circuits Syst. Ⅱ Exp. Briefs, 70 (2023), 2176–2180. https://doi.org/10.1109/TCSII.2022.3233420 doi: 10.1109/TCSII.2022.3233420
    [8] R. R. Selmic, F. L. Lewis, Neural-network approximation of piecewise continuous functions: Application to friction compensation, IEEE Trans. Neural Networks, 13 (2002), 745–751. https://doi.org/10.1109/ISIC.1997.626458 https://doi.org/10.1109/TNN.2002.1000141 doi: 10.1109/ISIC.1997.626458
    [9] W. He, Y. Dong, Adaptive fuzzy neural network control for a constrained robot using impedance learning, IEEE Trans. Neural Networks Learn. Syst., 29 (2017), 1174–1186. https://doi.org/10.1109/TNNLS.2017.2665581 doi: 10.1109/TNNLS.2017.2665581
    [10] Y. J. Liu, Q. Zeng, S. Tong, C. L. P. Chen, L. Liu, Adaptive neural network control for active suspension systems with time-varying vertical displacement and speed constraints, IEEE Trans. Ind. Electr., 66 (2019), 9458–9466. https://doi.org/10.1109/TIE.2019.2893847 doi: 10.1109/TIE.2019.2893847
    [11] S. Li, H. Wang, M. U. Rafique, A novel recurrent neural network for manipulator control with improved noise tolerance, IEEE Trans. Neural Networks Learn. Syst., 29 (2017), 1908–1918. https://doi.org/10.1109/TNNLS.2017.2672989 doi: 10.1109/TNNLS.2017.2672989
    [12] W. Su, B. Niu, H. Wang, W. Qi, Adaptive neural network asymptotic tracking control for a class of stochastic nonlinear systems with unknown control gains and full state constraints, Int. J. Adapt. Control Signal Process., 2021 (2021). https://doi.org/10.1002/acs.3304 doi: 10.1002/acs.3304
    [13] M. Shahvali, M. B. Naghibi-Sistani, J. Askari, Adaptive fault compensation control for nonlinear uncertain fractional-order systems: Static and dynamic event generator approaches, J. Franklin Inst., 358 (2021), 6074–6100. https://doi.org/10.1016/j.jfranklin.2021.05.033 doi: 10.1016/j.jfranklin.2021.05.033
    [14] M. Shahvali, M. B. Naghibi-Sistani, J. Askari, Dynamic event-triggered control for a class of nonlinear fractional-order systems, IEEE Trans. Circuits Syst. Ⅱ Exp. Briefs, 69 (2021), 2131–2135. https://doi.org/10.1109/TCSII.2021.3128561 doi: 10.1109/TCSII.2021.3128561
    [15] M. Shahvali, J. Askari, Adaptive neural dynamic surface control of MIMO stochastic nonlinear systems with unknown control directions, Int. J. Adapt. Control Signal Process., 31 (2017), 97–121. https://doi.org/10.1002/acs.2686 doi: 10.1002/acs.2686
    [16] S. He, H. Fang, M. Zhang, F. Liu, Z. Ding, Adaptive optimal control for a class of nonlinear systems: The online policy iteration approach, IEEE Trans. Neural Networks Learn. Syst., 31 (2019), 549–558. https://doi.org/10.1109/TNNLS.2019.2905715 doi: 10.1109/TNNLS.2019.2905715
    [17] H. Fang, M. Zhang, S. He, X. Luan, F. Liu, Z. Ding, Solving the zero-sum control problem for tidal turbine system: An online reinforcement learning approach, IEEE Trans. Cybern., 2022 (2022). https://doi.org/10.1109/TCYB.2022.3186886 doi: 10.1109/TCYB.2022.3186886
    [18] W. Bai, Q. Zhou, T. Li, H. Li, Adaptive reinforcement learning neural network control for uncertain nonlinear system with input saturation, IEEE Trans. Cybern., 50 (2019), 3433–3443. https://doi.org/10.1109/TCYB.2019.2921057 doi: 10.1109/TCYB.2019.2921057
    [19] L. Wang, Y. Yang, R. Min, S. Chakradhar, Accelerating deep neural network training with inconsistent stochastic gradient descent, Neural Networks, 93 (2017), 219–229. https://doi.org/10.1016/j.neunet.2017.06.003 doi: 10.1016/j.neunet.2017.06.003
    [20] J. R. Noriega, H. Wang, A direct adaptive neural-network control for unknown nonlinear systems and its application, IEEE Trans. Neural Networks, 9 (1998), 27–34. https://doi.org/10.1109/ACC.1995.532743 doi: 10.1109/ACC.1995.532743
    [21] M. Chen, S. S. Ge, B. V. E. How, Robust adaptive neural network control for a class of uncertain MIMO nonlinear systems with input nonlinearities, IEEE Trans. Neural Networks, 21 (2010), 796–812. https://doi.org/10.1109/TNN.2010.2042611 doi: 10.1109/TNN.2010.2042611
    [22] H. Razmi, S. Afshinfar, Neural network-based adaptive sliding mode control design for position and attitude control of a quadrotor UAV, Aerosp. Sci. Technol., 91 (2019), 12–27. https://doi.org/10.1016/j.ast.2019.04.055 doi: 10.1016/j.ast.2019.04.055
    [23] Q. Zhang, M. Mu, H. Ji, Q. Wang, X. Wang, An adaptive type-2 fuzzy sliding mode tracking controller for a robotic manipulator, Electr. Res. Arch., 31 (2023), 3791–3813. https://doi.org/10.3934/era.2023193 doi: 10.3934/era.2023193
    [24] Y. Zuo, Y. Wang, X. Liu, S. X. Yang, L. Huang, X. Wu, et al., Neural network robust H tracking control strategy for robot manipulators, Appl. Math. Modell., 34 (2010), 1823–1838. https://doi.org/10.1016/j.apm.2009.09.026 doi: 10.1016/j.apm.2009.09.026
    [25] E. K. Boukas, Z. K. Liu, P. Shi, Delay-dependent stability and output feedback stabilisation of markov jump system with time-delay, IEEE Proc. Control Theory Appl., 149 (2002), 379–386. https://doi.org/10.1049/ip-cta:20020442 doi: 10.1049/ip-cta:20020442
    [26] Z. G. Wu, P. Shi, Z. Shu, H. Su, R. Lu, Passivity-based asynchronous control for Markov jump systems, IEEE Trans. Autom. Control, 62 (2016), 2020–2025. https://doi.org/10.1109/TAC.2016.2593742 doi: 10.1109/TAC.2016.2593742
    [27] H. Wan, X. Luan, V. Stojanovic, F. Liu, Self-triggered finite-time control for discrete-time Markov jump systems, Inf. Sci., 634 (2023), 101–121. https://doi.org/10.1016/j.ins.2023.03.070 doi: 10.1016/j.ins.2023.03.070
    [28] P. Cheng, S. He, X. Luan, F. Liu, Finite-region asynchronous H control for 2D Markov jump systems, Automatica, 129 (2021), 109590. https://doi.org/10.1016/j.automatica.2021.109590 doi: 10.1016/j.automatica.2021.109590
    [29] P. Cheng, H. Wang, V. Stojanovic, F. Liu, S. He, K. Shi, Dissipativity-based finite-time asynchronous output feedback control for wind turbine system via a hidden Markov model, Int. J. Syst. Sci., 53 (2022), 3177–3189. https://doi.org/10.1080/00207721.2022.2076171 doi: 10.1080/00207721.2022.2076171
    [30] P. He, J. Wen, V. Stojanovic, F. Liu, X. Luan, Finite-time control of discrete-time semi-Markov jump linear systems: A self-triggered MPC approach, J. Franklin Inst., 359 (2022), 6939–6957. https://doi.org/10.1016/j.jfranklin.2022.06.043 doi: 10.1016/j.jfranklin.2022.06.043
    [31] A. Y. Lu, G. H. Yang, Input-to-state stabilizing control for cyber-physical systems with multiple transmission channels under denial of service, IEEE Trans. Autom. Control, 63 (2017), 1813–1820. https://doi.org/10.1109/TAC.2017.2751999 doi: 10.1109/TAC.2017.2751999
    [32] A. Y. Lu, G. H. Yang, Switched projected gradient descent algorithms for secure state estimation under sparse sensor attacks, Automatica, 103 (2019), 503–514. https://doi.org/10.1016/j.automatica.2019.02.016 doi: 10.1016/j.automatica.2019.02.016
    [33] S. Dong, Z. G. Wu, Y. J. Pan, H. Su, Y. Liu, Hidden-markov-model-based asynchronous filter design of nonlinear Markov jump systems in continuous-time domain, IEEE Trans. Cybern., 49 (2018), 2294–2304. https://doi.org/10.1109/TCYB.2018.2824799 doi: 10.1109/TCYB.2018.2824799
    [34] F. Li, S. Xu, H. Shen, Q. Ma, Passivity-based control for hidden markov jump systems with singular perturbations and partially unknown probabilities, IEEE Trans. Autom. Control, 65 (2019), 3701–3706. https://doi.org/10.1109/TAC.2019.2953461 doi: 10.1109/TAC.2019.2953461
    [35] J. Cheng, C. K. Ahn, H. R. Karimi, J. Cao, W. Qi, An event-based asynchronous approach to Markov jump systems with hidden mode detections and missing measurements, IEEE Trans. Syst. Man Cybern. Syst., 49 (2018), 1749–1758. https://doi.org/10.1109/TSMC.2018.2866906 doi: 10.1109/TSMC.2018.2866906
    [36] S. Dong, C. L. P. Chen, M. Fang, Z. G. Wu, Dissipativity-based asynchronous fuzzy sliding mode control for T–S fuzzy hidden Markov jump systems, IEEE Trans. Cybern., 50 (2019), 4020–4030. https://doi.org/10.1109/TCYB.2019.2919299 doi: 10.1109/TCYB.2019.2919299
    [37] Y. Zhang, Y. He, M. Wu, J. Zhang, Stabilization for Markovian jump systems with partial information on transition probability based on free-connection weighting matrices, Automatica, 47 (2011), 79–84. https://doi.org/10.1016/j.automatica.2010.09.009 doi: 10.1016/j.automatica.2010.09.009
    [38] Z. Xu, H. Su, P. Shi, Z. G. Wu, Asynchronous h control of semi-markov jump linear systems, Appl. Math. Comput., 349 (2019), 270–280. https://doi.org/10.1016/j.amc.2018.12.010 doi: 10.1016/j.amc.2018.12.010
    [39] C. Ma, P. Shi, X. Zhao, Q. Zeng, Consensus of euler-lagrange systems networked by sampled-data information with probabilistic time delays, IEEE Trans. Cybern., 45 (2014), 1126–1133. https://doi.org/10.1109/TCYB.2014.2345735 doi: 10.1109/TCYB.2014.2345735
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1527) PDF downloads(43) Cited by(0)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog