This paper addresses the attitude tracking problem of quadrotor unmanned aerial vehicles (UAVs) by introducing a novel predefined-time stable adaptive value iteration (PTS-AVI) control scheme. Unlike conventional value iteration-based adaptive dynamic programming (Ⅵ-ADP) methods—where the running cost is independent of the value function—the proposed cost function explicitly incorporates the value function $ V $, thereby ensuring predefined-time stability (PTS) throughout both the training and deployment phases. This functional dependency was systematically addressed via an auxiliary time-scale partial differential equation (PDE) formulated in the $ s $-domain. To circumvent the need to solve the complex Hamilton–Jacobi–Bellman (HJB) equation directly, a parameter update method was employed to approximate its solution, yielding an optimal control policy that satisfies predefined-time stability criteria. In addition, a high-precision predefined-time disturbance observer was designed to estimate and compensate for unknown disturbances. Both theoretical analysis and simulation results confirmed that the proposed control scheme guarantees state convergence to equilibrium within a user-specified time, regardless of initial conditions. This work differs from existing studies by integrating predefined-time stability requirements into the Ⅵ-ADP formulation and coupling it with a predefined-time disturbance observer for attitude tracking.
Citation: Zhixin Feng, Renming Yang. Attitude tracking control of quadrotor unmanned aerial vehicle based on an adaptive predefined time value iteration approach[J]. AIMS Mathematics, 2026, 11(3): 6297-6328. doi: 10.3934/math.2026260
This paper addresses the attitude tracking problem of quadrotor unmanned aerial vehicles (UAVs) by introducing a novel predefined-time stable adaptive value iteration (PTS-AVI) control scheme. Unlike conventional value iteration-based adaptive dynamic programming (Ⅵ-ADP) methods—where the running cost is independent of the value function—the proposed cost function explicitly incorporates the value function $ V $, thereby ensuring predefined-time stability (PTS) throughout both the training and deployment phases. This functional dependency was systematically addressed via an auxiliary time-scale partial differential equation (PDE) formulated in the $ s $-domain. To circumvent the need to solve the complex Hamilton–Jacobi–Bellman (HJB) equation directly, a parameter update method was employed to approximate its solution, yielding an optimal control policy that satisfies predefined-time stability criteria. In addition, a high-precision predefined-time disturbance observer was designed to estimate and compensate for unknown disturbances. Both theoretical analysis and simulation results confirmed that the proposed control scheme guarantees state convergence to equilibrium within a user-specified time, regardless of initial conditions. This work differs from existing studies by integrating predefined-time stability requirements into the Ⅵ-ADP formulation and coupling it with a predefined-time disturbance observer for attitude tracking.
| [1] |
B. Tian, J. Cui, H. Lu, L. Liu, Q. Zong, Attitude control of UAVs based on event-triggered supertwisting algorithm, IEEE T. Ind. Inform., 17 (2021), 1029–1038. https://doi.org/10.1109/TII.2020.2981367 doi: 10.1109/TII.2020.2981367
|
| [2] |
X. Zhang, Y. Wang, G. Zhu, X. Chen, Z. Li, C. Wang, et al., Compound adaptive fuzzy quantized control for quadrotor and its experimental verification, IEEE T. Cybernetics, 51 (2021), 1121–1133. https://doi.org/10.1109/TCYB.2020.2987811 doi: 10.1109/TCYB.2020.2987811
|
| [3] |
C. Li, Y. Wang, X. Yang, Adaptive fuzzy control of a quadrotor using disturbance observer, Aerosp. Sci. Technol., 128 (2022), 107784. https://doi.org/10.1016/j.ast.2022.107784 doi: 10.1016/j.ast.2022.107784
|
| [4] |
Q. Chen, Y. Ye, Z. Hu, J. Na, S. Wang, Finite-time approximation-free attitude control of quadrotors: Theory and experiments, IEEE T. Aero. Elec. Sys., 57 (2021), 1780–1792. https://doi.org/10.1109/TAES.2021.3050647 doi: 10.1109/TAES.2021.3050647
|
| [5] |
W. Zhu, L. Wang, B. Tian, Integral sliding mode control method for quadrotor unmanned aerial vehicle based on finite-time observer, J. Chin. Inertial Technol., 31 (2023), 1244–1253. https://doi.org/10.13695/j.cnki.12-1222/o3.2023.12.011 doi: 10.13695/j.cnki.12-1222/o3.2023.12.011
|
| [6] |
K. Liu, P. Yang, R. Wang, L. Jiao, T. Li, J. Zhang, Observer-based adaptive fuzzy finite-time attitude control for quadrotor UAVs, IEEE T. Aero. Elec. Sys., 59 (2023), 8637–8654. https://doi.org/10.1109/TAES.2023.3308552 doi: 10.1109/TAES.2023.3308552
|
| [7] |
K A. Alattas, M T. Vu, O. Mofid, F F. EI-Sousy, A. Fekih, S. Mobayen, Barrier function-based nonsingular finite-time tracker for quadrotor UAVs subject to uncertainties and input constraints, Mathematics, 10 (2022), 1659. https://doi.org/10.3390/math10101659 doi: 10.3390/math10101659
|
| [8] |
G. Cui, W. Yang, J. Yu, Z. Li, C. Tao, Fixed-time prescribed performance adaptive trajectory tracking control for a QUAV, IEEE T. Circuits II, 69 (2022), 494–498. https://doi.org/10.1109/TCSII.2021.3084240 doi: 10.1109/TCSII.2021.3084240
|
| [9] |
B. Gao, Y. J. Liu, L. Liu, Fixed-time neural control of a quadrotor UAV with input and attitude constraints, IEEE/CAA J. Automatic, 10 (2023), 281–283. https://doi.org/10.1109/JAS.2023.123045 doi: 10.1109/JAS.2023.123045
|
| [10] |
H. Jiang, Q. Ma, J. Guo, Fuzzy-based fixed-time attitude control of quadrotor unmanned aerial vehicle with full-state constraints: Theory and experiments, IEEE T. Fuzzy Syst., 32 (2024), 1108–1115. https://doi.org/10.1109/TFUZZ.2023.3318572 doi: 10.1109/TFUZZ.2023.3318572
|
| [11] |
H. Li, S. Zhao, W. He, R. Lu, Adaptive finite-time tracking control of full state constrained nonlinear systems with dead-zone, Automatica, 100 (2019), 99–107. https://doi.org/10.1016/j.automatica.2018.10.030 doi: 10.1016/j.automatica.2018.10.030
|
| [12] |
Y. Liu, X. Liu, Y. Jing, H. Wang, X. Li, Annular domain finite-time connective control for large-scale systems with expanding construction, IEEE T. Syst. Man Cy. Syst., 51 (2021), 6159–6169. https://doi.org/10.1109/TSMC.2019.2960009 doi: 10.1109/TSMC.2019.2960009
|
| [13] |
J. Yu, P. Shi, L. Zhao, Finite-time command filtered backstepping control for a class of nonlinear systems, Automatica, 92 (2018), 173–180. https://doi.org/10.1016/j.automatica.2018.03.033 doi: 10.1016/j.automatica.2018.03.033
|
| [14] |
X. Jin, Adaptive fixed-time control for MIMO nonlinear systems with asymmetric output constraints using universal barrier functions, IEEE T. Automat. Contr., 64 (2019), 3046–3053. https://doi.org/10.1109/TAC.2018.2874877 doi: 10.1109/TAC.2018.2874877
|
| [15] |
D. Li, S S. Ge, T H. Lee, Fixed-time-synchronized consensus control of multiagent systems, IEEE T. Control Netw., 8 (2021), 89–98. https://doi.org/10.1109/TCNS.2020.3034523 doi: 10.1109/TCNS.2020.3034523
|
| [16] |
C. Hu, H. He, H. Jiang, Fixed/preassigned-time synchronization of complex networks via improving fixed-time stability, IEEE T. Cybernetics, 51 (2021), 2882–2892. https://doi.org/10.1109/TCYB.2020.2977934 doi: 10.1109/TCYB.2020.2977934
|
| [17] |
K. Liu, R. Wang, S. Zheng, S. Dong, G. Sun, Fixed-time disturbance observer-based robust fault-tolerant tracking control for uncertain quadrotor UAV subject to input delay, Nonlinear Dyn., 107 (2022), 2363–2390. https://doi.org/10.1007/s11071-021-07080-0 doi: 10.1007/s11071-021-07080-0
|
| [18] |
J. D. Sánchez-Torres, A. J. Muñoz-Vázquez, M. Defoort, E. Jiménez-Rodríguez, A. G. Loukianov, A class of predefined-time controllers for uncertain second-order systems, Eur. J. Control, 53 (2020), 52–58. https://doi.org/10.1016/j.ejcon.2019.10.003 doi: 10.1016/j.ejcon.2019.10.003
|
| [19] |
E. Jiménez-Rodríguez, A. J. Muñoz-Vázquez, J. D. Sánchez-Torres, A. G. Loukianov, A note on predefined-time stability, IFAC-PapersOnLine., 51 (2018), 520–525. https://doi.org/10.1016/j.ifacol.2018.07.332 doi: 10.1016/j.ifacol.2018.07.332
|
| [20] |
A. K. Pal, S. Kamal, S. K. Nagar, B. Bandyopadhyay, L. Fridman, Design of controllers with arbitrary convergence time, Automatica, 112 (2020), 108710. https://doi.org/10.1016/j.automatica.2019.108710 doi: 10.1016/j.automatica.2019.108710
|
| [21] |
J. Ni, L. Liu, Y. Tang, C. Liu, Predefined-time consensus tracking of second-order multiagent systems, IEEE T. Syst. Man Cyb. Syst., 51 (2019), 2550–2560. https://doi.org/10.1109/TSMC.2019.2916257 doi: 10.1109/TSMC.2019.2916257
|
| [22] |
P. Yang, S. Zhang, X. Yu, W. He, Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults, IEEE T. Cybernetics, 55 (2025), 2621–2632. https://doi.org/10.1109/TCYB.2025.3557681 doi: 10.1109/TCYB.2025.3557681
|
| [23] |
Y. Gu, J. Zhao, Z Y. Sun, X. Xie, Reinforcement learning-based optimized multi-agent finite-time optimal synchronisation control and its application to the harmonic oscillator, Nonlinear Dyn., 112 (2024), 13175–13188. https://doi.org/10.1007/s11071-024-09758-7 doi: 10.1007/s11071-024-09758-7
|
| [24] |
X. Bu, M. Lv, H. Lei, Discrete-time optimal control ensuring fixed-time prescribed performance for SSP, IEEE T. Aero. Elec. Sys., 61 (2025), 3398–3407. https://doi.org/10.1109/TAES.2024.3485012 doi: 10.1109/TAES.2024.3485012
|
| [25] |
H. Yue, J. Xia, J. Zhang, J. H. Park, X. Xie, Event-based adaptive fixed-time optimal control for saturated fault-tolerant nonlinear multiagent systems via reinforcement learning algorithm, Neural Netw., 183 (2025), 106952. https://doi.org/10.1016/j.neunet.2024.106952 doi: 10.1016/j.neunet.2024.106952
|
| [26] |
L. Jin, L. Su, X. Fei, K. Wang, H. Shen, Adaptive neural inverse optimal control for uncertain switched nonlinear systems: an improved predefined-time method, J. Control Decis., 2024 (2024), 1–15. https://doi.org/10.1080/23307706.2024.2400686 doi: 10.1080/23307706.2024.2400686
|
| [27] |
H. Shen, G. Cui, Z. Li, Q. Chang, Adaptive predefined-time optimal tracking control of flexible-joint robots, IEEE Internet Things, 12 (2025), 53132–53142. https://doi.org/10.1109/JIOT.2025.3616620 doi: 10.1109/JIOT.2025.3616620
|
| [28] |
T. Bian, Z P. Jiang, Reinforcement learning and adaptive optimal control for continuous-time nonlinear systems: A value iteration approach, IEEE T. Neur. Net. Lear., 33 (2022), 2781–2790. https://doi.org/10.1109/TNNLS.2020.3045087 doi: 10.1109/TNNLS.2020.3045087
|
| [29] |
K. Liu, Y. Yang, H. Lyu, W. Yang, Y. Zhang, Z. Tan, et al., Adaptive predefined-time disturbance observer-based fast nonsingular sliding mode control strategy for consumer quadrotor UAVs: Theory and experiments, IEEE T. Consum. Electr., 71 (2025), 10629–10641. https://doi.org/10.1109/TCE.2025.3615655 doi: 10.1109/TCE.2025.3615655
|
| [30] |
K. Liu, W. Yang, L. Jiao, Z. Yuan, C Y. Wen, Fast fixed-time distributed neural formation control-based disturbance observer for multiple quadrotor UAVs under unknown disturbances, IEEE T. Aero. Elec. Sys., 61 (2025), 13137–13155. https://doi.org/10.1109/TAES.2025.3578046 doi: 10.1109/TAES.2025.3578046
|
| [31] |
J. J. Xiong, Y. Chen, BFNN‐based parameter adaptive sliding mode control for an uncertain TQUAV with time‐varying mass, Int. J. Robust Nonlin., 35 (2025), 4658–4668. https://doi.org/10.1002/rnc.7932 doi: 10.1002/rnc.7932
|
| [32] |
K A. Alattas, M T. Vu, O. Mofid, F F M. El-Sousy, A K. Alanazi, J. Awrejcewicz, et al., Adaptive nonsingular terminal sliding mode control for performance improvement of perturbed nonlinear systems, Mathematics, 10 (2022), 1064. https://doi.org/10.3390/math10071064 doi: 10.3390/math10071064
|
| [33] |
A. Najafi, M T. Vu, S. Mobayen, J. H. Asad, A. Fekih, Adaptive barrier fast terminal sliding mode actuator fault tolerant control approach for quadrotor UAVs, Mathematics, 10 (2022), 3009. https://doi.org/10.3390/math10163009 doi: 10.3390/math10163009
|
| [34] |
S. Li, S. Wang, Y. Zhang, X. Wang, Y. Zhang, W. Wu, et al., Distributed bearing-based fault-tolerant formation control of fixed-wing UAV swarm with prescribed performance, Aerosp. Sci. Technol., 168 (2026), 110897. https://doi.org/10.1016/j.ast.2025.110897 doi: 10.1016/j.ast.2025.110897
|
| [35] |
Y. Fu, B. Wang, H. Zhao, M. Zhou, N. Li, Z. Gao, Adaptive safety attitude control of a hybrid VTOL UAV under transition flight subject to multiple faults and uncertainties, Aerosp. Sci. Technol., 163 (2025), 110284. https://doi.org/10.1016/j.ast.2025.110284 doi: 10.1016/j.ast.2025.110284
|
| [36] |
J. Gu, Y. Wang, A constrained reinforcement learning based approach for cooperative control of multi-UAV in dense obstacle environments, Sci. China Technol. Sci., 69 (2026), 1120601. https://doi.org/10.1007/s11431-025-3076-2 doi: 10.1007/s11431-025-3076-2
|
| [37] |
Y. Feng, Y. Zhou, H W. Ho, Performance-guaranteed quadrotor control with incremental adaptive dynamic programming and disturbance compensation, Eng. Appl. Artif. Intel., 163 (2026), 113127. https://doi.org/10.1016/j.engappai.2025.113127 doi: 10.1016/j.engappai.2025.113127
|
| [38] |
X. Song, P. Sun, S. Song, V. Stojanovic, Saturated-threshold event-triggered adaptive global prescribed performance control for nonlinear Markov jumping systems and application to a chemical reactor mode, Expert Syst. Appl., 249 (2024), 123490. https://doi.org/10.1016/j.eswa.2024.123490 doi: 10.1016/j.eswa.2024.123490
|
| [39] |
Z. Peng, X. Song, S. Song, I. Tejado, V. Stojanovic, Resilient consensus tracking control and vibration suppression of moving multiple helicopter flexible slung-load systems, J. Vib. Control, 2025 (2025), 10775463251321551. https://doi.org/10.1177/10775463251321551 doi: 10.1177/10775463251321551
|
| [40] | H. K. Khalil, J. W. Grizzle, Nonlinear systems, Upper Saddle River, NJ: Prentice hall, 2002. |
| [41] |
F L. Lewis, D. Vrabie, K. G. Vamvoudakis, Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers, IEEE Control Syst. Mag., 32 (2012), 76–105. https://doi.org/10.1109/MCS.2012.2214134 doi: 10.1109/MCS.2012.2214134
|