
This paper proposes an adaptive incremental backstepping control method for stratospheric airship attitude control that combines time delay estimation and incremental backstepping control to enhance robustness under model uncertainties. By integrating incremental control and time delay estimation, a linear time-invariant system relating to the attitude angle tracking error is obtained, where the time-delay estimation error is treated as a disturbance to the system. Meanwhile, an adaptive technique is utilized to reduce the effects of noise and center-of-gravity variations on system robustness. In conclusion, the convergence property of all signals is meticulously examined by employing Lyapunov theory. The proposed scheme is subsequently validated for effectiveness through numerical simulations.
Citation: Yang Sun, Ming Zhu, Yifei Zhang, Tian Chen. Adaptive incremental backstepping control of stratospheric airships using time-delay estimation[J]. Electronic Research Archive, 2025, 33(5): 2925-2946. doi: 10.3934/era.2025128
[1] | Yong Xiong, Lin Pan, Min Xiao, Han Xiao . Motion control and path optimization of intelligent AUV using fuzzy adaptive PID and improved genetic algorithm. Mathematical Biosciences and Engineering, 2023, 20(5): 9208-9245. doi: 10.3934/mbe.2023404 |
[2] | Chengjun Wang, Xingyu Yao, Fan Ding, Zhipeng Yu . A trajectory planning method for a casting sorting robotic arm based on a nature-inspired Genghis Khan shark optimized algorithm. Mathematical Biosciences and Engineering, 2024, 21(2): 3364-3390. doi: 10.3934/mbe.2024149 |
[3] | Shuwen Zheng, Mingjun Zhang, Jing Zhang, Jitao Li . Lagrange tracking-based long-term drift trajectory prediction method for Autonomous Underwater Vehicle. Mathematical Biosciences and Engineering, 2023, 20(12): 21075-21097. doi: 10.3934/mbe.2023932 |
[4] | Xiaoyong Xiong, Haitao Min, Yuanbin Yu, Pengyu Wang . Application improvement of A* algorithm in intelligent vehicle trajectory planning. Mathematical Biosciences and Engineering, 2021, 18(1): 1-21. doi: 10.3934/mbe.2021001 |
[5] | Li-zhen Du, Shanfu Ke, Zhen Wang, Jing Tao, Lianqing Yu, Hongjun Li . Research on multi-load AGV path planning of weaving workshop based on time priority. Mathematical Biosciences and Engineering, 2019, 16(4): 2277-2292. doi: 10.3934/mbe.2019113 |
[6] | Chengxi Wu, Yuewei Dai, Liang Shan, Zhiyu Zhu, Zhengtian Wu . Data-driven trajectory tracking control for autonomous underwater vehicle based on iterative extended state observer. Mathematical Biosciences and Engineering, 2022, 19(3): 3036-3055. doi: 10.3934/mbe.2022140 |
[7] | Dacheng Yu, Mingjun Zhang, Xing Liu, Feng Yao . Fault feature extraction and fusion method for AUV with weak thruster fault based on variational mode decomposition and D-S evidence theory. Mathematical Biosciences and Engineering, 2022, 19(9): 9335-9356. doi: 10.3934/mbe.2022434 |
[8] | Zhenzhong Chu, Zhenhao Gu, Zhiqiang Li, Yunsai Chen, Mingjun Zhang . A fault diagnostic approach based on PSO-HMM for underwater thrusters. Mathematical Biosciences and Engineering, 2022, 19(12): 12617-12631. doi: 10.3934/mbe.2022589 |
[9] | Ping Li, Zhe Lin, Hong Shen, Zhaoqi Zhang, Xiaohua Mei . Optimized neural network based sliding mode control for quadrotors with disturbances. Mathematical Biosciences and Engineering, 2021, 18(2): 1774-1793. doi: 10.3934/mbe.2021092 |
[10] | Zhiqiang Wang, Jinzhu Peng, Shuai Ding . A Bio-inspired trajectory planning method for robotic manipulators based on improved bacteria foraging optimization algorithm and tau theory. Mathematical Biosciences and Engineering, 2022, 19(1): 643-662. doi: 10.3934/mbe.2022029 |
This paper proposes an adaptive incremental backstepping control method for stratospheric airship attitude control that combines time delay estimation and incremental backstepping control to enhance robustness under model uncertainties. By integrating incremental control and time delay estimation, a linear time-invariant system relating to the attitude angle tracking error is obtained, where the time-delay estimation error is treated as a disturbance to the system. Meanwhile, an adaptive technique is utilized to reduce the effects of noise and center-of-gravity variations on system robustness. In conclusion, the convergence property of all signals is meticulously examined by employing Lyapunov theory. The proposed scheme is subsequently validated for effectiveness through numerical simulations.
With the development and utilization of marine resources, the application scenarios and requirements of autonomous underwater vehicles (AUVs) are becoming increasingly extensive. AUVs play an essential role in marine environment observation [1,2], underwater target detection [3], seabed topographic mapping [4], and other fields. As an essential tool for developing marine resources, AUVs have the advantages of substantial autonomy and controllability [5], low operational risk, and economic advantages. Facing the complex underwater environment, an effective trajectory planning method is the key to ensuring the safety of AUVs.
The motion planning of an AUV is divided into path planning and trajectory planning. The main difference between these two categories is that there is only a series of way-points in the result obtained by path planning algorithms. However, information on time corresponding to the state is also included in the solutions obtained by trajectory planning algorithms [6]. In [7], Wang et al. proposed a novel simultaneous planning and control (SPC) method, which combines an improved artificial potential field (IAPF) and model predictive control (MPC) techniques. The IAPF is used for robust and efficient tracking in a short future. The MPC is implemented to generate actual control commands for high-precision tracking. In [8], for swarm intelligent algorithms, focusing on trajectory planning of carrier aircraft on deck, an optimal control problem was first established. Transformation of control variable was conducted to guarantee enough feasible solutions, and a segmented fitness function was set to treat the constraints discriminately and make the search efficient.
Ren et al. [9] changed the traditional A* algorithm into a dynamic measurement heuristic A* algorithm, which improved search efficiency. Liu et al. [10] optimized the algorithm's convergence speed and solution quality by updating the pheromone-based linear regression. Chu et al. [11] introduced an ocean current disturbance function in the deep reinforcement learning algorithm, which reduced the delay time of path planning. Trajectory planning theory has been widely used in the fields of aircraft [12], robotic arms [13] and automatic driving [14]. Liu and Zhang [15] designed an iterative strategy based on the initial values generator and improved the solution efficiency. In [16], the adaptive pseudospectral method was used to solve the optimal state and control variables of fuel consumption. In [17], the GPM was used to plan the overtaking trajectory on the curve and realized the tracking control of the trajectory.
The trajectory planning method can plan the speed and acceleration of the AUV to meet the requirements of smoothness and speed controllability. The essence of the trajectory optimization problem is the optimal control problem, which can be solved by indirect and direct methods [18]. The indirect method mainly converts the first-order necessary condition of the optimal control problem into a Hamiltonian boundary value problem, and it obtains an accurate analytical solution by solving the Hamiltonian boundary value problem. In [19], a symplectic pseudospectral method based on the dual variational principle and the quasilinearization method was proposed and was successfully applied to solve nonlinear optimal control problems with inequality constraints. Wang et al. [20] proposed an iterative framework to solve optimal control for nonlinear proportional state-delay systems. Due to the benefit of the successive convexification technique and the multi-interval pseudospectral method, initial guess on costate variables is avoided and converged solutions can be obtained with an exponential convergence rate. The disadvantage of the indirect method is that it requires the derivation of the Hamiltonian function for solution. In addition, there is no exact analytical solution for nonlinear complex control problems, so it is difficult to solve them using indirect methods.
The direct method to solve the trajectory planning problem is to transform it into a NLP problem by discretization [21]. Then, the optimal solution of the NLP problem is solved according to the KKT (Karush-Kuhn-Tucker) condition. The GPM constructs a sparse constrained Jacobian matrix through the placement of discrete points [22]. The GPM can obtain higher accuracy with fewer collocation points, and it has the advantage of fast solution speed, which is conducive to solving optimal control problems. In [23], the Legendre-Gauss method is improved, with higher computational efficiency and accuracy in solving optimal control problems. In [24], the application of GPM in the flight phase of aircraft and rockets was discussed. Ma et al. [25] proposed a path-planning method for unmanned surface vehicles (USVs) based on the rapid Gauss pseudospectral method (RGPM). This method can plan a globally time-optimal path in a complex marine environment, discretize the continuous-time optimal control path planning problem into a nonlinear programming problem, and solve and optimize the nonlinear programming problem to reduce computational costs. Simulation experiments verify the effectiveness of the path-planning method.
For small-scale dense problems, the SQP algorithm has the advantages of fast convergence speed and high accuracy when solving NLP problems. However, the SQP algorithm is sensitive to initial values. When calculating large-scale NLP problems, the dependence on initial values increases, and it is easy to fall into local optimization [26]. In order to solve the initial values dependency of the SQP algorithm, this paper proposes a method that combines the PSO algorithm and the GPM. The path pre-planning of the PSO algorithm and discretization of the GPM provide initial values reference for the SPQ algorithm to solve NLP problems.
The research in this paper includes the following aspects. First, a multi-constrained trajectory optimization model is built with the shortest sailing time as the optimization objective. The kinematic model, boundary constraints, and path constraints are established. The trajectory obtained can avoid collision with obstacles. Second, combined with the dynamic characteristics of the AUV during navigation, the mathematical model of the AUV trajectory planning is improved by introducing dynamic constraints. The navigation path obtained by the solution meets the actual requirements, and the change of control variables during the navigation process is more intuitive and traceable. Finally, the PSO is used to solve the path points quickly. The pre planned path is fitted, and the trajectory points are allocated to the LG points of the GPM, which are used as the initial values of the SQP algorithm and are iterated. The search cost is reduced, and the calculation speed is improved.
The main structure of this paper is as follows: Section 2 introduces the AUV trajectory planning problem, establishes the AUV motion model, and describes the AUV constraints and optimal control problems. Section 3 describes the principles and processes of the GPM and the PSO for trajectory planning. Section 4 compares three different strategies for initial value generation by simulation calculation and verifies the validity of the PSO-GPM in solving the trajectory planning problem. Section 5 summarizes the results of this study.
The trajectory planning problem of an AUV can be described as aiming at the shortest sailing time, searching for appropriate state variables values and control variables values so that the AUV can avoid obstacles from the beginning to the end. The problems to be solved include the following three points: (1) The process from starting position to the target position of the AUV can be realized. (2) The AUV can avoid all obstacles and navigate safely during the process. (3) The AUV can navigate with an optimum or approximately optimum trajectory.
The motion of the AUV is generally 6 degrees of freedom and can be decoupled into horizontal and vertical motion. Only the horizontal motion of the AUV is discussed in this paper. The mathematical model of the AUV mainly includes the kinematic model and the dynamic model. The kinematic model mainly describes the relationship between position, speed, acceleration, and time. The dynamic model considers the quality and force of the AUV and mainly analyses the motion characteristics of the robot after being stressed.
In practical applications, two directional thrusters control the movement of the AUV in three degrees of freedom: forward and backward, lateral displacement, and rotation. The distribution positions are shown in Figure 1(a).
Thrusters in the x and y directions are arranged at the tail of the AUV, respectively, responsible for the forward and lateral movement of the AUV as shown in Figure 1(b). Thruster 1 generates thrust Tu in the x direction, and Thruster 2 generates thrust Tv in the y direction. The torque formed by Tu and Tv changes the navigation direction of the AUV.
The kinematic model of the AUV in the horizontal plane of motion can be expressed as follows:
{˙px=vx⋅cosψ−vy⋅sinψ,˙py=vx⋅sinψ+vy⋅cosψ,˙ψ=rz, | (1) |
where px, py and ψ represent the position and the direction angle of an AUV in a fixed coordinate system; vx, vy and rz represent the velocity and the angular velocity of an AUV in a moving coordinate system.
The dynamic equations of an AUV in the horizontal motion plane are expressed as follows [27]:
{Mu˙vx=Mvvyrz−(Xu+X|u|u⋅|vx|)⋅vx+TuMv˙vy=−Muvxrz−(Yv+Y|v|v⋅|vy|)⋅vy+TvMr˙rz=(Mu−Mv)⋅vxvy−(Nr+N|r|r⋅|rz|)⋅rz+Tr | (2) |
{Mu=m−X˙uMv=m−Y˙v,Mr=Iz−N˙r | (3) |
where m is the mass of the AUV; Iz is the moment of inertia; (vx, vy, rz)T is the velocity vector; X(·), Y(·), and N(·) are the hydrodynamic coefficients; and T = (Tu, Tv, Tr)T is the system input consisting of thrust and thrust torque.
Path constraints are the constraints for the AUV to avoid obstacles and navigate in a feasible area. Path constraints must satisfy the requirement that no collisions occur during the voyage. Describing the constraints accurately between obstacles and the AUV is the key to obtaining the safe trajectory. The mathematical abstraction of the AUV and obstacles can be simplified to circles or rectangles. Considering the severe consequences of collisions between the AUV and obstacles, the outline of the AUV and obstacles should be accurately described when establishing path constraints. The kinematic model of the AUV is shown in Figure 1.
R(x, y) is the center of gravity of the AUV; O is the center of the circle for changing direction; l1 = 0.8 m and l2 = 0.2 m respectively represent the distance between R(x, y) and the front and rear edges of the AUV; d = 0.2 m is the width of the AUV; ψ is the heading angle of the AUV; A, B, C and D are the vertexes of the simplified contour of the AUV; the vertex coordinates of the AUV are represented as follows according to the geometric relationship:
{[Ax(t),Ay(t)]=[px(t)+l1cosψ(t)−d2sinψ(t),py(t)+l1sinψ(t)+d2cosψ(t)],[Bx(t),By(t)]=[px(t)+l1cosψ(t)+d2sinψ(t),py(t)+l1sinψ(t)−d2cosψ(t)],[Cx(t),Cy(t)]=[px(t)−l2cosψ(t)+d2sinψ(t),py(t)−l2sinψ(t)−d2cosψ(t)],[Dx(t),Dy(t)]=[px(t)−l2cosψ(t)−d2sinψ(t),py(t)−l2sinψ(t)+d2cosψ(t)]. | (4) |
The obstacle simulation method based on P-Norm criterion can express the position and shape information of obstacles in the form of functions. The form of path constraints is concise and unified, and the mathematical expression of obstacles can be flexibly adjusted to meet the formal requirements for constraint conditions in trajectory planning problems.
The P-criterion function can be expressed as
h(ox,oy)=(|ox−arx|p+|oy−bry|p)1/1pp−1, p=1,2,3,… | (5) |
where rx and ry represent the scale coefficients in the x and y directions; (a, b) represents the center position of the graph. When h(ox, oy) = 0, different p values represent different graphical boundaries.
In this paper, P = 2 and P = 6 are selected to represent the shapes of the obstacles with a circle and an approximate rectangle. The relationship between the trajectory points of the AUV and the position of the obstacles can be expressed as
hi[Ax(t),Ay(t)]=(|Ax(t)−airxi|pi+|Ay(t)−biryi|pi)1/1pipi−1, i=1,2,3,4, | (6) |
where i represents the i-th obstacle. The expression of points B, C and D is consistent with Eq 6. Any point where h(ox, oy) = 0 is located on the boundary of the obstacle. Accordingly, to achieve obstacle avoidance navigation of the AUV, it is necessary to ensure that the trajectory points of the AUV are located outside the obstacle, that is, to ensure that h(ox, oy) > 0.
In order to reach the target point safely and accurately, the AUV trajectory needs to satisfy the control and boundary constraints. The control constraints include the propeller's thrust and torque constraints, which can be expressed as follows:
Tumin⩽Tu⩽Tumax, Tvmin⩽Tv⩽Tvmax, Trmin⩽Tr⩽Trmax. | (7) |
State variable constraints include position, speed, and angle constraints and can be expressed as:
{pxmin⩽px⩽pxmax, pymin⩽py⩽pymax, ψmin⩽ψ⩽ψmax,vxmin⩽vx⩽vxmax, vymin⩽vy⩽vymax, rzmin⩽rz⩽rzmax. | (8) |
Boundary constraints include initial and terminal constraints, which correspond to the initial and terminal states of the AUV during navigation as follows:
{px(t0)=px0, py(t0)=py0, ψ(t0)=ψ0,vx(t0)=vx0, vy(t0)=vy0, rz(t0)=rz0. | (9) |
{px(tf)=pxf, py(tf)=pyf, ψ(tf)=ψf,vx(tf)=vxf, vy(tf)=vyf, rz(tf)=rzf. | (10) |
The trajectory optimization goal of the AUV is to find the trajectory with the optimal values of the control variables and satisfy the constraints by setting the objective function. The optimal control problem can be described as finding control variables u(t)∈Rm and system state variables x(t)∈Rn, minimizing performance indicators, and satisfying dynamic, path and boundary constraints.
J=Φ[x(t0),t0,x(tf),tf]+∫tft0L[x(t),u(t),t] dt | (11) |
s.t. {˙x=f[x(t),u(t),t] , t∈[t0,tf] ,C[x(t),u(t),t]⩽0 , t∈[t0,tf] ,ϕ[x(t0),t0,x(tf),tf]=0, | (12) |
where u represents the control variable of the system, and x represents the state variable; t0 and tf represent the initial and final time values, respectively, and can be set as free variables; Φ represents the Mayer form of the objective function; L represents the Lagrange form of the objective function; C represents the path constraint function; ϕ represents the boundary constraint function.
In the above formula, the function satisfies the following definition range and value range.
Φ:Rn×R×Rn×R→RL:Rn×Rm×R→Rf:Rn×Rm×R→Rnϕ:Rn×R×Rn×R→RqC:Rn×Rm×R→Rc | (13) |
GPM is a collocation method which discretizes state and control variables at a series of Legendre-Gauss (LG) points. Lagrange polynomials are used for global fitting to approximate the trajectories of state and control variables. Therefore, the optimal control problem in continuous space is transformed into a non-linear programming problem.
Since the collocation points of the GPM are all distributed in the region [-1, 1], a time variable τ ∈ [-1, 1] is introduced to discretize the time t ∈ [t0, tf] of the optimal control problem to [-1, 1]. The transformation form is as follows.
τ=2ttf−t0−tf+t0tf−t0 | (14) |
NLG points and initial time τ₀ = −1 are selected as collocation points. The state and control variables are approximated by Lagrange interpolation polynomial and discretized into the following equation.
x(τ)≈X(τ)=N∑i=0Li(τ)x(τi)u(τ)≈U(τ)=N∑i=1~Li(τ)U(τi) | (15) |
where the interpolation basis function Li(τ)(i=0,1,2,⋯,N) is defined as Li(τ)=N∏j=0,j≠iτ−τjτi−τj and ~Li(τ)(i=1,2,⋯,N) is defined as Li(τ)=N∏j=1,j≠iτ−τjτi−τj.
For the integration of the dynamic differential equation on the interval [-1, 1], the following equation can be obtained:
x(τf)−x(τ0)=tf−t02∫1−1f[x(τ),u(τ),τ] dτ | (16) |
The state variables in Eq (15) are differentiated as follows:
˙x(τk)≈˙X(τk)=N∑i=0˙Li(τk)X(τi)=N∑i=0DkiX(τi) | (17) |
The differential of the Lagrange polynomial at LG points can be determined by off-line calculation of differential approximation matrix D∈RN×(N+1).
In the discretization the terminal state is approximated by Gauss integration.
Xf=X0+tf−t02N∑k=1ωkf(X(τk),U(τk),τk,t0,tf) | (18) |
where τk(1,2,⋯,N) is the LG point, and ωk=∫1−1Li(τ)dτ is the Gauss weight.
The dynamic constraints of the optimal control problem are converted into algebraic constraints utilizing a differential approximation matrix, and the integral form of the state equation is obtained as follows:
N∑i=0DkiX(τi)−tf−t02f[X(τk),U(τk),τk,t0,tf]=0 | (19) |
Based on the above numerical approximation method, the continuous trajectory optimization problem is discretized and transformed into a non-linear programming problem, which can be solved by the SQP to obtain an approximate solution of the optimal control problem [28].
The selection of initial values is the key to realizing real-time or fast calculation. The quality of initial values directly affects the time of solving optimization problems. Initial values with poor quality will increase the calculation cost and result in the optimization problem not undergoing convergence. PSO, as a typical artificial intelligence algorithm, has the advantages of easy realization and fast convergence and is widely used to solve path planning problems [29]. Compared with the standard collocation method, the GPM can obtain higher solution accuracy with fewer nodes and has a faster convergence speed [30]. By combining the advantages of the two methods, the efficiency and accuracy of solving trajectory optimization problems are improved.
The PSO aims to search for the best solution by iteration from a group of stochastic solutions. During the iteration, the individual learns information from the individual and group extremes and constantly updates its speed vi and position xi according to Eqs (20) and (21).
vt+1i=ωvti+c1r1(pti−xti)+c2r2(ptg−xti) | (20) |
xt+1i=xti+vt+1i | (21) |
where ω is the inertia weight; c₁ and c₂ are the two learning factors, and r is a random number between (0, 1); xit is the position of the i-th particle at the t-th iteration; vit is the velocity of the i-th particle at the t-th iteration; pg is the Group optimal value.
When the PSO is used to solve path planning problems, the objective function is usually the shortest path. The penalty functions are set according to the constraints, and the equality and inequality constraints are weighted to convert the constrained optimization problems into unconstrained optimization problems. The fitness function of PSO includes the objective function and penalty function. If the constraints are met, the penalty function is set to 0. The particles are randomly placed in the feasible region to search, and the speed and position of the particles are changed by Eqs (20) and (21) to find the solution with the lowest fitness function. Finally, the results of path planning are obtained.
The PSO has a robust global searching ability and fast early convergence, and it is suitable for global path planning. However, it ignores the kinematic characteristics of the AUV when solving path planning problems, and the result only has position information. The trajectory planning method based on the GPM can calculate the optimum value of the state and control variables, including time information, by solving the dynamic constraints of the AUV. The information on speed and angular velocity can be directly used as the reference command of the control system. However, when solving NLP problems converted from the GPM, the SQP algorithm is sensitive to the initial values involved in the iterative calculation. The solution process depends on the selection of initial values. Combining the speed of the PSO with the accuracy of the GPM, the initial values can be optimized, and the computational efficiency of the algorithm can be improved. The PSO is used to pre-plan the path and converge the path around the optimal solution to provide initial values reference for the SQP algorithm. The pre-planned trajectory is fitted by the cubic spline interpolation so that the initial values are distributed at LG points. Fitted trajectory points are used as guessing values to improve the speed of the solution.
Set the navigation area of the AUV to a square two-dimensional plane of 25 m × 25 m, and set the start and end points of the AUV to point (0 m, 0 m) and point (25 m, 25 m). The objective function J = tf for AUV trajectory optimization is set to minimize navigation time. The state variables and control variables are discretized at 50 LG points. The boundary and constraint values of state variables are shown in Table 1, and control variable constraints are shown in Table 2.
State Variables | Starting Values | Terminal Values | Minimum Values | Maximum Values |
px/m | 0 | 25 | 0 | 30 |
py/m | 0 | 25 | 0 | 30 |
ψ/rad | π/4 | π/4 | −π | π |
vx(m/s) | 0 | 0 | −1 | 1 |
vy(m/s) | 0 | 0 | −1 | 1 |
rz(rad/s) | 0 | 0 | −π/6 | π/6 |
Control Variables | Tu | Tv | Tr |
[min, max] | [−53, 74]/N | [−53, 74]/N | [−53, 74]/(N∙m) |
The obstacle parameters in the navigation environment are shown in Table 3, which contains four obstacles. The relevant parameters in the dynamic constraints are shown in Table 4.
Name | Value | Name | Value | Name | Value |
X˙u | -30 kg | Xu | -70 kg/s | X|u|⋅u | -100 kg/m |
Y˙v | -80 kg | Yv | -100 kg/s | Y|v|⋅v | -200 kg/m |
N˙r | -30 kg∙m2 | Nr | -50 kg∙m/s | N|r|⋅r | -100 kg/m |
Iz | 50 kg∙m2 | m | 185 kg |
Obstacle | Center coordinates | Length/m | Width/m |
1 | (5 m, 5 m) | 2 | 2 |
2 | (8 m, 13 m) | 8 | 4 |
3 | (18 m, 12 m) | 4 | 8 |
4 | (20 m, 20 m) | 2 | 2 |
For the GPM without pre-planned path points, the initial values are selected by the cubic spline interpolation between the start point and the end point. The results of trajectory optimization obtained by simulation are shown in Figure 3.
As can be seen from Figure 3, the planned trajectory starts from the starting point (0, 0). The AUV adjusts its steering angle as it approaches obstacle 1. After bypassing obstacle 1, it navigates in a straight line to the vicinity of obstacle 2 and obstacle 3. To avoid collisions, the AUV adjusts its course, traversing between obstacle 3 and obstacle 4 and navigating to the destination (25, 25).
The navigation time of the AUV is 72.742 s, and the calculation time is 92.664 s. The trajectory tends to be smooth, and the AUV can avoid collision with obstacles during navigation. The optimal trajectory satisfying the path constraints is obtained. The chart below shows the control variables and speed curves during the navigation.
Figure 4(a) shows the variation curves of control variables Tu, Tv and Tr. It can be seen that Tu, Tv and Tr meet the control constraints during the whole navigation. When the AUV passes obstacle 1, the torque generated by Tu and Tv causes the AUV to turn, bypass obstacle 1, and continue sailing. During the navigation time interval of [35-60 s], due to the path constraints imposed by obstacles on the AUV, the control variable Tv changes its size and direction, ensuring that the AUV can safely navigate between obstacle 3 and obstacle 4. The trend of AUV speed change is similar to that of the control variable Tu.
For the GPM without pre-planned path points, the initial values are selected by the linear interpolation between the start point and the end point. The results of trajectory optimization obtained by simulation are shown in Figure 5.
As can be seen from the trajectory of the AUV in Figure 5, the trajectory meets the requirements for obstacle avoidance. However, during navigation, the AUV's heading angle deviates. When the AUV is not close to an obstacle, the heading angle has undergone significant deflection, which is not conducive to the safety of the AUV navigation. The navigation time of the AUV is 67.974 s, and the calculation time is 144.173 s. The chart below shows the control variables and speed curves during the navigation.
Combining the trajectory diagram in Figure 5, it can be concluded that the change in heading angle is related to the increase in the control variable values T. In Figure 6, the values of T in the x direction and the y direction are similar, and the torque generated by the two drives the AUV to change course. As a result, the AUV in Figure 5 significantly deviates from its heading when passing through obstacles.
In the same simulation environment, the PSO is introduced to pre-plan the navigation path of the AUV. The cubic spline interpolation is used to fit the pre-planned path points, and the obtained initial values are substituted for the problem of trajectory planning to solve. The results of trajectory optimization are shown in Figure 7.
The trajectory in Figure 7 satisfies the path and boundary constraints. During the voyage, there is no collision with obstacles. When the AUV passes obstacle 1, it adjusts the navigation direction to avoid collision with the obstacle. In the narrow area between obstacle 3 and obstacle 4, the AUV can also pass safely. The navigation time of the AUV is 63.507 s, and the simulation calculation time is 16.377 s. The calculation time of pre-planned path points using the PSO is 12.764 s, and the solution time after substituting pre-planned path points as initial values into the SQP is 3.613 s. After the PSO is used for pre-planning, the changes in control and state variables of the AUV are shown in Figure 8.
In Figure 8(a), Tu drives the AUV within the restraint range with the upper thrust limit. The control variables Tu, Tv and Tr meet the restraint requirements. During the navigation, the fluctuation of the control variable Tu is relatively small, whereas the fluctuation of Tv is relatively large. The torque generated by Tu and Tv causes the AUV's navigation direction to change.
In the above scenarios, the simulation environment is the same, and the shortest navigation time is the objective function of the trajectory optimization problem. The optimal control problem is transformed into a NLP problem by the GPM, and the optimal trajectory can be obtained quickly. The simulation results show that the optimal solution for trajectory optimization can be obtained by the GPM, and the results of trajectory planning satisfy the path constraints. At the same time, the AUV meets the requirements for obstacle avoidance during navigation.
Comparing the calculation times of three scenarios, it can be seen that when the PSO is used for pre-planning, a collision-free path can be quickly planned in 13 s. The pre-planned path points are fitted and assigned to the GPM as the initial values. The pre-planned trajectory is located near the optimal solution, which reduces the amount of global calculation and significantly improves the calculation speed. In Figure 9, it can be seen that the shortest sailing time is the PSO, and the shortest simulation calculation time is the PSO. After pre planning of the PSO algorithm, the SQP has significantly less time to solve trajectory optimization problems. Compared to the linear method and the spline method, the calculation efficiency by using the PSO pre-planning method is improved by 82.3 and 88.6%.
The thrust accumulation values reflect the demand for control during AUV navigation, which to some extent determines the AUV's navigation distance. In Figure 10, the thrust accumulation value of the PSO is the largest, and the thrust accumulation value of the spline is the smallest. Therefore, it is judged that the output of the PSO method control variables needs to be adjusted while the calculation time is optimized in order to achieve the goal of control optimization.
This paper takes AUV trajectory optimization as the research background and proposes the PSO method for path pre-planning to solve the initial value dependence problem of the SQP. The pre-planned path points are used as the initial values of the SQP. The combination of the PSO global search speed and the accuracy of the GPM can improve the optimum of trajectory planning. The simulation results show that the method of combining PSO and GPM reduces the calculation cost and can quickly calculate the convergent solution of the trajectory optimization problem which provides a reference for online real-time trajectory planning. The pre-planning of the PSO reduces the initial value dependency of the SQP algorithm in solving NLP problems, and it improves computational efficiency by 82.3 and 88.6% compared to the linear interpolation and the cubic spline interpolation.
The existing research results in this paper are limited to known feasible areas and the location of obstacles, and only representative obstacle models have been selected. The simulation results show that the solution results of the proposed method meet the requirements for safe navigation of the AUV. However, the selection method of LG points is relatively traditional, and the accuracy and local convergence of the solution needs to be improved. In [31], by replacing the common one-to-one collision-avoidance with the constructed within-SDC constraints, an optimal control problem whose scale is totally independent of the number of obstacles could be formulated. In [32], combined with the hp method based on the residual error of dynamic constraints, adjusting the number of collocation points and subintervals can improve convergence speed and computational efficiency. In future research, multiple obstacles and multiple types of obstacle models should be considered, and the accuracy of trajectory planning problems can be improved by refining the mesh.
This research was funded by the National Natural Science Foundation of China under Grant U2006228, 52171313, 51839004, and in part by the Key Laboratory Foundation of Underwater Robot Technology under Grant 6142215200305.
The authors declare there is no conflict of interest.
Nomenclatures | |
px | The position of AUV in the x direction |
py | The position of AUV in the y direction |
ψ | The direction angle of the AUV |
vx | The velocity of AUV in the x direction |
vy | The velocity of AUV in the y direction |
rz | The angular velocity of the AUV |
Tu | Thrust of propeller in the x direction |
Tv | Thrust of propeller in the y direction |
Tr | Torque generated by thrust |
m | Mass of the AUV |
Iz | The moment of inertia |
X˙u | Additional mass coefficient in the u direction |
Xu | Linear resistance coefficient in the u direction |
X|u|∙u | Secondary resistance coefficient in the u direction |
Y˙v | Additional mass coefficient in the v direction |
Yv | Linear resistance coefficient in the v direction |
Y|v|∙v | Secondary resistance coefficient in the v direction |
N˙r | Additional rotational inertia in r direction |
Nr | Linear resistance coefficient in the r direction |
N|r|∙r | Secondary resistance coefficient in the r direction |
R(x, y) | The center of gravity of the AUV |
l1 | The distance from the center of gravity to the head of the AUV |
l2 | The distance from the center of gravity to the tail of the AUV |
d | The width of AUV |
rx | The scale coefficients in the x direction |
ry | The scale coefficients in the y direction |
(a, b) | The center position of the graph |
x(t) | The state variable |
u(t) | The control variable |
Φ | The Mayer form of the objective function |
L | The Lagrange form of the objective function |
C | The path constraint function |
ϕ | The boundary constraint function |
τ | The time variable |
t0 | The Initial time |
tf | The terminal time |
Li(τ) | The interpolation basis function |
D | The differential approximation matrix |
ωk | The Gauss weight |
xit | The position of the i-th particle at the t-th iteration |
vit | The velocity of the i-th particle at the t-th iteration |
c1 | The learning factor |
c2 | The learning factor |
ω | The inertia weight |
r | A random number between (0, 1) |
pg | The Group optimal value |
[1] |
X. Yang, X. Yang, X. Deng, Horizontal trajectory control of stratospheric airships in wind field using Q-learning algorithm, Aerosp. Sci. Technol., 106 (2020), 106100. https://doi.org/10.1016/j.ast.2020.106100 doi: 10.1016/j.ast.2020.106100
![]() |
[2] |
M. Manikandan, R. S. Pant, A comparative study of conventional and tri-lobed stratospheric airships, Aeronaut. J., 125 (2021), 1434–1466. https://doi.org/10.1017/aer.2021.24 doi: 10.1017/aer.2021.24
![]() |
[3] |
J. H. Kim, S. J. Yoo, Distributed event-triggered adaptive formation tracking of networked uncertain stratospheric airships using neural networks, IEEE Access, 8 (2020), 49977–49988. https://doi.org/10.1109/ACCESS.2020.2979995 doi: 10.1109/ACCESS.2020.2979995
![]() |
[4] |
E. Price, M. J. Black, A. Ahmad, Viewpoint-driven formation control of airships for cooperative target tracking, IEEE Rob. Autom. Lett., 8 (2023), 3653–3660. https://doi.org/10.1109/LRA.2023.3264727 doi: 10.1109/LRA.2023.3264727
![]() |
[5] |
L. Chen, Q. Gao, Y. Deng, J. Liu, Moving-mass-based station keeping of stratospheric airships, Aeronaut. J., 125 (2021), 1231–1244. https://doi.org/10.1017/aer.2021.9 doi: 10.1017/aer.2021.9
![]() |
[6] |
Z. Zheng, Z. Guan, Y. Ma, B. Zhu, Constrained path-following control for an airship with uncertainties, Eng. Appl. Artif. Intell., 85 (2019), 295–306. https://doi.org/10.1016/j.engappai.2019.06.021 doi: 10.1016/j.engappai.2019.06.021
![]() |
[7] |
Y. Yang, A time-specified nonsingular terminal sliding mode control approach for trajectory tracking of robotic airships, Nonlinear Dyn., 92 (2018), 1359–1367. https://doi.org/10.1007/s11071-018-4131-3 doi: 10.1007/s11071-018-4131-3
![]() |
[8] |
Y. Wu, Q. Wang, D. Duan, W. Xie, Y. Wei, Neuroadaptive output-feedback trajectory tracking control for a stratospheric airship with prescribed performance, Aeronaut. J., 124 (2020), 1568–1591. https://doi.org/10.1017/aer.2020.54 doi: 10.1017/aer.2020.54
![]() |
[9] |
J. Yuan, M. Zhu, X. Guo, W. Lou, Finite-time trajectory tracking control for a stratospheric airship with full-state constraint and disturbances, J. Franklin Inst., 358 (2021), 1499–1528. https://doi.org/10.1016/j.jfranklin.2020.12.010 doi: 10.1016/j.jfranklin.2020.12.010
![]() |
[10] |
A. Saeed, Y. Liu, M. Z. Shah, L. Wang, Z. Zuo, Q. Wang, Higher order sliding mode based lateral guidance and control of finless airship, Aerosp. Sci. Technol., 113 (2021), 106670. https://doi.org/10.1016/j.ast.2021.106670 doi: 10.1016/j.ast.2021.106670
![]() |
[11] |
D. Cui, C. K. Ahn, Y. Sun, Z. Xiang, Mode-dependent state observer-based prescribed performance control of switched systems, IEEE Trans. Circuits Syst. II: Express Briefs, 71 (2024), 3810–3814. https://doi.org/10.1109/TCSII.2024.3370865 doi: 10.1109/TCSII.2024.3370865
![]() |
[12] |
X. Wang, E. van Kampen, Q. Chu, P. Lu, Stability analysis for incremental nonlinear dynamic inversion control, J. Guid. Control Dyn., 42 (2019), 1116–1129. https://doi.org/10.2514/1.G003791 doi: 10.2514/1.G003791
![]() |
[13] | P. Acquatella, E. van Kampen, Q. P. Chu, Incremental backstepping for robust nonlinear flight control, in EuroGNC 2013, 2nd CEAS Specialist Conference on Guidance, Navigation & Control, (2013), 1444–1463. |
[14] |
W. Li, X. Han, Y. Zhi, B. Wang, L. Liu, H. Fan, Adaptive finite-time incremental backstepping fault-tolerant control for flying-wing aircraft with state constraints, Aerosp. Sci. Technol., 147 (2024), 108968. https://doi.org/10.1016/j.ast.2024.108968 doi: 10.1016/j.ast.2024.108968
![]() |
[15] |
Y. Wang, F. Yan, J. Chen, F. Ju, B. Chen, A new adaptive time-delay control scheme for cable-driven manipulators, IEEE Trans. Ind. Inf., 15 (2019), 3469–3481. https://doi.org/10.1109/TII.2018.2876605 doi: 10.1109/TII.2018.2876605
![]() |
[16] |
G. R. Cho, P. H. Chang, S. H. Park, M. Jin, Robust tracking under nonlinear friction using time-delay control with internal model, IEEE Trans. Control Syst. Technol., 17 (2009), 1406–1414. https://doi.org/10.1109/TCST.2008.2007650 doi: 10.1109/TCST.2008.2007650
![]() |
[17] |
D. K. Han, P. Chang, Robust tracking of robot manipulator with nonlinear friction using time delay control with gradient estimator, J. Mech. Sci. Technol., 24 (2010), 1743–1752. https://doi.org/10.1007/s12206-010-0516-z doi: 10.1007/s12206-010-0516-z
![]() |
[18] |
H. Bae, M. Jin, J. Suh, J. Y. Lee, P. Chang, D. Ahn, Control of robot manipulators using time-delay estimation and fuzzy logic systems, J. Electr. Eng. Technol., 12 (2017), 1271–1279. https://doi.org/10.5370/JEET.2017.12.3.1271 doi: 10.5370/JEET.2017.12.3.1271
![]() |
[19] |
M. Jin, S. H. Kang, P. H. Chang, Robust compliant motion control of robot with nonlinear friction using time-delay estimation, IEEE Trans. Ind. Electron., 55 (2008), 258–269. https://doi.org/10.1109/TIE.2007.906132 doi: 10.1109/TIE.2007.906132
![]() |
[20] |
J. Ma, S. Yu, W. Hu, H. Wu, X. Li, Y. Zheng, et al., Finite-time robust flight control of logistic unmanned aerial vehicles using a time-delay estimation technique, Drones, 8 (2024), 58. https://doi.org/10.3390/drones8020058 doi: 10.3390/drones8020058
![]() |
[21] |
D. K. Schmidt, J. Stevens, J. Roney, Near-space station-keeping performance of a large high-altitude notional airship, J. Aircr., 44 (2007), 611–615. https://doi.org/10.2514/1.24863 doi: 10.2514/1.24863
![]() |
[22] |
Z. Zheng, Y. Zou, Adaptive integral LOS path following for an unmanned airship with uncertainties based on robust RBFNN backstepping, ISA Trans., 65 (2016), 210–219. https://doi.org/10.1016/j.isatra.2016.09.008 doi: 10.1016/j.isatra.2016.09.008
![]() |
[23] |
T. Chen, M. Zhu, Z. Zheng, Asymmetric error-constrained path-following control of a stratospheric airship with disturbances and actuator saturation, Mech. Syst. Signal Process., 119 (2019), 501–522. https://doi.org/10.1016/j.ymssp.2018.10.003 doi: 10.1016/j.ymssp.2018.10.003
![]() |
[24] |
Y. C. Wang, W. S. Chen, S. X. Zhang, J. W. Zhu, L. J. Cao, Command-filtered incremental backstepping controller for small unmanned aerial vehicles, J. Guid. Control Dyn., 41 (2018), 954–967. https://doi.org/10.2514/1.G003001 doi: 10.2514/1.G003001
![]() |
[25] |
J. Pomet, L. Praly, Adaptive nonlinear regulation: Estimation from the lyapunov equation, IEEE Trans. Autom. Control, 37 (1992), 729–740. https://doi.org/10.1109/9.256328 doi: 10.1109/9.256328
![]() |
[26] |
M. Jin, J. Lee, P. H. Chang, C. Choi, Practical nonsingular terminal sliding-mode control of robot manipulators for high-accuracy tracking control, IEEE Trans. Ind. Electron., 56 (2009), 3593–3601. https://doi.org/10.1109/TIE.2009.2024097 doi: 10.1109/TIE.2009.2024097
![]() |
[27] |
Y. Wang, Z. Zhang, C. Li, M. Buss, Adaptive incremental sliding mode control for a robot manipulator, Mechatronics, 82 (2022), 102717. https://doi.org/10.1016/j.mechatronics.2021.102717 doi: 10.1016/j.mechatronics.2021.102717
![]() |
1. | Lufei Zhang, Yuan Ge, Zhihong Guan, Gang Ye, Haoyu Feng, 2024, Path planning of underwater vehicle based on improved particle swarm algorithm, 979-8-3503-8778-0, 5584, 10.1109/CCDC62350.2024.10587945 | |
2. | Hui Jiao, Xiaofeng Zhang, Tao Xu, Wenxing Fu, 2025, Chapter 28, 978-981-96-3567-2, 295, 10.1007/978-981-96-3568-9_28 | |
3. | Cheng Qian, Xiaoyan Zhang, Xiaoxi Liang, Haoyu Cheng, 2025, Chapter 29, 978-981-96-3551-1, 301, 10.1007/978-981-96-3552-8_29 | |
4. | Xinwei Wang, Zhilong Deng, Haixu Li, Lei Wang, Junhong Jin, Xichao Su, Safe Dispatch Corridor: Toward Efficient Trajectory Planning for Carrier Aircraft Traction System on Flight Deck, 2025, 61, 0018-9251, 1997, 10.1109/TAES.2024.3468292 |
State Variables | Starting Values | Terminal Values | Minimum Values | Maximum Values |
px/m | 0 | 25 | 0 | 30 |
py/m | 0 | 25 | 0 | 30 |
ψ/rad | π/4 | π/4 | −π | π |
vx(m/s) | 0 | 0 | −1 | 1 |
vy(m/s) | 0 | 0 | −1 | 1 |
rz(rad/s) | 0 | 0 | −π/6 | π/6 |
Control Variables | Tu | Tv | Tr |
[min, max] | [−53, 74]/N | [−53, 74]/N | [−53, 74]/(N∙m) |
Name | Value | Name | Value | Name | Value |
X˙u | -30 kg | Xu | -70 kg/s | X|u|⋅u | -100 kg/m |
Y˙v | -80 kg | Yv | -100 kg/s | Y|v|⋅v | -200 kg/m |
N˙r | -30 kg∙m2 | Nr | -50 kg∙m/s | N|r|⋅r | -100 kg/m |
Iz | 50 kg∙m2 | m | 185 kg |
Obstacle | Center coordinates | Length/m | Width/m |
1 | (5 m, 5 m) | 2 | 2 |
2 | (8 m, 13 m) | 8 | 4 |
3 | (18 m, 12 m) | 4 | 8 |
4 | (20 m, 20 m) | 2 | 2 |
State Variables | Starting Values | Terminal Values | Minimum Values | Maximum Values |
px/m | 0 | 25 | 0 | 30 |
py/m | 0 | 25 | 0 | 30 |
ψ/rad | π/4 | π/4 | −π | π |
vx(m/s) | 0 | 0 | −1 | 1 |
vy(m/s) | 0 | 0 | −1 | 1 |
rz(rad/s) | 0 | 0 | −π/6 | π/6 |
Control Variables | Tu | Tv | Tr |
[min, max] | [−53, 74]/N | [−53, 74]/N | [−53, 74]/(N∙m) |
Name | Value | Name | Value | Name | Value |
X˙u | -30 kg | Xu | -70 kg/s | X|u|⋅u | -100 kg/m |
Y˙v | -80 kg | Yv | -100 kg/s | Y|v|⋅v | -200 kg/m |
N˙r | -30 kg∙m2 | Nr | -50 kg∙m/s | N|r|⋅r | -100 kg/m |
Iz | 50 kg∙m2 | m | 185 kg |
Obstacle | Center coordinates | Length/m | Width/m |
1 | (5 m, 5 m) | 2 | 2 |
2 | (8 m, 13 m) | 8 | 4 |
3 | (18 m, 12 m) | 4 | 8 |
4 | (20 m, 20 m) | 2 | 2 |