×

Learning-based collision avoidance and robust \(H_\infty\). (English) Zbl 1533.93160

Summary: This paper investigates the robust \(H_{\infty}\) optimal formation control for the position subsystem of quadrotor unmanned aerial vehicles (UAVs) subject to external disturbances and collision constraints. To prevent collision with both members of the formation and external obstacles, a collision avoidance potential function is constructed using relative position and velocity information. The basic bounded control input can ensure the stable flight and collision avoidance of a quadrotor UAV system. Based on the approximate dynamic programming (ADP) framework and two-player zero-sum differential game theory, the \(H_\infty\) optimal controller is designed to further enhance the control performance of the system. The optimal value function is approximated by a single layer neural network, which avoids solving complex nonlinear Hamilton-Jacobi-Isaac (HJI) equation. The stability of the closed loop system is proved. The effectiveness of the robust \(H_\infty\) optimal formation controller based on learning is validated through MATLAB simulation in two distinct scenarios.
© 2023 John Wiley & Sons Ltd.

MSC:

93B36 \(H^\infty\)-control
93C85 Automated systems (robots, etc.) in control theory
49N70 Differential games and control
91A10 Noncooperative games
91A05 2-person games

Software:

Matlab
Full Text: DOI

References:

[1] PakroohR, BohlooliA. A survey on unmanned aerial vehicles assisted internet of things: a service oriented classification. Wirel Personal Commun. 2021;119(2):1541‐1574.
[2] SaadiAA, SoukaneA, MeraihiY, GabisAB. UAV path planning using optimization approaches: a survey. Arch Comput Methods Eng. 2022;29(6):4233‐4284.
[3] GiordanD, AdamsMS, AicardiI, et al. The use of unmanned aerial vehicles (UAVs) for engineering geology applications. Bullet Eng Geol Environ. 2020;79(7):3437‐3481.
[4] TangJ, DuanHB, LaoSY. Swarm intelligence algorithms for multiple unmanned aerial vehicles collaboration: a comprehensive review. Artif Intell Rev. 2022;56(5):4295‐4327.
[5] AlsuhliG, FahimA, GadallahY. A survey on the role of UAVs in the communication process: a technological perspective. Comput Commun. 2022;194:86‐123.
[6] ChenYJ, LiangJC, MiaoZQ, WangYN. Distributed formation control of quadrotor UAVs based on rotation matrices without linear velocity feedback. Int J Control Autom Syst. 2021;19(10):3464‐3474.
[7] AnBH, WangB. Fully distributed prescribed performance formation control for UAVs with unknown maneuver of leader. Aerosp Sci Technol. 2022;130:107886.
[8] LiX, QiGY, ZhangLM. Time‐varying formation dynamics modeling and constrained trajectory optimization of multi‐quadrotor UAVs. Nonlinear Dyn. 2021;106(4):3265‐3284.
[9] YangM, BiWH, ZhangA, GaoF. A distributed task reassignment method in dynamic environment for multi‐UAV system. Appl Intell. 2022;52(5):1582‐1601.
[10] WangJ, WuYX, ChenYQ, JuS. Multi‐UAVs collaborative tracking of moving target with maximized visibility in urban environment. J Franklin Inst. 2022;359(11):5522‐5532. · Zbl 1492.93058
[11] YuZY, YuSZ, JiangHJ, HuC. Distributed consensus for multi‐agent systems via adaptive sliding mode control. Int J Robust Nonlinear Control. 2021;31(15):7125‐7151. · Zbl 1527.93416
[12] JiaQX, GuiYL, WuYH, ZhangCX. Disturbance observer‐based performance guaranteed fault‐tolerant control for multi‐spacecraft formation reconfiguration with collision avoidance. Aerosp Sci Technol. 2023;133:108099.
[13] SunXJ, WangGF, FanYS, MuDD. Collision avoidance control for unmanned surface vehicle with COLREGs compliance. Ocean Eng. 2023;267:113263.
[14] LiuW, QiuK, YangXF, et al. COLREGS‐based collision avoidance algorithm for unmanned surface vehicles using modified artificial potential fields. Phys Commun. 2023;57:101980.
[15] XuGT, LongT, WangZ, SunJL. Trust‐region filtered sequential convex programming for multi‐UAV trajectory planning and collision avoidance. ISA Trans. 2022;128:664‐676.
[16] ChenL, DuanHB. Collision‐free formation‐containment control for a group of UAVs with unknown disturbances. Aerosp Sci Technol. 2022;126:107618.
[17] DoKD. Bounded assignment formation control of second‐order dynamic agents. IEEE/Asme Trans Mechatron. 2014;19(2):477‐489.
[18] DoKD. Bounded and inverse optimal formation stabilization of second‐order agents. Automatica. 2021;123:109367. · Zbl 1461.93415
[19] GuoYH, ChenG, ZhaoT. Learning‐based collision‐free coordination for a team of uncertain quadrotor UAVs. Aerosp Sci Technol. 2021;119:107127.
[20] GuoYH, ChenG. Robust near‐optimal coordination in uncertain multiagent networks with motion constraints. IEEE Trans Cybern. 2021;53(5):2841‐2851.
[21] MohammadzamaniF, HashemiM, ShahgholianG. Adaptive control of nonlinear time‐delay systems in the presence of output constraints and actuators faults. Int J Control. 2023;96(3):541‐553. · Zbl 1519.93128
[22] MohammadzamaniF, HashemiM, ShahgholianG. Adaptive neural control of non‐linear fractional order multi‐agent systems in the presence of error constraints and input saturation. IET Control Theory Appl. 2022;16(13):1283‐1298.
[23] ShaoXL, LiuJ, CaoHL, et al. Robust dynamic surface trajectory tracking control for a quadrotor UAV via extended state observer. Int J Robust Nonlinear Control.2018;28(7):2700‐2719. · Zbl 1391.93156
[24] JanbaziV, HashemiM. Design of disturbance observer based on adaptive‐neural control for large‐scale time‐delay systems in the presence of actuator fault and unknown dead zone. Int J Adapt Control Signal Process. 2021;35(2):285‐309. · Zbl 07839199
[25] RekabiF, ShiraziFA, SadighMJ, SaadatM. Nonlinear
[( {H}_{\infty } \]\) measurement feedback control algorithm for quadrotor position tracking. J Franklin Inst. 2020;357(11):6777‐6804. · Zbl 1447.93234
[26] JiangHY, ZhouB, LiuGP. H_∞ optimal control of unknown linear systems by adaptive dynamic programming with applications to time‐delay systems. Int J Robust Nonlinear Control. 2021;31(12):5602‐5617. · Zbl 1525.93065
[27] WangZC, TianXH, MaiQQ, LiuHT. Fixed‐time composite robust
[( {H}_{\infty } \]\) tracking control of marine surface vessels based on the barrier Lyapunov function and an event‐triggered strategy. Ocean Eng. 2022;261:112113.
[28] VamvoudakisKG, LewisFL. Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration. Int J Robust Nonlinear Control. 2012;22(13):1460‐1483. · Zbl 1292.91011
[29] YangYL, MazouchiM, ModaresH. Hamiltonian‐driven adaptive dynamic programming for mixed
[( {H}_2/{H}_{\infty } \]\) performance using sum‐of‐squares. Int J Robust Nonlinear Control. 2021;31(6):1941‐1963. · Zbl 1526.93048
[30] QinCB, ZhangHG, WangYC, LuoYH. Neural network‐based online
[( {H}_{\infty } \]\) control for discrete‐time affine nonlinear system using adaptive dynamic programming. Neurocomputing. 2016;198:91‐99.
[31] ModaresH, LewisFL, JiangZP. H_∞ tracking control of completely unknown continuous‐time systems via off‐policy reinforcement learning. IEEE Trans Neural Netw Learn Syst. 2015;26(10):2550‐2562.
[32] WangD, LiuDR, MuCX, ZhangY. Neural network learning and robust stabilization of nonlinear systems with dynamic uncertainties. IEEE Trans Neural Netw Learn Syst. 2018;29(4):1342‐1351.
[33] LvYF, RenXM. Approximate Nash solutions for multiplayer mixed‐zero‐sum game with reinforcement learning. IEEE Trans Syst Man Cybern Syst. 2019;49(12):2739‐2750.
[34] MingZY, ZhangHG, LiangYL, SuHG. Nonzero‐sum differential games of continuous‐time nonlinear systems with uniformly ultimately
[( \varepsilon \]\)‐bounded by adaptive dynamic programming. Appl Math Comput. 2022;430:127248. · Zbl 1510.91050
[35] ZhangHG, QinCB, JiangB, LuoYH. Online adaptive policy learning algorithm for
[( {H}_{\infty } \]\) state feedback control of unknown affine nonlinear discrete‐time systems. IEEE Trans Cybern. 2014;44(12):2706‐2718.
[36] ZhongXN, HeHB, WangD, NiZ. Model‐free adaptive control for unknown nonlinear zero‐sum differential game. IEEE Trans Cybern. 2018;48(5):1633‐1646.
[37] WangHB, LiN, WangYL, SuB. Backstepping sliding mode trajectory tracking via extended state observer for quadrotors with wind disturbance. Int J Control Autom Syst. 2021;19(10):3273‐3284.
[38] LvYF, TianJY. Approximate‐dynamic‐programming
[( {H}_{\infty } \]\) controls for multi‐input nonlinear system. Control Theory Appl. 2021;38(10):1662‐1670.
[39] DouLQ, CaiSY. Event‐triggered‐based adaptive dynamic programming for distributed formation control of multi‐UAV. J Franklin Inst. 2022;359(8):3671‐3691. · Zbl 1489.93070
[40] LeeHI, ShinHS, TsourdosA. Concurrent learning adaptive control with directional forgetting. IEEE Trans Automat Contr. 2019;64(12):5164‐5170. · Zbl 1482.93312
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.