×

Event-triggered robust control for multi-player nonzero-sum games with input constraints and mismatched uncertainties. (English) Zbl 1532.93264

Summary: In this article, an event-triggered robust control (ETRC) method is investigated for multi-player nonzero-sum games of continuous-time input constrained nonlinear systems with mismatched uncertainties. By constructing an auxiliary system and designing an appropriate value function, the robust control problem of input constrained nonlinear systems is transformed into an optimal regulation problem. Then, a critic neural network (NN) is adopted to approximate the value function of each player for solving the event-triggered coupled Hamilton-Jacobi equation and obtaining control laws. Based on a designed event-triggering condition, control laws are updated when events occur only. Thus, both computational burden and communication bandwidth are reduced. We prove that the weight approximation errors of critic NNs and the closed-loop uncertain multi-player system states are all uniformly ultimately bounded thanks to the Lyapunov’s direct method. Finally, two examples are provided to demonstrate the effectiveness of the developed ETRC method.
{© 2022 John Wiley & Sons Ltd.}

MSC:

93C65 Discrete event control/observation systems
93B35 Sensitivity (robustness)
91A06 \(n\)-person games, \(n>2\)
93C40 Adaptive control/observation systems
49L20 Dynamic programming in optimal control and differential games
Full Text: DOI

References:

[1] JiangY, JiangZ. Robust adaptive dynamic programming and feedback stabilization of nonlinear systems. IEEE Trans Neural Netw Learn Syst. 2014;25(5):882‐893.
[2] LinF, BrandtRD, SunJ. Robust control of nonlinear systems: compensating for uncertainty. Int J Control. 1992;56(6):1453‐1459. · Zbl 0771.93018
[3] WangD, LiC, LiuD, MuC. Data‐based robust optimal control of continuous‐time affine nonlinear systems with matched uncertainties. Inf Sci. 2016;366:121‐133. · Zbl 1430.49030
[4] LiuD, XueS, ZhaoB, LuoB, WeiQ. Adaptive dynamic programming for control: a survey and recent advances. IEEE Trans Syst Man Cybern. 2021;51(1):142‐160.
[5] NaJ, LiG, WangB, HerrmannG, ZhanS. Robust optimal control of wave energy converters based on adaptive dynamic programming. IEEE Trans Sustain Energy. 2019;10(2):961‐970.
[6] WangD, HeH, LiuD. Adaptive critic nonlinear robust control: a survey. IEEE Trans Cybern. 2017;47(10):3429‐3451.
[7] ZhaoB, LuoF, LinH, LiuD. Particle swarm optimized neural networks based local tracking control scheme of unknown nonlinear interconnected systems. Neural Netw. 2021;134:54‐63.
[8] YangY, ZhuH, ZhangQ, ZhaoB, LiZ, WunschDC. Sparse online kernelized actor‐critic learning in reproducing kernel Hilbert space. Artif Intell Rev. 2022;55:23‐58.
[9] WangJ, YangC, XiaJ, WuZG, ShenH. Observer‐based sliding mode control for networked fuzzy singularly perturbed systems under weighted try‐once‐discard protocol. IEEE Trans Fuzzy Syst. 2022;30(6):1889‐1899.
[10] WangJ, XiaJ, ShenH, XingM, ParkJH. H∞ synchronization for fuzzy markov jump chaotic systems with piecewise‐constant transition probabilities subject to PDT switching rule. IEEE Trans Fuzzy Syst. 2021;29(10):3082‐3092.
[11] BertsekasDP. Dynamic Programming and Optimal Control. Athena Scientific; 2000.
[12] LiuD, WeiQ, WangD, YangX, LiH. Adaptive Dynamic Programming with Applications in Optimal Control. Springer; 2017. · Zbl 1390.93003
[13] LiuD, XuY, WeiQ, LiuX. Residential energy scheduling for variable weather solar energy based on adaptive dynamic programming. IEEE/CAA J Automat Sin. 2018;5(1):36‐46.
[14] WeiQ, LiuD, LiuY, SongR. Optimal constrained self‐learning battery sequential management in microgrid via adaptive dynamic programming. IEEE/CAA J Automat Sin. 2017;4(2):168‐176.
[15] ZhuY, ZhaoD. Comprehensive comparison of online ADP algorithms for continuous‐time optimal control. Artif Intell Rev. 2018;49(4):531‐547.
[16] ZhaoB, LiuD, LuoC. Reinforcement learning‐based optimal stabilization for unknown nonlinear systems subject to inputs with uncertain constraints. IEEE Trans Neural Netw Learn Syst. 2020;31(10):4330‐4340.
[17] YangY, KiumarsiB, ModaresH, XuC. Model‐free
[( \lambda \]\)‐policy iteration for discrete‐time linear quadratic regulation. IEEE Trans Neural Netw Learn Syst. 2021. doi:10.1109/TNNLS.2021.3098985
[18] WangF, ZhangH, LiuD. Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag. 2009;4(2):39‐47.
[19] ZhaoB, LiuD, AlippiC. Sliding‐mode surface‐based approximate optimal control for uncertain nonlinear systems with asymptotically stable critic structure. IEEE Trans Cybern. 2021;51(6):2858‐2869.
[20] LiuD, WeiQ. Policy iteration adaptive dynamic programming algorithm for discrete‐time nonlinear systems. IEEE Trans Neural Netw Learn Syst. 2014;25(3):621‐634.
[21] LiuD, WeiQ. Finite‐approximation‐error‐based optimal control approach for discrete‐time nonlinear systems. IEEE Trans Cybern. 2013;43(2):779‐789.
[22] ModaresH, LewisFL, JiangZ. H∞ tracking control of completely unknown continuous‐time systems via off‐policy reinforcement learning. IEEE Trans Neural Netw Learn Syst. 2015;26(10):2550‐2562.
[23] MuC, SunC, WangD, SongA. Adaptive tracking control for a class of continuous‐time uncertain nonlinear systems using the approximate solution of HJB equation. Neurocomputing. 2017;260:432‐442.
[24] ZhaoB, LiuD, YangX, LiY. Observer‐critic structure‐based adaptive dynamic programming for decentralised tracking control of unknown large‐scale nonlinear systems. Int J Syst Sci. 2017;48(9):1978‐1989. · Zbl 1371.93032
[25] WangD. Intelligent critic control with robustness guarantee of disturbed nonlinear plants. IEEE Trans Cybern. 2020;50(6):2740‐2748.
[26] ZhangH, CuiL, ZhangX, LuoY. Data‐driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans Neural Netw. 2011;22(12):2226‐2236.
[27] LiuD, YangX, WangD, WeiQ. Reinforcement‐learning‐based robust controller design for continuous‐time uncertain nonlinear systems subject to input constraints. IEEE Trans Cybern. 2015;45(7):1372‐1385.
[28] LiuD, WangD, LiH. Decentralized stabilization for a class of continuous‐time nonlinear interconnected systems using online learning optimal control approach. IEEE Trans Neural Netw Learn Syst. 2014;25(2):418‐428.
[29] ZhaoB, WangD, ShiG, LiuD, LiY. Decentralized control for large‐scale nonlinear systems with unknown mismatched interconnections via policy iteration. IEEE Trans Syst Man Cybern Syst. 2018;48(10):1725‐1735.
[30] WangD, LiuD, LiH. Policy iteration algorithm for online design of robust control for a class of continuous‐time nonlinear systems. IEEE Trans Automat Sci Eng. 2014;11(2):627‐632.
[31] WangD, LiuD, ZhangY, LiH. Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems. Neural Netw. 2018;97:11‐18. · Zbl 1441.93066
[32] LinH, ZhaoB, LiuD, AlippiC. Data‐based fault tolerant control for affine nonlinear systems through particle swarm optimized neural networks. IEEE/CAA J Automat Sin. 2020;7(4):954‐964.
[33] LiuX, ZhaoB, LiuD. Fault tolerant tracking control for nonlinear systems with actuator failures through particle swarm optimization‐based adaptive dynamic programming. Appl Soft Comput. 2020;97:106766.
[34] MuC, WangK. Approximate‐optimal control algorithm for constrained zero‐sum differential games through event‐triggering mechanism. Nonlinear Dyn. 2019;95:2639‐2657. · Zbl 1437.91080
[35] ZhaoB, LiuD. Event‐triggered decentralized tracking control of modular reconfigurable robots through adaptive dynamic programming. IEEE Trans Ind Electron. 2020;67(4):3054‐3064.
[36] ZhangQ, ZhaoD, ZhuY. Event‐triggered
[( {H}_{\infty } \]\) control for continuous‐time nonlinear system via concurrent learning. IEEE Trans Syst Man Cybern Syst. 2017;47(7):1071‐1081.
[37] XueS, LuoB, LiuD. Event‐triggered adaptive dynamic programming for zero‐sum game of partially unknown continuous‐time nonlinear systems. IEEE Trans Syst Man Cybern. 2020;50(9):3189‐3199.
[38] YangX, HeH. Event‐triggered robust stabilization of nonlinear input‐constrained systems using single network adaptive critic designs. IEEE Trans Syst Man Cybern Syst. 2020;50(9):3145‐3157.
[39] YangX, HeH. Adaptive critic designs for event‐triggered robust control of nonlinear systems with unknown dynamics. IEEE Trans Cybern. 2019;49(6):2255‐2267.
[40] WangD, MuC, YangX, LiuD. Event‐based constrained robust control of affine systems incorporating an adaptive critic mechanism. IEEE Trans Syst Man Cybern Syst. 2017;47(7):1602‐1612.
[41] YangD, LiT, ZhangH, XieX. Event‐trigger‐based robust control for nonlinear constrained‐input systems using reinforcement learning method. Neurocomputing. 2019;340:158‐170.
[42] ZhangQ, ZhaoD, WangD. Event‐based robust control for uncertain nonlinear systems using adaptive dynamic programming. IEEE Trans Neural Netw Learn Syst. 2018;29(1):37‐50.
[43] JiangH, ZhangH. Iterative ADP learning algorithms for discrete‐time multi‐player games. Artif Intell Rev. 2018;50(1):75‐91.
[44] ZhangH, SuH, ZhangK, LuoY. Event‐triggered adaptive dynamic programming for non‐zero‐sum games of unknown nonlinear systems via generalized fuzzy hyperbolic models. IEEE Trans Fuzzy Syst. 2019;27(11):2202‐2214.
[45] YeM. Distributed Nash equilibrium seeking for games in systems with bounded control inputs. IEEE Trans Automat Contr. 2020;66(8):3833‐3839. · Zbl 1471.93129
[46] YeM, LiD, HanQL, DingL. Distributed Nash equilibrium seeking for general networked games with bounded disturbances. IEEE/CAA J Automat Sin. 2022. doi:10.1109/JAS.2022.105428
[47] LinF. An optimal control approach to robust control design. Int J Control. 2000;73(3):177‐186. · Zbl 1004.93040
[48] YangY, GaoW, ModaresH, XuCZ. Robust actor‐critic learning for continuous‐time nonlinear systems with unmodeled dynamics. IEEE Trans Fuzzy Syst. 2022;30(6):2101‐2112.
[49] ZhuY, ZhaoD, HeH, JiJ. Event‐triggered optimal control for partially unknown constrained‐input systems via adaptive dynamic programming. IEEE Trans Ind Electron. 2017;64(5):4101‐4109.
[50] YangX, HeH. Event‐driven
[( {H}_{\infty } \]\) constrained control using adaptive critic learning. IEEE Trans Cybern. 2021;51(10):4860‐4872.
[51] WangD, MuC, LiuD, MaH. On mixed data and event driven design for adaptive‐critic‐based nonlinear
[( {H}_{\infty } \]\) control. IEEE Trans Neural Netw Learn Syst. 2018;29(4):993‐1005.
[52] ZhangY, ZhaoB, LiuD, ZhangS. Event‐triggered optimal tracking control of multi‐player unknown nonlinear systems via adaptive critic designs. Int J Robust Nonlinear Control. 2022;32(1):29‐51. · Zbl 1527.93306
[53] ZhangY, ZhaoB, LiuD, ZhangS. Event‐triggered control of discrete‐time zero‐sum games via deterministic policy gradient adaptive dynamic programming. IEEE Trans Syst Man Cybern Syst. 2022;52(8):4823‐4835.
[54] LinM, ZhaoB, LiuD. Policy gradient adaptive critic designs for model‐free optimal tracking control with experience replay. IEEE Trans Syst Man Cybern Syst. 2022;55(6):3692‐3703.
[55] ZhaoB, ZhangY, LiuD. Adaptive dynamic programming‐based cooperative motion/force control for modular reconfigurable manipulators: a joint task assignment approach. IEEE Trans Neural Netw Learn Syst. 2022. doi:10.1109/TNNLS.2022.3171828
[56] BorgersDP, HeemelsWPMH. Event‐separation properties of event‐triggered control systems. IEEE Trans Automat Contr. 2014;59(10):2644‐2656. · Zbl 1360.93379
[57] HuA, ParkJH, HuM. Consensus of nonlinear multiagent systems with intermittent dynamic event‐triggered protocols. Nonlinear Dyn. 2021;104(2):1299‐1313.
[58] LewisFL, JagannathanS, YesildirekA. Neural Network Control of Robot Manipulators and Nonlinear Systems. Taylor & Francis; 1999.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.