×

Nonlinear discrete time optimal control based on fuzzy models. (English) Zbl 1352.49023

Summary: The approach of designing a discrete time optimal controller for a nonlinear system represented by a fuzzy model is presented in this paper. A fuzzy model with product inference engine, singleton fuzzifier, center average defuzzifier, and Gaussian membership functions is trained by the orthogonal least square (OLS) learning algorithm based on given input-output data pairs. An optimal control scheme is then formulated based on the fuzzy model. The numerical solution of the problem is achieved by use of a feasible-direction algorithm. To show the effectiveness of the proposed method, the simulation results of three nonlinear optimal control problems are presented. The results show that the performance of the proposed approach is quite similar to that of optimal control of the system represented by an explicit mathematical model, thus demonstrating the efficacy of the proposed scheme for optimal control of unknown nonlinear systems.

MSC:

49K21 Optimality conditions for problems involving relations other than differential equations
93C42 Fuzzy control/observation systems
Full Text: DOI

References:

[1] Kirk, Optimal Control Theory: An Introduction (2012)
[2] Ross, A review of pseudospectral optimal control: From theory to flight, Annual Reviews in Control 36 (2) pp 182– (2012) · doi:10.1016/j.arcontrol.2012.09.002
[3] Kappen, Optimal control as a graphical model inference problem, Machine Learning 87 (2) pp 159– (2012) · Zbl 1243.93133 · doi:10.1007/s10994-012-5278-7
[4] Lee, Fuzzy logic in control systems: Fuzzy logic controller. II, IEEE Transactions on Systems, Man and Cybernetics 20 (2) pp 419– (1990) · Zbl 0707.93037 · doi:10.1109/21.52552
[5] Kosko, Fuzzy systems as universal approximators, IEEE Transactions on Computers 43 (11) pp 1329– (1994) · Zbl 1057.68664 · doi:10.1109/12.324566
[6] Wang, Back-propagation fuzzy system as nonlinear dynamic system identifiers, IEEE International Conference on Fuzzy Systems pp 1409– (1992) · doi:10.1109/FUZZY.1992.258711
[7] Wang, A Course in Fuzzy Systems (1999)
[8] Chiu, Fuzzy model identification based on cluster estimation, Journal of Intelligent and Fuzzy Systems 2 (3) pp 267– (1994)
[9] Wang, Fuzzy basis functions, universal approximation, and orthogonal least-squares learning, IEEE Transactions on Neural Networks 3 (5) pp 807– (1992) · doi:10.1109/72.159070
[10] Lee, Construction of fuzzy systems using least-squares method and genetic algorithm, Fuzzy Sets and Systems 137 (3) pp 297– (2003) · Zbl 1035.93043 · doi:10.1016/S0165-0114(02)00344-5
[11] Chen, Orthogonal least squares learning algorithm for radial basis function networks, IEEE Transactions on Neural Networks 2 (2) pp 302– (1991) · doi:10.1109/72.80341
[12] Sussmann, 300 years of optimal control: From the brachystochrone to the maximum principle, IEEE Control Systems 17 (3) pp 32– (1997) · doi:10.1109/37.588098
[13] Pontryagin, The Mathematical Theory of Optimal Processes (1962) · Zbl 0112.05502
[14] Al-Tamimi, Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 38 (4) pp 943– (2008) · doi:10.1109/TSMCB.2008.926614
[15] Lewis, Reinforcement learning and adaptive dynamic programming for feedback control, IEEE Circuits and Systems Magazine 9 (3) pp 32– (2009) · doi:10.1109/MCAS.2009.933854
[16] Wang, Adaptive dynamic programming: An introduction, IEEE Computational Intelligence Magazine 4 (2) pp 39– (2009) · doi:10.1109/MCI.2009.932261
[17] Chen, Generalized Hamilton–Jacobi–Bellman formulation-based neural network control of affine nonlinear discrete-time systems, IEEE Transactions on Neural Networks 19 (1) pp 90– (2008) · doi:10.1109/TNN.2007.900227
[18] Al-Tamimi, Model-free Q-learning designs for linear discrete-time zero-sum gameswith application to H-infinity control, Automatica 43 (3) pp 473– (2007) · Zbl 1137.93321 · doi:10.1016/j.automatica.2006.09.019
[19] Dierks, Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence, Neural Networks 22 (5) pp 851– (2009) · Zbl 1338.49074 · doi:10.1016/j.neunet.2009.06.014
[20] Dierks, Online optimal control of nonlinear discrete-time systems using approximate dynamic programming, Journal of Control Theory and Applications 9 (3) pp 361– (2011) · doi:10.1007/s11768-011-0178-0
[21] Wei, Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions, Neurocomputing 72 (7) pp 1839– (2009) · doi:10.1016/j.neucom.2008.05.012
[22] Heydari, Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics, IEEE Transactions on Neural Networks and Learning Systems 24 (1) pp 145– (2013) · doi:10.1109/TNNLS.2012.2227339
[23] Jiang, Computational adaptive optimal control for continuous-time linear systems withcompletely unknown dynamics, Automatica 48 (10) pp 2699– (2012) · Zbl 1271.93088 · doi:10.1016/j.automatica.2012.06.096
[24] Kotsialos, Optimal coordinated and integrated motorway network traffic control, 14th International Symposium on Transportation and TrafficTheory pp 621– (1999)
[25] Carlson, Optimal motorway traffic flow control involving variable speed limits and ramp metering, Transportation Science 44 (2) pp 238– (2010) · doi:10.1287/trsc.1090.0314
[26] Lee, Fuzzy logic in control systems: Fuzzy logic controller. I, IEEE Transactions on Systems, Man and Cybernetics 20 (2) pp 404– (1990) · Zbl 0707.93036 · doi:10.1109/21.52551
[27] Bertsekas, Nonlinear Programming (1999)
[28] Sun, Optimization Theory and Methods: Nonlinear Programming (2006)
[29] Magni, Robust model predictive control for nonlinear discrete time systems, International Journal of Robust and Nonlinear Control 13 (3-4) pp 229– (2003) · Zbl 1049.93030 · doi:10.1002/rnc.815
[30] Jaddu, Direct solution of nonlinear optimal control problems using quasilinearization and Chebyshevpolynomials, Journal of the Franklin Institute 339 (4) pp 479– (2002) · Zbl 1010.93507 · doi:10.1016/S0016-0032(02)00028-5
[31] Hicks, Approximation methods for optimal control synthesis, The Canadian Journal of Chemical Engineering 49 (4) pp 522– (1971) · doi:10.1002/cjce.5450490416
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.