[1] |
Anahtarci, B.; Kariksiz, CD; Saldi, N., Q-learning in regularized mean-field games, Dyn Games Appl (2022) · Zbl 1451.91014 · doi:10.1007/s13235-022-00450-2 |
[2] |
Brown, PN; Seaton, JH; Marden, JR, Robust networked multiagent optimization: designing agents to repair their own utility functions, Dyn Games Appl (2022) · Zbl 1519.91044 · doi:10.1007/s13235-022-00469-5 |
[3] |
Ferguson, BL; Marden, JR, Robust utility design in distributed resource allocation problems with defective agents, Dyn Games Appl (2022) · Zbl 1519.91113 · doi:10.1007/s13235-022-00470-y |
[4] |
Graham T, Kleshnina M, Filar JA (2022) Where do mistakes lead? A survey of games with incompe47 tent players. Dyn Games Appl. doi:10.1007/s13235-022-00425-3 |
[5] |
Jiang, H.; Mazalov, VV; Gao, H., Opinion dynamics control in a social network with a communication structure, Dyn Games Appl (2021) · Zbl 1519.91198 · doi:10.1007/s13235-021-00406-y |
[6] |
Mao, W.; Başar, T., Provably efficient reinforcement learning in decentralized general-sum Markov games, Dyn Games Appl (2022) · Zbl 1519.91029 · doi:10.1007/s13235-021-00420-0 |
[7] |
Newton, CJ; Ganesh, A.; Reeve, HWJ, Asymptotic optimality for decentralised bandits, Dyn Games Appl (2022) · Zbl 1516.91031 · doi:10.1007/s13235-022-00451-1 |
[8] |
Phade SR Anantharam V (2021) Learning in games with cumulative prospect theoretic preferences. Dyn Games Appl. doi:10.1007/s13235-021-00398-9 · Zbl 1516.91017 |
[9] |
Ramirez, S.; van Brandenburg, LH; Bauso, D., Coordinated replenishment game and learning under time dependency and uncertainty of the parameters, Dyn Games Appl (2022) · Zbl 1525.91051 · doi:10.1007/s13235-022-00441-3 |
[10] |
Sorin, S., Continuous time learning algorithms in optimization and game theory, Dyn Games Appl (2022) · Zbl 1519.91027 · doi:10.1007/s13235-021-00423-x |
[11] |
Subramanian J (2021) Robustness and sample complexity of model-based MARL for general-sum Markov games. doi:10.1007/s13235-023-00490-2 · Zbl 1519.91030 |
[12] |
Tang, D.; Tavafoghi, H.; Subramanian, V., Dynamic games among teams with delayed intra-team information sharing, Dyn Games Appl (2022) · Zbl 1519.91031 · doi:10.1007/s13235-022-00424-4 |
[13] |
Trivedi, P.; Hemachandra, N., Multi-agent natural actor-critic reinforcement learning algorithms, Dyn Games Appl (2022) · Zbl 1519.91063 · doi:10.1007/s13235-022-00449-9 |
[14] |
uz Zaman, MA; Miehling, E.; Başar, T., Reinforcement learning for non-stationary discrete-time linear-quadratic mean-field games in multiple populations, Dyn Games Appl (2022) · Zbl 1519.91036 · doi:10.1007/s13235-022-00448-w |