×

Chaos in learning a simple two-person game. (English) Zbl 1015.91014

Summary: We investigate the problem of learning to play the game of rock-paper-scissors. Each player attempts to improve her/his average score by adjusting the frequency of the three possible responses, using reinforcement learning. For the zero sum game the learning process displays Hamiltonian chaos. Thus, the learning trajectory can be simple or complex, depending on initial conditions. We also investigate the non-zero sum case and show that it can give rise to chaotic transients. This is, to our knowledge, the first demonstration of Hamiltonian chaos in learning a basic two-person game, extending earlier findings of chaotic attractors in dissipative systems. As we argue here, chaos provides an important self-consistency condition for determining when players will learn to behave as though they were fully rational. That chaos can occur in learning a simple game indicates one should use caution in assuming real people will learn to play a game according to a Nash equilibrium strategy.

MSC:

91A26 Rationality and learning in game theory
91A05 2-person games
37N40 Dynamical systems in optimization and economics

References:

[1] ANN MATH STUDIES 5 pp 1– (1964)
[2] PHYSICA D147 pp 221– (2000)
[3] Foster, PNAS 98 (22) pp 12848– (2001) · doi:10.1073/pnas.211534898
[4] PNAS 90 pp 5091– (1992)
[5] Hofbauer, Journal of mathematical biology 34 (5-6) pp 675– (1996) · Zbl 0845.92016 · doi:10.1007/BF02409754
[6] PHYS LETT A 150 pp 262– (1990) · doi:10.1016/0375-9601(90)90092-3
[7] 77 pp 1– (1997) · Zbl 0892.90198 · doi:10.1006/jeth.1997.2319
[8] Biological cybernetics 40 pp 1– (1981) · Zbl 0465.92016 · doi:10.1007/BF00326675
[9] PROG THEOR PHYS 94 pp 163– (1995) · doi:10.1143/PTP.94.163
[10] Farmer, Physical Review Letters 59 (8) pp 845– (1987) · doi:10.1103/PhysRevLett.59.845
[11] 16 pp 76– (1979) · Zbl 0398.90120 · doi:10.2307/3213376
[12] Mathematical biosciences 40 pp 145– (1978) · Zbl 0395.90118 · doi:10.1016/0025-5564(78)90077-9
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.