×

Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization. (English) Zbl 1448.92013

Summary: An open problem in the cognitive dimensions of navigation concerns how previous exploratory experience is reorganized in order to allow the creation of novel efficient navigation trajectories. This behavior is revealed in the “traveling salesrat problem” (TSP) when rats discover the shortest path linking baited food wells after a few exploratory traversals. We have recently published a model of navigation sequence learning, where sharp wave ripple replay of hippocampal place cells transmit “snippets” of the recent trajectories that the animal has explored to the prefrontal cortex (PFC) [the first author et al., “Reservoir computing model of prefrontal cortex creates novel combinations of previous navigation sequences from hippocampal place-cell replay with spatial reward propagation”, PLOS Comput. Biol. 15, No. 7, Article ID e1006624, 32 p. (2019; doi:10.1371/journal.pcbi.1006624)]. PFC is modeled as a recurrent reservoir network that is able to assemble these snippets into the efficient sequence (trajectory of spatial locations coded by place cell activation). The model of hippocampal replay generates a distribution of snippets as a function of their proximity to a reward, thus implementing a form of spatial credit assignment that solves the TSP task. The integrative PFC reservoir reconstructs the efficient TSP sequence based on exposure to this distribution of snippets that favors paths that are most proximal to rewards. While this demonstrates the theoretical feasibility of the PFC-HIPP interaction, the integration of such a dynamic system into a real-time sensory-motor system remains a challenge. In the current research, we test the hypothesis that the PFC reservoir model can operate in a real-time sensory-motor loop. Thus, the main goal of the paper is to validate the model in simulated and real robot scenarios. Place cell activation encoding the current position of the simulated and physical rat robot feeds the PFC reservoir which generates the successor place cell activation that represents the next step in the reproduced sequence in the readout. This is input to the robot, which advances to the coded location and then generates de novo the current place cell activation. This allows demonstration of the crucial role of embodiment. If the spatial code readout from PFC is played back directly into PFC, error can accumulate, and the system can diverge from desired trajectories. This required a spatial filter to decode the PFC code to a location and then recode a new place cell code for that location. In the robot, the place cell vector output of PFC is used to physically displace the robot and then generate a new place cell coded input to the PFC, replacing part of the software recoding procedure that was required otherwise. We demonstrate how this integrated sensory-motor system can learn simple navigation sequences and then, importantly, how it can synthesize novel efficient sequences based on prior experience, as previously demonstrated [loc. cit.]. This contributes to the understanding of hippocampal replay in novel navigation sequence formation and the important role of embodiment.

MSC:

92B20 Neural networks for/in biological studies, artificial life and related topics
92D50 Animal behavior
68T40 Artificial intelligence for robotics
Full Text: DOI

References:

[1] Ambrose, RE; Pfeiffer, BE; Foster, DJ, Reverse replay of hippocampal place cells is uniquely modulated by changing reward, Neuron, 91, 1124-1136 (2016)
[2] Andrychowicz M, Wolski F, Ray A, Schneider J, Fong R, et al (2017) Advances in neural information processing systems, pp 5048-5058
[3] Arleo, A.; Gerstner, W., Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity, Biol Cybern, 83, 287-299 (2000)
[4] Barrera, A.; Weitzenfeld, A., Biologically-inspired robot spatial cognition based on rat neurophysiological studies, Auton Robots, 25, 147-169 (2008)
[5] Barrera, A.; Cáceres, A.; Weitzenfeld, A.; Ramirez-Amaya, V., Comparative experimental studies on spatial memory and learning in rats and robots, J Intell Rob Syst, 63, 361-397 (2011)
[6] Barrera, A.; Tejera, G.; Llofriu, M.; Weitzenfeld, A., Learning spatial localization: from rat studies to computational models of the Hippocampus, Spat Cognit Comput, 15, 27-59 (2015)
[7] Bendor, D.; Wilson, MA, Biasing the content of hippocampal replay during sleep, Nat Neurosci, 15, 1439-1444 (2012)
[8] Brown, MA; Sharp, PE, Simulation of spatial learning in the Morris water maze by a neural network model of the hippocampal formation and nucleus accumbens, Hippocampus, 5, 171-188 (1995)
[9] Burgess, N.; Recce, M.; O’Keefe, J., A model of hippocampal function, Neural Netw, 7, 1065-1081 (1994) · Zbl 0825.92052
[10] Burgess, N.; Donnett, JG; Jeffery, KJ; O-keefe, J., Robotic and neuronal simulation of the hippocampus and rat navigation, Philos Trans R Soc Lond Ser B Biol Sci, 352, 1535-1543 (1997)
[11] Buzsáki, G., Two-stage model of memory trace formation: a role for “noisy” brain states, Neuroscience, 31, 551-570 (1989)
[12] Caluwaerts, K.; Staffa, M.; N’Guyen, S.; Grand, C.; Dollé, L., A biologically inspired meta-control navigation system for the psikharpax rat robot, Bioinspir Biomim, 7, 025009 (2012)
[13] Carr, MF; Jadhav, SP; Frank, LM, Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval, Nat Neurosci, 14, 147-153 (2011)
[14] Cazé, R.; Khamassi, M.; Aubin, L.; Girard, B., Hippocampal replays under the scrutiny of reinforcement learning models, J Neurophysiol, 120, 2877-2896 (2018)
[15] Cazin, N.; Llofriu Alonso, M.; Scleidorovich Chiodi, P.; Pelc, T.; Harland, B., Reservoir computing model of prefrontal cortex creates novel combinations of previous navigation sequences from hippocampal place-cell replay with spatial reward propagation, PLoS Comput Biol, 15, e1006624 (2019)
[16] Davidson, TJ; Kloosterman, F.; Wilson, MA, Hippocampal replay of extended experience, Neuron, 63, 497-507 (2009)
[17] de Jong, LW; Gereke, B.; Martin, GM; Fellous, J-M, The traveling salesrat: insights into the dynamics of efficient spatial navigation in the rodent, J Neural Eng, 8, 065010 (2011)
[18] De Lavilléon, G.; Lacroix, MM; Rondi-Reig, L.; Benchenane, K., Explicit memory creation during sleep demonstrates a causal role of place cells in navigation, Nat Neurosci, 18, 493 (2015)
[19] Diba, K.; Buzsaki, G., Forward and reverse hippocampal place-cell sequences during ripples, Nat Neurosci, 10, 1241 (2007)
[20] Dollé, L.; Sheynikhovich, D.; Girard, B.; Chavarriaga, R.; Guillot, A., Path planning versus cue responding: a bio-inspired model of switching between navigation strategies, Biol Cybern, 103, 299-317 (2010) · Zbl 1266.92021
[21] Dominey, PF, Complex sensory-motor sequence learning based on recurrent state representation and reinforcement learning, Biol Cybern, 73, 265-274 (1995) · Zbl 0828.92003
[22] Dominey, PF, Influences of temporal organization on sequence learning and transfer: comments on Stadler (1995) and Curran and Keele (1993), J Exp Psychol Learn Mem Cogn, 24, 14 (1998)
[23] Dominey, PF, A shared system for learning serial and temporal structure of sensori-motor sequences? Evidence from simulation and human experiments, Brain Res Cogn Brain Res, 6, 163-172 (1998)
[24] Dominey, PF; Ramus, F., Neural network processing of natural language: I. Sensitivity to serial, temporal and abstract structure of language in the infant, Lang Cognit Process, 15, 40 (2000)
[25] Dominey, PF; Arbib, MA; Joseph, JP, A model of corticostriatal plasticity for learning oculomotor associations and sequences, J Cogn Neurosci, 7, 25 (1995)
[26] Dominey, PF; Inui, T.; Hoen, M., Neural network processing of natural language: II. Towards a unified model of corticostriatal function in learning sentence comprehension and non-linguistic sequencing, Brain Lang, 109, 80-92 (2009)
[27] Euston, DR; Tatsuno, M.; McNaughton, BL, Fast-forward playback of recent memory sequences in prefrontal cortex during sleep, Science, 318, 1147-1150 (2007)
[28] Foster, DJ; Wilson, MA, Reverse replay of behavioural sequences in hippocampal place cells during the awake state, Nature, 440, 680-683 (2006)
[29] Gaussier, P.; Revel, A.; Banquet, J-P; Babeau, V., From view cells and place cells to cognitive map learning: processing stages of the hippocampal system, Biol Cybern, 86, 15-28 (2002) · Zbl 1104.92308
[30] Gaussier, P.; Banquet, J.; Sargolini, F.; Giovannangeli, C.; Save, E.; Poucet, B., A model of grid cells involving extra hippocampal path integration, and the hippocampal loop, J Integr Neurosci, 6, 447-476 (2007)
[31] Guazzelli, A.; Bota, M.; Corbacho, FJ; Arbib, MA, Affordances. Motivations, and the world graph theory, Adapt Behav, 6, 435-471 (1998)
[32] Gupta, AS; van der Meer, MA; Touretzky, DS; Redish, AD, Hippocampal replay is not a simple function of experience, Neuron, 65, 695-705 (2010)
[33] Hasselmo, ME, Temporally structured replay of neural activity in a model of entorhinal cortex, hippocampus and postsubiculum, Eur J Neurosci, 28, 1301-1315 (2008)
[34] Hinaut, X.; Dominey, PF, Real-time parallel processing of grammatical structure in the fronto-striatal system: a recurrent network simulation study using reservoir computing, PLoS ONE, 8, 1-18 (2013)
[35] Hoffman, KL; McNaughton, BL, Coordinated reactivation of distributed memory traces in primate neocortex, Science, 297, 2070-2073 (2002)
[36] Jaeger, H.; Haas, H., Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication, Science, 304, 78-80 (2004)
[37] Jaeger, H.; Lukosevicius, M.; Popovici, D.; Siewert, U., Optimization and applications of echo state networks with leaky-integrator neurons, Neural Netw, 20, 335-352 (2007) · Zbl 1132.68554
[38] Ji, D.; Wilson, MA, Coordinated memory replay in the visual cortex and hippocampus during sleep, Nat Neurosci, 10, 100-107 (2007)
[39] Johnson, A.; Redish, AD, Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model, Neural Netw, 18, 1163-1171 (2005) · Zbl 1085.92006
[40] Lansink, CS; Goltstein, PM; Lankelma, JV; Joosten, RN; McNaughton, BL; Pennartz, CM, Preferential reactivation of motivationally relevant information in the ventral striatum, J Neurosci, 28, 6372-6382 (2008)
[41] Llofriu, M.; Tejera, G.; Contreras, M.; Pelc, T.; Fellous, J-M; Weitzenfeld, A., Goal-oriented robot navigation learning using a multi-scale space representation, Neural Netw, 72, 62-74 (2015)
[42] Lukosevicius, M.; Montavon, G.; Orr, GB; Müller, KR, A practical guide to applying echo state networks, Neural networks: tricks of the trade, 659-686 (2012), Berlin: Springer, Berlin
[43] Lukosevicius, M.; Jaeger, H., Reservoir computing approaches to recurrent neural network training, Comput Sci Rev, 3, 22 (2009) · Zbl 1302.68235
[44] Maass, W.; Natschlager, T.; Markram, H., Real-time computing without stable states: a new framework for neural computation based on perturbations, Neural Comput, 14, 2531-2560 (2002) · Zbl 1057.68618
[45] McClelland, JL; McNaughton, BL; O’Reilly, RC, Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory, Psychol Rev, 102, 419-457 (1995)
[46] Moser, EI; Moser, M-B; McNaughton, BL, Spatial representation in the hippocampal formation: a history, Nat Neurosci, 20, 1448 (2017)
[47] Nadel, L.; Moscovitch, M., Memory consolidation, retrograde amnesia and the hippocampal complex, Curr Opin Neurobiol, 7, 217-227 (1997)
[48] Nikolic, D.; Hausler, S.; Singer, W.; Maass, W., Distributed fading memory for stimulus properties in the primary visual cortex, PLoS Biol, 7, e1000260 (2009)
[49] Peyrache, A.; Khamassi, M.; Benchenane, K.; Wiener, SI; Battaglia, FP, Replay of rule-learning related neural patterns in the prefrontal cortex during sleep, Nat Neurosci, 12, 919-926 (2009)
[50] Pfeifer, R.; Lungarella, M.; Iida, F., Self-organization, embodiment, and biologically inspired robotics, Science, 318, 1088-1093 (2007)
[51] Pfeiffer, BE; Foster, DJ, Hippocampal place-cell sequences depict future paths to remembered goals, Nature, 497, 74 (2013)
[52] Redish, AD; Touretzky, DS, Cognitive maps beyond the hippocampus, Hippocampus, 7, 15-35 (1997)
[53] Rigotti, M.; Barak, O.; Warden, MR; Wang, X-J; Daw, ND, The importance of mixed selectivity in complex cognitive tasks, Nature, 497, 585-590 (2013)
[54] Ruder S. 2016. An overview of gradient descent optimization algorithms. arXiv preprintarXiv:1609.04747
[55] Singer, AC; Frank, LM, Rewarded outcomes enhance reactivation of experience in the Hippocampus, Neuron, 64, 910-921 (2009)
[56] Tejera, G.; Llofriu, M.; Barrera, A.; Weitzenfeld, A., Bio-inspired robotics: a spatial cognition model integrating place cells, grid cells and head direction cells, J Intell Rob Syst, 91, 85-99 (2018)
[57] Widrow B, Hoff ME (1960) Adaptive switching circuits. Stanford University, CA Stanford Electronics Labs
[58] Wilson, MA; McNaughton, BL, Reactivation of hippocampal ensemble memories during sleep, Science, 265, 676-679 (1994)
[59] Wylie, TR, The discrete Fréchet distance with applications (2013), Bozeman: College of Engineering, Montana State University-Bozeman, Bozeman
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.