Software Search - zbMATH Open

IMPALA

swMATH ID:	41064
Software Authors:	Espeholt, Lasse; Soyer, Hubert; Munos, Remi; Simonyan, Karen; Mnih, Volodymir; Ward, Tom; Doron, Yotam; Firoiu, Vlad; Harley, Tim; Dunning, Iain; Legg, Shane; Kavukcuoglu, Koray
Description:	IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures. In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distributed agent IMPALA (Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in single-machine training but also scales to thousands of machines without sacrificing data efficiency or resource utilisation. We achieve stable learning at high throughput by combining decoupled acting and learning with a novel off-policy correction method called V-trace. We demonstrate the effectiveness of IMPALA for multi-task reinforcement learning on DMLab-30 (a set of 30 tasks from the DeepMind Lab environment (Beattie et al., 2016)) and Atari-57 (all available Atari games in Arcade Learning Environment (Bellemare et al., 2013a)). Our results show that IMPALA is able to achieve better performance than previous agents with less data, and crucially exhibits positive transfer between tasks as a result of its multi-task approach.
Homepage:	https://arxiv.org/abs/1802.01561
Source Code:	https://github.com/deepmind/scalable_agent
Related Software:	MuJoCo; OpenAI Gym; Adam; QT-Opt; AlexNet; ImageNet; PyTorch; PARAMIC; CARLA; SURREAL; GAZEBO; SUMO; Stable Baselines3; HOGWILD; UCI-ml; TensorFlow; MOPO; Safety Gym; Horizon; TEXPLORE
Cited in:	14 Documents

all top 5

Cited by 78 Authors

2	Hughes, Edward W.
2	Lanctot, Marc
1	Abbas, Zaheer
1	Alamri, Hamad
1	Arampatzis, Georgios
1	Azevedo, Américo
1	Bachrach, Yoram
1	Bard, Nolan
1	Başar, Tamer
1	Bellemare, Marc G.
1	Biedenkapp, André
1	Bouchachia, Abdelhamid
1	Bowling, Michael
1	Burch, Neil
1	Calandra, Roberto
1	Chandar, Sarath
1	Chen, Bing
1	Czarnecki, Wojciech Marian
1	Dašić, Dejan
1	Dulac-Arnold, Gabriel
1	Dumoulin, Vincent
1	Dunning, Iain
1	Economides, Athena E.
1	Eimer, Theresa
1	Everett, Richard
1	Fachantidis, Anestis
1	Faust, Aleksandra
1	Foerster, Jakob N.
1	Gowal, Sven
1	Graepel, Thore
1	Hester, Todd
1	Hutter, Frank
1	Ilić, Nemanja
1	Jacobsen, Andrew
1	Johanson, Michael
1	Karnakov, Petr
1	Koumoutsakos, Petros D.
1	Larochelle, Hugo
1	Lazaridis, Aristotelis
1	Lazaridou, Angeliki
1	Leibo, Joel Z.
1	Levine, Nir
1	Li, Jerry
1	Lindauer, Marius
1	Makarov, Alekseĭ Aleksandrovich
1	Mankowitz, Daniel J.
1	Martin, Sergio M.
1	Marzen, Sarah E.
1	Miao, Yingjie
1	Mohamad, Saad
1	Moitra, Subhodeep
1	Mourad, Shibl
1	Paduraru, Cosmin
1	Parisotto, Emilio
1	Parker-Holder, Jack
1	Patterson, Andrew
1	Petrović, Ranko
1	Pham, Thuan Q.
1	Rajan, Raghu
1	Rupprecht, Timothy
1	Schlegel, Matthew
1	Sha, Xingyu
1	Song, H. Francis
1	Song, Xingyou
1	Tan, Leonard
1	Tao, Xiaohui
1	Tomé De Andrade e. Silva, Manuel
1	Vlahavas, Ioannis P.
1	Vučetić, Miljan
1	Wälchli, Daniel
1	Wang, Yanzhi
1	White, Adam
1	White, Martha
1	You, Keyou
1	Zhang, Baohe
1	Zhang, Ji
1	Zhang, Jiaqi
1	Zhang, Kaiqing

all top 5

Cited in 9 Serials

3	Artificial Intelligence
3	The Journal of Artificial Intelligence Research (JAIR)
2	Machine Learning
1	Computer Methods in Applied Mechanics and Engineering
1	Automatica
1	Computers & Operations Research
1	Neural Networks
1	Journal of Theoretical Biology
1	Computer Science Review

all top 5

Cited in 6 Fields

12	Computer science (68-XX)
3	Game theory, economics, finance, and other social and behavioral sciences (91-XX)
2	Operations research, mathematical programming (90-XX)
2	Systems theory; control (93-XX)
1	Statistics (62-XX)
1	Biology and other natural sciences (92-XX)

IMPALA

Cited by 78 Authors

Cited in 9 Serials

Cited in 6 Fields

Citations by Year