User profiles for Konrad Zolna

Konrad Żołna

Research Scientist, DeepMind
Verified email at google.com
Cited by 2501

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo…�- arXiv preprint arXiv�…, 2022 - arxiv.org
Inspired by progress in large-scale language modeling, we apply a similar approach towards
building a single generalist agent beyond the realm of text outputs. The agent, which we …

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel…�- Advances in�…, 2020 - proceedings.neurips.cc
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Hyperparameter selection for offline reinforcement learning

…, C Paduraru, A Michi, C Gulcehre, K Zolna…�- arXiv preprint arXiv�…, 2020 - arxiv.org
Offline reinforcement learning (RL purely from logged data) is an important avenue for
deploying RL techniques in real-world scenarios. However, existing hyperparameter selection …

Rl unplugged: A suite of benchmarks for offline reinforcement learning

…, T Paine, S G�mez, K Zolna…�- Advances in�…, 2020 - proceedings.neurips.cc
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

Robocat: A self-improving foundation agent for robotic manipulation

K Bousmalis, G Vezzani, D Rao, C Devin…�- arXiv preprint arXiv�…, 2023 - arxiv.org
The ability to leverage heterogeneous robotic experience from different robots and tasks to
quickly master novel skills and embodiments has the potential to transform robot learning. …

Offline learning from demonstrations and unlabeled experience

K Zolna, A Novikov, K Konyushkova…�- arXiv preprint arXiv�…, 2020 - arxiv.org
Behavior cloning (BC) is often practical for robot learning because it allows a policy to be
trained offline without rewards, by supervised learning on expert demonstrations. However, BC …

Scaling data-driven robotics with reward sketching and batch reinforcement learning

…, K Konyushkova, S Reed, R Jeong, K Zolna…�- arXiv preprint arXiv�…, 2019 - arxiv.org
We present a framework for data-driven robotics that makes use of a large dataset of recorded
robot experience and scales to several tasks using learned reward functions. We show …

Genie: Generative interactive environments

…, S Ozair, S Reed, J Zhang, K Zolna…�- …�on Machine Learning, 2024 - openreview.net
We introduce Genie, the first *generative interactive environment* trained in an unsupervised
manner from unlabelled Internet videos. The model can be prompted to generate an …

Task-relevant adversarial imitation learning

K Zolna, S Reed, A Novikov…�- …�on Robot Learning, 2021 - proceedings.mlr.press
We show that a critical vulnerability in adversarial imitation is the tendency of discriminator
networks to learn spurious associations between visual features and expert labels. When the …

Fraternal dropout

K Zolna, D Arpit, D Suhubdy, Y Bengio�- arXiv preprint arXiv:1711.00066, 2017 - arxiv.org
Recurrent neural networks (RNNs) are important class of architectures among neural networks
useful for language modeling and sequential prediction. However, optimizing RNNs is …