Google Scholar

User profiles for Konrad Zolna

Konrad Żołna

Research Scientist, DeepMind

Verified email at google.com

Cited by 2501

[PDF] arxiv.org

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo…�- arXiv preprint arXiv�…, 2022 - arxiv.org

Inspired by progress in large-scale language modeling, we apply a similar approach towards
building a single generalist agent beyond the realm of text outputs. The agent, which we …

Save Cite Cited by 871 Related articles All 4 versions View as HTML

[PDF] neurips.cc

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel…�- Advances in�…, 2020 - proceedings.neurips.cc

Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Save Cite Cited by 322 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Hyperparameter selection for offline reinforcement learning

…, C Paduraru, A Michi, C Gulcehre, K Zolna…�- arXiv preprint arXiv�…, 2020 - arxiv.org

Offline reinforcement learning (RL purely from logged data) is an important avenue for
deploying RL techniques in real-world scenarios. However, existing hyperparameter selection …

Save Cite Cited by 160 Related articles All 2 versions View as HTML

[PDF] neurips.cc

Rl unplugged: A suite of benchmarks for offline reinforcement learning

…, T Paine, S G�mez, K Zolna…�- Advances in�…, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

Save Cite Cited by 181 Related articles All 8 versions View as HTML

[PDF] arxiv.org

Robocat: A self-improving foundation agent for robotic manipulation

K Bousmalis, G Vezzani, D Rao, C Devin…�- arXiv preprint arXiv�…, 2023 - arxiv.org

The ability to leverage heterogeneous robotic experience from different robots and tasks to
quickly master novel skills and embodiments has the potential to transform robot learning. …

Save Cite Cited by 74 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Offline learning from demonstrations and unlabeled experience

K Zolna, A Novikov, K Konyushkova…�- arXiv preprint arXiv�…, 2020 - arxiv.org

Behavior cloning (BC) is often practical for robot learning because it allows a policy to be
trained offline without rewards, by supervised learning on expert demonstrations. However, BC …

Save Cite Cited by 72 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Scaling data-driven robotics with reward sketching and batch reinforcement learning

…, K Konyushkova, S Reed, R Jeong, K Zolna…�- arXiv preprint arXiv�…, 2019 - arxiv.org

We present a framework for data-driven robotics that makes use of a large dataset of recorded
robot experience and scales to several tasks using learned reward functions. We show …

Save Cite Cited by 144 Related articles All 2 versions View as HTML

[PDF] openreview.net

Genie: Generative interactive environments

…, S Ozair, S Reed, J Zhang, K Zolna…�- …�on Machine Learning, 2024 - openreview.net

We introduce Genie, the first *generative interactive environment* trained in an unsupervised
manner from unlabelled Internet videos. The model can be prompted to generate an …

Save Cite Cited by 51 Related articles All 3 versions View as HTML

[PDF] mlr.press

Task-relevant adversarial imitation learning

K Zolna, S Reed, A Novikov…�- …�on Robot Learning, 2021 - proceedings.mlr.press

We show that a critical vulnerability in adversarial imitation is the tendency of discriminator
networks to learn spurious associations between visual features and expert labels. When the …

Save Cite Cited by 62 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Fraternal dropout

K Zolna, D Arpit, D Suhubdy, Y Bengio�- arXiv preprint arXiv:1711.00066, 2017 - arxiv.org

Recurrent neural networks (RNNs) are important class of architectures among neural networks
useful for language modeling and sequential prediction. However, optimizing RNNs is …

Save Cite Cited by 62 Related articles All 4 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Konrad Zolna

Konrad Żołna

A generalist agent

Critic regularized regression

Hyperparameter selection for offline reinforcement learning

Rl unplugged: A suite of benchmarks for offline reinforcement learning

Robocat: A self-improving foundation agent for robotic manipulation

Offline learning from demonstrations and unlabeled experience

Scaling data-driven robotics with reward sketching and batch reinforcement learning

Genie: Generative interactive environments

Task-relevant adversarial imitation learning

Fraternal dropout