Guez, Arthur

23 publications

AISTATS 2025 A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana L Borsa, Arthur Guez, Will Dabney

NeurIPSW 2024 A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana L Borsa, Arthur Guez, Will Dabney

ICLR 2022 COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation Jongmin Lee, Cosmin Paduraru, Daniel J Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez

NeurIPS 2022 Large-Scale Retrieval for Reinforcement Learning Peter Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Theophane Weber, Timothy Lillicrap

ICLR 2022 Policy Improvement by Planning with Gumbel Ivo Danihelka, Arthur Guez, Julian Schrittwieser, David Silver

ICML 2022 Retrieval-Augmented Reinforcement Learning Anirudh Goyal, Abram Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Peter C Humphreys, Ksenia Konyushova, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

ICML 2021 Counterfactual Credit Assignment in Model-Free Reinforcement Learning Thomas Mesnard, Theophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Thomas S Stepleton, Nicolas Heess, Arthur Guez, Eric Moulines, Marcus Hutter, Lars Buesing, Remi Munos

ICML 2021 Muesli: Combining Improvements in Policy Optimization Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado Van Hasselt

ICLR 2021 On the Role of Planning in Model-Based Deep Reinforcement Learning Jessica B Hamrick, Abram L. Friesen, Feryal Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Holger Buesing, Petar Veličković, Theophane Weber

NeurIPS 2020 Value-Driven Hindsight Modelling Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

ICML 2019 An Investigation of Model-Free Planning Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sebastien Racaniere, Theophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

ICLR 2019 Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search Lars Buesing, Theophane Weber, Yori Zwols, Nicolas Heess, Sebastien Racaniere, Arthur Guez, Jean-Baptiste Lespiau

ICML 2018 Learning to Search with MCTSnets Arthur Guez, Theophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Remi Munos, David Silver

NeurIPS 2017 Imagination-Augmented Agents for Deep Reinforcement Learning Sébastien Racanière, Theophane Weber, David Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra

ICML 2017 The Predictron: End-to-End Learning and Planning David Silver, Hado Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

AAAI 2016 Deep Reinforcement Learning with Double Q-Learning Hado van Hasselt, Arthur Guez, David Silver

AAAI 2016 Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos

NeurIPS 2016 Learning Values Across Many Orders of Magnitude Hado P van Hasselt, Arthur Guez, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver

NeurIPS 2016 Learning Values Across Many Orders of Magnitude Hado P van Hasselt, Arthur Guez, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver

NeurIPS 2014 Bayes-Adaptive Simulation-Based Search with Value Function Approximation Arthur Guez, Nicolas Heess, David Silver, Peter Dayan

JAIR 2013 Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search Arthur Guez, David Silver, Peter Dayan

NeurIPS 2012 Efficient Bayes-Adaptive Reinforcement Learning Using Sample-Based Search Arthur Guez, David Silver, Peter Dayan

AAAI 2008 Adaptive Treatment of Epilepsy via Batch-Mode Reinforcement Learning Arthur Guez, Robert D. Vincent, Massimo Avoli, Joelle Pineau