Guez, Arthur

23 publications

AISTATS 2025 A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana L Borsa, Arthur Guez, Will Dabney
NeurIPSW 2024 A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana L Borsa, Arthur Guez, Will Dabney
ICLR 2022 COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation Jongmin Lee, Cosmin Paduraru, Daniel J Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez
NeurIPS 2022 Large-Scale Retrieval for Reinforcement Learning Peter Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Theophane Weber, Timothy Lillicrap
ICLR 2022 Policy Improvement by Planning with Gumbel Ivo Danihelka, Arthur Guez, Julian Schrittwieser, David Silver
ICML 2022 Retrieval-Augmented Reinforcement Learning Anirudh Goyal, Abram Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Peter C Humphreys, Ksenia Konyushova, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell
ICML 2021 Counterfactual Credit Assignment in Model-Free Reinforcement Learning Thomas Mesnard, Theophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Thomas S Stepleton, Nicolas Heess, Arthur Guez, Eric Moulines, Marcus Hutter, Lars Buesing, Remi Munos
ICML 2021 Muesli: Combining Improvements in Policy Optimization Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado Van Hasselt
ICLR 2021 On the Role of Planning in Model-Based Deep Reinforcement Learning Jessica B Hamrick, Abram L. Friesen, Feryal Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Holger Buesing, Petar Veličković, Theophane Weber
NeurIPS 2020 Value-Driven Hindsight Modelling Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess
ICML 2019 An Investigation of Model-Free Planning Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sebastien Racaniere, Theophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap
ICLR 2019 Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search Lars Buesing, Theophane Weber, Yori Zwols, Nicolas Heess, Sebastien Racaniere, Arthur Guez, Jean-Baptiste Lespiau
ICML 2018 Learning to Search with MCTSnets Arthur Guez, Theophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Remi Munos, David Silver
NeurIPS 2017 Imagination-Augmented Agents for Deep Reinforcement Learning Sébastien Racanière, Theophane Weber, David Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra
ICML 2017 The Predictron: End-to-End Learning and Planning David Silver, Hado Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
AAAI 2016 Deep Reinforcement Learning with Double Q-Learning Hado van Hasselt, Arthur Guez, David Silver
AAAI 2016 Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos
NeurIPS 2016 Learning Values Across Many Orders of Magnitude Hado P van Hasselt, Arthur Guez, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver
NeurIPS 2016 Learning Values Across Many Orders of Magnitude Hado P van Hasselt, Arthur Guez, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver
NeurIPS 2014 Bayes-Adaptive Simulation-Based Search with Value Function Approximation Arthur Guez, Nicolas Heess, David Silver, Peter Dayan
JAIR 2013 Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search Arthur Guez, David Silver, Peter Dayan
NeurIPS 2012 Efficient Bayes-Adaptive Reinforcement Learning Using Sample-Based Search Arthur Guez, David Silver, Peter Dayan
AAAI 2008 Adaptive Treatment of Epilepsy via Batch-Mode Reinforcement Learning Arthur Guez, Robert D. Vincent, Massimo Avoli, Joelle Pineau