Legg, Shane

18 publications

ICML 2024 Position: Levels of AGI for Operationalizing Progress on the Path to AGI Meredith Ringel Morris, Jascha Sohl-Dickstein, Noah Fiedel, Tris Warkentin, Allan Dafoe, Aleksandra Faust, Clement Farabet, Shane Legg

ICLR 2023 Neural Networks and the Chomsky Hierarchy Gregoire Deletang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A Ortega

TMLR 2022 Your Policy Regularizer Is Secretly an Adversary Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Gregoire Detetang, Markus Kunesch, Shane Legg, Pedro A Ortega

AAAI 2021 Agent Incentives: A Causal Perspective Tom Everitt, Ryan Carey, Eric D. Langlois, Pedro A. Ortega, Shane Legg

ICLR 2021 Quantifying Differences in Reward Functions Adam Gleave, Michael D Dennis, Shane Legg, Stuart Russell, Jan Leike

NeurIPS 2020 Avoiding Side Effects by Considering Future Tasks Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg

ICML 2020 Learning Human Objectives by Evaluating Hypothetical Behavior Siddharth Reddy, Anca Dragan, Sergey Levine, Shane Legg, Jan Leike

NeurIPS 2020 Meta-Trained Agents Implement Bayes-Optimal Agents Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro Ortega

IJCAI 2020 Pitfalls of Learning a Reward Function Online Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg

ICML 2018 IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Vlad Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

ICLR 2018 Noisy Networks for Exploration Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Matteo Hessel, Ian Osband, Alex Graves, Volodymyr Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

NeurIPS 2018 Reward Learning from Human Preferences and Demonstrations in Atari Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei

NeurIPS 2017 Deep Reinforcement Learning from Human Preferences Paul F Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, Dario Amodei

IJCAI 2017 Reinforcement Learning with a Corrupted Reward Channel Tom Everitt, Victoria Krakovna, Laurent Orseau, Shane Legg

ALT 2017 Soft-Bayes: Prod for Mixtures of Experts with Log-Loss Laurent Orseau, Tor Lattimore, Shane Legg

NeurIPS 2007 Temporal Difference Updating Without a Learning Rate Marcus Hutter, Shane Legg

ALT 2006 Is There an Elegant Universal Theory of Prediction? Shane Legg

IJCAI 2005 A Universal Measure of Intelligence for Artificial Agents Shane Legg, Marcus Hutter