Legg, Shane

18 publications

ICML 2024 Position: Levels of AGI for Operationalizing Progress on the Path to AGI Meredith Ringel Morris, Jascha Sohl-Dickstein, Noah Fiedel, Tris Warkentin, Allan Dafoe, Aleksandra Faust, Clement Farabet, Shane Legg
ICLR 2023 Neural Networks and the Chomsky Hierarchy Gregoire Deletang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A Ortega
TMLR 2022 Your Policy Regularizer Is Secretly an Adversary Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Gregoire Detetang, Markus Kunesch, Shane Legg, Pedro A Ortega
AAAI 2021 Agent Incentives: A Causal Perspective Tom Everitt, Ryan Carey, Eric D. Langlois, Pedro A. Ortega, Shane Legg
ICLR 2021 Quantifying Differences in Reward Functions Adam Gleave, Michael D Dennis, Shane Legg, Stuart Russell, Jan Leike
NeurIPS 2020 Avoiding Side Effects by Considering Future Tasks Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg
ICML 2020 Learning Human Objectives by Evaluating Hypothetical Behavior Siddharth Reddy, Anca Dragan, Sergey Levine, Shane Legg, Jan Leike
NeurIPS 2020 Meta-Trained Agents Implement Bayes-Optimal Agents Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro Ortega
IJCAI 2020 Pitfalls of Learning a Reward Function Online Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg
ICML 2018 IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Vlad Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu
ICLR 2018 Noisy Networks for Exploration Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Matteo Hessel, Ian Osband, Alex Graves, Volodymyr Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg
NeurIPS 2018 Reward Learning from Human Preferences and Demonstrations in Atari Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei
NeurIPS 2017 Deep Reinforcement Learning from Human Preferences Paul F Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, Dario Amodei
IJCAI 2017 Reinforcement Learning with a Corrupted Reward Channel Tom Everitt, Victoria Krakovna, Laurent Orseau, Shane Legg
ALT 2017 Soft-Bayes: Prod for Mixtures of Experts with Log-Loss Laurent Orseau, Tor Lattimore, Shane Legg
NeurIPS 2007 Temporal Difference Updating Without a Learning Rate Marcus Hutter, Shane Legg
ALT 2006 Is There an Elegant Universal Theory of Prediction? Shane Legg
IJCAI 2005 A Universal Measure of Intelligence for Artificial Agents Shane Legg, Marcus Hutter