Grau-Moya, Jordi

14 publications

NeurIPS 2025 Understanding Prompt Tuning and In-Context Learning via Meta-Learning Tim Genewein, Li Kevin Wenliang, Jordi Grau-Moya, Anian Ruoss, Laurent Orseau, Marcus Hutter
NeurIPS 2024 Amortized Planning with Large-Scale Transformers: A Case Study on Chess Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Cannada A. Lewis, Joel Veness, Tim Genewein
ICLR 2024 Language Modeling Is Compression Gregoire Deletang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein, Christopher Mattern, Jordi Grau-Moya, Li Kevin Wenliang, Matthew Aitchison, Laurent Orseau, Marcus Hutter, Joel Veness
ICML 2024 Learning Universal Predictors Jordi Grau-Moya, Tim Genewein, Marcus Hutter, Laurent Orseau, Gregoire Deletang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison, Joel Veness
ICML 2023 Memory-Based Meta-Learning on Non-Stationary Distributions Tim Genewein, Gregoire Deletang, Anian Ruoss, Li Kevin Wenliang, Elliot Catt, Vincent Dutordoir, Jordi Grau-Moya, Laurent Orseau, Marcus Hutter, Joel Veness
ICLR 2023 Neural Networks and the Chomsky Hierarchy Gregoire Deletang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A Ortega
NeurIPS 2023 Self-Predictive Universal AI Elliot Catt, Jordi Grau-Moya, Marcus Hutter, Matthew Aitchison, Tim Genewein, Grégoire Delétang, Kevin Li, Joel Veness
TMLR 2022 Your Policy Regularizer Is Secretly an Adversary Rob Brekelmans, Tim Genewein, Jordi Grau-Moya, Gregoire Detetang, Markus Kunesch, Shane Legg, Pedro A Ortega
NeurIPS 2019 A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment Felix Leibfried, Sergio Pascual-Díaz, Jordi Grau-Moya
CoRL 2019 Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning Felix Leibfried, Jordi Grau-Moya
ICLR 2019 Soft Q-Learning with Mutual-Information Regularization Jordi Grau-Moya, Felix Leibfried, Peter Vrancx
IJCAI 2018 Balancing Two-Player Stochastic Games with Soft Q-Learning Jordi Grau-Moya, Felix Leibfried, Haitham Bou-Ammar
ECML-PKDD 2016 Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes Jordi Grau-Moya, Felix Leibfried, Tim Genewein, Daniel A. Braun
NeurIPS 2012 A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function Pedro Ortega, Jordi Grau-moya, Tim Genewein, David Balduzzi, Daniel Braun