Borsa, Diana

11 publications

JMLR 2025 Optimizing Return Distributions with Distributional Dynamic Programming Bernardo Ávila Pires, Mark Rowland, Diana Borsa, Zhaohan Daniel Guo, Khimya Khetarpal, André Barreto, David Abel, Rémi Munos, Will Dabney
NeurIPS 2023 A State Representation for Diminishing Rewards Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani
ICML 2022 Generalised Policy Improvement with Geometric Policy Composition Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Remi Munos, Andre Barreto
ICML 2022 Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, Andre Barreto, Simon Osindero
AAAI 2021 Expected Eligibility Traces Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa
AISTATS 2020 Conditional Importance Sampling for Off-Policy Learning Mark Rowland, Anna Harutyunyan, Hado Hasselt, Diana Borsa, Tom Schaul, Remi Munos, Will Dabney
NeurIPS 2019 The Option Keyboard: Combining Skills in Reinforcement Learning Andre Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan Hunt, Shibl Mourad, David Silver, Doina Precup
AISTATS 2019 The Termination Critic Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos, Doina Precup
ICLR 2019 Universal Successor Features Approximators Diana Borsa, Andre Barreto, John Quan, Daniel J. Mankowitz, Hado van Hasselt, Remi Munos, David Silver, Tom Schaul
ICML 2018 Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement Andre Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin Zidek, Remi Munos
UAI 2016 Training Neural Nets to Aggregate Crowdsourced Responses Alex Gaunt, Diana Borsa, Yoram Bachrach