ML Anthology
Authors
Search
About
D'Oro, Pierluca
18 publications
ICCV
2025
Controlling Multimodal LLMs via Reward-Guided Decoding
Oscar Mañas
,
Pierluca D'Oro
,
Koustuv Sinha
,
Adriana Romero-Soriano
,
Michal Drozdzal
,
Aishwarya Agrawal
ICLR
2025
MaestroMotif: Skill Design from Artificial Intelligence Feedback
Martin Klissarov
,
Mikael Henaff
,
Roberta Raileanu
,
Shagun Sodhani
,
Pascal Vincent
,
Amy Zhang
,
Pierre-Luc Bacon
,
Doina Precup
,
Marlos C. Machado
,
Pierluca D'Oro
TMLR
2025
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Simon Dufort-Labbé
,
Pierluca D'Oro
,
Evgenii Nikishin
,
Irina Rish
,
Pierre-Luc Bacon
,
Razvan Pascanu
,
Aristide Baratin
ICLRW
2025
Mol-MoE: Training Preference-Guided Routers for Molecule Generation
Diego Calanzone
,
Pierluca D'Oro
,
Pierre-Luc Bacon
ICLR
2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
,
Pierluca D'Oro
,
Amy Zhang
,
Yuandong Tian
,
Michael Rabbat
ICMLW
2024
Controlling Large Language Model Agents with Entropic Activation Steering
Nate Rahn
,
Pierluca D'Oro
,
Marc G Bellemare
NeurIPSW
2024
Controlling Multimodal LLMs via Reward-Guided Decoding
Oscar Mañas
,
Pierluca D'Oro
,
Koustuv Sinha
,
Adriana Romero-Soriano
,
Michal Drozdzal
,
Aishwarya Agrawal
ICLR
2024
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
,
Pierluca D'Oro
,
Shagun Sodhani
,
Roberta Raileanu
,
Pierre-Luc Bacon
,
Pascal Vincent
,
Amy Zhang
,
Mikael Henaff
ICLR
2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
,
Pierluca D'Oro
,
Evgenii Nikishin
,
Aaron Courville
NeurIPSW
2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
,
Pierluca D'Oro
,
Shagun Sodhani
,
Roberta Raileanu
,
Pierre-Luc Bacon
,
Pascal Vincent
,
Amy Zhang
,
Mikael Henaff
NeurIPSW
2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
,
Pierluca D'Oro
,
Shagun Sodhani
,
Roberta Raileanu
,
Pierre-Luc Bacon
,
Pascal Vincent
,
Amy Zhang
,
Mikael Henaff
NeurIPS
2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn
,
Pierluca D'Oro
,
Harley Wiltzer
,
Pierre-Luc Bacon
,
Marc Bellemare
ICLR
2023
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
Pierluca D'Oro
,
Max Schwarzer
,
Evgenii Nikishin
,
Pierre-Luc Bacon
,
Marc G Bellemare
,
Aaron Courville
NeurIPSW
2022
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
Pierluca D'Oro
,
Max Schwarzer
,
Evgenii Nikishin
,
Pierre-Luc Bacon
,
Marc G Bellemare
,
Aaron Courville
NeurIPSW
2022
Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning
Zhixuan Lin
,
Pierluca D'Oro
,
Evgenii Nikishin
,
Aaron Courville
NeurIPSW
2022
Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning
Zhixuan Lin
,
Pierluca D'Oro
,
Evgenii Nikishin
,
Aaron Courville
NeurIPSW
2021
Long-Term Credit Assignment via Model-Based Temporal Shortcuts
Michel Ma
,
Pierluca D'Oro
,
Yoshua Bengio
,
Pierre-Luc Bacon
NeurIPS
2020
How to Learn a Useful Critic? Model-Based Action-Gradient-Estimator Policy Optimization
Pierluca D'Oro
,
Wojciech Jaśkowski