D'Oro, Pierluca

18 publications

ICCV 2025 Controlling Multimodal LLMs via Reward-Guided Decoding Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal
ICLR 2025 MaestroMotif: Skill Design from Artificial Intelligence Feedback Martin Klissarov, Mikael Henaff, Roberta Raileanu, Shagun Sodhani, Pascal Vincent, Amy Zhang, Pierre-Luc Bacon, Doina Precup, Marlos C. Machado, Pierluca D'Oro
TMLR 2025 Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons Simon Dufort-Labbé, Pierluca D'Oro, Evgenii Nikishin, Irina Rish, Pierre-Luc Bacon, Razvan Pascanu, Aristide Baratin
ICLRW 2025 Mol-MoE: Training Preference-Guided Routers for Molecule Generation Diego Calanzone, Pierluca D'Oro, Pierre-Luc Bacon
ICLR 2025 Towards General-Purpose Model-Free Reinforcement Learning Scott Fujimoto, Pierluca D'Oro, Amy Zhang, Yuandong Tian, Michael Rabbat
ICMLW 2024 Controlling Large Language Model Agents with Entropic Activation Steering Nate Rahn, Pierluca D'Oro, Marc G Bellemare
NeurIPSW 2024 Controlling Multimodal LLMs via Reward-Guided Decoding Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal
ICLR 2024 Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff
ICLR 2024 The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin, Aaron Courville
NeurIPSW 2023 Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff
NeurIPSW 2023 Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff
NeurIPS 2023 Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Nate Rahn, Pierluca D'Oro, Harley Wiltzer, Pierre-Luc Bacon, Marc Bellemare
ICLR 2023 Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro, Max Schwarzer, Evgenii Nikishin, Pierre-Luc Bacon, Marc G Bellemare, Aaron Courville
NeurIPSW 2022 Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro, Max Schwarzer, Evgenii Nikishin, Pierre-Luc Bacon, Marc G Bellemare, Aaron Courville
NeurIPSW 2022 Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin, Aaron Courville
NeurIPSW 2022 Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin, Aaron Courville
NeurIPSW 2021 Long-Term Credit Assignment via Model-Based Temporal Shortcuts Michel Ma, Pierluca D'Oro, Yoshua Bengio, Pierre-Luc Bacon
NeurIPS 2020 How to Learn a Useful Critic? Model-Based Action-Gradient-Estimator Policy Optimization Pierluca D'Oro, Wojciech Jaśkowski