Mazoure, Bogdan

16 publications

CVPR 2025 From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Andrew Szot, Bogdan Mazoure, Omar Attia, Aleksei Timofeev, Harsh Agrawal, Devon Hjelm, Zhe Gan, Zsolt Kira, Alexander Toshev
ICLR 2025 On the Modeling Capabilities of Large Language Models for Sequential Decision Making Martin Klissarov, R Devon Hjelm, Alexander T Toshev, Bogdan Mazoure
NeurIPS 2024 Grounding Multimodal Large Language Models in Actions Andrew Szot, Bogdan Mazoure, Harsh Agrawal, Devon Hjelm, Zsolt Kira, Alexander Toshev
ICLR 2024 Large Language Models as Generalizable Policies for Embodied Tasks Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Rin Metcalf, Walter Talbott, Natalie Mackraz, R Devon Hjelm, Alexander T Toshev
ICMLW 2023 Accelerating Exploration and Representation Learning with Offline Pre-Training Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand
CoRL 2023 Contrastive Value Learning: Implicit Models for Simple Offline RL Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson
ICLR 2023 Learning About Progress from Experts Jake Bruce, Ankit Anand, Bogdan Mazoure, Rob Fergus
NeurIPSW 2022 Contrastive Value Learning: Implicit Models for Simple Offline RL Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson
ICLR 2022 Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL Bogdan Mazoure, Ahmed M Ahmed, R Devon Hjelm, Andrey Kolobov, Patrick MacAlpine
NeurIPS 2022 Improving Zero-Shot Generalization in Offline Reinforcement Learning Using Generalized Similarity Functions Bogdan Mazoure, Ilya Kostrikov, Ofir Nachum, Jonathan J Tompson
JAIR 2022 Low-Rank Representation of Reinforcement Learning Policies Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup, Guillaume Rabusseau
AISTATS 2021 A Theoretical Analysis of Catastrophic Forgetting Through the NTK Overlap Matrix Thang Doan, Mehdi Abbana Bennani, Bogdan Mazoure, Guillaume Rabusseau, Pierre Alquier
NeurIPS 2020 Deep Reinforcement and InfoMax Learning Bogdan Mazoure, Remi Tachet des Combes, Thang Long Doan, Philip Bachman, R Devon Hjelm
AISTATS 2020 Efficient Planning Under Partial Observability with Unnormalized Q Functions and Spectral Learning Tianyu Li, Bogdan Mazoure, Doina Precup, Guillaume Rabusseau
CoRL 2019 Leveraging Exploration in Off-Policy Algorithms via Normalizing Flows Bogdan Mazoure, Thang Doan, Audrey Durand, Joelle Pineau, R Devon Hjelm
AAAI 2019 On-Line Adaptative Curriculum Learning for GANs Thang Doan, João Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau, R. Devon Hjelm