van Seijen, Harm

16 publications

ICLR 2024 Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning Harry Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio
NeurIPSW 2024 Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning Harry Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio
ICML 2023 Principled Offline RL in the Presence of Rich Exogenous Information Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Rajiv Didolkar, Dipendra Misra, Xin Li, Harm Van Seijen, Remi Tachet Des Combes, John Langford
NeurIPSW 2022 Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information Riashat Islam, Manan Tomar, Alex Lamb, Hongyu Zang, Yonathan Efroni, Dipendra Misra, Aniket Rajiv Didolkar, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford
ICLR 2022 Modular Lifelong Reinforcement Learning via Neural Composition Jorge A Mendez, Harm van Seijen, Eric Eaton
NeurIPSW 2022 Replay Buffer with Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar
ICLRW 2022 Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods Yi Wan, Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Sarath Chandar, Harm van Seijen
ICLR 2021 Systematic Generalisation with Group Invariant Predictions Faruk Ahmed, Yoshua Bengio, Harm van Seijen, Aaron Courville
NeurIPS 2020 The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning Harm Van Seijen, Hadi Nekoei, Evan Racah, Sarath Chandar
ICML 2019 Dead-Ends and Secure Exploration in Reinforcement Learning Mehdi Fatemi, Shikhar Sharma, Harm Van Seijen, Samira Ebrahimi Kahou
NeurIPS 2019 Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning Harm Van Seijen, Mehdi Fatemi, Arash Tavakoli
AAAI 2018 On Value Function Representation of Long Horizon Problems Lucas Lehnert, Romain Laroche, Harm van Seijen
NeurIPS 2017 Hybrid Reward Architecture for Reinforcement Learning Harm Van Seijen, Mehdi Fatemi, Joshua Romoff, Romain Laroche, Tavian Barnes, Jeffrey Tsang
JMLR 2016 True Online Temporal-Difference Learning Harm van Seijen, A. Rupam Mahmood, Patrick M. Pilarski, Marlos C. Machado, Richard S. Sutton
ICML 2013 Planning by Prioritized Sweeping with Small Backups Harm Van Seijen, Rich Sutton
JMLR 2011 Exploiting Best-Match Equations for Efficient Reinforcement Learning Harm van Seijen, Shimon Whiteson, Hado van Hasselt, Marco Wiering