Bellemare, Marc G.
59 publications
NeurIPS
2025
Tapered Off-Policy REINFORCE - Stable and Efficient Reinforcement Learning for Large Language Models
ICML
2022
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
NeurIPSW
2022
Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning
IJCAI
2019
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents