Lebensold, Jonathan

3 publications

NeurIPS 2025 Tapered Off-Policy REINFORCE - Stable and Efficient Reinforcement Learning for Large Language Models Nicolas Le Roux, Marc G Bellemare, Jonathan Lebensold, Arnaud Bergeron, Joshua Greaves, Alexandre Fréchette, Carolyne Pelletier, Eric Thibodeau-Laufer, Sándor Tóth, Sam Work
NeurIPSW 2024 Mitigating Downstream Model Risks via Model Provenance Keyu Wang, Scott Schaffter, Abdullah Norozi Iranzad, Doina Precup, Jonathan Lebensold, Meg Risdal
AISTATS 2024 On the Privacy of Selection Mechanisms with Gaussian Noise Jonathan Lebensold, Doina Precup, Borja Balle