Le Roux, Nicolas
38 publications
NeurIPS
2025
Tapered Off-Policy REINFORCE - Stable and Efficient Reinforcement Learning for Large Language Models
NeurIPSW
2023
Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism
AISTATS
2022
On the Convergence of Stochastic Extragradient for Bilinear Games Using Restarted Iteration Averaging
ICML
2021
Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization
AISTATS
2020
On the Interplay Between Noise and Curvature and Its Effect on Optimization and Generalization