Pouplin, Thomas

4 publications

ICML 2025 The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-Fidelity Data Thomas Pouplin, Kasia Kobalczyk, Hao Sun, Mihaela Van Der Schaar
NeurIPSW 2024 Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization Hao Sun, Thomas Pouplin, Nicolás Astorga, Tennison Liu, Mihaela van der Schaar
NeurIPSW 2024 Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization Hao Sun, Thomas Pouplin, Nicolás Astorga, Tennison Liu, Mihaela van der Schaar
ICML 2024 Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise Thomas Pouplin, Alan Jeffares, Nabeel Seedat, Mihaela Van Der Schaar