Coste, Thomas

9 publications

ICLRW 2025 AppVLM: A Lightweight Vision Language Model for Online App Control Georgios Papoudakis, Thomas Coste, Zhihao Wu, Jianye Hao, Jun Wang, Kun Shao
ICLR 2025 Lightweight Neural App Control Filippos Christianos, Georgios Papoudakis, Thomas Coste, Jianye Hao, Jun Wang, Kun Shao
NeurIPS 2025 Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control Georgios Papoudakis, Thomas Coste, Jianye Hao, Jun Wang, Kun Shao
ICLRW 2024 Bayesian Reward Models for LLM Alignment Adam X. Yang, Maxime Robeyns, Thomas Coste, Jun Wang, Haitham Bou Ammar, Laurence Aitchison
ICMLW 2024 Bayesian Reward Models for LLM Alignment Adam X. Yang, Maxime Robeyns, Thomas Coste, Zhengyan Shi, Jun Wang, Haitham Bou Ammar, Laurence Aitchison
NeurIPSW 2024 Lightweight Neural App Control Filippos Christianos, Georgios Papoudakis, Thomas Coste, Jianye Hao, Jun Wang, Kun Shao
ICLR 2024 Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste, Usman Anwar, Robert Kirk, David Krueger
NeurIPSW 2023 Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste, Usman Anwar, Robert Kirk, David Krueger
NeurIPSW 2023 Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste, Usman Anwar, Robert Kirk, David Krueger