Carta, Thomas
6 publications
NeurIPSW
2024
SAC-GLAM: Improving Online RL for LLM Agents with Soft Actor-Critic and Hindsight Relabeling
NeurIPSW
2023
Codeplay: Autotelic Learning Through Collaborative Self-Play in Programming Environments
6 publications