Qu, Yuxiao

11 publications

ICLR 2026 RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems Yuxiao Qu, Anikait Singh, Yoonho Lee, Amrith Setlur, Ruslan Salakhutdinov, Chelsea Finn, Aviral Kumar
ICLR 2025 Harnessing Webpage UIs for Text-Rich Visual Understanding Junpeng Liu, Tianyue Ou, Yifan Song, Yuxiao Qu, Wai Lam, Chenyan Xiong, Wenhu Chen, Graham Neubig, Xiang Yue
ICML 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Amrith Setlur, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov, Aviral Kumar
ICLRW 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Amrith Setlur, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov, Aviral Kumar
ICLRW 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov
ICMLW 2024 Recursive Introspection: Teaching Foundation Model Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
ICMLW 2024 Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
ICMLW 2024 Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
ICMLW 2024 Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
NeurIPS 2024 Recursive Introspection: Teaching Language Model Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
CoLLAs 2022 Simulation-Acquired Latent Action Spaces for Dynamics Generalization Nicholas Corrado, Yuxiao Qu, Josiah P. Hanna