Yang, Matthew Y. R.

6 publications

ICLR 2026 E3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs Amrith Setlur, Matthew Y. R. Yang, Charlie Victor Snell, Jeremiah Greer, Ian Wu, Virginia Smith, Max Simchowitz, Aviral Kumar
ICLR 2026 InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning Matthew Y. R. Yang, Hao Bai, Ian Wu, Gene Yang, Amrith Setlur, Aviral Kumar
ICML 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Amrith Setlur, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov, Aviral Kumar
ICLRW 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Amrith Setlur, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov, Aviral Kumar
ICLRW 2025 Optimizing Test-Time Compute via Meta Reinforcement Finetuning Yuxiao Qu, Matthew Y. R. Yang, Lewis Tunstall, Edward Emanuel Beeching, Ruslan Salakhutdinov
ICML 2024 Disguised Copyright Infringement of Latent Diffusion Models Yiwei Lu, Matthew Y. R. Yang, Zuoqiu Liu, Gautam Kamath, Yaoliang Yu