ML Anthology
Authors
Search
About
Yang, Matthew Y. R.
6 publications
ICLR
2026
E3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs
Amrith Setlur
,
Matthew Y. R. Yang
,
Charlie Victor Snell
,
Jeremiah Greer
,
Ian Wu
,
Virginia Smith
,
Max Simchowitz
,
Aviral Kumar
ICLR
2026
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
Matthew Y. R. Yang
,
Hao Bai
,
Ian Wu
,
Gene Yang
,
Amrith Setlur
,
Aviral Kumar
ICML
2025
Optimizing Test-Time Compute via Meta Reinforcement Finetuning
Yuxiao Qu
,
Matthew Y. R. Yang
,
Amrith Setlur
,
Lewis Tunstall
,
Edward Emanuel Beeching
,
Ruslan Salakhutdinov
,
Aviral Kumar
ICLRW
2025
Optimizing Test-Time Compute via Meta Reinforcement Finetuning
Yuxiao Qu
,
Matthew Y. R. Yang
,
Amrith Setlur
,
Lewis Tunstall
,
Edward Emanuel Beeching
,
Ruslan Salakhutdinov
,
Aviral Kumar
ICLRW
2025
Optimizing Test-Time Compute via Meta Reinforcement Finetuning
Yuxiao Qu
,
Matthew Y. R. Yang
,
Lewis Tunstall
,
Edward Emanuel Beeching
,
Ruslan Salakhutdinov
ICML
2024
Disguised Copyright Infringement of Latent Diffusion Models
Yiwei Lu
,
Matthew Y. R. Yang
,
Zuoqiu Liu
,
Gautam Kamath
,
Yaoliang Yu