ML Anthology
Authors
Search
About
Ling, Zhenqing
3 publications
ICLR
2026
BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning
Qianli Shen
,
Daoyuan Chen
,
Yilun Huang
,
Zhenqing Ling
,
Yaliang Li
,
Bolin Ding
,
Jingren Zhou
NeurIPS
2025
Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data
Zhenqing Ling
,
Daoyuan Chen
,
Liuyi Yao
,
Qianli Shen
,
Yaliang Li
,
Ying Shen
NeurIPS
2025
MindGYM: What Matters in Question Synthesis for Thinking-Centric Fine-Tuning?
Zhe Xu
,
Daoyuan Chen
,
Zhenqing Ling
,
Yaliang Li
,
Ying Shen