ML Anthology
Authors
Search
About
Zhang, Dylan
4 publications
TMLR
2025
Entropy-Regularized Process Reward Model
Hanning Zhang
,
Pengcheng Wang
,
Shizhe Diao
,
Yong Lin
,
Rui Pan
,
Hanze Dong
,
Dylan Zhang
,
Pavlo Molchanov
,
Tong Zhang
ICLRW
2025
Improving Influence-Based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities
Qirun Dai
,
Dylan Zhang
,
Jiaqi W. Ma
,
Hao Peng
NeurIPS
2025
The Best Instruction-Tuning Data Are Those That Fit
Dylan Zhang
,
Qirun Dai
,
Hao Peng
TMLR
2024
Transformer-Based Models Are Not yet Perfect at Learning to Emulate Structural Recursion
Dylan Zhang
,
Curt Tigges
,
Zory Zhang
,
Stella Biderman
,
Maxim Raginsky
,
Talia Ringer