Zhang, Dylan

4 publications

TMLR 2025 Entropy-Regularized Process Reward Model Hanning Zhang, Pengcheng Wang, Shizhe Diao, Yong Lin, Rui Pan, Hanze Dong, Dylan Zhang, Pavlo Molchanov, Tong Zhang
ICLRW 2025 Improving Influence-Based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities Qirun Dai, Dylan Zhang, Jiaqi W. Ma, Hao Peng
NeurIPS 2025 The Best Instruction-Tuning Data Are Those That Fit Dylan Zhang, Qirun Dai, Hao Peng
TMLR 2024 Transformer-Based Models Are Not yet Perfect at Learning to Emulate Structural Recursion Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer