Zhu, Wenbo
7 publications
ICLR
2026
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
NeurIPS
2025
DuSA: Fast and Accurate Dual-Stage Sparse Attention Mechanism Accelerating Both Training and Inference
7 publications