Deng, Zehang

1 publications

ICLR 2026 Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization Haocheng Luo, Zehang Deng, Thanh-Toan Do, Mehrtash Harandi, Dinh Phung, Trung Le