Zhang, Haode

1 publications

ICML 2025 Robust Reward Alignment via Hypothesis Space Batch Cutting Zhixian Xie, Haode Zhang, Yizhe Feng, Wanxin Jin