Du, Yuhao

3 publications

TMLR 2026 RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment Yuhao Du, Zhuo Li, Pengyu Cheng, Zhihong Chen, Yuejiao Xie, Xiang Wan, Anningzhe Gao
NeurIPS 2025 Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval Haifan Gong, Xuanye Zhang, Ruifei Zhang, Yun Su, Zhuo Li, Yuhao Du, Anningzhe Gao, Xiang Wan, Haofeng Li
NeurIPS 2023 Strategic Behavior in Two-Sided Matching Markets with Prediction-Enhanced Preference-Formation Stefania Ionescu, Yuhao Du, Kenneth Joseph, Ancsa Hannak