Wang, Mingzhi

4 publications

ICLR 2025 Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng, Yaodong Yang
ICML 2025 Falcon: Fast Visuomotor Policies via Partial Denoising Haojun Chen, Minghao Liu, Chengdong Ma, Xiaojian Ma, Zailin Ma, Huimin Wu, Yuanpei Chen, Yifan Zhong, Mingzhi Wang, Qing Li, Yaodong Yang
ICLR 2025 Magnetic Preference Optimization: Achieving Last-Iterate Convergence for Language Model Alignment Mingzhi Wang, Chengdong Ma, Qizhi Chen, Linjian Meng, Yang Han, Jiancong Xiao, Zhaowei Zhang, Jing Huo, Weijie J Su, Yaodong Yang
NeurIPS 2023 Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning Stephen McAleer, Gabriele Farina, Gaoyue Zhou, Mingzhi Wang, Yaodong Yang, Tuomas Sandholm