ML Anthology
Authors
Search
About
Wang, Mingzhi
4 publications
ICLR
2025
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Zhaowei Zhang
,
Fengshuo Bai
,
Qizhi Chen
,
Chengdong Ma
,
Mingzhi Wang
,
Haoran Sun
,
Zilong Zheng
,
Yaodong Yang
ICML
2025
Falcon: Fast Visuomotor Policies via Partial Denoising
Haojun Chen
,
Minghao Liu
,
Chengdong Ma
,
Xiaojian Ma
,
Zailin Ma
,
Huimin Wu
,
Yuanpei Chen
,
Yifan Zhong
,
Mingzhi Wang
,
Qing Li
,
Yaodong Yang
ICLR
2025
Magnetic Preference Optimization: Achieving Last-Iterate Convergence for Language Model Alignment
Mingzhi Wang
,
Chengdong Ma
,
Qizhi Chen
,
Linjian Meng
,
Yang Han
,
Jiancong Xiao
,
Zhaowei Zhang
,
Jing Huo
,
Weijie J Su
,
Yaodong Yang
NeurIPS
2023
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning
Stephen McAleer
,
Gabriele Farina
,
Gaoyue Zhou
,
Mingzhi Wang
,
Yaodong Yang
,
Tuomas Sandholm