Yuan, Huizhuo

16 publications

ICLRW 2025 Game-Theoretic Regularized Self-Play Alignment of Large Language Models Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic
ICML 2025 MARS: Unleashing the Power of Variance Reduction for Training Large Models Huizhuo Yuan, Yifeng Liu, Shuang Wu, Zhou Xun, Quanquan Gu
ICLR 2025 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
NeurIPS 2025 Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression Yuning Shen, Lihao Wang, Huizhuo Yuan, Yan Wang, Bangji Yang, Quanquan Gu
NeurIPS 2025 Tensor Product Attention Is All You Need Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew C Yao
NeurIPSW 2024 Accelerated Preference Optimization for Large Language Model Alignment Jiafan He, Huizhuo Yuan, Quanquan Gu
NeurIPS 2024 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu
ICML 2024 Protein Conformation Generation via Force-Guided SE(3) Diffusion Models Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu
ICML 2024 Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu
NeurIPS 2024 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu
ICMLW 2024 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
NeurIPSW 2024 Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
ICLR 2023 A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning Zixiang Chen, Chris Junchi Li, Huizhuo Yuan, Quanquan Gu, Michael Jordan
ICML 2023 Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization Chris Junchi Li, Huizhuo Yuan, Gauthier Gidel, Quanquan Gu, Michael Jordan
ICML 2019 Differential Inclusions for Modeling Nonsmooth ADMM Variants: A Continuous Limit Theory Huizhuo Yuan, Yuren Zhou, Chris Junchi Li, Qingyun Sun
NeurIPS 2019 Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent Wenqing Hu, Chris Junchi Li, Xiangru Lian, Ji Liu, Huizhuo Yuan