ML Anthology
Authors
Search
About
Yuan, Huizhuo
16 publications
ICLRW
2025
Game-Theoretic Regularized Self-Play Alignment of Large Language Models
Xiaohang Tang
,
Sangwoong Yoon
,
Seongho Son
,
Huizhuo Yuan
,
Quanquan Gu
,
Ilija Bogunovic
ICML
2025
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Huizhuo Yuan
,
Yifeng Liu
,
Shuang Wu
,
Zhou Xun
,
Quanquan Gu
ICLR
2025
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu
NeurIPS
2025
Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression
Yuning Shen
,
Lihao Wang
,
Huizhuo Yuan
,
Yan Wang
,
Bangji Yang
,
Quanquan Gu
NeurIPS
2025
Tensor Product Attention Is All You Need
Yifan Zhang
,
Yifeng Liu
,
Huizhuo Yuan
,
Zhen Qin
,
Yang Yuan
,
Quanquan Gu
,
Andrew C Yao
NeurIPSW
2024
Accelerated Preference Optimization for Large Language Model Alignment
Jiafan He
,
Huizhuo Yuan
,
Quanquan Gu
NeurIPS
2024
Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time
Zixiang Chen
,
Huizhuo Yuan
,
Yongqian Li
,
Yiwen Kou
,
Junkai Zhang
,
Quanquan Gu
ICML
2024
Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Yan Wang
,
Lihao Wang
,
Yuning Shen
,
Yiqun Wang
,
Huizhuo Yuan
,
Yue Wu
,
Quanquan Gu
ICML
2024
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Zixiang Chen
,
Yihe Deng
,
Huizhuo Yuan
,
Kaixuan Ji
,
Quanquan Gu
NeurIPS
2024
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan
,
Zixiang Chen
,
Kaixuan Ji
,
Quanquan Gu
ICMLW
2024
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu
NeurIPSW
2024
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu
ICLR
2023
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Zixiang Chen
,
Chris Junchi Li
,
Huizhuo Yuan
,
Quanquan Gu
,
Michael Jordan
ICML
2023
Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization
Chris Junchi Li
,
Huizhuo Yuan
,
Gauthier Gidel
,
Quanquan Gu
,
Michael Jordan
ICML
2019
Differential Inclusions for Modeling Nonsmooth ADMM Variants: A Continuous Limit Theory
Huizhuo Yuan
,
Yuren Zhou
,
Chris Junchi Li
,
Qingyun Sun
NeurIPS
2019
Efficient Smooth Non-Convex Stochastic Compositional Optimization via Stochastic Recursive Gradient Descent
Wenqing Hu
,
Chris Junchi Li
,
Xiangru Lian
,
Ji Liu
,
Huizhuo Yuan