ML Anthology
Authors
Search
About
Ji, Kaixuan
7 publications
TMLR
2025
Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji
,
Jiafan He
,
Quanquan Gu
ICLR
2025
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu
ICLR
2024
Horizon-Free Reinforcement Learning in Adversarial Linear Mixture MDPs
Kaixuan Ji
,
Qingyue Zhao
,
Jiafan He
,
Weitong Zhang
,
Quanquan Gu
ICML
2024
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Zixiang Chen
,
Yihe Deng
,
Huizhuo Yuan
,
Kaixuan Ji
,
Quanquan Gu
NeurIPS
2024
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan
,
Zixiang Chen
,
Kaixuan Ji
,
Quanquan Gu
ICMLW
2024
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu
NeurIPSW
2024
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
,
Zhiqing Sun
,
Huizhuo Yuan
,
Kaixuan Ji
,
Yiming Yang
,
Quanquan Gu