ML Anthology
Authors
Search
About
Yang, Zhengyi
6 publications
ICML
2025
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
Junkang Wu
,
Xue Wang
,
Zhengyi Yang
,
Jiancan Wu
,
Jinyang Gao
,
Bolin Ding
,
Xiang Wang
,
Xiangnan He
NeurIPS
2025
On Efficiency-Effectiveness Trade-Off of Diffusion-Based Recommenders
Wenyu Mao
,
Jiancan Wu
,
Guoqing Hu
,
Zhengyi Yang
,
Wei Ji
,
Xiang Wang
ICLR
2025
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
Junkang Wu
,
Yuexiang Xie
,
Zhengyi Yang
,
Jiancan Wu
,
Jiawei Chen
,
Jinyang Gao
,
Bolin Ding
,
Xiang Wang
,
Xiangnan He
NeurIPS
2024
$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Junkang Wu
,
Yuexiang Xie
,
Zhengyi Yang
,
Jiancan Wu
,
Jinyang Gao
,
Bolin Ding
,
Xiang Wang
,
Xiangnan He
NeurIPS
2024
On SoftMax Direct Preference Optimization for Recommendation
Yuxin Chen
,
Junfei Tan
,
An Zhang
,
Zhengyi Yang
,
Leheng Sheng
,
Enzhi Zhang
,
Xiang Wang
,
Tat-Seng Chua
NeurIPS
2023
Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion
Zhengyi Yang
,
Jiancan Wu
,
Zhicai Wang
,
Xiang Wang
,
Yancheng Yuan
,
Xiangnan He