Tang, Wenpin

10 publications

ICLR 2025 MallowsPO: Fine-Tune Your LLM with Preference Dispersions Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang
JAIR 2025 Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey Genta Indra Winata, Hanyang Zhao, Anirban Das, Wenpin Tang, David D. Yao, Shi-Xiong Zhang, Sambit Sahu
ICLR 2025 RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization Hanyang Zhao, Genta Indra Winata, Anirban Das, Shi-Xiong Zhang, David Yao, Wenpin Tang, Sambit Sahu
ICML 2025 Score as Action: Fine Tuning Diffusion Generative Models by Continuous-Time Reinforcement Learning Hanyang Zhao, Haoxian Chen, Ji Zhang, David Yao, Wenpin Tang
ICLRW 2025 Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-Time Reinforcement Learning Hanyang Zhao, Haoxian Chen, Ji Zhang, David Yao, Wenpin Tang
NeurIPSW 2024 Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang
JMLR 2023 Inference for Gaussian Processes with Matern Covariogram on Compact Riemannian Manifolds Didong Li, Wenpin Tang, Sudipto Banerjee
NeurIPS 2023 Policy Optimization for Continuous Reinforcement Learning Hanyang Zhao, Wenpin Tang, David Yao
ICML 2020 The Buckley-Osthus Model and the Block Preferential Attachment Model: Statistical Analysis and Application Wenpin Tang, Xin Guo, Fengmin Tang
ICML 2019 Mallows Ranking Models: Maximum Likelihood Estimate and Regeneration Wenpin Tang