Yang, Yingxiang

12 publications

ICML 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
ICLRW 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
ICLR 2024 Let Models Speak Ciphers: Multiagent Debate Through Embeddings Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang
NeurIPS 2024 Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang
ICMLW 2024 Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang
ICML 2022 Fourier Learning with Cyclical Data Yingxiang Yang, Zhihan Xiong, Tianyi Liu, Taiqing Wang, Chong Wang
NeurIPSW 2022 RLCG: When Reinforcement Learning Meets Coarse Graining Shenghao Wu, Tianyi Liu, Zhirui Wang, Wen Yan, Yingxiang Yang
NeurIPS 2020 The Devil Is in the Detail: A Framework for Macroscopic Prediction via Microscopic Models Yingxiang Yang, Negar Kiyavash, Le Song, Niao He
NeurIPS 2019 Learning Positive Functions with Pseudo Mirror Descent Yingxiang Yang, Haoxiang Wang, Negar Kiyavash, Niao He
NeurIPS 2018 Predictive Approximate Bayesian Computation via Saddle Points Yingxiang Yang, Bo Dai, Negar Kiyavash, Niao He
NeurIPS 2017 Online Learning for Multivariate Hawkes Processes Yingxiang Yang, Jalal Etesami, Niao He, Negar Kiyavash
AAAI 2016 Mobility Sequence Extraction and Labeling Using Sparse Cell Phone Data Yingxiang Yang, Peter Widhalm, Shounak Athavale, Marta C. González