Cheng, Pengyu

10 publications

TMLR 2026 RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment Yuhao Du, Zhuo Li, Pengyu Cheng, Zhihong Chen, Yuejiao Xie, Xiang Wan, Anningzhe Gao
NeurIPS 2024 Self-Playing Adversarial Language Game Enhances LLM Reasoning Pengyu Cheng, Yong Dai, Tianhao Hu, Han Xu, Zhisong Zhang, Lei Han, Nan Du, Xiaolong Li
AISTATS 2023 Estimating Total Correlation with Mutual Information Estimators Ke Bai, Pengyu Cheng, Weituo Hao, Ricardo Henao, Larry Carin
AISTATS 2023 Toward Fairness in Text Generation via Mutual Information Minimization Based on Importance Sampling Rui Wang, Pengyu Cheng, Ricardo Henao
ICLR 2021 FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders Pengyu Cheng, Weituo Hao, Siyang Yuan, Shijing Si, Lawrence Carin
ICLR 2021 Improving Zero-Shot Voice Style Transfer via Disentangled Representation Learning Siyang Yuan, Pengyu Cheng, Ruiyi Zhang, Weituo Hao, Zhe Gan, Lawrence Carin
ICML 2020 CLUB: A Contrastive Log-Ratio Upper Bound of Mutual Information Pengyu Cheng, Weituo Hao, Shuyang Dai, Jiachang Liu, Zhe Gan, Lawrence Carin
AAAI 2020 Dynamic Embedding on Textual Networks via a Gaussian Process Pengyu Cheng, Yitong Li, Xinyuan Zhang, Liqun Chen, David E. Carlson, Lawrence Carin
NeurIPSW 2020 Estimating Total Correlation with Mutual Information Bounds Pengyu Cheng, Weituo Hao, Lawrence Carin
ICML 2019 Understanding and Accelerating Particle-Based Variational Inference Chang Liu, Jingwei Zhuo, Pengyu Cheng, Ruiyi Zhang, Jun Zhu