ML Anthology
Authors
Search
About
Cheng, Pengyu
10 publications
TMLR
2026
RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment
Yuhao Du
,
Zhuo Li
,
Pengyu Cheng
,
Zhihong Chen
,
Yuejiao Xie
,
Xiang Wan
,
Anningzhe Gao
NeurIPS
2024
Self-Playing Adversarial Language Game Enhances LLM Reasoning
Pengyu Cheng
,
Yong Dai
,
Tianhao Hu
,
Han Xu
,
Zhisong Zhang
,
Lei Han
,
Nan Du
,
Xiaolong Li
AISTATS
2023
Estimating Total Correlation with Mutual Information Estimators
Ke Bai
,
Pengyu Cheng
,
Weituo Hao
,
Ricardo Henao
,
Larry Carin
AISTATS
2023
Toward Fairness in Text Generation via Mutual Information Minimization Based on Importance Sampling
Rui Wang
,
Pengyu Cheng
,
Ricardo Henao
ICLR
2021
FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders
Pengyu Cheng
,
Weituo Hao
,
Siyang Yuan
,
Shijing Si
,
Lawrence Carin
ICLR
2021
Improving Zero-Shot Voice Style Transfer via Disentangled Representation Learning
Siyang Yuan
,
Pengyu Cheng
,
Ruiyi Zhang
,
Weituo Hao
,
Zhe Gan
,
Lawrence Carin
ICML
2020
CLUB: A Contrastive Log-Ratio Upper Bound of Mutual Information
Pengyu Cheng
,
Weituo Hao
,
Shuyang Dai
,
Jiachang Liu
,
Zhe Gan
,
Lawrence Carin
AAAI
2020
Dynamic Embedding on Textual Networks via a Gaussian Process
Pengyu Cheng
,
Yitong Li
,
Xinyuan Zhang
,
Liqun Chen
,
David E. Carlson
,
Lawrence Carin
NeurIPSW
2020
Estimating Total Correlation with Mutual Information Bounds
Pengyu Cheng
,
Weituo Hao
,
Lawrence Carin
ICML
2019
Understanding and Accelerating Particle-Based Variational Inference
Chang Liu
,
Jingwei Zhuo
,
Pengyu Cheng
,
Ruiyi Zhang
,
Jun Zhu