Feng, Guhao

9 publications

ICML 2025 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Zikang Shan, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang

NeurIPS 2025 Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng, Yihan Geng, Jian Guan, Wei Wu, Liwei Wang, Di He

ICMLW 2024 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang

ICML 2024 Do Efficient Transformers Really Save Computation? Kai Yang, Jan Ackermann, Zhenyu He, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He, Liwei Wang

NeurIPS 2024 Rethinking Model-Based, Policy-Based, and Value-Based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng, Han Zhong

ICMLW 2024 Rethinking Model-Based, Policy-Based, and Value-Based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng, Han Zhong

ICML 2024 Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang, Jingjing Xu, Zhi Zhang, Hongxia Yang, Di He

ICML 2023 A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests Bohang Zhang, Guhao Feng, Yiheng Du, Di He, Liwei Wang

NeurIPS 2023 Towards Revealing the Mystery Behind Chain of Thought: A Theoretical Perspective Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, Liwei Wang