Zheng, Haizhong

11 publications

ICLR 2026 Jackpot: Align Actor-Policy Distribution for Scalable and Stable RL for LLM Zhuoming Chen, Hongyi Liu, Yang Zhou, Haizhong Zheng, Beidi Chen
ICLR 2026 OPPO: Accelerating PPO-Based RLHF via Pipeline Overlap Kaizhuo Yan, YingJie Yu, Yifan Yu, Haizhong Zheng, Fan Lai
ICLR 2026 Prosperity Before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs? Haizhong Zheng, Jiawei Zhao, Beidi Chen
NeurIPS 2025 Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts Haizhong Zheng, Yang Zhou, Brian R. Bartoldson, Bhavya Kailkhura, Fan Lai, Jiawei Zhao, Beidi Chen
ICLR 2025 ELFS: Label-Free Coreset Selection with Proxy Training Dynamics Haizhong Zheng, Elisa Tsai, Yifu Lu, Jiachen Sun, Brian R. Bartoldson, Bhavya Kailkhura, Atul Prakash
NeurIPS 2025 Kinetics: Rethinking Test-Time Scaling Law Ranajoy Sadhukhan, Zhuoming Chen, Haizhong Zheng, Beidi Chen
ICLR 2024 CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-Training for BEV Perception Jiachen Sun, Haizhong Zheng, Qingzhao Zhang, Atul Prakash, Zhuoqing Mao, Chaowei Xiao
NeurIPS 2024 Learn to Be Efficient: Build Structured Sparsity in Large Language Models Haizhong Zheng, Xiaoyan Bai, Xueshen Liu, Z. Morley Mao, Beidi Chen, Fan Lai, Atul Prakash
ECCV 2024 Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation Haizhong Zheng, Jiachen Sun, Shutong Wu, Bhavya Kailkhura, Zhuoqing Morley Mao, Chaowei Xiao, Atul Prakash
ICLR 2023 Coverage-Centric Coreset Selection for High Pruning Rates Haizhong Zheng, Rui Liu, Fan Lai, Atul Prakash
CVPR 2020 Efficient Adversarial Training with Transferable Adversarial Examples Haizhong Zheng, Ziqi Zhang, Juncheng Gu, Honglak Lee, Atul Prakash