Liu, Runze

8 publications

NeurIPS 2025 Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration Junqi Gao, Zhichang Guo, Dazhi Zhang, Dong Li, Runze Liu, Pengfei Li, Kai Tian, Biqing Qi
ICLRW 2025 Can 1b LLM Surpass 405b LLM? Rethinking Compute-Optimal Test-Time Scaling Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou
ICLR 2025 Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint Jiafei Lyu, Mengbei Yan, Zhongjian Qiao, Runze Liu, Xiaoteng Ma, Deheng Ye, Jing-Wen Yang, Zongqing Lu, Xiu Li
AAAI 2025 RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang
ICML 2024 PEARL: Zero-Shot Cross-Task Preference Alignment and Robust Reward Learning for Robotic Manipulation Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
ICLR 2024 SEABO: A Simple Search-Based Method for Offline Imitation Learning Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu
NeurIPSW 2023 Zero-Shot Cross-Task Preference Alignment for Offline RL via Optimal Transport Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
NeurIPS 2022 Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-Based Reinforcement Learning Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang