Qiu, Wenjie

8 publications

ICLR 2026 Preference-Based Policy Optimization from Sparse-Reward Offline Dataset Wenjie Qiu, Guofeng Cui, Shicheng Liu, Yuanlin Duan, He Zhu
NeurIPS 2025 Explainable Reinforcement Learning from Human Feedback to Improve Alignment Shicheng Liu, Siyuan Xu, Wenjie Qiu, Hangfan Zhang, Minghui Zhu
NeurIPS 2025 Learning from Demonstrations via Capability-Aware Goal Sampling Yuanlin Duan, Yuning Wang, Wenjie Qiu, He Zhu
NeurIPS 2025 MetaBox-V2: A Unified Benchmark Platform for Meta-Black-Box Optimization Zeyuan Ma, Yue-Jiao Gong, Hongshu Guo, Wenjie Qiu, Sijie Ma, Hongqiao Lian, Jiajun Zhan, Kaixu Chen, Chen Wang, Zhiyang Huang, Zechuan Huang, Guojun Peng, Ran Cheng, Yining Ma
ICLR 2025 Q-Adapter: Customizing Pre-Trained LLMs to New Preferences with Forgetting Mitigation Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu, Bo An
ICML 2024 Debiased Offline Representation Learning for Fast Online Adaptation in Non-Stationary Dynamics Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu
NeurIPS 2023 Instructing Goal-Conditioned Reinforcement Learning Agents with Temporal Logic Objectives Wenjie Qiu, Wensen Mao, He Zhu
ICLR 2022 Programmatic Reinforcement Learning Without Oracles Wenjie Qiu, He Zhu