Jia, Qing-Shan

6 publications

ICML 2025 CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries Ni Mu, Hao Hu, Xiao Hu, Yiqin Yang, Bo Xu, Qing-Shan Jia
IJCAI 2025 S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learning Ni Mu, Yao Luan, Yiqin Yang, Bo Xu, Qing-Shan Jia
NeurIPS 2025 STAIR: Addressing Stage Misalignment Through Temporal-Aligned Preference Reinforcement Learning Yao Luan, Ni Mu, Yiqin Yang, Bo Xu, Qing-Shan Jia
ICLR 2024 Query-Policy Misalignment in Preference-Based Reinforcement Learning Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang
ICLR 2023 Mind the Gap: Offline Policy Optimization for Imperfect Rewards Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang
ICMLW 2023 Query-Policy Misalignment in Preference-Based Reinforcement Learning Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang