ML Anthology
Authors
Search
About
Jia, Qing-Shan
6 publications
ICML
2025
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries
Ni Mu
,
Hao Hu
,
Xiao Hu
,
Yiqin Yang
,
Bo Xu
,
Qing-Shan Jia
IJCAI
2025
S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learning
Ni Mu
,
Yao Luan
,
Yiqin Yang
,
Bo Xu
,
Qing-Shan Jia
NeurIPS
2025
STAIR: Addressing Stage Misalignment Through Temporal-Aligned Preference Reinforcement Learning
Yao Luan
,
Ni Mu
,
Yiqin Yang
,
Bo Xu
,
Qing-Shan Jia
ICLR
2024
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Xiao Hu
,
Jianxiong Li
,
Xianyuan Zhan
,
Qing-Shan Jia
,
Ya-Qin Zhang
ICLR
2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
,
Xiao Hu
,
Haoran Xu
,
Jingjing Liu
,
Xianyuan Zhan
,
Qing-Shan Jia
,
Ya-Qin Zhang
ICMLW
2023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Xiao Hu
,
Jianxiong Li
,
Xianyuan Zhan
,
Qing-Shan Jia
,
Ya-Qin Zhang