Luan, Yao

2 publications

IJCAI 2025 S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learning Ni Mu, Yao Luan, Yiqin Yang, Bo Xu, Qing-Shan Jia
NeurIPS 2025 STAIR: Addressing Stage Misalignment Through Temporal-Aligned Preference Reinforcement Learning Yao Luan, Ni Mu, Yiqin Yang, Bo Xu, Qing-Shan Jia