Yang, Yiqin

17 publications

ICML 2025 CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries Ni Mu, Hao Hu, Xiao Hu, Yiqin Yang, Bo Xu, Qing-Shan Jia
NeurIPS 2025 DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning Runpeng Xie, Quanwei Wang, Hao Hu, Zherui Zhou, Ni Mu, Xiyun Li, Yiqin Yang, Shuang Xu, Qianchuan Zhao, Bo Xu
ICLR 2025 Episodic Novelty Through Temporal Distance Yuhua Jiang, Qihan Liu, Yiqin Yang, Xiaoteng Ma, Dianyu Zhong, Hao Hu, Jun Yang, Bin Liang, Bo Xu, Chongjie Zhang, Qianchuan Zhao
ICLR 2025 Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset Yiqin Yang, Quanwei Wang, Chenghao Li, Hao Hu, Chengjie Wu, Yuhua Jiang, Dianyu Zhong, Ziyou Zhang, Qianchuan Zhao, Chongjie Zhang, Bo Xu
IJCAI 2025 S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learning Ni Mu, Yao Luan, Yiqin Yang, Bo Xu, Qing-Shan Jia
NeurIPS 2025 STAIR: Addressing Stage Misalignment Through Temporal-Aligned Preference Reinforcement Learning Yao Luan, Ni Mu, Yiqin Yang, Bo Xu, Qing-Shan Jia
ICML 2024 Bayesian Design Principles for Offline-to-Online Reinforcement Learning Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang
NeurIPSW 2024 Episodic Novelty Through Temporal Distance Yuhua Jiang, Qihan Liu, Yiqin Yang, Xiaoteng Ma, Dianyu Zhong, Bo Xu, Jun Yang, Bin Liang, Chongjie Zhang, Qianchuan Zhao
AAAI 2024 Learning Diverse Risk Preferences in Population-Based Self-Play Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao
AAAI 2024 No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning Dianyu Zhong, Yiqin Yang, Qianchuan Zhao
ICML 2024 Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners Chengjie Wu, Hao Hu, Yiqin Yang, Ning Zhang, Chongjie Zhang
AAAI 2023 Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang
ICLR 2023 The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement Learning Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang
NeurIPS 2023 Unsupervised Behavior Extraction via Random Intent Priors Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang
ICLR 2022 Offline Reinforcement Learning with Value-Based Episodic Memory Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu
ICML 2022 On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang
NeurIPS 2021 Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao