Hu, Hao

26 publications

ICML 2025 CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries Ni Mu, Hao Hu, Xiao Hu, Yiqin Yang, Bo Xu, Qing-Shan Jia
NeurIPS 2025 DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning Runpeng Xie, Quanwei Wang, Hao Hu, Zherui Zhou, Ni Mu, Xiyun Li, Yiqin Yang, Shuang Xu, Qianchuan Zhao, Bo Xu
ICLR 2025 Episodic Novelty Through Temporal Distance Yuhua Jiang, Qihan Liu, Yiqin Yang, Xiaoteng Ma, Dianyu Zhong, Hao Hu, Jun Yang, Bin Liang, Bo Xu, Chongjie Zhang, Qianchuan Zhao
ICLR 2025 Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset Yiqin Yang, Quanwei Wang, Chenghao Li, Hao Hu, Chengjie Wu, Yuhua Jiang, Dianyu Zhong, Ziyou Zhang, Qianchuan Zhao, Chongjie Zhang, Bo Xu
CVPR 2025 LOGICZSL: Exploring Logic-Induced Representation for Compositional Zero-Shot Learning Peng Wu, Xiankai Lu, Hao Hu, Yongqin Xian, Jianbing Shen, Wenguan Wang
ICML 2024 Bayesian Design Principles for Offline-to-Online Reinforcement Learning Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang
ICML 2024 Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners Chengjie Wu, Hao Hu, Yiqin Yang, Ning Zhang, Chongjie Zhang
ICML 2024 Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang
ICLR 2024 Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets Yihuan Mao, Chengjie Wu, Xi Chen, Hao Hu, Ji Jiang, Tianze Zhou, Tangjie Lv, Changjie Fan, Zhipeng Hu, Yi Wu, Yujing Hu, Chongjie Zhang
AAAI 2023 Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang
NeurIPS 2023 Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang
NeurIPSW 2023 Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang
ICLR 2023 The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement Learning Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang
NeurIPS 2023 Unsupervised Behavior Extraction via Random Intent Priors Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang
ICML 2023 What Is Essential for Unseen Goal Generalization of Offline Goal-Conditioned RL? Rui Yang, Lin Yong, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang
ECML-PKDD 2022 Learnable Masked Tokens for Improved Transferability of Self-Supervised Vision Transformers Hao Hu, Federico Baldassarre, Hossein Azizpour
ICLR 2022 Offline Reinforcement Learning with Value-Based Episodic Memory Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu
ICML 2022 On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang
AAAI 2022 Optimizing Binary Decision Diagrams with MaxSAT for Classification Hao Hu, Marie-José Huguet, Mohamed Siala
ICML 2021 Generalizable Episodic Memory for Deep Reinforcement Learning Hao Hu, Jianing Ye, Guangxiang Zhu, Zhizhou Ren, Chongjie Zhang
ICML 2021 MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration Jin Zhang, Jianhao Wang, Hao Hu, Tong Chen, Yingfeng Chen, Changjie Fan, Chongjie Zhang
NeurIPS 2021 On the Estimation Bias in Double Q-Learning Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang
IJCAI 2020 Learning Optimal Decision Trees with MaxSAT and Its Integration in AdaBoost Hao Hu, Mohamed Siala, Emmanuel Hebrard, Marie-José Huguet
AAAI 2019 Learning to Adaptively Scale Recurrent Neural Networks Hao Hu, Liqiang Wang, Guo-Jun Qi
ICML 2017 State-Frequency Memory Recurrent Neural Networks Hao Hu, Guo-Jun Qi
CVPRW 2017 Temporal Domain Neural Encoder for Video Representation Learning Hao Hu, Zhaowen Wang, Joon-Young Lee, Zhe Lin, Guo-Jun Qi