Liu, Zihe

3 publications

ICLR 2026 Tricks or Traps? a Deep Dive into RL for LLM Reasoning Zihe Liu, Jiashun Liu, Yancheng He, Weixun Wang, Jiaheng Liu, Ling Pan, Xinyu Hu, Shaopan Xiong, Ju Huang, Jian Hu, Shengyi Huang, Siran Yang, Jiamang Wang, Wenbo Su, Bo Zheng
IJCAI 2024 A Behavior-Aware Approach for Deep Reinforcement Learning in Non-Stationary Environments Without Known Change Points Zihe Liu, Jie Lu, Guangquan Zhang, Junyu Xuan
UAI 2024 Functional Wasserstein Variational Policy Optimization Junyu Xuan, Mengjing Wu, Zihe Liu, Jie Lu