Liu, Jiashun

12 publications

ICLR 2026 Asymmetric Proximal Policy Optimization: Mini-Critics Boost LLM Reasoning Jiashun Liu, Johan Obando-Ceron, Han Lu, Yancheng He, Weixun Wang, Wenbo Su, Bo Zheng, Pablo Samuel Castro, Aaron Courville, Ling Pan
ICLR 2026 The Rank and Gradient Lost in Non-Stationarity: Sample Weight Decay for Mitigating Plasticity Loss in Reinforcement Learning Zihao Wu, Hongyao Tang, Yi Ma, Jiashun Liu, Yan Zheng, Jianye Hao
ICLR 2026 Tricks or Traps? a Deep Dive into RL for LLM Reasoning Zihe Liu, Jiashun Liu, Yancheng He, Weixun Wang, Jiaheng Liu, Ling Pan, Xinyu Hu, Shaopan Xiong, Ju Huang, Jian Hu, Shengyi Huang, Siran Yang, Jiamang Wang, Wenbo Su, Bo Zheng
AAAI 2025 Flow Factorization for Efficient Generative Flow Networks Jiashun Liu, Chunhui Li, Cheng-Hao Liu, Dianbo Liu, Qingpeng Cai, Ling Pan
NeurIPS 2025 Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization Ziqi Wang, Jiashun Liu, Ling Pan
NeurIPS 2025 Measure Gradients, Not Activations! Enhancing Neuronal Activity in Deep Reinforcement Learning Jiashun Liu, Zihao Wu, Johan Obando-Ceron, Pablo Samuel Castro, Aaron Courville, Ling Pan
ICLR 2025 Neuroplastic Expansion in Deep Reinforcement Learning Jiashun Liu, Johan Samir Obando Ceron, Aaron Courville, Ling Pan
ICML 2025 The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning Jiashun Liu, Johan Obando-Ceron, Pablo Samuel Castro, Aaron Courville, Ling Pan
CVPR 2024 Generate Subgoal Images Before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts Fei Ni, Jianye Hao, Shiguang Wu, Longxin Kou, Jiashun Liu, Yan Zheng, Bin Wang, Yuzheng Zhuang
UAI 2024 Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search Jiashun Liu, Xiaotian Hao, Jianye Hao, Yan Zheng, Yujing Hu, Changjie Fan, Tangjie Lv, Zhipeng Hu
ICML 2024 Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation Jiashun Liu, Jianye Hao, Yi Ma, Shuyin Xia
NeurIPS 2024 Unlock the Intermittent Control Ability of Model Free Reinforcement Learning Jiashun Liu, Jianye Hao, Xiaotian Hao, Yi Ma, Yan Zheng, Yujing Hu, Tangji Lv