Zhu, Hanlin

13 publications

ICML 2025 Avoiding Catastrophe in Online Learning by Asking for Help Benjamin Plaut, Hanlin Zhu, Stuart Russell
NeurIPS 2025 Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers Yixiao Huang, Hanlin Zhu, Tianyu Guo, Jiantao Jiao, Somayeh Sojoudi, Michael I. Jordan, Stuart Russell, Song Mei
NeurIPS 2025 Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought Hanlin Zhu, Shibo Hao, Zhiting Hu, Jiantao Jiao, Stuart Russell, Yuandong Tian
ICML 2025 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Dijia Su, Hanlin Zhu, Yingchen Xu, Jiantao Jiao, Yuandong Tian, Qinqing Zheng
ICLR 2024 On Representation Complexity of Model-Based and Model-Free Reinforcement Learning Hanlin Zhu, Baihe Huang, Stuart Russell
NeurIPS 2024 Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics Hanlin Zhu, Baihe Huang, Shaolun Zhang, Michael Jordan, Jiantao Jiao, Yuandong Tian, Stuart Russell
NeurIPS 2023 Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Hanlin Zhu, Paria Rashidinejad, Jiantao Jiao
ICMLW 2023 Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Hanlin Zhu, Paria Rashidinejad, Jiantao Jiao
ICLR 2023 Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian Paria Rashidinejad, Hanlin Zhu, Kunhe Yang, Stuart Russell, Jiantao Jiao
NeurIPS 2023 Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability Hanlin Zhu, Amy Zhang
AISTATS 2023 Provably Efficient Reinforcement Learning via Surprise Bound Hanlin Zhu, Ruosong Wang, Jason Lee
NeurIPSW 2023 Towards Optimal Statistical Watermarking Baihe Huang, Banghua Zhu, Hanlin Zhu, Jason Lee, Jiantao Jiao, Michael Jordan
COLT 2021 Average-Case Communication Complexity of Statistical Problems Cyrus Rashtchian, David Woodruff, Peng Ye, Hanlin Zhu