ML Anthology
Authors
Search
About
Zhu, Hanlin
13 publications
ICML
2025
Avoiding Catastrophe in Online Learning by Asking for Help
Benjamin Plaut
,
Hanlin Zhu
,
Stuart Russell
NeurIPS
2025
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
Yixiao Huang
,
Hanlin Zhu
,
Tianyu Guo
,
Jiantao Jiao
,
Somayeh Sojoudi
,
Michael I. Jordan
,
Stuart Russell
,
Song Mei
NeurIPS
2025
Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
Hanlin Zhu
,
Shibo Hao
,
Zhiting Hu
,
Jiantao Jiao
,
Stuart Russell
,
Yuandong Tian
ICML
2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Dijia Su
,
Hanlin Zhu
,
Yingchen Xu
,
Jiantao Jiao
,
Yuandong Tian
,
Qinqing Zheng
ICLR
2024
On Representation Complexity of Model-Based and Model-Free Reinforcement Learning
Hanlin Zhu
,
Baihe Huang
,
Stuart Russell
NeurIPS
2024
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
Hanlin Zhu
,
Baihe Huang
,
Shaolun Zhang
,
Michael Jordan
,
Jiantao Jiao
,
Yuandong Tian
,
Stuart Russell
NeurIPS
2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
,
Paria Rashidinejad
,
Jiantao Jiao
ICMLW
2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
,
Paria Rashidinejad
,
Jiantao Jiao
ICLR
2023
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
,
Hanlin Zhu
,
Kunhe Yang
,
Stuart Russell
,
Jiantao Jiao
NeurIPS
2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
Hanlin Zhu
,
Amy Zhang
AISTATS
2023
Provably Efficient Reinforcement Learning via Surprise Bound
Hanlin Zhu
,
Ruosong Wang
,
Jason Lee
NeurIPSW
2023
Towards Optimal Statistical Watermarking
Baihe Huang
,
Banghua Zhu
,
Hanlin Zhu
,
Jason Lee
,
Jiantao Jiao
,
Michael Jordan
COLT
2021
Average-Case Communication Complexity of Statistical Problems
Cyrus Rashtchian
,
David Woodruff
,
Peng Ye
,
Hanlin Zhu