Zhu, Hanlin
15 publications
ICLR
2026
Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought
NeurIPS
2025
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
NeurIPS
2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning