Jiao, Jiantao
42 publications
NeurIPS
2025
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
NeurIPSW
2024
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
NeurIPS
2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
ICLR
2023
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
ICLRW
2023
Principled Reinforcement Learning with Human Feedback from Pairwise or $k$-Wise Comparisons