Wang, Yingjie
18 publications
ICLR
2026
Towards a Theoretical Understanding of In-Context Learning: Stability and Non-I.I.D Generalisation
NeurIPS
2025
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
18 publications