Wang, Boshi
8 publications
ICMLW
2024
Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
NeurIPS
2024
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization
8 publications