Gromov, Andrey
16 publications
NeurIPS
2024
Learning to Grok: Emergence of In-Context Learning and Skill Composition in Modular Arithmetic Tasks
ICMLW
2024
Learning to Grok: Emergence of In-Context Learning and Skill Composition in Modular Arithmetic Tasks
ICLRW
2024
Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations
NeurIPSW
2023
Associative Memory Under the Probabilistic Lens: Improved Transformers & Dynamic Memory Creation