Zhu, Libin
9 publications
ICML
2025
Emergence in Non-Neural Models: Grokking Modular Arithmetic via Average Gradient Outer Product
NeurIPSW
2024
Emergence in Non-Neural Models: Grokking Modular Arithmetic via Average Gradient Outer Product
NeurIPS
2022
Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture