Belkin, Mikhail
66 publications
COLT
2025
A Gap Between the Gaussian RKHS and Neural Networks: An Infinite-Center Asymptotic Analysis
ICML
2025
Emergence in Non-Neural Models: Grokking Modular Arithmetic via Average Gradient Outer Product
NeurIPSW
2024
Emergence in Non-Neural Models: Grokking Modular Arithmetic via Average Gradient Outer Product
ICLR
2024
More Is Better: When Infinite Overparameterization Is Optimal and Overfitting Is Obligatory
NeurIPSW
2023
On Feature Learning of Recursive Feature Machines and Automatic Relevance Determination
ICLR
2021
Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks
NeurIPS
2021
Risk Bounds for Over-Parameterized Maximum Margin Classification on Sub-Gaussian Mixtures
COLT
2018
Approximation Beats Concentration? an Approximation View on Inference with Smooth Radial Kernels
NeurIPS
2018
Overfitting or Perfect Fitting? Risk Bounds for Classification and Regression Rules That Interpolate
COLT
2014
The More, the Merrier: The Blessing of Dimensionality for Learning Large Gaussian Mixtures
COLT
2012
Toward Understanding Complex Spaces: Graph Laplacians on Manifolds with Singularities and Boundaries