Han, Andi
29 publications
ICML
2025
Efficient Optimization with Orthogonality Constraint: A Randomized Riemannian Submanifold Method
NeurIPS
2025
Generalization Bound of Gradient Flow Through Training Trajectory and Data-Dependent Kernel
ICLR
2025
On the Optimization and Generalization of Two-Layer Transformers with Sign Gradient Descent
ICLRW
2025
SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting
NeurIPS
2024
Provably Transformers Harness Multi-Concept Word Semantics for Efficient In-Context Learning
NeurIPS
2024
SLTrain: A Sparse Plus Low Rank Approach for Parameter and Memory Efficient Pretraining
ACML
2023
A New Perspective on the Expressive Equivalence Between Graph Convolution and Attention Models
TMLR
2023
Improved Differentially Private Riemannian Optimization: Fast Sampling and Variance Reduction