Shi, Zhenmei
40 publications
AISTATS
2025
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-Context by Multi-Step Gradient Descent
AISTATS
2025
Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs
ICML
2025
Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities
NeurIPS
2025
Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning
CPAL
2025
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity
AISTATS
2025
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
NeurIPS
2024
Is a Picture Worth a Thousand Words? Delving into Spatial Reasoning for Vision Language Models