Liang, Yingyu
77 publications
AISTATS
2025
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-Context by Multi-Step Gradient Descent
AISTATS
2025
Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs
ICML
2025
Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities
NeurIPS
2025
Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning
CPAL
2025
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity
AISTATS
2025
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
ICLR
2023
The Trade-Off Between Universality and Label Efficiency of Representations from Contrastive Learning
ICMLW
2022
The Trade-Off Between Label Efficiency and Universality of Representations from Contrastive Learning
NeurIPS
2021
Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-Training Ensembles
NeurIPS
2020
Functional Regularization for Representation Learning: A Unified Theoretical Perspective
NeurIPS
2019
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
NeurIPS
2019
N-Gram Graph: Simple Unsupervised Representation for Graphs, with Applications to Molecules
NeurIPS
2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
ICML
2017
Provable Alternating Gradient Descent for Non-Negative Matrix Factorization with Strong Correlations