Kakade, Sham M.
120 publications
ICLR
2025
Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond
ICLRW
2025
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions
NeurIPS
2024
Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent
JMLR
2021
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
NeurIPS
2019
The Step Decay Schedule: A near Optimal, Geometrically Decaying Learning Rate Procedure for Least Squares
NeurIPS
2013
When Are Overcomplete Topic Models Identifiable? Uniqueness of Tensor Tucker Decompositions with Structured Sparsity