Sra, Suvrit
102 publications
ICML
2024
Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions in Context
NeurIPSW
2023
Sion's Minimax Theorem in Geodesic Metric Spaces and a Riemannian Extragradient Algorithm
NeurIPS
2023
Transformers Learn to Implement Preconditioned Gradient Descent for In-Context Learning
ICML
2022
Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective
NeurIPS
2021
Three Operator Splitting with Subgradients, Stochastic Gradients, and Adaptive Learning Rates
ICML
2020
Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
NeurIPS
2020
SGD with Shuffling: Optimal Rates Without Component Convexity and Large Epoch Requirements
NeurIPS
2016
Fast Mixing Markov Chains for Strongly Rayleigh Measures, DPPs, and Constrained Sampling
NeurIPS
2013
Geometric Optimisation on Positive Definite Matrices for Elliptically Contoured Distributions
NeurIPS
2012
A New Metric on the Manifold of Kernel Matrices with Application to Matrix Geometric Means
ICCV
2011
Efficient Similarity Search for Covariance Matrices via the Jensen-Bregman LogDet Divergence