ML Anthology
Authors
Search
About
Reddi, Sashank
20 publications
NeurIPS
2023
What Is the Inductive Bias of Flatness Regularization? a Study of Deep Matrix Factorization Models
Khashayar Gatmiry
,
Zhiyuan Li
,
Tengyu Ma
,
Sashank Reddi
,
Stefanie Jegelka
,
Ching-Yao Chuang
ICML
2022
In Defense of Dual-Encoders for Neural Ranking
Aditya Menon
,
Sadeep Jayasumana
,
Ankit Singh Rawat
,
Seungyeon Kim
,
Sashank Reddi
,
Sanjiv Kumar
ICML
2022
Private Adaptive Optimization with Side Information
Tian Li
,
Manzil Zaheer
,
Sashank Reddi
,
Virginia Smith
ICML
2022
Robust Training of Neural Networks Using Scale Invariant Architectures
Zhiyuan Li
,
Srinadh Bhojanapalli
,
Manzil Zaheer
,
Sashank Reddi
,
Sanjiv Kumar
AISTATS
2021
RankDistil: Knowledge Distillation for Ranking
Sashank Reddi
,
Rama Kumar Pasumarthi
,
Aditya Menon
,
Ankit Singh Rawat
,
Felix Yu
,
Seungyeon Kim
,
Andreas Veit
,
Sanjiv Kumar
ICML
2021
A Statistical Perspective on Distillation
Aditya K Menon
,
Ankit Singh Rawat
,
Sashank Reddi
,
Seungyeon Kim
,
Sanjiv Kumar
NeurIPS
2021
Breaking the Centralized Barrier for Cross-Device Federated Learning
Sai Praneeth Karimireddy
,
Martin Jaggi
,
Satyen Kale
,
Mehryar Mohri
,
Sashank Reddi
,
Sebastian U Stich
,
Ananda Theertha Suresh
ICML
2021
Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Ankit Singh Rawat
,
Aditya K Menon
,
Wittawat Jitkrittum
,
Sadeep Jayasumana
,
Felix Yu
,
Sashank Reddi
,
Sanjiv Kumar
NeurIPS
2021
Efficient Training of Retrieval Models Using Negative Cache
Erik Lindgren
,
Sashank Reddi
,
Ruiqi Guo
,
Sanjiv Kumar
ICML
2021
Federated Composite Optimization
Honglin Yuan
,
Manzil Zaheer
,
Sashank Reddi
ICLR
2020
Large Batch Optimization for Deep Learning: Training BERT in 76 Minutes
Yang You
,
Jing Li
,
Sashank Reddi
,
Jonathan Hseu
,
Sanjiv Kumar
,
Srinadh Bhojanapalli
,
Xiaodan Song
,
James Demmel
,
Kurt Keutzer
,
Cho-Jui Hsieh
ICLR
2020
Learning to Learn by Zeroth-Order Oracle
Yangjun Ruan
,
Yuanhao Xiong
,
Sashank Reddi
,
Sanjiv Kumar
,
Cho-Jui Hsieh
ICML
2020
Low-Rank Bottleneck in Multi-Head Attention Models
Srinadh Bhojanapalli
,
Chulhee Yun
,
Ankit Singh Rawat
,
Sashank Reddi
,
Sanjiv Kumar
NeurIPS
2020
O(n) Connections Are Expressive Enough: Universal Approximability of Sparse Transformers
Chulhee Yun
,
Yin-Wen Chang
,
Srinadh Bhojanapalli
,
Ankit Singh Rawat
,
Sashank Reddi
,
Sanjiv Kumar
ICML
2020
SCAFFOLD: Stochastic Controlled Averaging for Federated Learning
Sai Praneeth Karimireddy
,
Satyen Kale
,
Mehryar Mohri
,
Sashank Reddi
,
Sebastian Stich
,
Ananda Theertha Suresh
NeurIPS
2020
Why Are Adaptive Methods Good for Attention Models?
Jingzhao Zhang
,
Sai Praneeth Karimireddy
,
Andreas Veit
,
Seungyeon Kim
,
Sashank Reddi
,
Sanjiv Kumar
,
Suvrit Sra
NeurIPS
2019
Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces
Chuan Guo
,
Ali Mousavi
,
Xiang Wu
,
Daniel N Holtmann-Rice
,
Satyen Kale
,
Sashank Reddi
,
Sanjiv Kumar
ICML
2019
Escaping Saddle Points with Adaptive Gradient Methods
Matthew Staib
,
Sashank Reddi
,
Satyen Kale
,
Sanjiv Kumar
,
Suvrit Sra
NeurIPS
2019
Multilabel Reductions: What Is My Loss Optimising?
Aditya K Menon
,
Ankit Singh Rawat
,
Sashank Reddi
,
Sanjiv Kumar
NeurIPS
2018
Adaptive Methods for Nonconvex Optimization
Manzil Zaheer
,
Sashank Reddi
,
Devendra Sachan
,
Satyen Kale
,
Sanjiv Kumar