Vaswani, Sharan
32 publications
NeurIPSW
2024
Improving OOD Generalization of Pre-Trained Encoders via Aligned Embedding-Space Ensembles
NeurIPSW
2024
Improving OOD Generalization of Pre-Trained Encoders via Aligned Embedding-Space Ensembles
NeurIPS
2024
Small Steps No More: Global Convergence of Stochastic Gradient Bandits for Arbitrary Learning Rates
NeurIPSW
2023
Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism