Srivatsa, Vikranth

3 publications

ICLR 2025 Preble: Efficient Distributed Prompt Scheduling for LLM Serving Vikranth Srivatsa, Zijian He, Reyna Abhyankar, Dongming Li, Yiying Zhang
ICML 2024 InferCept: Efficient Intercept Support for Augmented Large Language Model Inference Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang
NeurIPSW 2021 The Effect of Model Size on Worst-Group Generalization Alan Le Pham, Eunice Chan, Vikranth Srivatsa, Dhruba Ghosh, Yaoqing Yang, Yaodong Yu, Ruiqi Zhong, Joseph E. Gonzalez, Jacob Steinhardt