Abhyankar, Reyna

2 publications

ICLR 2025 Preble: Efficient Distributed Prompt Scheduling for LLM Serving Vikranth Srivatsa, Zijian He, Reyna Abhyankar, Dongming Li, Yiying Zhang
ICML 2024 InferCept: Efficient Intercept Support for Augmented Large Language Model Inference Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang