Jain, Gaurav

2 publications

ICML 2025 Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Gaurav Jain, Oren Pereg, Moshe Wasserblat, David Harel
ICML 2025 Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs Ravi Ghadia, Avinash Kumar, Gaurav Jain, Prashant J. Nair, Poulami Das