ML Anthology
Authors
Search
About
Jha, Siddharth
2 publications
ICMLW
2024
Characterizing Prompt Compression Methods for Long Context Inference
Siddharth Jha
,
Lutfi Eren Erdogan
,
Sehoon Kim
,
Kurt Keutzer
,
Amir Gholami
ICMLW
2024
Learned Best-Effort LLM Serving
Siddharth Jha
,
Coleman Richard Charles Hooper
,
Xiaoxuan Liu
,
Sehoon Kim
,
Kurt Keutzer