Cheng, Ellie Y

2 publications

ICML 2025 Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding Tian Jin, Ellie Y Cheng, Zachary Ankner, Nikunj Saunshi, Blake M Elias, Amir Yazdanbakhsh, Jonathan Ragan-Kelley, Suvinay Subramanian, Michael Carbin
ICLRW 2024 Expressing and Exploiting Parallelism in Language Model Decoding Tian Jin, Ellie Y Cheng, Michael Carbin