ML Anthology
Authors
Search
About
Wu, Chun-Feng
1 publications
NeurIPS
2023
$s^3$: Increasing GPU Utilization During Generative Inference for Higher Throughput
Yunho Jin
,
Chun-Feng Wu
,
David Brooks
,
Gu-Yeon Wei