Wu, Chun-Feng

1 publications

NeurIPS 2023 $s^3$: Increasing GPU Utilization During Generative Inference for Higher Throughput Yunho Jin, Chun-Feng Wu, David Brooks, Gu-Yeon Wei