Kamahori, Keisuke

2 publications

ICLR 2025 Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Keisuke Kamahori, Tian Tang, Yile Gu, Kan Zhu, Baris Kasikci
ICLRW 2024 Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Keisuke Kamahori, Yile Gu, Kan Zhu, Baris Kasikci