ML Anthology
Authors
Search
About
Kamahori, Keisuke
2 publications
ICLR
2025
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models
Keisuke Kamahori
,
Tian Tang
,
Yile Gu
,
Kan Zhu
,
Baris Kasikci
ICLRW
2024
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models
Keisuke Kamahori
,
Yile Gu
,
Kan Zhu
,
Baris Kasikci