Zhou, Yuxin

1 publications

ICML 2025 FloE: On-the-Fly MoE Inference on Memory-Constrained GPU Yuxin Zhou, Zheng Li, Jun Zhang, Jue Wang, Yiping Wang, Zhongle Xie, Ke Chen, Lidan Shou