Zhang, Yiqi

2 publications

ICLR 2026 MoNE: Replacing Redundant Experts with Lightweight Novices for Structured Pruning of MoE Geng Zhang, Han Yuxuan, Yuxuan Lou, Yiqi Zhang, Wangbo Zhao, Yang You
NeurIPS 2024 SpeedLoader: An I/O Efficient Scheme for Heterogeneous and Distributed LLM Operation Yiqi Zhang, Yang You