Dan, Ou

1 publications

ICLR 2026 SERE: Similarity-Based Expert Re-Routing for Efficient Batch Decoding in MoE Models Juntong Wu, Jialiang Cheng, Fuyu Lv, Ou Dan, Li Yuan