ML Anthology
Authors
Search
About
Wu, Juntong
1 publications
ICLR
2026
SERE: Similarity-Based Expert Re-Routing for Efficient Batch Decoding in MoE Models
Juntong Wu
,
Jialiang Cheng
,
Fuyu Lv
,
Ou Dan
,
Li Yuan