Wei, Zhongyu
21 publications
ICLR
2026
Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models
NeurIPS
2025
Multi-Agent KTO: Enhancing Strategic Interactions of Large Language Model in Language Game
21 publications