Yang, Jaehoon

1 publications

ICLR 2026 Libra: Effective yet Efficient Load Balancing for Large-Scale MoE Inference Jaehoon Yang, Yushin Kim, Seokwon Moon, Yeonhong Park, Jae W. Lee