Chen, Zhangyu

1 publications

ICLR 2026 DualMap: Enabling Both Cache Affinity and Load Balancing for Distributed LLM Serving Ying Yuan, Pengfei Zuo, Bo Wang, Zhangyu Chen, Zhipeng Tan, Zhou Yu