Zhang, Xianwei

1 publications

NeurIPS 2025 DynaPipe: Dynamic Layer Redistribution for Efficient Serving of LLMs with Pipeline Parallelism HongXin Xu, Tianyu Guo, Xianwei Zhang