Borisov, Boyko

1 publications

ICLRW 2025 DeltaMoE: Memory-Efficient Inference for Merged Mixture of Experts with Delta Compression Boyko Borisov, Xiaozhe Yao, Nezihe Merve Gürel, Ana Klimovic