Wang, Yanling

1 publications

ICLR 2025 Streamlining Redundant Layers to Compress Large Language Models Xiaodong Chen, Yuxuan Hu, Jing Zhang, Yanling Wang, Cuiping Li, Hong Chen