ML Anthology
Authors
Search
About
Yu, Zhongzhi
8 publications
ICML
2025
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Dachuan Shi
,
Yonggan Fu
,
Xiangchi Yuan
,
Zhongzhi Yu
,
Haoran You
,
Sixu Li
,
Xin Dong
,
Jan Kautz
,
Pavlo Molchanov
,
Yingyan Celine Lin
NeurIPS
2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment
Yonggan Fu
,
Zhongzhi Yu
,
Junwei Li
,
Jiayi Qian
,
Yongan Zhang
,
Xiangchi Yuan
,
Dachuan Shi
,
Roman Yakunin
,
Yingyan Lin
ICML
2024
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration
Zhongzhi Yu
,
Zheng Wang
,
Yonggan Fu
,
Huihong Shi
,
Khalid Shaikh
,
Yingyan Celine Lin
CVPR
2023
Hint-Aug: Drawing Hints from Foundation Vision Transformers Towards Boosted Few-Shot Parameter-Efficient Tuning
Zhongzhi Yu
,
Shang Wu
,
Yonggan Fu
,
Shunyao Zhang
,
Yingyan Lin
ICML
2023
Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning
Zhongzhi Yu
,
Yang Zhang
,
Kaizhi Qian
,
Cheng Wan
,
Yonggan Fu
,
Yongan Zhang
,
Yingyan Celine Lin
NeurIPS
2022
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
Yonggan Fu
,
Yang Zhang
,
Kaizhi Qian
,
Zhifan Ye
,
Zhongzhi Yu
,
Cheng-I Jeff Lai
,
Celine Lin
AAAI
2022
MIA-Former: Efficient and Robust Vision Transformers via Multi-Grained Input-Adaptation
Zhongzhi Yu
,
Yonggan Fu
,
Sicheng Li
,
Chaojian Li
,
Yingyan Lin
ICLR
2021
HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
Chaojian Li
,
Zhongzhi Yu
,
Yonggan Fu
,
Yongan Zhang
,
Yang Zhao
,
Haoran You
,
Qixuan Yu
,
Yue Wang
,
Cong Hao
,
Yingyan Lin