Zhang, Yuxin
25 publications
ICML
2025
Determining Layer-Wise Sparsity for Large Language Models Through a Theoretical Perspective
NeurIPS
2025
Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective
ICML
2025
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models
NeurIPS
2025
Spotlight Attention: Towards Efficient LLM Generation via Non-Linear Hashing-Based KV Cache Retrieval
AAAI
2025
TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning