Lin, Yingyan
34 publications
NeurIPS
2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment
NeurIPS
2024
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
NeurIPS
2021
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found Within Randomly Initialized Networks