Shi, Huihong

4 publications

NeurIPS 2024 ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization Haoran You, Yipin Guo, Yichao Fu, Wei Zhou, Huihong Shi, Xiaofan Zhang, Souvik Kundu, Amir Yazdanbakhsh, Yingyan Lin
ICML 2024 Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Zhongzhi Yu, Zheng Wang, Yonggan Fu, Huihong Shi, Khalid Shaikh, Yingyan Celine Lin
NeurIPS 2023 ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer Haoran You, Huihong Shi, Yipin Guo, Yingyan Lin
TMLR 2022 Max-Affine Spline Insights into Deep Network Pruning Haoran You, Randall Balestriero, Zhihan Lu, Yutong Kou, Huihong Shi, Shunyao Zhang, Shang Wu, Yingyan Lin, Richard Baraniuk