You, Haoran

16 publications

CVPR 2025 Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training Lexington Whalen, Zhenbang Du, Haoran You, Chaojian Li, Sixu Li, Yingyan Lin
ICML 2025 LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models Dachuan Shi, Yonggan Fu, Xiangchi Yuan, Zhongzhi Yu, Haoran You, Sixu Li, Xin Dong, Jan Kautz, Pavlo Molchanov, Yingyan Celine Lin
CVPR 2025 Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers Haoran You, Connelly Barnes, Yuqian Zhou, Yan Kang, Zhenbang Du, Wei Zhou, Lingzhi Zhang, Yotam Nitzan, Xiaoyang Liu, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Yingyan Celine Lin
NeurIPS 2024 ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization Haoran You, Yipin Guo, Yichao Fu, Wei Zhou, Huihong Shi, Xiaofan Zhang, Souvik Kundu, Amir Yazdanbakhsh, Yingyan Lin
ICML 2024 When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models Haoran You, Yichao Fu, Zheng Wang, Amir Yazdanbakhsh, Yingyan Celine Lin
CVPR 2023 Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference Haoran You, Yunyang Xiong, Xiaoliang Dai, Bichen Wu, Peizhao Zhang, Haoqi Fan, Peter Vajda, Yingyan Lin
NeurIPS 2023 ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer Haoran You, Huihong Shi, Yipin Guo, Yingyan Lin
AAAI 2022 Early-Bird GCNs: Graph-Network Co-Optimization Towards More Efficient GCN Training and Inference via Drawing Early-Bird Lottery Tickets Haoran You, Zhihan Lu, Zijian Zhou, Yonggan Fu, Yingyan Lin
TMLR 2022 Max-Affine Spline Insights into Deep Network Pruning Haoran You, Randall Balestriero, Zhihan Lu, Yutong Kou, Huihong Shi, Shunyao Zhang, Shang Wu, Yingyan Lin, Richard Baraniuk
ICML 2022 ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks Haoran You, Baopu Li, Shi Huihong, Yonggan Fu, Yingyan Lin
ECCV 2022 SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning Haoran You, Baopu Li, Zhanyi Sun, Xu Ouyang, Yingyan Lin
ICLR 2021 HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark Chaojian Li, Zhongzhi Yu, Yonggan Fu, Yongan Zhang, Yang Zhao, Haoran You, Qixuan Yu, Yue Wang, Cong Hao, Yingyan Lin
ICLR 2020 Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networks Haoran You, Chaojian Li, Pengfei Xu, Yonggan Fu, Yue Wang, Xiaohan Chen, Richard G. Baraniuk, Zhangyang Wang, Yingyan Lin
NeurIPS 2020 FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training Yonggan Fu, Haoran You, Yang Zhao, Yue Wang, Chaojian Li, Kailash Gopalakrishnan, Zhangyang Wang, Yingyan Lin
ECCV 2020 HALO: Hardware-Aware Learning to Optimize Chaojian Li, Tianlong Chen, Haoran You, Zhangyang Wang, Yingyan Lin
NeurIPS 2020 ShiftAddNet: A Hardware-Inspired Deep Network Haoran You, Xiaohan Chen, Yongan Zhang, Chaojian Li, Sicheng Li, Zihao Liu, Zhangyang Wang, Yingyan Lin