Peng, Hongwu

5 publications

ICLR 2025 RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs Xi Xie, Yuebo Luo, Hongwu Peng, Caiwen Ding
NeurIPS 2024 Learning from Teaching Regularization: Generalizable Correlations Should Be Easy to Imitate Can Jin, Tong Che, Hongwu Peng, Yiyuan Li, Dimitris N. Metaxas, Marco Pavone
ICML 2024 Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Tianle Cai, Yuhong Li, Zhengyang Geng, Hongwu Peng, Jason D. Lee, Deming Chen, Tri Dao
ICCV 2023 AutoReP: Automatic ReLU Replacement for Fast Private Network Inference Hongwu Peng, Shaoyi Huang, Tong Zhou, Yukui Luo, Chenghong Wang, Zigeng Wang, Jiahui Zhao, Xi Xie, Ang Li, Tony Geng, Kaleel Mahmood, Wujie Wen, Xiaolin Xu, Caiwen Ding
NeurIPS 2023 LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference Hongwu Peng, Ran Ran, Yukui Luo, Jiahui Zhao, Shaoyi Huang, Kiran Thorat, Tong Geng, Chenghong Wang, Xiaolin Xu, Wujie Wen, Caiwen Ding