Gong, Ruihao

20 publications

AAAI 2025 AtomNet: Designing Tiny Models from Operators Under Extreme MCU Constraints Zhiwei Dong, Mingzhu Shen, Shihao Bai, Xiuying Wei, Jinyang Guo, Ruihao Gong, Song-Lu Chen, Xianglong Liu, Xu-Cheng Yin
ICML 2025 DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models Changyi He, Yifu Ding, Jinyang Guo, Ruihao Gong, Haotong Qin, Xianglong Liu
ICML 2025 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration Yushi Huang, Zining Wang, Ruihao Gong, Jing Liu, Xinjie Zhang, Jinyang Guo, Xianglong Liu, Jun Zhang
NeurIPS 2025 Hierachical Balance Packing: Towards Efficient Supervised Fine-Tuning for Long-Context LLM Yongqiang Yao, Jingru Tan, Kaihuan Liang, Feizhao Zhang, Jiahao Hu, Shuo Wu, Yazhe Niu, Ruihao Gong, Dahua Lin, Ningyi Xu
ICML 2025 OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance Yongqiang Yao, Jingru Tan, Feizhao Zhang, Jiahao Hu, Yazhe Niu, Jin Xin, Bo Li, Pengfei Liu, Ruihao Gong, Dahua Lin, Ningyi Xu
ICML 2024 Compressing Large Language Models by Joint Sparsification and Quantization Jinyang Guo, Jianyu Wu, Zining Wang, Jiaheng Liu, Ge Yang, Yifu Ding, Ruihao Gong, Haotong Qin, Xianglong Liu
AAAI 2024 Fast and Controllable Post-Training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes Ruihao Gong, Yang Yong, Zining Wang, Jinyang Guo, Xiuying Wei, Yuqing Ma, Xianglong Liu
ICLR 2024 QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models Jing Liu, Ruihao Gong, Xiuying Wei, Zhiwei Dong, Jianfei Cai, Bohan Zhuang
AAAI 2024 Selective Focus: Investigating Semantics Sensitivity in Post-Training Quantization for Lane Detection Yunqian Fan, Xiuying Wei, Ruihao Gong, Yuqing Ma, Xiangguo Zhang, Qi Zhang, Xianglong Liu
CVPR 2024 TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models Yushi Huang, Ruihao Gong, Jing Liu, Tianlong Chen, Xianglong Liu
CVPR 2023 Annealing-Based Label-Transfer Learning for Open World Object Detection Yuqing Ma, Hainan Li, Zhange Zhang, Jinyang Guo, Shanghang Zhang, Ruihao Gong, Xianglong Liu
CVPR 2023 Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization Aishan Liu, Shiyu Tang, Siyuan Liang, Ruihao Gong, Boxi Wu, Xianglong Liu, Dacheng Tao
ICCV 2023 Lossy and Lossless (l2) Post-Training Model Size Compression Yumeng Shi, Shihao Bai, Xiuying Wei, Ruihao Gong, Jianlei Yang
NeurIPS 2022 Outlier Suppression: Pushing the Limit of Low-Bit Transformer Language Models Xiuying Wei, Yunchen Zhang, Xiangguo Zhang, Ruihao Gong, Shanghang Zhang, Qi Zhang, Fengwei Yu, Xianglong Liu
ICLR 2022 QDrop: Randomly Dropping Quantization for Extremely Low-Bit Post-Training Quantization Xiuying Wei, Ruihao Gong, Yuhang Li, Xianglong Liu, Fengwei Yu
ICML 2021 A Free Lunch from ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration Yuhang Li, Shikuang Deng, Xin Dong, Ruihao Gong, Shi Gu
ICLR 2021 BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction Yuhang Li, Ruihao Gong, Xu Tan, Yang Yang, Peng Hu, Qi Zhang, Fengwei Yu, Wei Wang, Shi Gu
CVPR 2021 Diversifying Sample Generation for Accurate Data-Free Quantization Xiangguo Zhang, Haotong Qin, Yifu Ding, Ruihao Gong, Qinghua Yan, Renshuai Tao, Yuhang Li, Fengwei Yu, Xianglong Liu
ICCV 2021 MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing Yuhang Li, Feng Zhu, Ruihao Gong, Mingzhu Shen, Xin Dong, Fengwei Yu, Shaoqing Lu, Shi Gu
ICCV 2021 Once Quantization-Aware Training: High Performance Extremely Low-Bit Architecture Search Mingzhu Shen, Feng Liang, Ruihao Gong, Yuhang Li, Chuming Li, Chen Lin, Fengwei Yu, Junjie Yan, Wanli Ouyang