Niu, Wei

30 publications

CVPR 2025 AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Yuanbin Man, Ying Huang, Chengming Zhang, Bingzhe Li, Wei Niu, Miao Yin
CVPR 2025 GaussianSpa: An "Optimizing-Sparsifying" Simplification Framework for Compact and High-Quality 3D Gaussian Splatting Yangming Zhang, Wenqi Jia, Wei Niu, Miao Yin
AAAI 2025 LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers Xuan Shen, Zhao Song, Yufa Zhou, Bo Chen, Yanyu Li, Yifan Gong, Kai Zhang, Hao Tan, Jason Kuen, Henghui Ding, Zhihao Shu, Wei Niu, Pu Zhao, Yanzhi Wang, Jiuxiang Gu
CVPR 2025 QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge Xuan Shen, Weize Ma, Jing Liu, Changdi Yang, Rui Ding, Quanyi Wang, Henghui Ding, Wei Niu, Yanzhi Wang, Pu Zhao, Jun Lin, Jiuxiang Gu
ICLR 2025 Sparse Learning for State Space Models on Mobile Xuan Shen, Hangyu Zheng, Yifan Gong, Zhenglun Kong, Changdi Yang, Zheng Zhan, Yushu Wu, Xue Lin, Yanzhi Wang, Pu Zhao, Wei Niu
AAAI 2025 Toward Adaptive Large Language Models Structured Pruning via Hybrid-Grained Weight Importance Assessment Jun Liu, Zhenglun Kong, Pu Zhao, Changdi Yang, Xuan Shen, Hao Tang, Geng Yuan, Wei Niu, Wenbin Zhang, Xue Lin, Dong Huang, Yanzhi Wang
ECCV 2024 Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design Gen Li, Zhihao Shu, Jie Ji, Minghai Qin, Fatemeh Afghah, Wei Niu, Xiaolong Ma
NeurIPS 2024 Exploring Token Pruning in Vision State Space Models Zheng Zhan, Zhenglun Kong, Yifan Gong, Yushu Wu, Zichong Meng, Hangyu Zheng, Xuan Shen, Stratis Ioannidis, Wei Niu, Pu Zhao, Yanzhi Wang
NeurIPS 2024 Fast and Memory-Efficient Video Diffusion Using Streamlined Inference Zheng Zhan, Yushu Wu, Yifan Gong, Zichong Meng, Zhenglun Kong, Changdi Yang, Geng Yuan, Pu Zhao, Wei Niu, Yanzhi Wang
ICLR 2024 NeurRev: Train Better Sparse Neural Network Practically via Neuron Revitalization Gen Li, Lu Yin, Jie Ji, Wei Niu, Minghai Qin, Bin Ren, Linke Guo, Shiwei Liu, Xiaolong Ma
NeurIPS 2024 Real-Time Core-Periphery Guided ViT with Smart Data Layout Selection on Mobile Devices Zhihao Shu, Xiaowei Yu, Zihao Wu, Wenqi Jia, Yinchen Shi, Miao Yin, Tianming Liu, Dajiang Zhu, Wei Niu
CVPR 2023 Pruning Parameterization with Bi-Level Optimization for Efficient Semantic Segmentation on the Edge Changdi Yang, Pu Zhao, Yanyu Li, Wei Niu, Jiexiong Guan, Hao Tang, Minghai Qin, Bin Ren, Xue Lin, Yanzhi Wang
CVPR 2023 Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma
AAAI 2023 Towards Real-Time Segmentation on the Edge Yanyu Li, Changdi Yang, Pu Zhao, Geng Yuan, Wei Niu, Jiexiong Guan, Hao Tang, Minghai Qin, Qing Jin, Bin Ren, Xue Lin, Yanzhi Wang
ECCV 2022 Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang
IJCAI 2022 Real-Time Portrait Stylization on the Edge Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang
ECCV 2022 SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang
NeurIPS 2022 SparCL: Sparse Continual Learning on the Edge Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, Jennifer Dy
AAAI 2021 A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices Yuxuan Cai, Geng Yuan, Hongjia Li, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang
IJCAI 2021 A Compression-Compilation Framework for On-Mobile Real-Time BERT Applications Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang
ICCV 2021 Achieving On-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang
NeurIPS 2021 MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin
CVPR 2021 NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin
AAAI 2021 RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices Wei Niu, Mengshu Sun, Zhengang Li, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Sijia Liu, Xue Lin, Bin Ren
IJCAI 2021 Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices Xuan Shen, Geng Yuan, Wei Niu, Xiaolong Ma, Jiexiong Guan, Zhengang Li, Bin Ren, Yanzhi Wang
AAAI 2021 YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang
ECCV 2020 An Image Enhancing Pattern-Based Sparsity for Real-Time Inference on Mobile Devices Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Wujie Wen, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang
AAAI 2020 PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang
IJCAI 2020 Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization Wei Niu, Pu Zhao, Zheng Zhan, Xue Lin, Yanzhi Wang, Bin Ren
AAAI 2018 Location-Sensitive User Profiling Using Crowdsourced Labels Wei Niu, James Caverlee, Haokai Lu