Yuan, Zhihang

24 publications

CVPR 2025 A Closer Look at Time Steps Is Worthy of Triple Speed-up for Diffusion Model Training Kai Wang, Mingjia Shi, Yukun Zhou, Zekai Li, Zhihang Yuan, Yuzhang Shang, Xiaojiang Peng, Hanwang Zhang, Yang You
ICCV 2025 DLFR-Gen: Diffusion-Based Video Generation with Dynamic Latent Frame Rate Zhihang Yuan, Rui Xie, Yuzhang Shang, Hanling Zhang, Siyuan Wang, Shengen Yan, Guohao Dai, Yu Wang
ICCV 2025 DiTFastAttnV2: Head-Wise Attention Compression for Multi-Modality Diffusion Transformers Hanling Zhang, Rundong Su, Zhihang Yuan, Pengtao Chen, Mingzhu Shen, Yibo Fan, Shengen Yan, Guohao Dai, Yu Wang
ICCV 2025 EA-ViT: Efficient Adaptation for Elastic Vision Transformer Chen Zhu, Wangbo Zhao, Huiwen Zhang, Yuhao Zhou, Weidong Tang, Shuo Wang, Zhihang Yuan, Yuzhang Shang, Xiaojiang Peng, Kai Wang, Dawei Yang
ICLR 2025 MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods Zukang Xu, Yuxuan Yue, Xing Hu, Dawei Yang, Zhihang Yuan, Zixu Jiang, Zhixuan Chen, JiangyongYu, Xuchen, Sifan Zhou
ICML 2025 MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance Zhixuan Chen, Xing Hu, Dawei Yang, Zukang Xu, Xu Chen, Zhihang Yuan, Sifan Zhou, Jiangyong Yu
ICML 2025 MxMoE: Mixed-Precision Quantization for MoE with Accuracy and Performance Co-Design Haojie Duanmu, Xiuhong Li, Zhihang Yuan, Size Zheng, Jiangfei Duan, Xingcheng Zhang, Dahua Lin
ICLR 2025 OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting Xing Hu, Yuan Cheng, Dawei Yang, Zhixuan Chen, Zukang Xu, JiangyongYu, Xuchen, Zhihang Yuan, Zhe Jiang, Sifan Zhou
CVPR 2025 PillarHist: A Quantization-Aware Pillar Feature Encoder Based on Height-Aware Histogram Sifan Zhou, Zhihang Yuan, Dawei Yang, Xing Hu, Jian Qian, Ziyu Zhao
ICCV 2025 QuEST: Low-Bit Diffusion Model Quantization via Efficient Selective Finetuning Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, Junchi Yan, Yan Yan
NeurIPS 2025 R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Tianyu Fu, Yi Ge, Yichen You, Enshu Liu, Zhihang Yuan, Guohao Dai, Shengen Yan, Huazhong Yang, Yu Wang
ICML 2025 RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization Chen Xu, Yuxuan Yue, Zukang Xu, Xing Hu, Jiangyong Yu, Zhixuan Chen, Sifan Zhou, Zhihang Yuan, Dawei Yang
NeurIPS 2025 SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-Critical Expert Identification ZhengLin Lai, Mengyao Liao, Bingzhe Wu, Dong Xu, Zebin Zhao, Zhihang Yuan, Chao Fan, Jianqiang Li
NeurIPS 2024 DiTFastAttn: Attention Compression for Diffusion Transformer Models Zhihang Yuan, Hanling Zhang, Pu Lu, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang
ICML 2024 Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, Siddharth N, Shuo Chen, Mahesh K. Marina
NeurIPSW 2024 LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization Rui Xie, Tianchen Zhao, Zhihang Yuan, Rui Wan, Wenxi Gao, Zhenhua Zhu, Xuefei Ning, Yu Wang
ICLR 2024 PB-LLM: Partially Binarized Large Language Models Zhihang Yuan, Yuzhang Shang, Zhen Dong
ICMLW 2023 Benchmarking the Reliability of Post-Training Quantization: A Particular Focus on Worst-Case Performance Zhihang Yuan, Jiawei Liu, Jiaxiang Wu, Dawei Yang, Qiang Wu, Guangyu Sun, Wenyu Liu, Xinggang Wang, Bingzhe Wu
NeurIPS 2023 MIM4DD: Mutual Information Maximization for Dataset Distillation Yuzhang Shang, Zhihang Yuan, Yan Yan
CVPR 2023 PD-Quant: Post-Training Quantization Based on Prediction Difference Metric Jiawei Liu, Lin Niu, Zhihang Yuan, Dawei Yang, Xinggang Wang, Wenyu Liu
CVPR 2023 Post-Training Quantization on Diffusion Models Yuzhang Shang, Zhihang Yuan, Bin Xie, Bingzhe Wu, Yan Yan
NeurIPS 2022 Latency-Aware Spatial-Wise Dynamic Networks Yizeng Han, Zhihang Yuan, Yifan Pu, Chenhao Xue, Shiji Song, Guangyu Sun, Gao Huang
ECCV 2022 PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun
ECCV 2020 S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search Zhihang Yuan, Bingzhe Wu, Guangyu Sun, Zheng Liang, Shiwan Zhao, Weichen Bi