Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-Based Token Pruning
Abstract
The excellent performance of diffusion models in image generation is always accompanied by overlarge computation costs, which have prevented the application of diffusion models in edge devices and interactive applications. Previous works mainly focus on using fewer sampling steps and compressing the denoising network of diffusion models, while this paper proposes to accelerate diffusion models by introducing SiTo, a similarity-based token pruning method that adaptive prunes the redundant tokens in the input data. SiTo is designed to maximize the similarity between model prediction with and without token pruning by using cheap and hardware-friendly operations, leading to significant acceleration ratios without performance drop, and even sometimes improvements in the generation quality. For instance, the zero-shot evaluation shows SiTo leads to 1.90x and 1.75x acceleration on COCO30K and ImageNet with 1.33 and 1.15 FID reduction at the same time. Besides, SiTo has no training requirements and does not require any calibration data, making it plug-and-play in real-world applications.
Cite
Text
Zhang et al. "Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-Based Token Pruning." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.33071Markdown
[Zhang et al. "Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-Based Token Pruning." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-training-a/) doi:10.1609/AAAI.V39I9.33071BibTeX
@inproceedings{zhang2025aaai-training-a,
title = {{Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-Based Token Pruning}},
author = {Zhang, Evelyn and Tang, Jiayi and Ning, Xuefei and Zhang, Linfeng},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {9878-9886},
doi = {10.1609/AAAI.V39I9.33071},
url = {https://mlanthology.org/aaai/2025/zhang2025aaai-training-a/}
}