SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Abstract
Diffusion models excel in high-quality generation but suffer from slow inference due to iterative sampling. While recent methods have successfully transformed diffusion models into one-step generators, they neglect model size reduction, limiting their applicability in compute-constrained scenarios. This paper aims to develop small, efficient one-step diffusion models based on the powerful rectified flow framework, by exploring joint compression of inference steps and model size. The rectified flow framework trains one-step generative models using two operations, reflow and distillation. Compared with the original framework, squeezing the model size brings two new challenges: (1) the initialization mismatch between large teachers and small students during reflow; (2) the underperformance of naive distillation on small student models. To overcome these issues, we propose Annealing Reflow and Flow-Guided Distillation, which together comprise our framework. With our novel framework, we train a one-step diffusion model with an FID of 5.02 and 15.7M parameters, outperforming the previous state-of-the-art one-step diffusion model (FID=6.47, 19.4M parameters) on CIFAR10. On ImageNet 64×64 and FFHQ 64×64, our method yields small one-step diffusion models that are comparable to larger models, showcasing the effectiveness of our method in creating compact, efficient one-step diffusion models.
Cite
Text
Zhu et al. "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73007-8_20Markdown
[Zhu et al. "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhu2024eccv-slimflow/) doi:10.1007/978-3-031-73007-8_20BibTeX
@inproceedings{zhu2024eccv-slimflow,
title = {{SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow}},
author = {Zhu, Yuanzhi and Liu, Xingchao and Liu, Qiang},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73007-8_20},
url = {https://mlanthology.org/eccv/2024/zhu2024eccv-slimflow/}
}