SenseFlow: Scaling Distribution Matching for Flow-Based Text-to-Image Distillation
Abstract
The Distribution Matching Distillation (DMD) has been successfully applied to text-to-image diffusion models such as Stable Diffusion (SD) 1.5. However, vanilla DMD suffers from convergence difficulties on large-scale flow-based text-to-image models, such as SD 3.5 and FLUX. In this paper, we first analyze the issues when applying vanilla DMD on large-scale models. Then, to overcome the scalability challenge, we propose implicit distribution alignment (IDA) to constrain the divergence between the generator and the fake distribution. Furthermore, we propose intra-segment guidance (ISG) to relocate the timestep denoising importance from the teacher model. With IDA alone, DMD converges for SD 3.5; employing both IDA and ISG, DMD converges for SD 3.5 and FLUX.1 dev. Together with a scaled VFM-based discriminator, our final model, dubbed **SenseFlow**, achieves superior performance in distillation for both diffusion based text-to-image models such as SDXL, and flow-matching models such as SD 3.5 Large and FLUX.1 dev. The source code is available at https://github.com/XingtongGe/SenseFlow.
Cite
Text
Ge et al. "SenseFlow: Scaling Distribution Matching for Flow-Based Text-to-Image Distillation." International Conference on Learning Representations, 2026.Markdown
[Ge et al. "SenseFlow: Scaling Distribution Matching for Flow-Based Text-to-Image Distillation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/ge2026iclr-senseflow/)BibTeX
@inproceedings{ge2026iclr-senseflow,
title = {{SenseFlow: Scaling Distribution Matching for Flow-Based Text-to-Image Distillation}},
author = {Ge, Xingtong and Zhang, Xin and Xu, Tongda and Zhang, Yi and Zhang, Xinjie and Wang, Yan and Zhang, Jun},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/ge2026iclr-senseflow/}
}