STORK: Faster Diffusion and Flow Matching Sampling by Resolving Both Stiffness and Structure-Dependence

Abstract

Diffusion models (DMs) and flow-matching models have demonstrated remarkable performance in image and video generation. However, such models require a significant number of function evaluations (NFEs) during sampling, leading to costly inference. Consequently, quality-preserving fast sampling methods that require fewer NFEs have been an active area of research. However, prior training-free sampling methods fail to simultaneously address two key challenges: the stiffness of the ODE (i.e., the non-straightness of the velocity field) and dependence on the semi-linear structure of the DM ODE (which limits their direct applicability to flow-matching models). In this work, we introduce the Stabilized Taylor Orthogonal Runge–Kutta (STORK) method, addressing both design concerns. We demonstrate that STORK consistently improves the quality of diffusion and flow-matching sampling for image and video generation.

Cite

Text

Tan et al. "STORK: Faster Diffusion and Flow Matching Sampling by Resolving Both Stiffness and Structure-Dependence." International Conference on Learning Representations, 2026.

Markdown

[Tan et al. "STORK: Faster Diffusion and Flow Matching Sampling by Resolving Both Stiffness and Structure-Dependence." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/tan2026iclr-stork/)

BibTeX

@inproceedings{tan2026iclr-stork,
  title     = {{STORK: Faster Diffusion and Flow Matching Sampling by Resolving Both Stiffness and Structure-Dependence}},
  author    = {Tan, Zheng and Wang, Weizhen and Bertozzi, Andrea L. and Ryu, Ernest K.},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/tan2026iclr-stork/}
}