Flow Along the K-Amplitude for Generative Modeling

Abstract

In this work, we propose a novel generative learning paradigm, K-Flow, an algorithm that flows along the $K$-amplitude. Here k is a scaling parameter that organizes frequency bands (or projected coefficients), and amplitude describes the norm of such projected coefficients. By incorporating the $K$-amplitude decomposition, K-Flow enables flow matching across the scaling parameter as time. We discuss three venues of six properties of K-Flow, from theoretical foundations, energy and temporal dynamics, and practical applications, respectively. Specifically, from the practical usage perspective, K-Flow allows steerable generation by controlling the information at different scales. To demonstrate the effectiveness of K-Flow, we conduct experiments on unconditional image generation and class-conditional image generation. Additionally, we conduct three ablation studies to demonstrate how K-Flow steers the scaling parameter to control the resolution of image generation.

Cite

Text

Du et al. "Flow Along the K-Amplitude for Generative Modeling." ICLR 2025 Workshops: DeLTa, 2025.

Markdown

[Du et al. "Flow Along the K-Amplitude for Generative Modeling." ICLR 2025 Workshops: DeLTa, 2025.](https://mlanthology.org/iclrw/2025/du2025iclrw-flow/)

BibTeX

@inproceedings{du2025iclrw-flow,
  title     = {{Flow Along the K-Amplitude for Generative Modeling}},
  author    = {Du, Weitao and Chang, Shuning and Tang, Jiasheng and Rong, Yu and Wang, Fan and Liu, Shengchao},
  booktitle = {ICLR 2025 Workshops: DeLTa},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/du2025iclrw-flow/}
}