Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Abstract

We introduce Purrception, a variational flow matching approach for vector-quantized image generation that provides explicit categorical supervision while maintaining continuous transport dynamics. Our method adapts Variational Flow Matching to vector-quantized latents by learning categorical posteriors over codebook indices while computing velocity fields in the continuous embedding space. This combines the geometric awareness of continuous methods with the discrete supervision of categorical approaches, enabling uncertainty quantification over plausible codes and temperature-controlled generation. We evaluate Purrception on ImageNet-1k $256 \times 256$ generation. Training converges faster than both continuous flow matching and discrete flow matching baselines while achieving competitive FID scores with state-of-the-art models. This demonstrates that Variational Flow Matching can effectively bridge continuous transport and discrete supervision for improved training efficiency in image generation.

Cite

Text

Matișan et al. "Purrception: Variational Flow Matching for Vector-Quantized Image Generation." International Conference on Learning Representations, 2026.

Markdown

[Matișan et al. "Purrception: Variational Flow Matching for Vector-Quantized Image Generation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/matisan2026iclr-purrception/)

BibTeX

@inproceedings{matisan2026iclr-purrception,
  title     = {{Purrception: Variational Flow Matching for Vector-Quantized Image Generation}},
  author    = {Matișan, Răzvan-Andrei and Hu, Vincent Tao and Bartosh, Grigory and Ommer, Björn and Snoek, Cees G. M. and Welling, Max and van de Meent, Jan-Willem and Derakhshani, Mohammad Mahdi and Eijkelboom, Floor},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/matisan2026iclr-purrception/}
}