SoftCFG: Uncertainty-Guided Stable Guidance for Visual Autoregressive Model

Xu, Dongli; Tiulpin, Aleksei; Blaschko, Matthew B.

SoftCFG: Uncertainty-Guided Stable Guidance for Visual Autoregressive Model

Dongli Xu, Aleksei Tiulpin, Matthew B. Blaschko

ICLR 2026

/iclr/2026/xu2026iclr-softcfg/

Abstract

Autoregressive (AR) models have emerged as powerful tools for image generation by modeling images as sequences of discrete tokens. While Classifier-Free Guidance (CFG) has been adopted to improve conditional generation, its application in AR models faces two key issues: guidance diminishing, where the conditional–unconditional gap quickly vanishes as decoding progresses, and over-guidance, where strong conditions distort visual coherence. To address these challenges, we propose SoftCFG, an uncertainty-guided inference method that distributes adaptive perturbations across all tokens in the sequence. The key idea behind SoftCFG is to let each generated token contribute certainty-weighted guidance, ensuring that the signal persists across steps while resolving conflicts between text guidance and visual context. To further stabilize long-sequence generation, we introduce Step Normalization, which bounds cumulative perturbations of SoftCFG. Our method is training-free, model-agnostic, and seamlessly integrates with existing AR pipelines. Experiments show that SoftCFG significantly improves image quality over standard CFG and achieves state-of-the-art FID on ImageNet 256 × 256 among autoregressive models.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Xu et al. "SoftCFG: Uncertainty-Guided Stable Guidance for Visual Autoregressive Model." International Conference on Learning Representations, 2026.

Markdown

[Xu et al. "SoftCFG: Uncertainty-Guided Stable Guidance for Visual Autoregressive Model." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/xu2026iclr-softcfg/)

BibTeX

@inproceedings{xu2026iclr-softcfg,
  title     = {{SoftCFG: Uncertainty-Guided Stable Guidance for Visual Autoregressive Model}},
  author    = {Xu, Dongli and Tiulpin, Aleksei and Blaschko, Matthew B.},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/xu2026iclr-softcfg/}
}