Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs

Kim, BumJun; Jeon, Dongjae; Kim, Dueun; Jeung, Wonje; No, Albert

Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs

BumJun Kim, Dongjae Jeon, Dueun Kim, Wonje Jeung, Albert No

ICLR 2026

/iclr/2026/kim2026iclr-rainbow/

Abstract

Diffusion large language models (dLLMs) have emerged as a promising alternative to autoregressive models, offering flexible generation orders and strong performance on complex reasoning tasks. However, instruction-tuned dLLMs exhibit a critical vulnerability we term \<eos\> overflow: as allocated sequence length increases, responses paradoxically become shorter, collapsing into early termination or degenerating into streams of \<eos\> tokens. Although noticed in practice, this issue has not been systematically analyzed. We trace its root cause to the dual role of \<eos\> as both termination and padding, which concentrates probability mass on \<eos\> at later positions and propagates backward to trigger early termination. To address this, we introduce Rainbow Padding, a simple remedy that replaces repeated \<eos\> placeholders with a repeating cycle of distinct padding tokens, distributing probability mass and breaking \<eos\> dominance. Experiments show that Rainbow Padding substantially improves length robustness and output quality, with as few as seven padding tokens to prevent early termination. Moreover, the method integrates efficiently into existing instruction-tuned models: LoRA fine-tuning for a single epoch on minimal data yields significant improvements, making this solution highly practical. The project is available at ~\url{https://ai-isl.github.io/rainbow-padding}

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Kim et al. "Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs." International Conference on Learning Representations, 2026.

Markdown

[Kim et al. "Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/kim2026iclr-rainbow/)

BibTeX

@inproceedings{kim2026iclr-rainbow,
  title     = {{Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs}},
  author    = {Kim, BumJun and Jeon, Dongjae and Kim, Dueun and Jeung, Wonje and No, Albert},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/kim2026iclr-rainbow/}
}