ORION: Decoupling and Alignment for Unified Autoregressive Understanding and Generation

Abstract

Unified multimodal Large Language Models (MLLMs) hold great promise for seamlessly integrating understanding and generation. However, monolithic autoregressive architectures, despite their elegance and conversational fluency, suffer from a fundamental semantic–structural conflict: optimizing for low-level reconstructability in generation leads to catastrophic forgetting of high-level semantic understanding. We present ORION, a unified framework that resolves this conflict through Decoupling and Alignment. A non-linear vision head decouples structural pressures from shared representations, while a novel Representation Consistency Loss explicitly aligns semantics during generation. Together with a curated progressive training recipe and high-quality multimodal data, our method enables balanced optimization of both capabilities. Built purely on a monolithic autoregressive backbone without task-specific separate parameters, ORION achieves performance on par with or exceeding recent state-of-the-art unified models that rely on more complex designs. These results validate monolithic autoregression as a simple, effective, and competitive path toward truly integrated multimodal intelligence.

Cite

Text

Hu et al. "ORION: Decoupling and Alignment for Unified Autoregressive Understanding and Generation." International Conference on Learning Representations, 2026.

Markdown

[Hu et al. "ORION: Decoupling and Alignment for Unified Autoregressive Understanding and Generation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/hu2026iclr-orion/)

BibTeX

@inproceedings{hu2026iclr-orion,
  title     = {{ORION: Decoupling and Alignment for Unified Autoregressive Understanding and Generation}},
  author    = {Hu, Taihang and Chen, Mengting and Lan, Jinsong and Zhu, Xiaoyong and Zhang, Kaifu and Cheng, Ming-Ming and Zheng, Bo and Wang, Yaxing},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/hu2026iclr-orion/}
}