Diffusion-RainbowPA: Improvements Integrated Preference Alignment for Diffusion-Based Text-to-Image Generation

Abstract

Although rapidly increasing capabilities of text-to-image (T2I) models have profound implications across various industries, they concurrently suffer from numerous shortcomings, necessitating the implementation of effective alignment strategies with human preference. Diffusion-DPO and SPO have emerged as robust approaches for aligning diffusion-based T2I models with human preference feedback. However, they tend to suffer from text-image misalignment, aesthetic overfitting and low-quality generation. To tackle such matters, we improve the alignment paradigm through a tripartite perspective, which are the calibration enhancement (Calibration Enhanced Preference Alignment), the overfitting mitigation (Identical Preference Alignment, Jensen-Shannon Divergence Constraint) and the performance optimization (Margin Strengthened Preference Alignment, SFT-like Regularization). Furthermore, combining them with the step-aware preference alignment paradigm, we propose the Diffusion-RainbowPA, a suite of total six improvements that collectively improve the alignment performance of Diffusion-DPO. With comprehensive alignment performance evaluation and comparison, it is demonstrated that Diffusion-RainbowPA outperforms current state-of-the-art methods. We also conduct ablation studies on the introduced components that reveal incorporation of each has positively enhanced alignment performance.

Cite

Text

Sun et al. "Diffusion-RainbowPA: Improvements Integrated Preference Alignment for Diffusion-Based Text-to-Image Generation." Transactions on Machine Learning Research, 2025.

Markdown

[Sun et al. "Diffusion-RainbowPA: Improvements Integrated Preference Alignment for Diffusion-Based Text-to-Image Generation." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/sun2025tmlr-diffusionrainbowpa/)

BibTeX

@article{sun2025tmlr-diffusionrainbowpa,
  title     = {{Diffusion-RainbowPA: Improvements Integrated Preference Alignment for Diffusion-Based Text-to-Image Generation}},
  author    = {Sun, Haoyuan and Liang, Bin and Xia, Bo and Wu, Jiaqi and Zhao, Yifei and Qin, Kai and Chang, Yongzhe and Wang, Xueqian},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/sun2025tmlr-diffusionrainbowpa/}
}