CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Abstract
Diffusion models have emerged as a dominant approach for text-to-image generation. Key components such as the human preference alignment and classifier-free guidance play a crucial role in ensuring generation quality. However, their independent application in current text-to-image models continues to face significant challenges in achieving strong text-image alignment, high generation quality, and consistency with human aesthetic standards. In this work, we for the first time, explore facilitating the collaboration of human performance alignment and test-time sampling to unlock the potential of text-to-image models. Consequently, we introduce CHATS (Combining Human-Aligned optimization and Test-time Sampling), a novel generative framework that separately models the preferred and dispreferred distributions and employs a proxy-prompt-based sampling strategy to utilize the useful information contained in both distributions. We observe that CHATS exhibits exceptional data efficiency, achieving strong performance with only a small, high-quality funetuning dataset. Extensive experiments demonstrate that CHATS surpasses traditional preference alignment methods, setting new state-of-the-art across various standard benchmarks. The code is publicly available at github.com/AIDC-AI/CHATS.
Cite
Text
Fu et al. "CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Fu et al. "CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/fu2025icml-chats/)BibTeX
@inproceedings{fu2025icml-chats,
title = {{CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation}},
author = {Fu, Minghao and Wang, Guo-Hua and Cao, Liangfu and Chen, Qing-Guo and Xu, Zhao and Luo, Weihua and Zhang, Kaifu},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {17852-17874},
volume = {267},
url = {https://mlanthology.org/icml/2025/fu2025icml-chats/}
}