GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Abstract
Posters serve an essential function in marketing and advertising by improving visual communication and brand visibility, thus significantly contributing to industrial design. With the latest developments in controllable T2I diffusion models, research interest has surged in text rendering within synthesized images. Although text rendering accuracy has seen advancements, automatic poster generation remains a relatively untapped area. This paper presents an automatic poster generation framework featuring text rendering capabilities through the use of LLMs. Our framework employs a triple-cross attention mechanism based on alignment learning to achieve precise text placement within detailed contextual backgrounds. Moreover, it supports adjustable fonts, varying image resolutions, and poster rendering with textual prompts in both English and Chinese. Additionally, we present a comprehensive bilingual image-text dataset, GlyphDraw-3M, comprising 3 million image-text pairs, each with OCR annotations and resolutions exceeding 1024. Our method utilizes the SDXL architecture, and extensive experiments confirm its ability to generate posters with intricate and context-rich backgrounds.
Cite
Text
Ma et al. "GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I6.32636Markdown
[Ma et al. "GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/ma2025aaai-glyphdraw/) doi:10.1609/AAAI.V39I6.32636BibTeX
@inproceedings{ma2025aaai-glyphdraw,
title = {{GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models}},
author = {Ma, Jian and Deng, Yonglin and Chen, Chen and Du, Nanyang and Lu, Haonan and Yang, Zhenyu},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {5955-5963},
doi = {10.1609/AAAI.V39I6.32636},
url = {https://mlanthology.org/aaai/2025/ma2025aaai-glyphdraw/}
}