BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Abstract
Creating visual layouts is a critical step in graphic design. Automatic generation of such layouts is essential for scalable and diverse visual designs. To advance conditional layout generation, we introduce BLT, a bidirectional layout transformer. BLT differs from previous work on transformers in adopting non-autoregressive transformers. In training, BLT learns to predict the masked attributes by attending to surrounding attributes in two directions. During inference, BLT first generates a draft layout from the input and then iteratively refines it into a high-quality layout by masking out low-confident attributes. The masks generated in both training and inference are controlled by a new hierarchical sampling policy. We verify the proposed model on six benchmarks of diverse design tasks. Experimental results demonstrate two benefits compared to the state-of-the-art layout transformer models. First, our model empowers layout transformers to fulfill controllable layout generation. Second, it achieves up to 10x speedup in generating a layout at inference time than the layout transformer baseline. Code is released at https://shawnkx.github.io/blt.
Cite
Text
Kong et al. "BLT: Bidirectional Layout Transformer for Controllable Layout Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19790-1_29Markdown
[Kong et al. "BLT: Bidirectional Layout Transformer for Controllable Layout Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/kong2022eccv-blt/) doi:10.1007/978-3-031-19790-1_29BibTeX
@inproceedings{kong2022eccv-blt,
title = {{BLT: Bidirectional Layout Transformer for Controllable Layout Generation}},
author = {Kong, Xiang and Jiang, Lu and Chang, Huiwen and Zhang, Han and Hao, Yuan and Gong, Haifeng and Essa, Irfan},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19790-1_29},
url = {https://mlanthology.org/eccv/2022/kong2022eccv-blt/}
}