BLT: Bidirectional Layout Transformer for Controllable Layout Generation

Abstract

Creating visual layouts is a critical step in graphic design. Automatic generation of such layouts is essential for scalable and diverse visual designs. To advance conditional layout generation, we introduce BLT, a bidirectional layout transformer. BLT differs from previous work on transformers in adopting non-autoregressive transformers. In training, BLT learns to predict the masked attributes by attending to surrounding attributes in two directions. During inference, BLT first generates a draft layout from the input and then iteratively refines it into a high-quality layout by masking out low-confident attributes. The masks generated in both training and inference are controlled by a new hierarchical sampling policy. We verify the proposed model on six benchmarks of diverse design tasks. Experimental results demonstrate two benefits compared to the state-of-the-art layout transformer models. First, our model empowers layout transformers to fulfill controllable layout generation. Second, it achieves up to 10x speedup in generating a layout at inference time than the layout transformer baseline. Code is released at https://shawnkx.github.io/blt.

Cite

Text

Kong et al. "BLT: Bidirectional Layout Transformer for Controllable Layout Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19790-1_29

Markdown

[Kong et al. "BLT: Bidirectional Layout Transformer for Controllable Layout Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/kong2022eccv-blt/) doi:10.1007/978-3-031-19790-1_29

BibTeX

@inproceedings{kong2022eccv-blt,
  title     = {{BLT: Bidirectional Layout Transformer for Controllable Layout Generation}},
  author    = {Kong, Xiang and Jiang, Lu and Chang, Huiwen and Zhang, Han and Hao, Yuan and Gong, Haifeng and Essa, Irfan},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19790-1_29},
  url       = {https://mlanthology.org/eccv/2022/kong2022eccv-blt/}
}