LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity
Abstract
When translating text inputs into layouts or images, existing works typically require explicit descriptions of each object in a scene, including their spatial information or the associated relationships. To better exploit the text input, so that implicit objects or relationships can be properly inferred during layout generation, we propose a LayoutTransformer Network (LT-Net) in this paper. Given a scene-graph input, our LT-Net uniquely encodes the semantic features for exploiting their co-occurrences and implicit relationships. This allows one to manipulate conceptually diverse yet plausible layout outputs. Moreover, the decoder of our LT-Net translates the encoded contextual features into bounding boxes with self-supervised relation consistency preserved. By fitting their distributions to Gaussian mixture models, spatially-diverse layouts can be additionally produced by LT-Net. We conduct extensive experiments on the datasets of MS-COCO and Visual Genome, and confirm the effectiveness and plausibility of our LT-Net over recent layout generation models.
Cite
Text
Yang et al. "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00373Markdown
[Yang et al. "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/yang2021cvpr-layouttransformer/) doi:10.1109/CVPR46437.2021.00373BibTeX
@inproceedings{yang2021cvpr-layouttransformer,
title = {{LayoutTransformer: Scene Layout Generation with Conceptual and Spatial Diversity}},
author = {Yang, Cheng-Fu and Fan, Wan-Cyuan and Yang, Fu-En and Wang, Yu-Chiang Frank},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {3732-3741},
doi = {10.1109/CVPR46437.2021.00373},
url = {https://mlanthology.org/cvpr/2021/yang2021cvpr-layouttransformer/}
}