Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Abstract

In this paper, we address the task of semantic-guided scene generation. One open challenge widely observed in global image-level generation methods is the difficulty of generating small objects and detailed local texture. To tackle this issue, in this work we consider learning the scene generation in a local context, and correspondingly design a local class-specific generative network with semantic maps as a guidance, which separately constructs and learns sub-generators concentrating on the generation of different classes, and is able to provide more scene details. To learn more discriminative class-specific feature representations for the local generation, a novel classification module is also proposed. To combine the advantage of both global image-level and the local class-specific generation, a joint generation network is designed with an attention fusion module and a dual-discriminator structure embedded. Extensive experiments on two scene image generation tasks show superior generation performance of the proposed model. State-of-the-art results are established by large margins on both tasks and on challenging public benchmarks. The source code and trained models are available at https://github.com/Ha0Tang/LGGAN.

Cite

Text

Tang et al. "Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00789

Markdown

[Tang et al. "Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/tang2020cvpr-local/) doi:10.1109/CVPR42600.2020.00789

BibTeX

@inproceedings{tang2020cvpr-local,
  title     = {{Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation}},
  author    = {Tang, Hao and Xu, Dan and Yan, Yan and Torr, Philip H.S. and Sebe, Nicu},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00789},
  url       = {https://mlanthology.org/cvpr/2020/tang2020cvpr-local/}
}