View Independent Generative Adversarial Network for Novel View Synthesis

Abstract

Synthesizing novel views from a 2D image requires to infer 3D structure and project it back to 2D from a new viewpoint. In this paper, we propose an encoder-decoder based generative adversarial network VI-GAN to tackle this problem. Our method is to let the network, after seeing many images of objects belonging to the same category in different views, obtain essential knowledge of intrinsic properties of the objects. To this end, an encoder is designed to extract view-independent feature that characterizes intrinsic properties of the input image, which includes 3D structure, color, texture etc. We also make the decoder hallucinate the image of a novel view based on the extracted feature and an arbitrary user-specific camera pose. Extensive experiments demonstrate that our model can synthesize high-quality images in different views with continuous camera poses, and is general for various applications.

Cite

Text

Xu et al. "View Independent Generative Adversarial Network for Novel View Synthesis." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00788

Markdown

[Xu et al. "View Independent Generative Adversarial Network for Novel View Synthesis." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/xu2019iccv-view/) doi:10.1109/ICCV.2019.00788

BibTeX

@inproceedings{xu2019iccv-view,
  title     = {{View Independent Generative Adversarial Network for Novel View Synthesis}},
  author    = {Xu, Xiaogang and Chen, Ying-Cong and Jia, Jiaya},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00788},
  url       = {https://mlanthology.org/iccv/2019/xu2019iccv-view/}
}