LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference

Abstract

We propose a novel method, LoLep, which regresses Locally-Learned planes from a single RGB image to represent scenes accurately, thus generating better novel views. Without the depth information, regressing appropriate plane locations is a challenging problem. To solve this issue, we pre-partition the disparity space into bins and design a disparity sampler to regress local offsets for multiple planes in each bin. However, only using such a sampler makes the network not convergent; we further propose two optimizing strategies that combine with different disparity distributions of datasets and propose an occlusion-aware reprojection loss as a simple yet effective geometric supervision technique. We also introduce a self-attention mechanism to improve occlusion inference and present a Block-Sampling Self-Attention (BS-SA) module to address the problem of applying self-attention to large feature maps. We demonstrate the effectiveness of our approach and generate state-of-the-art results on different datasets. Compared to MINE, our approach has an LPIPS reduction of 4.8% 9.0% and an RV reduction of 74.9% 83.5%. We also evaluate the performance on real-world images and demonstrate the benefits. We will release the source code at the time of publication.

Cite

Text

Wang et al. "LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00995

Markdown

[Wang et al. "LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/wang2023iccv-lolep/) doi:10.1109/ICCV51070.2023.00995

BibTeX

@inproceedings{wang2023iccv-lolep,
  title     = {{LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference}},
  author    = {Wang, Cong and Wang, Yu-Ping and Manocha, Dinesh},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {10841-10851},
  doi       = {10.1109/ICCV51070.2023.00995},
  url       = {https://mlanthology.org/iccv/2023/wang2023iccv-lolep/}
}