GPR-Net: Multi-View Layout Estimation via a Geometry-Aware Panorama Registration Network

Abstract

We present a room layout estimation framework that jointly learns wide baseline panorama registration and layout estimation given a pair of 360° panoramas. To effectively tackle the wide baseline registration problem, we introduce a novel end-to-end supervised Geometry-aware Panorama Registration Network or GPR-Net that exploits the layout geometry and computes fine-grained correspondences on the layout boundary, instead of the global pixel-space. GPR-Net consists of two main parts. The geometry transformer learns a set of 1D horizon features sampled on the panorama. These 1D feature maps encode geometric cues describing the ceiling-wall and floor-wall layout boundaries, and the correspondence and co-visibility between layout boundaries. These learned geometric cues are further used for direct regression of relative pose (translation and rotation) with a pose transformer. The final layout is then obtained by registering the two layouts using the estimated pose and taking the union of the two individual layouts derived from the estimated layout boundary maps. Experimental results indicate that our method achieves state-of-the-art performance in both panorama registration and layout estimation on a large-scale indoor panorama dataset ZInD [3]. Our code is available online1.

Cite

Text

Su et al. "GPR-Net: Multi-View Layout Estimation via a Geometry-Aware Panorama Registration Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00688

Markdown

[Su et al. "GPR-Net: Multi-View Layout Estimation via a Geometry-Aware Panorama Registration Network." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/su2023cvprw-gprnet/) doi:10.1109/CVPRW59228.2023.00688

BibTeX

@inproceedings{su2023cvprw-gprnet,
  title     = {{GPR-Net: Multi-View Layout Estimation via a Geometry-Aware Panorama Registration Network}},
  author    = {Su, Jheng-Wei and Peng, Chi-Han and Wonka, Peter and Chu, Hung-Kuo},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {6469-6478},
  doi       = {10.1109/CVPRW59228.2023.00688},
  url       = {https://mlanthology.org/cvprw/2023/su2023cvprw-gprnet/}
}