Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction

Abstract

State-of-the-art single-view 360 room layout reconstruction methods formulate the problem as a high-level 1D (per-column) regression task. On the other hand traditional low-level 2D layout segmentation is simpler to learn and can represent occluded regions but it requires complex post-processing for the targeting layout polygon and sacrifices accuracy. We present Seg2Reg to render 1D layout depth regression from the 2D segmentation map in a differentiable and occlusion-aware way marrying the merits of both sides. Specifically our model predicts floor-plan density for the input equirectangular 360 image. Formulating the 2D layout representation as a density field enables us to employ 'flattened' volume rendering to form 1D layout depth regression. In addition we propose a novel 3D warping augmentation on layout to improve generalization. Finally we re-implement recent room layout reconstruction methods into our codebase for benchmarking and explore modern backbones and training techniques to serve as the strong baseline. The code is at https: //PanoLayoutStudio.github.io .

Cite

Text

Sun et al. "Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00993

Markdown

[Sun et al. "Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/sun2024cvpr-seg2reg/) doi:10.1109/CVPR52733.2024.00993

BibTeX

@inproceedings{sun2024cvpr-seg2reg,
  title     = {{Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction}},
  author    = {Sun, Cheng and Tai, Wei-En and Shih, Yu-Lin and Chen, Kuan-Wei and Syu, Yong-Jing and The, Kent Selwyn and Wang, Yu-Chiang Frank and Chen, Hwann-Tzong},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {10435-10445},
  doi       = {10.1109/CVPR52733.2024.00993},
  url       = {https://mlanthology.org/cvpr/2024/sun2024cvpr-seg2reg/}
}