Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction
Abstract
State-of-the-art single-view 360 room layout reconstruction methods formulate the problem as a high-level 1D (per-column) regression task. On the other hand traditional low-level 2D layout segmentation is simpler to learn and can represent occluded regions but it requires complex post-processing for the targeting layout polygon and sacrifices accuracy. We present Seg2Reg to render 1D layout depth regression from the 2D segmentation map in a differentiable and occlusion-aware way marrying the merits of both sides. Specifically our model predicts floor-plan density for the input equirectangular 360 image. Formulating the 2D layout representation as a density field enables us to employ 'flattened' volume rendering to form 1D layout depth regression. In addition we propose a novel 3D warping augmentation on layout to improve generalization. Finally we re-implement recent room layout reconstruction methods into our codebase for benchmarking and explore modern backbones and training techniques to serve as the strong baseline. The code is at https: //PanoLayoutStudio.github.io .
Cite
Text
Sun et al. "Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00993Markdown
[Sun et al. "Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/sun2024cvpr-seg2reg/) doi:10.1109/CVPR52733.2024.00993BibTeX
@inproceedings{sun2024cvpr-seg2reg,
title = {{Seg2Reg: Differentiable 2D Segmentation to 1d Regression Rendering for 360 Room Layout Reconstruction}},
author = {Sun, Cheng and Tai, Wei-En and Shih, Yu-Lin and Chen, Kuan-Wei and Syu, Yong-Jing and The, Kent Selwyn and Wang, Yu-Chiang Frank and Chen, Hwann-Tzong},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {10435-10445},
doi = {10.1109/CVPR52733.2024.00993},
url = {https://mlanthology.org/cvpr/2024/sun2024cvpr-seg2reg/}
}