Structural Deep Metric Learning for Room Layout Estimation
Abstract
In this paper, we propose a structural deep metric learning (SDML) method for room layout estimation, which aims to recover the 3D spatial layout of a cluttered indoor scene from a monocular RGB image. Different from existing room layout estimation methods that solve a regression or per-pixel classification problem, we formulate the room layout estimation problem from a metric learning perspective where we explicitly model the structural relations across different images. We propose to learn a latent embedding space where the Euclidean distance can characterize the actual structural difference between the layouts of two rooms. We then minimize the discrepancy between an image and its ground-truth layout in the learned embedding space. We employ a metric model and a layout encoder to map the RGB images and the ground-truth layouts to the embedding space, respectively, and a layout decoder to map the embeddings to the corresponding layouts, where the whole framework is trained in an end-to-end manner. We perform experiments on the widely used Hedau and LSUN datasets and achieve state-of-the-art performance.
Cite
Text
Zheng et al. "Structural Deep Metric Learning for Room Layout Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58523-5_43Markdown
[Zheng et al. "Structural Deep Metric Learning for Room Layout Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/zheng2020eccv-structural/) doi:10.1007/978-3-030-58523-5_43BibTeX
@inproceedings{zheng2020eccv-structural,
title = {{Structural Deep Metric Learning for Room Layout Estimation}},
author = {Zheng, Wenzhao and Lu, Jiwen and Zhou, Jie},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58523-5_43},
url = {https://mlanthology.org/eccv/2020/zheng2020eccv-structural/}
}