Structural Deep Metric Learning for Room Layout Estimation

Zheng, Wenzhao; Lu, Jiwen; Zhou, Jie

doi:10.1007/978-3-030-58523-5_43

Structural Deep Metric Learning for Room Layout Estimation

Wenzhao Zheng, Jiwen Lu, Jie Zhou

ECCV 2020

doi:10.1007/978-3-030-58523-5_43 /eccv/2020/zheng2020eccv-structural/

Abstract

In this paper, we propose a structural deep metric learning (SDML) method for room layout estimation, which aims to recover the 3D spatial layout of a cluttered indoor scene from a monocular RGB image. Different from existing room layout estimation methods that solve a regression or per-pixel classification problem, we formulate the room layout estimation problem from a metric learning perspective where we explicitly model the structural relations across different images. We propose to learn a latent embedding space where the Euclidean distance can characterize the actual structural difference between the layouts of two rooms. We then minimize the discrepancy between an image and its ground-truth layout in the learned embedding space. We employ a metric model and a layout encoder to map the RGB images and the ground-truth layouts to the embedding space, respectively, and a layout decoder to map the embeddings to the corresponding layouts, where the whole framework is trained in an end-to-end manner. We perform experiments on the widely used Hedau and LSUN datasets and achieve state-of-the-art performance.

PDF ECCV Semantic Scholar

Cite

Text

Zheng et al. "Structural Deep Metric Learning for Room Layout Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58523-5_43

Markdown

[Zheng et al. "Structural Deep Metric Learning for Room Layout Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/zheng2020eccv-structural/) doi:10.1007/978-3-030-58523-5_43

BibTeX

@inproceedings{zheng2020eccv-structural,
  title     = {{Structural Deep Metric Learning for Room Layout Estimation}},
  author    = {Zheng, Wenzhao and Lu, Jiwen and Zhou, Jie},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58523-5_43},
  url       = {https://mlanthology.org/eccv/2020/zheng2020eccv-structural/}
}