Dual Graphical Models for Relational Modeling of Indoor Object Categories

Abstract

There are three levels for indoor scene understanding, pixel level labeling, object level recognition and scene level holistic understanding. The three levels provide complementary bottom-up scene representation. Traditional research often addresses these three tasks separately where the three levels of semantic data are seldom jointly considered. We propose a new method to bridge the three semantic levels by using dual graphical models for relational modeling of object categories in indoor scenes. The vertical placement model captures top-down object configuration by which the visible pixels of some accessory objects could be used to infer the presence of a supportive object underneath. The horizontal placement model reveals how multiple object categories are related to each other on the ground in different indoor scenes. The experimental results show improvements on the bounding box accuracy using both vertical and horizontal placement models from pixel level labeling.

Cite

Text

Guo et al. "Dual Graphical Models for Relational Modeling of Indoor Object Categories." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019. doi:10.1109/CVPRW.2019.00132

Markdown

[Guo et al. "Dual Graphical Models for Relational Modeling of Indoor Object Categories." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/guo2019cvprw-dual/) doi:10.1109/CVPRW.2019.00132

BibTeX

@inproceedings{guo2019cvprw-dual,
  title     = {{Dual Graphical Models for Relational Modeling of Indoor Object Categories}},
  author    = {Guo, Lin and Fan, Guoliang and Sheng, Weihua},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2019},
  pages     = {1007-1013},
  doi       = {10.1109/CVPRW.2019.00132},
  url       = {https://mlanthology.org/cvprw/2019/guo2019cvprw-dual/}
}