Understanding Bayesian Rooms Using Composite 3D Object Models
Abstract
We develop a comprehensive Bayesian generative model for understanding indoor scenes. While it is common in this domain to approximate objects with 3D bounding boxes, we propose using strong representations with finer granularity. For example, we model a chair as a set of four legs, a seat and a backrest. We find that modeling detailed geometry improves recognition and reconstruction, and enables more refined use of appearance for scene understanding. We demonstrate this with a new likelihood function that rewards 3D object hypotheses whose 2D projection is more uniform in color distribution. Such a measure would be confused by background pixels if we used a bounding box to represent a concave object like a chair. Complex objects are modeled using a set or re-usable 3D parts, and we show that this representation captures much of the variation among object instances with relatively few parameters. We also designed specific data-driven inference mechanisms for each part that are shared by all objects containing that part, which helps make inference transparent to the modeler. Further, we show how to exploit contextual relationships to detect more objects, by, for example,proposing chairs around and underneath tables. We present results showing the benefits of each of these innovations. The performance of our approach often exceeds that of state-of-the-art methods on the two tasks of room layout estimation and object recognition, as evaluated on two bench mark data sets used in this domain.
Cite
Text
Del Pero et al. "Understanding Bayesian Rooms Using Composite 3D Object Models." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.27Markdown
[Del Pero et al. "Understanding Bayesian Rooms Using Composite 3D Object Models." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/pero2013cvpr-understanding/) doi:10.1109/CVPR.2013.27BibTeX
@inproceedings{pero2013cvpr-understanding,
title = {{Understanding Bayesian Rooms Using Composite 3D Object Models}},
author = {Del Pero, Luca and Bowdish, Joshua and Kermgard, Bonnie and Hartley, Emily and Barnard, Kobus},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2013},
doi = {10.1109/CVPR.2013.27},
url = {https://mlanthology.org/cvpr/2013/pero2013cvpr-understanding/}
}