Object Detection by 3D Aspectlets and Occlusion Reasoning
Abstract
We propose a novel framework for detecting multiple objects from a single image and reasoning about occlusions between objects. We address this problem from a 3D perspective in order to handle various occlusion patterns which can take place between objects. We introduce the concept of ``3D aspect lets'' based on a piecewise planar object representation. A 3D aspect let represents a portion of the object which provides evidence for partial observation of the object. A new probabilistic model (which we called spatial layout model) is proposed to combine the bottom-up evidence from 3D aspect lets and the top-down occlusion reasoning to help object detection. Experiments are conducted on two new challenging datasets with various degrees of occlusions to demonstrate that, by contextualizing objects in their 3D geometric configuration with respect to the observer, our method is able to obtain competitive detection results even in the presence of severe occlusions. Moreover, we demonstrate the ability of the model to estimate the locations of objects in 3D and predict the occlusion order between objects in images.
Cite
Text
Xiang and Savarese. "Object Detection by 3D Aspectlets and Occlusion Reasoning." IEEE/CVF International Conference on Computer Vision Workshops, 2013. doi:10.1109/ICCVW.2013.75Markdown
[Xiang and Savarese. "Object Detection by 3D Aspectlets and Occlusion Reasoning." IEEE/CVF International Conference on Computer Vision Workshops, 2013.](https://mlanthology.org/iccvw/2013/xiang2013iccvw-object/) doi:10.1109/ICCVW.2013.75BibTeX
@inproceedings{xiang2013iccvw-object,
title = {{Object Detection by 3D Aspectlets and Occlusion Reasoning}},
author = {Xiang, Yu and Savarese, Silvio},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2013},
pages = {530-537},
doi = {10.1109/ICCVW.2013.75},
url = {https://mlanthology.org/iccvw/2013/xiang2013iccvw-object/}
}