3D LayoutCRF for Multi-View Object Class Recognition and Segmentation
Abstract
We introduce an approach to accurately detect and segment partially occluded objects in various viewpoints and scales. Our main contribution is a novel framework for combining object-level descriptions (such as position, shape, and color) with pixel-level appearance, boundary, and occlusion reasoning. In training, we exploit a rough 3D object model to learn physically localized part appearances. To find and segment objects in an image, we generate proposals based on the appearance and layout of local parts. The proposals are then refined after incorporating object-level information, and overlapping objects compete for pixels to produce a final description and segmentation of objects in the scene. A further contribution is a novel instance penalty, which is handled very efficiently during inference. We experimentally validate our approach on the challenging PASCAL'06 car database.
Cite
Text
Hoiem et al. "3D LayoutCRF for Multi-View Object Class Recognition and Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2007. doi:10.1109/CVPR.2007.383045Markdown
[Hoiem et al. "3D LayoutCRF for Multi-View Object Class Recognition and Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2007.](https://mlanthology.org/cvpr/2007/hoiem2007cvpr-d/) doi:10.1109/CVPR.2007.383045BibTeX
@inproceedings{hoiem2007cvpr-d,
title = {{3D LayoutCRF for Multi-View Object Class Recognition and Segmentation}},
author = {Hoiem, Derek and Rother, Carsten and Winn, John M.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2007},
doi = {10.1109/CVPR.2007.383045},
url = {https://mlanthology.org/cvpr/2007/hoiem2007cvpr-d/}
}