Reconstructing PASCAL VOC

Abstract

We address the problem of populating object category detection datasets with dense, per-object 3D reconstructions, bootstrapped from class labels, ground truth figure-ground segmentations and a small set of keypoint annotations. Our proposed algorithm first estimates camera viewpoint using rigid structure-from-motion, then reconstructs object shapes by optimizing over visual hull proposals guided by loose within-class shape similarity assumptions. The visual hull sampling process attempts to intersect an object's projection cone with the cones of minimal subsets of other similar objects among those pictured from certain vantage points. We show that our method is able to produce convincing per-object 3D reconstructions on one of the most challenging existing object-category detection datasets, PASCAL VOC. Our results may re-stimulate once popular geometry-oriented model-based recognition approaches.

Cite

Text

Vicente et al. "Reconstructing PASCAL VOC." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.13

Markdown

[Vicente et al. "Reconstructing PASCAL VOC." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/vicente2014cvpr-reconstructing/) doi:10.1109/CVPR.2014.13

BibTeX

@inproceedings{vicente2014cvpr-reconstructing,
  title     = {{Reconstructing PASCAL VOC}},
  author    = {Vicente, Sara and Carreira, Joao and Agapito, Lourdes and Batista, Jorge},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2014},
  doi       = {10.1109/CVPR.2014.13},
  url       = {https://mlanthology.org/cvpr/2014/vicente2014cvpr-reconstructing/}
}