Predicting Multiple Structured Visual Interpretations

Dey, Debadeepta; Ramakrishna, Varun; Hebert, Martial; Bagnell, J. Andrew

doi:10.1109/ICCV.2015.337

Predicting Multiple Structured Visual Interpretations

Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell

ICCV 2015

doi:10.1109/ICCV.2015.337 /iccv/2015/dey2015iccv-predicting/

Abstract

We present a simple approach for producing a small number of structured visual outputs which have high recall, for a variety of tasks including monocular pose estimation and semantic scene segmentation. Current state-of-the-art approaches learn a single model and modify inference procedures to produce a small number of diverse predictions. We take the alternate route of modifying the learning procedure to directly optimize for good, high recall sequences of structured-output predictors. Our approach introduces no new parameters, naturally learns diverse predictions and is not tied to any specific structured learning or inference procedure. We leverage recent advances in the contextual submodular maximization literature to learn a sequence of predictors and empirically demonstrate the simplicity and performance of our approach on multiple challenging vision tasks including achieving state-of-the-art results on multiple predictions for monocular pose-estimation and image foreground/background segmentation.

PDF ICCV Semantic Scholar

Cite

Text

Dey et al. "Predicting Multiple Structured Visual Interpretations." International Conference on Computer Vision, 2015. doi:10.1109/ICCV.2015.337

Markdown

[Dey et al. "Predicting Multiple Structured Visual Interpretations." International Conference on Computer Vision, 2015.](https://mlanthology.org/iccv/2015/dey2015iccv-predicting/) doi:10.1109/ICCV.2015.337

BibTeX

@inproceedings{dey2015iccv-predicting,
  title     = {{Predicting Multiple Structured Visual Interpretations}},
  author    = {Dey, Debadeepta and Ramakrishna, Varun and Hebert, Martial and Bagnell, J. Andrew},
  booktitle = {International Conference on Computer Vision},
  year      = {2015},
  doi       = {10.1109/ICCV.2015.337},
  url       = {https://mlanthology.org/iccv/2015/dey2015iccv-predicting/}
}