Object Detection with Grammar Models

Abstract

Compositional models provide an elegant formalism for representing the visual appearance of highly variable objects. While such models are appealing from a theoretical point of view, it has been difficult to demonstrate that they lead to performance advantages on challenging datasets. Here we develop a grammar model for person detection and show that it outperforms previous high-performance systems on the PASCAL benchmark. Our model represents people using a hierarchy of deformable parts, variable structure and an explicit model of occlusion for partially visible objects. To train the model, we introduce a new discriminative framework for learning structured prediction models from weakly-labeled data.

Cite

Text

Girshick et al. "Object Detection with Grammar Models." Neural Information Processing Systems, 2011.

Markdown

[Girshick et al. "Object Detection with Grammar Models." Neural Information Processing Systems, 2011.](https://mlanthology.org/neurips/2011/girshick2011neurips-object/)

BibTeX

@inproceedings{girshick2011neurips-object,
  title     = {{Object Detection with Grammar Models}},
  author    = {Girshick, Ross B. and Felzenszwalb, Pedro F. and McAllester, David A.},
  booktitle = {Neural Information Processing Systems},
  year      = {2011},
  pages     = {442-450},
  url       = {https://mlanthology.org/neurips/2011/girshick2011neurips-object/}
}