Discriminatively Trained And-or Tree Models for Object Detection
Abstract
This paper presents a method of learning reconfigurable And-Or Tree (AOT) models discriminatively from weakly annotated data for object detection. To explore the appearance and geometry space of latent structures effectively, we first quantize the image lattice using an overcomplete set of shape primitives, and then organize them into a directed acyclic And-Or Graph (AOG) by exploiting their compositional relations. We allow overlaps between child nodes when combining them into a parent node, which is equivalent to introducing an appearance Or-node implicitly for the overlapped portion. The learning of an AOT model consists of three components: (i) Unsupervised sub-category learning (i.e., branches of an object Or-node) with the latent structures in AOG being integrated out. (ii) Weaklysupervised part configuration learning (i.e., seeking the globally optimal parse trees in AOG for each sub-category). To search the globally optimal parse tree in AOG efficiently, we propose a dynamic programming (DP) algorithm. (iii) Joint appearance and structural parameters training under latent structural SVM framework. In experiments, our method is tested on PASCAL VOC 2007 and 2010 detection benchmarks of 20 object classes and outperforms comparable state-of-the-art methods.
Cite
Text
Song et al. "Discriminatively Trained And-or Tree Models for Object Detection." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.421Markdown
[Song et al. "Discriminatively Trained And-or Tree Models for Object Detection." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/song2013cvpr-discriminatively/) doi:10.1109/CVPR.2013.421BibTeX
@inproceedings{song2013cvpr-discriminatively,
title = {{Discriminatively Trained And-or Tree Models for Object Detection}},
author = {Song, Xi and Wu, Tianfu and Jia, Yunde and Zhu, Song-Chun},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2013},
doi = {10.1109/CVPR.2013.421},
url = {https://mlanthology.org/cvpr/2013/song2013cvpr-discriminatively/}
}