Growing a Bag of Systems Tree for Fast and Accurate Classification

Abstract

The bag-of-systems (BoS) representation is a descriptor of motion in a video, where dynamic texture (DT) codewords represent the typical motion patterns in spatio-temporal patches extracted from the video. The efficacy of the BoS descriptor depends on the richness of the codebook, which directly depends on the number of codewords in the codebook. However, for even modest sized codebooks, mapping videos onto the codebook results in a heavy computational load. In this paper we propose the BoS Tree, which constructs a bottom-up hierarchy of codewords that enables efficient mapping of videos to the BoS codebook. By leveraging the tree structure to efficiently index the codewords, the BoS Tree allows for fast look-ups in the codebook and enables the practical use of larger, richer codebooks. We demonstrate the effectiveness of BoS Trees on classification of three video datasets, as well as on annotation of a music dataset.

Cite

Text

Coviello et al. "Growing a Bag of Systems Tree for Fast and Accurate Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6247900

Markdown

[Coviello et al. "Growing a Bag of Systems Tree for Fast and Accurate Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/coviello2012cvpr-growing/) doi:10.1109/CVPR.2012.6247900

BibTeX

@inproceedings{coviello2012cvpr-growing,
  title     = {{Growing a Bag of Systems Tree for Fast and Accurate Classification}},
  author    = {Coviello, Emanuele and Mumtaz, Adeel and Chan, Antoni B. and Lanckriet, Gert R. G.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {1979-1986},
  doi       = {10.1109/CVPR.2012.6247900},
  url       = {https://mlanthology.org/cvpr/2012/coviello2012cvpr-growing/}
}