Binarized Mode Seeking for Scalable Visual Pattern Discovery
Abstract
This paper studies visual pattern discovery in large-scale image collections via binarized mode seeking, where images can only be represented as binary codes for efficient storage and computation. We address this problem from the perspective of binary space mode seeking. First, a binary mean shift (bMS) is proposed to discover frequent patterns via mode seeking directly in binary space. The binomial-based kernel and binary constraint are introduced for binarized analysis. Second, we further extend bMS to a more general form, namely contrastive binary mean shift (cbMS), which maximizes the contrastive density in binary space, for finding informative patterns that are both frequent and discriminative for the dataset. With the binarized algorithm and optimization, our methods demonstrate significant computation (50X) and storage (32X) improvement compared to standard techniques operating in Euclidean space, while the performance does not largely degenerate. Furthermore, cbMS discovers more informative patterns by suppressing low discriminative modes. We evaluate our methods on both annotated ILSVRC (1M images) and un-annotated blind Flickr (10M images) datasets with million scale images, which demonstrates both the scalability and effectiveness of our algorithms for discovering frequent and informative patterns in large scale collection.
Cite
Text
Zhang et al. "Binarized Mode Seeking for Scalable Visual Pattern Discovery." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.722Markdown
[Zhang et al. "Binarized Mode Seeking for Scalable Visual Pattern Discovery." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/zhang2017cvpr-binarized/) doi:10.1109/CVPR.2017.722BibTeX
@inproceedings{zhang2017cvpr-binarized,
title = {{Binarized Mode Seeking for Scalable Visual Pattern Discovery}},
author = {Zhang, Wei and Cao, Xiaochun and Wang, Rui and Guo, Yuanfang and Chen, Zhineng},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2017},
doi = {10.1109/CVPR.2017.722},
url = {https://mlanthology.org/cvpr/2017/zhang2017cvpr-binarized/}
}