Summarizing Visual Data Using Bidirectional Similarity

Abstract

We propose a principled approach to summarization of visual data (images or video) based on optimization of a well-defined similarity measure. The problem we consider is re-targeting (or summarization) of image/video data into smaller sizes. A good ldquovisual summaryrdquo should satisfy two properties: (1) it should contain as much as possible visual information from the input data; (2) it should introduce as few as possible new visual artifacts that were not in the input data (i.e., preserve visual coherence). We propose a bi-directional similarity measure which quantitatively captures these two requirements: Two signals S and T are considered visually similar if all patches of S (at multiple scales) are contained in T, and vice versa. The problem of summarization/re-targeting is posed as an optimization problem of this bi-directional similarity measure. We show summarization results for image and video data. We further show that the same approach can be used to address a variety of other problems, including automatic cropping, completion and synthesis of visual data, image collage, object removal, photo reshuffling and more.

Cite

Text

Simakov et al. "Summarizing Visual Data Using Bidirectional Similarity." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587842

Markdown

[Simakov et al. "Summarizing Visual Data Using Bidirectional Similarity." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/simakov2008cvpr-summarizing/) doi:10.1109/CVPR.2008.4587842

BibTeX

@inproceedings{simakov2008cvpr-summarizing,
  title     = {{Summarizing Visual Data Using Bidirectional Similarity}},
  author    = {Simakov, Denis and Caspi, Yaron and Shechtman, Eli and Irani, Michal},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587842},
  url       = {https://mlanthology.org/cvpr/2008/simakov2008cvpr-summarizing/}
}