Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations
Abstract
A dynamic visual search framework based mainly on inner-scene similarity is proposed. Algorithms as well as measures quantifying the difficulty of search tasks are suggested. Given a number of candidates (e.g. sub-images), our basic hypothesis is that more visually similar candidates are more likely to have the same identity. Both deterministic and stochastic approaches, relying on this hypothesis, are used to quantify this intuition. Under the deterministic approach, we suggest a measure similar to Kolmogorov’s ε -covering that quantifies the difficulty of a search task and bounds the performance of all search algorithms. We also suggest a simple algorithm that meets this bound. Under the stochastic approach, we model the identities of the candidates as correlated random variables and characterize the task using its second order statistics. We derive a search procedure based on minimum MSE linear estimation. Simple extensions enable the algorithm to use top-down and/or bottom-up information, when available.
Cite
Text
Avraham and Lindenbaum. "Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations." European Conference on Computer Vision, 2004. doi:10.1007/978-3-540-24671-8_5Markdown
[Avraham and Lindenbaum. "Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations." European Conference on Computer Vision, 2004.](https://mlanthology.org/eccv/2004/avraham2004eccv-dynamic/) doi:10.1007/978-3-540-24671-8_5BibTeX
@inproceedings{avraham2004eccv-dynamic,
title = {{Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations}},
author = {Avraham, Tamar and Lindenbaum, Michael},
booktitle = {European Conference on Computer Vision},
year = {2004},
pages = {58-70},
doi = {10.1007/978-3-540-24671-8_5},
url = {https://mlanthology.org/eccv/2004/avraham2004eccv-dynamic/}
}