Efficiently Selecting Regions for Scene Understanding
Abstract
Recent advances in scene understanding and related tasks have highlighted the importance of using regions to reason about high-level scene structure. Typically, the regions are selected beforehand and then an energy function is defined over them. This two step process suffers from the following deficiencies: (i) the regions may not match the boundaries of the scene entities, thereby introducing errors; and (ii) as the regions are obtained without any knowledge of the energy function, they may not be suitable for the task at hand. We address these problems by designing an efficient approach for obtaining the best set of regions in terms of the energy function itself. Each iteration of our algorithm selects regions from a large dictionary by solving an accurate linear programming relaxation via dual decomposition. The dictionary of regions is constructed by merging and intersecting segments obtained from multiple bottom-up over-segmentations. To demonstrate the usefulness of our algorithm, we consider the task of scene segmentation and show significant improvements over state of the art methods.
Cite
Text
Kumar and Koller. "Efficiently Selecting Regions for Scene Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5540072Markdown
[Kumar and Koller. "Efficiently Selecting Regions for Scene Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/kumar2010cvpr-efficiently/) doi:10.1109/CVPR.2010.5540072BibTeX
@inproceedings{kumar2010cvpr-efficiently,
title = {{Efficiently Selecting Regions for Scene Understanding}},
author = {Kumar, M. Pawan and Koller, Daphne},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2010},
pages = {3217-3224},
doi = {10.1109/CVPR.2010.5540072},
url = {https://mlanthology.org/cvpr/2010/kumar2010cvpr-efficiently/}
}