Top-Down Pairwise Potentials for Piecing Together Multi-Class Segmentation Puzzles

Abstract

Top-down class-specific knowledge is crucial for accurate image segmentation, as low-level color and texture cues alone are insufficient to identify true object boundaries. However, existing methods such as conditional random field models (CRFs) generally impose the class-specific knowledge only at the "node" level, evaluating class membership probabilities at the (super)pixels that define the random field graph. We introduce a strategy for pairwise potential functions that capture top-down information, where we prefer to assign the same label to adjacent regions when the entropy reduction that would result from their merging is high. By measuring how the certainty of the object-level classifiers changes when considering the appearance description extracted from adjacent regions, we can "piece together" objects whose heterogenous texture would prevent both the too-local node potentials and conventional bottom-up smoothness terms from recognizing the object. We show how this idea can be used as either an affinity function for agglomerative clustering, or a pairwise potential for a CRF model. Experiments with two datasets show that the proposed entropy-guided affinity function has a clear positive impact on multi-class segmentation.

Cite

Text

Vijayanarasimhan and Grauman. "Top-Down Pairwise Potentials for Piecing Together Multi-Class Segmentation Puzzles." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2010. doi:10.1109/CVPRW.2010.5543728

Markdown

[Vijayanarasimhan and Grauman. "Top-Down Pairwise Potentials for Piecing Together Multi-Class Segmentation Puzzles." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2010.](https://mlanthology.org/cvprw/2010/vijayanarasimhan2010cvprw-topdown/) doi:10.1109/CVPRW.2010.5543728

BibTeX

@inproceedings{vijayanarasimhan2010cvprw-topdown,
  title     = {{Top-Down Pairwise Potentials for Piecing Together Multi-Class Segmentation Puzzles}},
  author    = {Vijayanarasimhan, Sudheendra and Grauman, Kristen},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2010},
  pages     = {25-32},
  doi       = {10.1109/CVPRW.2010.5543728},
  url       = {https://mlanthology.org/cvprw/2010/vijayanarasimhan2010cvprw-topdown/}
}