SuperParsing: Scalable Nonparametric Image Parsing with Superpixels

Abstract

This paper presents a simple and effective nonparametric approach to the problem of image parsing, or labeling image regions (in our case, superpixels produced by bottom-up segmentation) with their categories. This approach requires no training, and it can easily scale to datasets with tens of thousands of images and hundreds of labels. It works by scene-level matching with global image descriptors, followed by superpixel-level matching with local features and efficient Markov random field (MRF) optimization for incorporating neighborhood context. Our MRF setup can also compute a simultaneous labeling of image regions into semantic classes (e.g., tree, building, car) and geometric classes (sky, vertical, ground). Our system outperforms the state-of-the-art nonparametric method based on SIFT Flow on a dataset of 2,688 images and 33 labels. In addition, we report per-pixel rates on a larger dataset of 15,150 images and 170 labels. To our knowledge, this is the first complete evaluation of image parsing on a dataset of this size, and it establishes a new benchmark for the problem.

Cite

Text

Tighe and Lazebnik. "SuperParsing: Scalable Nonparametric Image Parsing with Superpixels." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-15555-0_26

Markdown

[Tighe and Lazebnik. "SuperParsing: Scalable Nonparametric Image Parsing with Superpixels." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/tighe2010eccv-superparsing/) doi:10.1007/978-3-642-15555-0_26

BibTeX

@inproceedings{tighe2010eccv-superparsing,
  title     = {{SuperParsing: Scalable Nonparametric Image Parsing with Superpixels}},
  author    = {Tighe, Joseph and Lazebnik, Svetlana},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {352-365},
  doi       = {10.1007/978-3-642-15555-0_26},
  url       = {https://mlanthology.org/eccv/2010/tighe2010eccv-superparsing/}
}