Segmentation from a Box

Abstract

Drawing a box around an intended segmentation target has become both a popular user interface and a common output for learning-driven detection algorithms. Despite the ubiquity of using a box to define a segmentation target, it is unclear in the literature whether a box is sufficient to define a unique segmentation or whether segmentation from a box is ill-posed without higher-level (semantic) knowledge of the intended target. We examine this issue by conducting a study of 14 subjects who are asked to segment a boxed target in a set of 50 real images for which they have no semantic attachment. We find that the subjects do indeed perceive and trace almost the same segmentations as each other, despite the inhomogeneity of the image intensities, irregular shapes of the segmentation targets and weakness of the target boundaries. Since the subjects produce the same segmentation, we conclude that the problem is well-posed and then provide a new segmentation algorithm from a box which achieves results close to the perceived target.

Cite

Text

Grady et al. "Segmentation from a Box." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126264

Markdown

[Grady et al. "Segmentation from a Box." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/grady2011iccv-segmentation/) doi:10.1109/ICCV.2011.6126264

BibTeX

@inproceedings{grady2011iccv-segmentation,
  title     = {{Segmentation from a Box}},
  author    = {Grady, Leo J. and Jolly, Marie-Pierre and Seitz, Aaron R.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2011},
  pages     = {367-374},
  doi       = {10.1109/ICCV.2011.6126264},
  url       = {https://mlanthology.org/iccv/2011/grady2011iccv-segmentation/}
}