Deep Watershed Transform for Instance Segmentation

Abstract

Most contemporary approaches to instance segmentation use complex pipelines involving conditional random fields, recurrent neural networks, object proposals, or template matching schemes. In this paper, we present a simple yet powerful end-to-end convolutional neural network to tackle this task. Our approach combines intuitions from the classical watershed transform and modern deep learning to produce an energy map of the image where object instances are unambiguously represented as energy basins. We then perform a cut at a single energy level to directly yield connected components corresponding to object instances. Our model achieves more than double the performance over the state-of-the-art on the challenging Cityscapes Instance Level Segmentation task.

Cite

Text

Bai and Urtasun. "Deep Watershed Transform for Instance Segmentation." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.305

Markdown

[Bai and Urtasun. "Deep Watershed Transform for Instance Segmentation." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/bai2017cvpr-deep/) doi:10.1109/CVPR.2017.305

BibTeX

@inproceedings{bai2017cvpr-deep,
  title     = {{Deep Watershed Transform for Instance Segmentation}},
  author    = {Bai, Min and Urtasun, Raquel},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.305},
  url       = {https://mlanthology.org/cvpr/2017/bai2017cvpr-deep/}
}