Simultaneous Multi-Body Stereo and Segmentation

Abstract

This paper presents a novel multi-body multi-view stereo method to simultaneously recover dense depth maps and perform segmentation with the input of a monocular image sequence. Unlike traditional multi-view stereo approaches that generally handle a single static scene or an object, we show that depth estimation and segmentation can be jointly modeled and be globally solved in an energy minimization framework for ubiquitous scenes containing multiple independently moving rigid objects. Our major contribution includes a new multi-body stereo model, which integrates the color, geometry, and layer constraints for spatio-temporal depth recovery and automatic object segmentation. A two-pass optimization scheme is proposed to progressively update the estimates. Our method is applied to a variety of challenging examples.

Cite

Text

Zhang et al. "Simultaneous Multi-Body Stereo and Segmentation." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126322

Markdown

[Zhang et al. "Simultaneous Multi-Body Stereo and Segmentation." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/zhang2011iccv-simultaneous/) doi:10.1109/ICCV.2011.6126322

BibTeX

@inproceedings{zhang2011iccv-simultaneous,
  title     = {{Simultaneous Multi-Body Stereo and Segmentation}},
  author    = {Zhang, Guofeng and Jia, Jiaya and Bao, Hujun},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2011},
  pages     = {826-833},
  doi       = {10.1109/ICCV.2011.6126322},
  url       = {https://mlanthology.org/iccv/2011/zhang2011iccv-simultaneous/}
}