Multi-Object Discovery by Low-Dimensional Object Motion

Abstract

Recent work in unsupervised multi-object segmentation shows impressive results by predicting motion from a single image despite the inherent ambiguity in predicting motion without the next image. On the other hand, the set of possible motions for an image can be constrained to a low-dimensional space by considering the scene structure and moving objects in it. We propose to model pixel-wise geometry and object motion to remove ambiguity in reconstructing flow from a single image. Specifically, we divide the image into coherently moving regions and use depth to construct flow bases that best explain the observed flow in each region. We achieve state-of-the-art results in unsupervised multi-object segmentation on synthetic and real-world datasets by modeling the scene structure and object motion. Our evaluation of the predicted depth maps shows reliable performance in monocular depth estimation.

Cite

Text

Safadoust and Güney. "Multi-Object Discovery by Low-Dimensional Object Motion." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00074

Markdown

[Safadoust and Güney. "Multi-Object Discovery by Low-Dimensional Object Motion." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/safadoust2023iccv-multiobject/) doi:10.1109/ICCV51070.2023.00074

BibTeX

@inproceedings{safadoust2023iccv-multiobject,
  title     = {{Multi-Object Discovery by Low-Dimensional Object Motion}},
  author    = {Safadoust, Sadra and Güney, Fatma},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {734-744},
  doi       = {10.1109/ICCV51070.2023.00074},
  url       = {https://mlanthology.org/iccv/2023/safadoust2023iccv-multiobject/}
}