Unsupervised Learning of Translation Invariant Occlusive Components

Abstract

We study unsupervised learning of occluding objects in images of visual scenes. The derived learning algorithm is based on a probabilistic generative model which parameterizes object shapes, object features and the background. No assumptions are made for the object orders in depth or the objects' planar positions. Parameter optimization is thus subject to the large combinatorics of depth orders and positions. Previous approaches constrained this combinatorics but were still only able to learn a very small number of objects. By applying a novel variational EM approach, we show that even without constraints on the object combinatorics, a relatively large number of objects can be learned. In different numerical experiments, our unsupervised approach extracts explicit object representations with object masks and object features closely aligned with the true objects in the scenes. We investigate the robustness of the approach and the use of the learned representations for inference. Furthermore, we demonstrate generality of the approach by applying it to grayscale images, color-vector images, and Gabor-vector images as well as to motion trajectory data for which the extracted components correspond to motion primitives.

Cite

Text

Dai and Lücke. "Unsupervised Learning of Translation Invariant Occlusive Components." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6247953

Markdown

[Dai and Lücke. "Unsupervised Learning of Translation Invariant Occlusive Components." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/dai2012cvpr-unsupervised/) doi:10.1109/CVPR.2012.6247953

BibTeX

@inproceedings{dai2012cvpr-unsupervised,
  title     = {{Unsupervised Learning of Translation Invariant Occlusive Components}},
  author    = {Dai, Zhenwen and Lücke, Jörg},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {2400-2407},
  doi       = {10.1109/CVPR.2012.6247953},
  url       = {https://mlanthology.org/cvpr/2012/dai2012cvpr-unsupervised/}
}