Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization
Abstract
Discovering object-centric representations from images has the potential to greatly improve the robustness, sample efficiency and interpretability of machine learning algorithms. Current works on multi-object images typically follow a generative approach that optimizes for input reconstruction and fail to scale to real-world datasets despite significant increases in model capacity. We address this limitation by proposing a novel method that leverages feature connectivity to cluster neighboring pixels likely to belong to the same object. We further design two object-centric regularization terms to refine object representations in the latent space, enabling our approach to scale to complex real-world images. Experimental results on simulated, real-world, complex texture and common object images demonstrate a substantial improvement in the quality of discovered objects compared to state-of-the-art methods, as well as the sample efficiency and generalizability of our approach. We also show that the discovered object-centric representations can accurately predict key object properties in downstream tasks, highlighting the potential of our method to advance the field of multi-object representation learning.
Cite
Text
Foo et al. "Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization." Neural Information Processing Systems, 2023.Markdown
[Foo et al. "Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/foo2023neurips-multiobject/)BibTeX
@inproceedings{foo2023neurips-multiobject,
title = {{Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization}},
author = {Foo, Alex and Hsu, Wynne and Lee, Mong Li},
booktitle = {Neural Information Processing Systems},
year = {2023},
url = {https://mlanthology.org/neurips/2023/foo2023neurips-multiobject/}
}