Object Discovery in Videos as Foreground Motion Clustering
Abstract
We consider the problem of providing dense segmentation masks for object discovery in videos. We formulate the object discovery problem as foreground motion clustering, where the goal is to cluster foreground pixels in videos into different objects. We introduce a novel pixel-trajectory recurrent neural network that learns feature embeddings of foreground pixel trajectories linked across time. By clustering the pixel trajectories using the learned feature embeddings, our method establishes correspondences between foreground object masks across video frames. To demonstrate the effectiveness of our framework for object discovery, we conduct experiments on commonly used datasets for motion segmentation, where we achieve state-of-the-art performance.
Cite
Text
Xie et al. "Object Discovery in Videos as Foreground Motion Clustering." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.01023Markdown
[Xie et al. "Object Discovery in Videos as Foreground Motion Clustering." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/xie2019cvpr-object/) doi:10.1109/CVPR.2019.01023BibTeX
@inproceedings{xie2019cvpr-object,
title = {{Object Discovery in Videos as Foreground Motion Clustering}},
author = {Xie, Christopher and Xiang, Yu and Harchaoui, Zaid and Fox, Dieter},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.01023},
url = {https://mlanthology.org/cvpr/2019/xie2019cvpr-object/}
}