Principled Fusion of High-Level Model and Low-Level Cues for Motion Segmentation
Abstract
High-level generative models provide elegant descriptions of videos and are commonly used as the inference framework in many unsupervised motion segmentation schemes. However, approximate inference in these models often require ad-hoc initialization to avoid local minima issues. Low-level cues, obtained independently from the high-level model, can constrain the search space and reduce the chance of inference algorithms falling into a local minima. This paper introduces a novel principled fusion framework where, local hierarchical superpixels segmentation of images are used to capture local motion. The low-level cues such as local motion, on their own, not adequate to obtain full motion segmentation as occlusion needs to be handled globally. We fuse the low-level motion cues with the high-level model in a principled manner to surmount the shortcomings of using only the high-level model or low-level cues to perform motion segmentation. The fused model contains both continuous and discrete variables which forms a number of Markov Random fields. Variational approximation or belief propagation algorithms cannot be applied due to the complex interactions between the variables. Hence, approximate inference is performed using expectation propagation (EP) algorithm. The scheme is demonstrated by performing motion segmentation in two video sequences.
Cite
Text
Thayananthan et al. "Principled Fusion of High-Level Model and Low-Level Cues for Motion Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587438Markdown
[Thayananthan et al. "Principled Fusion of High-Level Model and Low-Level Cues for Motion Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/thayananthan2008cvpr-principled/) doi:10.1109/CVPR.2008.4587438BibTeX
@inproceedings{thayananthan2008cvpr-principled,
title = {{Principled Fusion of High-Level Model and Low-Level Cues for Motion Segmentation}},
author = {Thayananthan, Arasanathan and Iwasaki, Masahiro and Cipolla, Roberto},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2008},
doi = {10.1109/CVPR.2008.4587438},
url = {https://mlanthology.org/cvpr/2008/thayananthan2008cvpr-principled/}
}