Topical Video Object Discovery from Key Frames by Modeling Word Co-Occurrence Prior
Abstract
A topical video object refers to an object that is frequently highlighted in a video. It could be, e.g., the product logo and the leading actor/actress in a TV commercial. We propose a topic model that incorporates a word co-occurrence prior for efficient discovery of topical video objects from a set of key frames. Previous work using topic models, such as Latent Dirichelet Allocation (LDA), for video object discovery often takes a bag-of-visual-words representation, which ignored important co-occurrence information among the local features. We show that such data driven co-occurrence information from bottom-up can conveniently be incorporated in LDA with a Gaussian Markov prior, which combines top down probabilistic topic modeling with bottom up priors in a unified model. Our experiments on challenging videos demonstrate that the proposed approach can discover different types of topical objects despite variations in scale, view-point, color and lighting changes, or even partial occlusions. The efficacy of the co-occurrence prior is clearly demonstrated when comparing with topic models without such priors.
Cite
Text
Zhao et al. "Topical Video Object Discovery from Key Frames by Modeling Word Co-Occurrence Prior." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.210Markdown
[Zhao et al. "Topical Video Object Discovery from Key Frames by Modeling Word Co-Occurrence Prior." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/zhao2013cvpr-topical/) doi:10.1109/CVPR.2013.210BibTeX
@inproceedings{zhao2013cvpr-topical,
title = {{Topical Video Object Discovery from Key Frames by Modeling Word Co-Occurrence Prior}},
author = {Zhao, Gangqiang and Yuan, Junsong and Hua, Gang},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2013},
doi = {10.1109/CVPR.2013.210},
url = {https://mlanthology.org/cvpr/2013/zhao2013cvpr-topical/}
}