Semi-Supervised Learning of Feature Hierarchies for Object Detection in a Video
Abstract
We propose a novel approach to boost the performance of generic object detectors on videos by learning videospecific features using a deep neural network. The insight behind our proposed approach is that an object appearing in different frames of a video clip should share similar features, which can be learned to build better detectors. Unlike many supervised detector adaptation or detection-bytracking methods, our method does not require any extra annotations or utilize temporal correspondence. We start with the high-confidence detections from a generic detector, then iteratively learn new video-specific features and refine the detection scores. In order to learn discriminative and compact features, we propose a new feature learning method using a deep neural network based on auto encoders. It differs from the existing unsupervised feature learning methods in two ways: first it optimizes both discriminative and generative properties of the features simultaneously, which gives our features better discriminative ability; second, our learned features are more compact, while the unsupervised feature learning methods usually learn a redundant set of over-complete features. Extensive experimental results on person and horse detection show that significant performance improvement can be achieved with our proposed method.
Cite
Text
Yang et al. "Semi-Supervised Learning of Feature Hierarchies for Object Detection in a Video." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.216Markdown
[Yang et al. "Semi-Supervised Learning of Feature Hierarchies for Object Detection in a Video." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/yang2013cvpr-semisupervised/) doi:10.1109/CVPR.2013.216BibTeX
@inproceedings{yang2013cvpr-semisupervised,
title = {{Semi-Supervised Learning of Feature Hierarchies for Object Detection in a Video}},
author = {Yang, Yang and Shu, Guang and Shah, Mubarak},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2013},
doi = {10.1109/CVPR.2013.216},
url = {https://mlanthology.org/cvpr/2013/yang2013cvpr-semisupervised/}
}