Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel

Abstract

In this paper, we propose a novel kernel function for recognizing objects in RGB-D egocentric videos. In order to effectively exploit the varied object appearance in a video, we take a set-based recognition approach and represent the target object using the set of frames contained in the video. Our kernel function measures the similarity of two sets by the minimum distance between the sparse affine hulls of the two sets. Our kernel function also allows convenient integration of heterogeneous data modalities beyond RGB and depth. We extensively evaluate the proposed method on three benchmark datasets, including two RGB-D object datasets and one thermal/visible face dataset. All the results clearly show that the proposed method outperforms state-of-the-art methods.

Cite

Text

Wan and Aggarwal. "Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2015. doi:10.1109/CVPRW.2015.7301302

Markdown

[Wan and Aggarwal. "Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2015.](https://mlanthology.org/cvprw/2015/wan2015cvprw-robust/) doi:10.1109/CVPRW.2015.7301302

BibTeX

@inproceedings{wan2015cvprw-robust,
  title     = {{Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel}},
  author    = {Wan, Shaohua and Aggarwal, J. K.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2015},
  pages     = {97-104},
  doi       = {10.1109/CVPRW.2015.7301302},
  url       = {https://mlanthology.org/cvprw/2015/wan2015cvprw-robust/}
}