Untangling Object-View Manifold for Multiview Recognition and Pose Estimation

Abstract

The problem of multi-view/view-invariant recognition remains one of the most fundamental challenges to the progress of the computer vision. In this paper we consider the problem of modeling the combined object-viewpoint manifold. The shape and appearance of an object in a given image is a function of its category, style within category, viewpoint, and several other factors. The visual manifold (in any chosen feature representation space) given all these variability collectively is very hard and even impossible to model. We propose an efficient computational framework that can untangle such a complex manifold, and achieve a model that separates a view-invariant category representation, from category-invariant pose representation. We outperform the state of the art in the three widely used multiview dataset, for both category recognition, and pose estimation.

Cite

Text

Bakry and Elgammal. "Untangling Object-View Manifold for Multiview Recognition and Pose Estimation." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10593-2_29

Markdown

[Bakry and Elgammal. "Untangling Object-View Manifold for Multiview Recognition and Pose Estimation." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/bakry2014eccv-untangling/) doi:10.1007/978-3-319-10593-2_29

BibTeX

@inproceedings{bakry2014eccv-untangling,
  title     = {{Untangling Object-View Manifold for Multiview Recognition and Pose Estimation}},
  author    = {Bakry, Amr and Elgammal, Ahmed M.},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {434-449},
  doi       = {10.1007/978-3-319-10593-2_29},
  url       = {https://mlanthology.org/eccv/2014/bakry2014eccv-untangling/}
}