3D Visual Phrases for Landmark Recognition

Abstract

In this paper, we study the problem of landmark recognition and propose to leverage 3D visual phrases to improve the performance. A 3D visual phrase is a triangular facet on the surface of a reconstructed 3D landmark model. In contrast to existing 2D visual phrases which are mainly based on co-occurrence statistics in 2D image planes, such 3D visual phrases explicitly characterize the spatial structure of a 3D object (landmark), and are highly robust to projective transformations due to viewpoint changes. We present an effective solution to discover, describe, and detect 3D visual phrases. The experiments on 10 landmarks have achieved promising results, which demonstrate that our approach provides a good balance between precision and recall of landmark recognition while reducing the dependence on post-verification to reject false positives.

Cite

Text

Hao et al. "3D Visual Phrases for Landmark Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6248104

Markdown

[Hao et al. "3D Visual Phrases for Landmark Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/hao2012cvpr-d/) doi:10.1109/CVPR.2012.6248104

BibTeX

@inproceedings{hao2012cvpr-d,
  title     = {{3D Visual Phrases for Landmark Recognition}},
  author    = {Hao, Qiang and Cai, Rui and Li, Zhiwei and Zhang, Lei and Pang, Yanwei and Wu, Feng},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {3594-3601},
  doi       = {10.1109/CVPR.2012.6248104},
  url       = {https://mlanthology.org/cvpr/2012/hao2012cvpr-d/}
}