Adversarial Networks for Camera Pose Regression and Refinement

Abstract

Despite recent advances on the topic of direct camera pose regression using neural networks, accurately estimating the camera pose of a single RGB image still remains a challenging task. To address this problem, we introduce a novel framework based, in its core, on the idea of implicitly learning the joint distribution of RGB images and their corresponding camera poses using a discriminator network and adversarial learning. Our method allows not only to regress the camera pose from a single image, however, also offers a solely RGB-based solution for camera pose refinement using the discriminator network. Further, we show that our method can effectively be used to optimize the predicted camera poses and thus improve the localization accuracy. To this end, we validate our proposed method on the publicly available 7-Scenes dataset improving upon the results of direct camera pose regression methods.

Cite

Text

Bui et al. "Adversarial Networks for Camera Pose Regression and Refinement." IEEE/CVF International Conference on Computer Vision Workshops, 2019. doi:10.1109/ICCVW.2019.00470

Markdown

[Bui et al. "Adversarial Networks for Camera Pose Regression and Refinement." IEEE/CVF International Conference on Computer Vision Workshops, 2019.](https://mlanthology.org/iccvw/2019/bui2019iccvw-adversarial/) doi:10.1109/ICCVW.2019.00470

BibTeX

@inproceedings{bui2019iccvw-adversarial,
  title     = {{Adversarial Networks for Camera Pose Regression and Refinement}},
  author    = {Bui, Mai and Baur, Christoph and Navab, Nassir and Ilic, Slobodan and Albarqouni, Shadi},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2019},
  pages     = {3778-3787},
  doi       = {10.1109/ICCVW.2019.00470},
  url       = {https://mlanthology.org/iccvw/2019/bui2019iccvw-adversarial/}
}