KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering
Abstract
Temporal camera relocalization estimates the pose with respect to each video frame in sequence, as opposed to one-shot relocalization which focuses on a still image. Even though the time dependency has been taken into account, current temporal relocalization methods still generally underperform the state-of-the-art one-shot approaches in terms of accuracy. In this work, we improve the temporal relocalization method by using a network architecture that incorporates Kalman filtering (KFNet) for online camera relocalization. In particular, KFNet extends the scene coordinate regression problem to the time domain in order to recursively establish 2D and 3D correspondences for the pose determination. The network architecture design and the loss formulation are based on Kalman filtering in the context of Bayesian learning. Extensive experiments on multiple relocalization benchmarks demonstrate the high accuracy of KFNet at the top of both one-shot and temporal relocalization approaches.
Cite
Text
Zhou et al. "KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00497Markdown
[Zhou et al. "KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/zhou2020cvpr-kfnet/) doi:10.1109/CVPR42600.2020.00497BibTeX
@inproceedings{zhou2020cvpr-kfnet,
title = {{KFNet: Learning Temporal Camera Relocalization Using Kalman Filtering}},
author = {Zhou, Lei and Luo, Zixin and Shen, Tianwei and Zhang, Jiahui and Zhen, Mingmin and Yao, Yao and Fang, Tian and Quan, Long},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2020},
doi = {10.1109/CVPR42600.2020.00497},
url = {https://mlanthology.org/cvpr/2020/zhou2020cvpr-kfnet/}
}