Continual Learning for Image-Based Camera Localization
Abstract
For several emerging technologies such as augmented reality, autonomous driving and robotics, visual localization is a critical component. Directly regressing camera pose/3D scene coordinates from the input image using deep neural networks has shown great potential. However, such methods assume a stationary data distribution with all scenes simultaneously available during training. In this paper, we approach the problem of visual localization in a continual learning setup -- whereby the model is trained on scenes in an incremental manner. Our results show that similar to the classification domain, non-stationary data induces catastrophic forgetting in deep networks for visual localization. To address this issue, a strong baseline based on storing and replaying images from a fixed buffer is proposed. Furthermore, we propose a new sampling method based on coverage score (Buff-CS) that adapts the existing sampling strategies in the buffering process to the problem of visual localization. Results demonstrate consistent improvements over standard buffering methods on two challenging datasets -- 7Scenes, 12Scenes, and also 19Scenes by combining the former scenes.
Cite
Text
Wang et al. "Continual Learning for Image-Based Camera Localization." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00324Markdown
[Wang et al. "Continual Learning for Image-Based Camera Localization." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/wang2021iccv-continual/) doi:10.1109/ICCV48922.2021.00324BibTeX
@inproceedings{wang2021iccv-continual,
title = {{Continual Learning for Image-Based Camera Localization}},
author = {Wang, Shuzhe and Laskar, Zakaria and Melekhov, Iaroslav and Li, Xiaotian and Kannala, Juho},
booktitle = {International Conference on Computer Vision},
year = {2021},
pages = {3252-3262},
doi = {10.1109/ICCV48922.2021.00324},
url = {https://mlanthology.org/iccv/2021/wang2021iccv-continual/}
}