Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer
Abstract
We propose the Recurrent homography estimation framework using Homography-guided image Warping and Focus transformer (FocusFormer), named RHWF. Both being appropriately absorbed into the recurrent framework, the homography-guided image warping progressively enhances the feature consistency and the attention-focusing mechanism in FocusFormer aggregates the intra-inter correspondence in a global->nonlocal->local manner. Thanks to the above strategies, RHWF ranks top in accuracy on a variety of datasets, including the challenging cross-resolution and cross-modal ones. Meanwhile, benefiting from the recurrent framework, RHWF achieves parameter efficiency despite the transformer architecture. Compared to previous state-of-the-art approaches LocalTrans and IHN, RHWF reduces the mean average corner error (MACE) by about 70% and 38.1% on the MSCOCO dataset, while saving the parameter costs by 86.5% and 24.6%. Similar to the previous works, RHWF can also be arranged in 1-scale for efficiency and 2-scale for accuracy, with the 1-scale RHWF already outperforming most of the previous methods. Source code is available at https://github.com/imdumpl78/RHWF.
Cite
Text
Cao et al. "Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00948Markdown
[Cao et al. "Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/cao2023cvpr-recurrent/) doi:10.1109/CVPR52729.2023.00948BibTeX
@inproceedings{cao2023cvpr-recurrent,
title = {{Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer}},
author = {Cao, Si-Yuan and Zhang, Runmin and Luo, Lun and Yu, Beinan and Sheng, Zehua and Li, Junwei and Shen, Hui-Liang},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {9833-9842},
doi = {10.1109/CVPR52729.2023.00948},
url = {https://mlanthology.org/cvpr/2023/cao2023cvpr-recurrent/}
}