CLKN: Cascaded Lucas-Kanade Networks for Image Alignment

Abstract

This paper proposes a data-driven approach for image alignment. Our main contribution is a novel network architecture that combines the strengths of convolutional neural networks (CNNs) and the Lucas-Kanade algorithm. The main component of this architecture is a Lucas-Kanade layer that performs the inverse compositional algorithm on convolutional feature maps. To train our network, we develop a cascaded feature learning method that incorporates the coarse-to-fine strategy into the training process. This method learns a pyramid representation of convolutional features in a cascaded manner and yields a cascaded network that performs coarse-to-fine alignment on the feature pyramids. We apply our model to the task of homography estimation, and perform training and evaluation on a large labeled dataset generated from the MS-COCO dataset. Experimental results show that the proposed approach significantly outperforms the other methods.

Cite

Text

Chang et al. "CLKN: Cascaded Lucas-Kanade Networks for Image Alignment." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.402

Markdown

[Chang et al. "CLKN: Cascaded Lucas-Kanade Networks for Image Alignment." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/chang2017cvpr-clkn/) doi:10.1109/CVPR.2017.402

BibTeX

@inproceedings{chang2017cvpr-clkn,
  title     = {{CLKN: Cascaded Lucas-Kanade Networks for Image Alignment}},
  author    = {Chang, Che-Han and Chou, Chun-Nan and Chang, Edward Y.},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.402},
  url       = {https://mlanthology.org/cvpr/2017/chang2017cvpr-clkn/}
}