Dense and Low-Rank Gaussian CRFs Using Deep Embeddings

Abstract

In this work we introduce a structured prediction model that endows the Deep Gaussian Conditional Random Field (G-CRF) with a densely connected graph structure. We keep memory and computational complexity under control by expressing the pairwise interactions as inner products of low-dimensional, learnable embeddings. The G-CRF system matrix is therefore low-rank, allowing us to solve the resulting system in a few milliseconds on the GPU by using conjugate gradients. As in G-CRF, inference is exact, the unary and pairwise terms are jointly trained end-to-end by using analytic expressions for the gradients, while we also develop even faster, Potts-type variants of our embeddings. We show that the learned embeddings capture pixel-to-pixel affinities in a task-specific manner, while our approach achieves state of the art results on three challenging benchmarks, namely semantic segmentation, human part segmentation, and saliency estimation. Our implementation is fully GPU based, built on top of the Caffe library, and is available at https://github.com/siddharthachandra/gcrf-v2.0

Cite

Text

Chandra et al. "Dense and Low-Rank Gaussian CRFs Using Deep Embeddings." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.546

Markdown

[Chandra et al. "Dense and Low-Rank Gaussian CRFs Using Deep Embeddings." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/chandra2017iccv-dense/) doi:10.1109/ICCV.2017.546

BibTeX

@inproceedings{chandra2017iccv-dense,
  title     = {{Dense and Low-Rank Gaussian CRFs Using Deep Embeddings}},
  author    = {Chandra, Siddhartha and Usunier, Nicolas and Kokkinos, Iasonas},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.546},
  url       = {https://mlanthology.org/iccv/2017/chandra2017iccv-dense/}
}