Top-Down Learning for Structured Labeling with Convolutional Pseudoprior

Abstract

Current practice in convolutional neural networks (CNN) remains largely bottom-up and the role of top-down process in CNN for pattern analysis and visual inference is not very clear. In this paper, we propose a new method for structured labeling by developing convolutional pseudo-prior (ConvPP) on the ground-truth labels. Our method has several interesting properties: (1) compared with classical machine learning algorithms like CRFs and Structural SVM, ConvPP automatically learns rich convolutional kernels to capture both short- and long- range contexts; (2) compared with cascade classifiers like Auto-Context, ConvPP avoids the iterative steps of learning a series of discriminative classifiers and automatically learns contextual configurations; (3) compared with recent efforts combing CNN models with CRFs and RNNs, ConvPP learns convolution in the labeling space with much improved modeling capability and less manual specification; (4) compared with Bayesian models like MRFs, ConvPP capitalizes on the rich representation power of convolution by automatically learning priors built on convolutional filters. We accomplish our task using pseudo-likelihood approximation to the prior under a novel fixed-point network structure that facilitates an end-to-end learning process. We show state-of-the-art results on sequential labeling and image labeling benchmarks.

Cite

Text

Xie et al. "Top-Down Learning for Structured Labeling with Convolutional Pseudoprior." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46493-0_19

Markdown

[Xie et al. "Top-Down Learning for Structured Labeling with Convolutional Pseudoprior." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/xie2016eccv-top/) doi:10.1007/978-3-319-46493-0_19

BibTeX

@inproceedings{xie2016eccv-top,
  title     = {{Top-Down Learning for Structured Labeling with Convolutional Pseudoprior}},
  author    = {Xie, Saining and Huang, Xun and Tu, Zhuowen},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {302-317},
  doi       = {10.1007/978-3-319-46493-0_19},
  url       = {https://mlanthology.org/eccv/2016/xie2016eccv-top/}
}