Matching-CNN Meets KNN: Quasi-Parametric Human Parsing

Liu, Si; Liang, Xiaodan; Liu, Luoqi; Shen, Xiaohui; Yang, Jianchao; Xu, Changsheng; Lin, Liang; Cao, Xiaochun; Yan, Shuicheng

doi:10.1109/CVPR.2015.7298748

Matching-CNN Meets KNN: Quasi-Parametric Human Parsing

Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Liang Lin, Xiaochun Cao, Shuicheng Yan

CVPR 2015

doi:10.1109/CVPR.2015.7298748 /cvpr/2015/liu2015cvpr-matchingcnn/

Abstract

Both parametric and non-parametric approaches have demonstrated encouraging performances in the human parsing task, namely segmenting a human image into several semantic regions (e.g., hat, bag, left arm, face). In this work, we aim to develop a new solution with the advantages of both methodologies, namely supervision from annotated data and the flexibility to use newly annotated (possibly uncommon) images, and present a quasi-parametric human parsing model. Under the classic KNN-based nonparametric framework, the parametric Matching Convolutional Neural Network (M-CNN) is proposed to predict the matching confidence and displacement of the best matched region in the testing image for a particular semantic region in one KNN image. Given a testing image, we first retrieve its KNN images from the annotated/manually-parsed human image corpus. Then each semantic region in each KNN image is matched with confidence to the testing image using M-CNN, and the matched regions from all KNN images are further fused, followed by a superpixel smoothing procedure to obtain the ultimate human parsing result. The M-CNN differs from the classic CNN in that the tailored cross image matching filters are introduced to characterize the matching between the testing image and the semantic region of a KNN image. The cross image matching filters are defined at different convolution layers, each aiming to capture a particular range of displacements. Comprehensive evaluations over a large dataset with 7,700 annotated human images well demonstrate the significant performance gain from the quasi-parametric model over the state-of-the-arts, for the human parsing task.

PDF CVPR Semantic Scholar

Cite

Text

Liu et al. "Matching-CNN Meets KNN: Quasi-Parametric Human Parsing." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7298748

Markdown

[Liu et al. "Matching-CNN Meets KNN: Quasi-Parametric Human Parsing." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/liu2015cvpr-matchingcnn/) doi:10.1109/CVPR.2015.7298748

BibTeX

@inproceedings{liu2015cvpr-matchingcnn,
  title     = {{Matching-CNN Meets KNN: Quasi-Parametric Human Parsing}},
  author    = {Liu, Si and Liang, Xiaodan and Liu, Luoqi and Shen, Xiaohui and Yang, Jianchao and Xu, Changsheng and Lin, Liang and Cao, Xiaochun and Yan, Shuicheng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7298748},
  url       = {https://mlanthology.org/cvpr/2015/liu2015cvpr-matchingcnn/}
}