Pose Pooling Kernels for Sub-Category Recognition

Abstract

The ability to normalize pose based on super-category landmarks can significantly improve models of individual categories when training data are limited. Previous methods have considered the use of volumetric or morphable models for faces and for certain classes of articulated objects. We consider methods which impose fewer representational assumptions on categories of interest, and exploit contemporary detection schemes which consider the ensemble of responses of detectors trained for specific pose-keypoint configurations. We develop representations for poselet-based pose normalization using both explicit warping and implicit pooling as mechanisms. Our method defines a pose normalized similarity or kernel function that is suitable for nearest-neighbor or kernel-based learning methods.

Cite

Text

Zhang et al. "Pose Pooling Kernels for Sub-Category Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6248364

Markdown

[Zhang et al. "Pose Pooling Kernels for Sub-Category Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/zhang2012cvpr-pose/) doi:10.1109/CVPR.2012.6248364

BibTeX

@inproceedings{zhang2012cvpr-pose,
  title     = {{Pose Pooling Kernels for Sub-Category Recognition}},
  author    = {Zhang, Ning and Farrell, Ryan and Darrell, Trevor},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {3665-3672},
  doi       = {10.1109/CVPR.2012.6248364},
  url       = {https://mlanthology.org/cvpr/2012/zhang2012cvpr-pose/}
}