Appearance Consensus Driven Self-Supervised Human Mesh Recovery

Abstract

We present a self-supervised human mesh recovery framework to infer human pose and shape from monocular images in the absence of any paired supervision. Recent advances have shifted the interest towards directly regressing parameters of a parametric human model by supervising them on large-scale, images with 2D landmark annotations. This limits the generalizability of such approaches to operate on samples from unlabeled wild environments. Acknowledging this we propose a novel appearance consensus driven self-supervised objective. To effectively disentangle the foreground (FG) human we rely on image pairs depicting the same person (consistent FG) in varied pose and background (BG) which are obtained from unlabeled wild videos. The proposed FG appearance consistency objective makes use of a novel, differentiable extit{Color-recovery} module to obtain vertex colors without involving any trainable appearance extraction network; via efficient realization of color-picking and reflectional symmetry. We achieve state-of-the-art results on the standard model-based 3D pose estimation benchmarks at comparable supervision levels. Furthermore, the resulting colored mesh prediction opens up usage of our framework for a variety of appearance-related tasks beyond pose and shape estimation, thus establishing our superior generalizability.

Cite

Text

Kundu et al. "Appearance Consensus Driven Self-Supervised Human Mesh Recovery." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58452-8_46

Markdown

[Kundu et al. "Appearance Consensus Driven Self-Supervised Human Mesh Recovery." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/kundu2020eccv-appearance/) doi:10.1007/978-3-030-58452-8_46

BibTeX

@inproceedings{kundu2020eccv-appearance,
  title     = {{Appearance Consensus Driven Self-Supervised Human Mesh Recovery}},
  author    = {Kundu, Jogendra Nath and Rakesh, Mugalodi and Jampani, Varun and Venkatesh, Rahul Mysore and Babu, R. Venkatesh},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58452-8_46},
  url       = {https://mlanthology.org/eccv/2020/kundu2020eccv-appearance/}
}