QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query

Abstract

We propose a sparse end-to-end multi-person pose regression framework, termed QueryPose, which can directly predict multi-person keypoint sequences from the input image. The existing end-to-end methods rely on dense representations to preserve the spatial detail and structure for precise keypoint localization. However, the dense paradigm introduces complex and redundant post-processes during inference. In our framework, each human instance is encoded by several learnable spatial-aware part-level queries associated with an instance-level query. First, we propose the Spatial Part Embedding Generation Module (SPEGM) that considers the local spatial attention mechanism to generate several spatial-sensitive part embeddings, which contain spatial details and structural information for enhancing the part-level queries. Second, we introduce the Selective Iteration Module (SIM) to adaptively update the sparse part-level queries via the generated spatial-sensitive part embeddings stage-by-stage. Based on the two proposed modules, the part-level queries are able to fully encode the spatial details and structural information for precise keypoint regression. With the bipartite matching, QueryPose avoids the hand-designed post-processes. Without bells and whistles, QueryPose surpasses the existing dense end-to-end methods with 73.6 AP on MS COCO mini-val set and 72.7 AP on CrowdPose test set. Code is available at https://github.com/buptxyb666/QueryPose.

Cite

Text

Xiao et al. "QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query." Neural Information Processing Systems, 2022.

Markdown

[Xiao et al. "QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/xiao2022neurips-querypose/)

BibTeX

@inproceedings{xiao2022neurips-querypose,
  title     = {{QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query}},
  author    = {Xiao, Yabo and Su, Kai and Wang, Xiaojuan and Yu, Dongdong and Jin, Lei and He, Mingshu and Yuan, Zehuan},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/xiao2022neurips-querypose/}
}