Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection

Chen, Jie; Li, Zhiheng; Luo, Jiebo; Xu, Chenliang

doi:10.1109/CVPR42600.2020.00992

Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection

Jie Chen, Zhiheng Li, Jiebo Luo, Chenliang Xu

CVPR 2020

doi:10.1109/CVPR42600.2020.00992 /cvpr/2020/chen2020cvpr-learning/

Abstract

We address weakly-supervised video actor-action segmentation (VAAS), which extends general video object segmentation (VOS) to additionally consider action labels of the actors. The most successful methods on VOS synthesize a pool of pseudo-annotations (PAs) and then refine them iteratively. However, they face challenges as to how to select from a massive amount of PAs high-quality ones, how to set an appropriate stop condition for weakly-supervised training, and how to initialize PAs pertaining to VAAS. To overcome these challenges, we propose a general Weakly-Supervised framework with a Wise Selection of training samples and model evaluation criterion (WS^2). Instead of blindly trusting quality-inconsistent PAs, WS^2 employs a learning-based selection to select effective PAs and a novel region integrity criterion as a stopping condition for weakly-supervised training. In addition, a 3D-Conv GCAM is devised to adapt to the VAAS task. Extensive experiments show that WS^2 achieves state-of-the-art performance on both weakly-supervised VOS and VAAS tasks and is on par with the best fully-supervised method on VAAS.

PDF CVPR Semantic Scholar

Cite

Text

Chen et al. "Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00992

Markdown

[Chen et al. "Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/chen2020cvpr-learning/) doi:10.1109/CVPR42600.2020.00992

BibTeX

@inproceedings{chen2020cvpr-learning,
  title     = {{Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection}},
  author    = {Chen, Jie and Li, Zhiheng and Luo, Jiebo and Xu, Chenliang},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00992},
  url       = {https://mlanthology.org/cvpr/2020/chen2020cvpr-learning/}
}