Interactive Labeling for Human Pose Estimation in Surveillance Videos
Abstract
Automatically detecting and estimating the movement of persons in real-world uncooperative scenarios is very challenging in great part due to limited and unreliably annotated data. For instance annotating a single human body pose for activity recognition requires 40-60 seconds in complex sequences, leading to long-winded and costly annotation processes. Therefore increasing the sizes of annotated datasets through crowdsourcing or automated annotation is often used at a great financial costs, without reliable validation processes and inadequate annotation tools greatly impacting the annotation quality. In this work we combine multiple techniques into a single web-based general-purpose annotation application. Pre-trained machine learning models enable annotators to interactively detect pedestrians, re-identify them throughout the sequence, estimate their poses, and correct annotation suggestions in the same interface. Annotations are then inter- and extrapolated between frames. The application is evaluated through several user studies and the results are extensively analyzed. Experiments demonstrate a 55% reduction in annotation time for less complex scenarios while simultaneously decreasing perceived annotator workload.
Cite
Text
Cormier et al. "Interactive Labeling for Human Pose Estimation in Surveillance Videos." IEEE/CVF International Conference on Computer Vision Workshops, 2021. doi:10.1109/ICCVW54120.2021.00190Markdown
[Cormier et al. "Interactive Labeling for Human Pose Estimation in Surveillance Videos." IEEE/CVF International Conference on Computer Vision Workshops, 2021.](https://mlanthology.org/iccvw/2021/cormier2021iccvw-interactive/) doi:10.1109/ICCVW54120.2021.00190BibTeX
@inproceedings{cormier2021iccvw-interactive,
title = {{Interactive Labeling for Human Pose Estimation in Surveillance Videos}},
author = {Cormier, Mickael and Röpke, Fabian and Golda, Thomas and Beyerer, Jürgen},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2021},
pages = {1649-1658},
doi = {10.1109/ICCVW54120.2021.00190},
url = {https://mlanthology.org/iccvw/2021/cormier2021iccvw-interactive/}
}