An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks
Abstract
Automated segmentation and recognition of fine-grained activities is important for enabling new applications in industrial automation, human-robot collaboration, and surgical training. Many existing approaches to activity recognition assume that a video has already been segmented and perform classification using an abstract representation based on spatio-temporal features. While some approaches perform joint activity segmentation and recognition, they typically suffer from a poor modeling of the transitions between actions and a representation that does not incorporate contextual information about the scene. In this paper, we propose a model for action segmentation and recognition that improves upon existing work in two directions. First, we develop a variation of the Skip-Chain Conditional Random Field that captures long-range state transitions between actions by using higher-order temporal relationships. Second, we argue that in constrained environments, where the relevant set of objects is known, it is better to develop features using high-level object relationships that have semantic meaning instead of relying on abstract features. We apply our approach to a set of tasks common for training in robotic surgery: suturing, knot tying, and needle passing, and show that our method increases micro and macro accuracy by 18.46% and 44.13% relative to the state of the art on a widely used robotic surgery dataset.
Cite
Text
Lea et al. "An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015. doi:10.1109/WACV.2015.154Markdown
[Lea et al. "An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015.](https://mlanthology.org/wacv/2015/lea2015wacv-improved/) doi:10.1109/WACV.2015.154BibTeX
@inproceedings{lea2015wacv-improved,
title = {{An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks}},
author = {Lea, Colin and Hager, Gregory D. and Vidal, René},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2015},
pages = {1123-1129},
doi = {10.1109/WACV.2015.154},
url = {https://mlanthology.org/wacv/2015/lea2015wacv-improved/}
}