An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks

Lea, Colin; Hager, Gregory D.; Vidal, René

doi:10.1109/WACV.2015.154

An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks

Colin Lea, Gregory D. Hager, René Vidal

WACV 2015 pp. 1123-1129

doi:10.1109/WACV.2015.154 /wacv/2015/lea2015wacv-improved/

Abstract

Automated segmentation and recognition of fine-grained activities is important for enabling new applications in industrial automation, human-robot collaboration, and surgical training. Many existing approaches to activity recognition assume that a video has already been segmented and perform classification using an abstract representation based on spatio-temporal features. While some approaches perform joint activity segmentation and recognition, they typically suffer from a poor modeling of the transitions between actions and a representation that does not incorporate contextual information about the scene. In this paper, we propose a model for action segmentation and recognition that improves upon existing work in two directions. First, we develop a variation of the Skip-Chain Conditional Random Field that captures long-range state transitions between actions by using higher-order temporal relationships. Second, we argue that in constrained environments, where the relevant set of objects is known, it is better to develop features using high-level object relationships that have semantic meaning instead of relying on abstract features. We apply our approach to a set of tasks common for training in robotic surgery: suturing, knot tying, and needle passing, and show that our method increases micro and macro accuracy by 18.46% and 44.13% relative to the state of the art on a widely used robotic surgery dataset.

WACV Semantic Scholar

Cite

Text

Lea et al. "An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015. doi:10.1109/WACV.2015.154

Markdown

[Lea et al. "An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015.](https://mlanthology.org/wacv/2015/lea2015wacv-improved/) doi:10.1109/WACV.2015.154

BibTeX

@inproceedings{lea2015wacv-improved,
  title     = {{An Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training Tasks}},
  author    = {Lea, Colin and Hager, Gregory D. and Vidal, René},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2015},
  pages     = {1123-1129},
  doi       = {10.1109/WACV.2015.154},
  url       = {https://mlanthology.org/wacv/2015/lea2015wacv-improved/}
}