Scene-Dependent Intention Recognition for Task Communication with Reduced Human-Robot Interaction

Abstract

In order for assistive robots to collaborate effectively with humans, they must be endowed with the ability to perceive scenes and more importantly, recognize human intentions. These intentions are often inferred from observed physical actions and direct communication from fully-functional individuals. For individuals with reduced capabilities, it may be difficult or impossible to perform physical actions or easily communicate. Therefore, their intentions must be inferred differently. To this end, we propose an intention recognition framework that is appropriate for persons with limited physical capabilities. This framework determines and learns human intentions based on scene objects, the actions that can be performed on them, and past interaction history. It is based on a Markov model formulation entitled Object-Action Intention Networks, which constitute the crux of a computer vision-based human-robot collaborative system that reduces the necessary interactions for communicating tasks to a robot. Evaluations were conducted on multiple scenes comprised of multiple possible object categories and actions. We achieve approximately 81% reduction in interactions overall after learning, when compared to other intention recognition approaches.

Cite

Text

Duncan et al. "Scene-Dependent Intention Recognition for Task Communication with Reduced Human-Robot Interaction." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-16199-0_51

Markdown

[Duncan et al. "Scene-Dependent Intention Recognition for Task Communication with Reduced Human-Robot Interaction." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/duncan2014eccv-scene/) doi:10.1007/978-3-319-16199-0_51

BibTeX

@inproceedings{duncan2014eccv-scene,
  title     = {{Scene-Dependent Intention Recognition for Task Communication with Reduced Human-Robot Interaction}},
  author    = {Duncan, Kester and Sarkar, Sudeep and Alqasemi, Redwan and Dubey, Rajiv V.},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {730-745},
  doi       = {10.1007/978-3-319-16199-0_51},
  url       = {https://mlanthology.org/eccv/2014/duncan2014eccv-scene/}
}