How to Track Your Dragon: A Multi-Attentional Framework for Real-Time RGB-D 6-DOF Object Pose Tracking

Abstract

We present a novel multi-attentional convolutional architecture to tackle the problem of real-time RGB-D 6D object pose tracking of single, known objects. Such a problem poses multiple challenges originating both from the objects' nature and their interaction with their environment, which previous approaches have failed to fully address. The proposed framework encapsulates methods for background clutter and occlusion handling by integrating multiple parallel soft spatial attention modules into a multitask Convolutional Neural Network (CNN) architecture. Moreover, we consider the special geometrical properties of both the object's 3D model and the pose space, and we use a more sophisticated approach for data augmentation for training. The provided experimental results confirm the effectiveness of the proposed multi-attentional architecture, as it improves the State-of-the-Art (SoA) tracking performance by an average score of 40.5% for translation and 57.5% for rotation, when testing on the dataset presented in [1], the most complete dataset designed, up to date, for the problem of RGB-D object tracking.

Cite

Text

Marougkas et al. "How to Track Your Dragon: A Multi-Attentional Framework for Real-Time RGB-D 6-DOF Object Pose Tracking." European Conference on Computer Vision Workshops, 2020. doi:10.1007/978-3-030-66096-3_45

Markdown

[Marougkas et al. "How to Track Your Dragon: A Multi-Attentional Framework for Real-Time RGB-D 6-DOF Object Pose Tracking." European Conference on Computer Vision Workshops, 2020.](https://mlanthology.org/eccvw/2020/marougkas2020eccvw-track/) doi:10.1007/978-3-030-66096-3_45

BibTeX

@inproceedings{marougkas2020eccvw-track,
  title     = {{How to Track Your Dragon: A Multi-Attentional Framework for Real-Time RGB-D 6-DOF Object Pose Tracking}},
  author    = {Marougkas, Isidoros and Koutras, Petros and Kardaris, Nikolaos and Retsinas, Georgios and Chalvatzaki, Georgia and Maragos, Petros},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2020},
  pages     = {682-699},
  doi       = {10.1007/978-3-030-66096-3_45},
  url       = {https://mlanthology.org/eccvw/2020/marougkas2020eccvw-track/}
}