End-to-End Multi-Task Learning with Attention

Abstract

We propose a novel multi-task learning architecture, which allows learning of task-specific feature-level attention. Our design, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool, together with a soft-attention module for each task. These modules allow for learning of task-specific features from the global features, whilst simultaneously allowing for features to be shared across different tasks. The architecture can be trained end-to-end and can be built upon any feed-forward neural network, is simple to implement, and is parameter efficient. We evaluate our approach on a variety of datasets, across both image-to-image predictions and image classification tasks. We show that our architecture is state-of-the-art in multi-task learning compared to existing methods, and is also less sensitive to various weighting schemes in the multi-task loss function. Code is available at https://github.com/lorenmt/mtan.

Cite

Text

Liu et al. "End-to-End Multi-Task Learning with Attention." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00197

Markdown

[Liu et al. "End-to-End Multi-Task Learning with Attention." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/liu2019cvpr-endtoend/) doi:10.1109/CVPR.2019.00197

BibTeX

@inproceedings{liu2019cvpr-endtoend,
  title     = {{End-to-End Multi-Task Learning with Attention}},
  author    = {Liu, Shikun and Johns, Edward and Davison, Andrew J.},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00197},
  url       = {https://mlanthology.org/cvpr/2019/liu2019cvpr-endtoend/}
}