MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention
Abstract
Prior work in multi-task learning has mainly focused on predictions on a single image. In this work, we present a new approach for multi-task learning from videos via efficient inter-frame local attention (MILA). Our approach contains a novel inter-frame attention module which allows learning of task-specific attention across frames. We embed the attention module in a "slow-fast" architecture, where the slow network runs on sparsely sampled keyframes and the fast shallow network runs on non-keyframes at a high frame rate. We also propose an effective adversarial learning strategy to encourage the slow and fast net-work to learn similar features to well align keyframes and non-keyframes. Our approach ensures low-latency multi-task learning while maintaining high quality predictions. MILA obatins competitive accuracy compared to state-of-the-art on two multi-task learning benchmarks while reducing the number of floating point operations (FLOPs) by up to 70%. In addition, our attention based feature propagation method (ILA) outperforms prior work in terms of task accuracy while also reducing up to 90% of FLOPs.
Cite
Text
Kim et al. "MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention." IEEE/CVF International Conference on Computer Vision Workshops, 2021. doi:10.1109/ICCVW54120.2021.00251Markdown
[Kim et al. "MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention." IEEE/CVF International Conference on Computer Vision Workshops, 2021.](https://mlanthology.org/iccvw/2021/kim2021iccvw-mila/) doi:10.1109/ICCVW54120.2021.00251BibTeX
@inproceedings{kim2021iccvw-mila,
title = {{MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention}},
author = {Kim, Donghyun and Lan, Tian and Zou, Chuhang and Xu, Ning and Plummer, Bryan A. and Sclaroff, Stan and Eledath, Jayan and Medioni, Gérard G.},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2021},
pages = {2219-2229},
doi = {10.1109/ICCVW54120.2021.00251},
url = {https://mlanthology.org/iccvw/2021/kim2021iccvw-mila/}
}