MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders
Abstract
Multi-task dense scene understanding, which learns a model for multiple dense prediction tasks, has a wide range of application scenarios. Modeling long-range dependency and enhancing cross-task interactions are crucial to multi-task dense prediction. In this paper, we propose MTMamba, a novel Mamba-based architecture for multi-task scene understanding. It contains two types of core blocks: self-task Mamba (STM) block and cross-task Mamba (CTM) block. STM handles long-range dependency by leveraging Mamba, while CTM explicitly models task interactions to facilitate information exchange across tasks. Experiments on NYUDv2 and PASCAL-Context datasets demonstrate the superior performance of MTMamba over Transformer-based and CNN-based methods. Notably, on the PASCAL-Context dataset, MTMamba achieves improvements of +2.08, +5.01, and +4.90 over the previous best methods in the tasks of semantic segmentation, human parsing, and object boundary detection, respectively. The code is available at https://github.com/EnVision-Research/MTMamba.
Cite
Text
Lin et al. "MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72897-6_18Markdown
[Lin et al. "MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/lin2024eccv-mtmamba/) doi:10.1007/978-3-031-72897-6_18BibTeX
@inproceedings{lin2024eccv-mtmamba,
title = {{MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders}},
author = {Lin, Baijiong and Jiang, Weisen and Chen, Pengguang and Zhang, Yu and Liu, Shu and Chen, Yingcong},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72897-6_18},
url = {https://mlanthology.org/eccv/2024/lin2024eccv-mtmamba/}
}