Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation

Abstract

In this paper, we propose a novel joint Task-Recursive Learning (TRL) framework for the closing-loop semantic segmentation and monocular depth estimation tasks. TRL can recursively refine the results of both tasks through serialized task-level interactions. In order to mutually-boost for each other, we encapsulate the interaction into a specific Task-Attentional Module (TAM) to adaptively enhance some counterpart patterns of both tasks. Further, to make the inference more credible, we propagate previous learning experiences on both tasks into the next network evolution by explicitly concatenating previous responses. The sequence of task-level interactions are finally evolved along a coarse-to-fine scale space such that the required details may be reconstructed progressively. Extensive experiments on NYU-Depth v2 and SUN RGB-D datasets demonstrate that our method achieves state-of-the-art results for monocular depth estimation and semantic segmentation.

Cite

Text

Zhang et al. "Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01249-6_15

Markdown

[Zhang et al. "Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/zhang2018eccv-joint/) doi:10.1007/978-3-030-01249-6_15

BibTeX

@inproceedings{zhang2018eccv-joint,
  title     = {{Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation}},
  author    = {Zhang, Zhenyu and Cui, Zhen and Xu, Chunyan and Jie, Zequn and Li, Xiang and Yang, Jian},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2018},
  doi       = {10.1007/978-3-030-01249-6_15},
  url       = {https://mlanthology.org/eccv/2018/zhang2018eccv-joint/}
}