MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation
Abstract
Test-time adaptation approaches have recently emerged as a practical solution for handling domain shift without access to the source domain data. In this paper, we propose and explore a new multi-modal extension of test-time adaptation for 3D semantic segmentation. We find that, directly applying existing methods usually results in performance instability at test time, because multi-modal input is not considered jointly. To design a framework that can take full advantage of multi-modality, where each modality provides regularized self-supervisory signals to other modalities, we propose two complementary modules within and across the modalities. First, Intra-modal Pseudo-label Generation (Intra-PG) is introduced to obtain reliable pseudo labels within each modality by aggregating information from two models that are both pre-trained on source data but updated with target data at different paces. Second, Intermodal Pseudo-label Refinement (Inter-PR) adaptively selects more reliable pseudo labels from different modalities based on a proposed consistency scheme. Experiments demonstrate that our regularized pseudo labels produce stable self-learning signals in numerous multi-modal test-time adaptation scenarios for 3D semantic segmentation. Visit our project website at https://www.nec-labs.com/ mas/MM-TTA.
Cite
Text
Shin et al. "MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.01642Markdown
[Shin et al. "MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/shin2022cvpr-mmtta/) doi:10.1109/CVPR52688.2022.01642BibTeX
@inproceedings{shin2022cvpr-mmtta,
title = {{MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation}},
author = {Shin, Inkyu and Tsai, Yi-Hsuan and Zhuang, Bingbing and Schulter, Samuel and Liu, Buyu and Garg, Sparsh and Kweon, In So and Yoon, Kuk-Jin},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {16928-16937},
doi = {10.1109/CVPR52688.2022.01642},
url = {https://mlanthology.org/cvpr/2022/shin2022cvpr-mmtta/}
}