Low-Cost Multispectral Scene Analysis with Modality Distillation
Abstract
Despite its robust performance under various illumination conditions, multispectral scene analysis has not been widely deployed due to two strong practical limitations: 1) thermal cameras, especially high-resolution ones are much more expensive than conventional visible cameras; 2) the most commonly adopted multispectral architectures, two-stream neural networks, nearly double the inference time of a regular mono-spectral model which makes them impractical in embedded environments. In this work, we aim to tackle these two limitations by proposing a novel knowledge distillation framework named Modality Distillation (MD). The proposed framework distils the knowledge from a high thermal resolution two-stream network with feature-level fusion to a low thermal resolution one-stream network with image-level fusion. We show on different multispectral scene analysis benchmarks that our method can effectively allow the use of low-resolution thermal sensors with more compact one-stream networks.
Cite
Text
Zhang et al. "Low-Cost Multispectral Scene Analysis with Modality Distillation." Winter Conference on Applications of Computer Vision, 2022.Markdown
[Zhang et al. "Low-Cost Multispectral Scene Analysis with Modality Distillation." Winter Conference on Applications of Computer Vision, 2022.](https://mlanthology.org/wacv/2022/zhang2022wacv-lowcost/)BibTeX
@inproceedings{zhang2022wacv-lowcost,
title = {{Low-Cost Multispectral Scene Analysis with Modality Distillation}},
author = {Zhang, Heng and Fromont, Elisa and Lefèvre, Sébastien and Avignon, Bruno},
booktitle = {Winter Conference on Applications of Computer Vision},
year = {2022},
pages = {803-812},
url = {https://mlanthology.org/wacv/2022/zhang2022wacv-lowcost/}
}