Cross Modal Distillation for Supervision Transfer
Abstract
In this work we propose a technique that transfers supervision between images from different modalities. We use learned representations from a large labeled modality as supervisory signal for training representations for a new unlabeled paired modality. Our method enables learning of rich representations for unlabeled modalities and can be used as a pre-training procedure for new modalities with limited labeled data. We transfer supervision from labeled RGB images to unlabeled depth and optical flow images and demonstrate large improvements for both these cross modal supervision transfers.
Cite
Text
Gupta et al. "Cross Modal Distillation for Supervision Transfer." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.309Markdown
[Gupta et al. "Cross Modal Distillation for Supervision Transfer." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/gupta2016cvpr-cross/) doi:10.1109/CVPR.2016.309BibTeX
@inproceedings{gupta2016cvpr-cross,
title = {{Cross Modal Distillation for Supervision Transfer}},
author = {Gupta, Saurabh and Hoffman, Judy and Malik, Jitendra},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2016},
doi = {10.1109/CVPR.2016.309},
url = {https://mlanthology.org/cvpr/2016/gupta2016cvpr-cross/}
}