AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Abstract

Semi-supervised learning (SSL) has seen great strides when labeled data is scarce but unlabeled data is abundant. Critically, most recent work assume that such unlabeled data is drawn from the same distribution as the labeled data. In this work, we show that state-of-the-art SSL algorithms suffer a degradation in performance in the presence of unlabeled auxiliary data that does not necessarily possess the same class distribution as the labeled set. We term this problem as Auxiliary-SSL and propose AuxMix, an algorithm that leverages self-supervised learning tasks to learn generic features in order to mask auxiliary data that are not semantically similar to the labeled set. We also propose to regularize learning by maximizing the predicted entropy for dissimilar auxiliary samples. We show an improvement of 5% over existing baselines on a ResNet-50 model when trained on CIFAR10 dataset with 4k labeled samples and all unlabeled data is drawn from the Tiny-Imagenet dataset. We report competitive results on several datasets and conduct ablation studies.

Cite

Text

Banitalebi-Dehkordi et al. "AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00445

Markdown

[Banitalebi-Dehkordi et al. "AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/banitalebidehkordi2022cvprw-auxmix/) doi:10.1109/CVPRW56347.2022.00445

BibTeX

@inproceedings{banitalebidehkordi2022cvprw-auxmix,
  title     = {{AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data}},
  author    = {Banitalebi-Dehkordi, Amin and Gujjar, Pratik and Zhang, Yong},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {3998-4005},
  doi       = {10.1109/CVPRW56347.2022.00445},
  url       = {https://mlanthology.org/cvprw/2022/banitalebidehkordi2022cvprw-auxmix/}
}