Discriminability-Enforcing Loss to Improve Representation Learning

Abstract

During the training process, deep neural networks implicitly learn to represent the input data samples through a hierarchy of features, where the size of the hierarchy is determined by the number of layers. In this paper, we focus on enforcing the discriminative power of the high-level representations, that are typically learned by the deeper layers (closer to the output). To this end, we introduce a new loss term inspired by the Gini impurity, which is aimed at minimizing the entropy (increasing the discriminative power) of individual high-level features with respect to the class labels. Although our Gini loss induces highly-discriminative features, it does not ensure that the distribution of the high-level features matches the distribution of the classes. As such, we introduce another loss term to minimize the Kullback–Leibler divergence between the two distributions. We conduct experiments on two image classification data sets (CIFAR-100 and Caltech 101), considering multiple neural architectures ranging from convolutional networks (ResNet-17, ResNet-18, ResNet-50) to transformers (CvT). Our empirical results show that integrating our novel loss terms into the training objective consistently out-performs the models trained with cross-entropy alone, with-out increasing the inference time at all.

Cite

Text

Croitoru et al. "Discriminability-Enforcing Loss to Improve Representation Learning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00292

Markdown

[Croitoru et al. "Discriminability-Enforcing Loss to Improve Representation Learning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/croitoru2022cvprw-discriminabilityenforcing/) doi:10.1109/CVPRW56347.2022.00292

BibTeX

@inproceedings{croitoru2022cvprw-discriminabilityenforcing,
  title     = {{Discriminability-Enforcing Loss to Improve Representation Learning}},
  author    = {Croitoru, Florinel-Alin and Grigore, Diana-Nicoleta and Ionescu, Radu Tudor},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {2597-2601},
  doi       = {10.1109/CVPRW56347.2022.00292},
  url       = {https://mlanthology.org/cvprw/2022/croitoru2022cvprw-discriminabilityenforcing/}
}