Rethinking Feature Distribution for Loss Functions in Image Classification
Abstract
We propose a large-margin Gaussian Mixture (L-GM) loss for deep neural networks in classification tasks. Different from the softmax cross-entropy loss, our proposal is established on the assumption that the deep features of the training set follow a Gaussian Mixture distribution. By involving a classification margin and a likelihood regularization, the L-GM loss facilitates both a high classification performance and an accurate modeling of the training feature distribution. As such, the L-GM loss is superior to the softmax loss and its major variants in the sense that besides classification, it can be readily used to distinguish abnormal inputs, such as the adversarial examples, based on their features' likelihood to the training feature distribution. Extensive experiments on various recognition benchmarks like MNIST, CIFAR, ImageNet and LFW, as well as on adversarial examples demonstrate the effectiveness of our proposal.
Cite
Text
Wan et al. "Rethinking Feature Distribution for Loss Functions in Image Classification." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00950Markdown
[Wan et al. "Rethinking Feature Distribution for Loss Functions in Image Classification." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/wan2018cvpr-rethinking/) doi:10.1109/CVPR.2018.00950BibTeX
@inproceedings{wan2018cvpr-rethinking,
title = {{Rethinking Feature Distribution for Loss Functions in Image Classification}},
author = {Wan, Weitao and Zhong, Yuanyi and Li, Tianpeng and Chen, Jiansheng},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00950},
url = {https://mlanthology.org/cvpr/2018/wan2018cvpr-rethinking/}
}