Feature Space Augmentation for Long-Tailed Data
Abstract
Real-world data often follow a long-tailed distribution as the frequency of each class is typically different. For example, a dataset can have a large number of under-represented classes and a few classes with more than sufficient data. However, a model to represent the dataset is usually expected to have reasonably homogeneous performances across classes. Introducing class-balanced loss and advanced methods on data re-sampling and augmentation are among the best practices to alleviate the data imbalance problem. However, the other part of the problem about the under-represented classes will have to rely on additional knowledge to recover the missing information.In this work, we present a novel approach to address the long-tailed problem by augmenting the under-represented classes in the feature space with the features learned from the classes with ample samples. In particular, we decompose the features of each class into a class-generic component and a class-specific component using class activation map. Novel samples of under-represented classes are then generated on the fly during training stages by fusing the class-specific features from the under-represented classes with the class-generic features from confusing classes. Our results on different datasets such as iNaturalist and a long-tailed version of CIFAR have shown superior performance compared to common practice and the state of the arts.
Cite
Text
Chu et al. "Feature Space Augmentation for Long-Tailed Data." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58526-6_41Markdown
[Chu et al. "Feature Space Augmentation for Long-Tailed Data." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/chu2020eccv-feature/) doi:10.1007/978-3-030-58526-6_41BibTeX
@inproceedings{chu2020eccv-feature,
title = {{Feature Space Augmentation for Long-Tailed Data}},
author = {Chu, Peng and Bian, Xiao and Liu, Shaopeng and Ling, Haibin},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58526-6_41},
url = {https://mlanthology.org/eccv/2020/chu2020eccv-feature/}
}