Effect of Stage Training for Long-Tailed Multi-Label Image Classification

Abstract

In this study, we focus on the multi-stage training approach for training image classification models in the ICCV CVAMD 2023 Shared Task CXR-LT: Multi-Label LongTailed Classification on Chest X-Rays. In the proposed approach, the input image size and batch size are adjusted at each stage of the training process. In the first stage, we use a smaller input image size and a larger batch size for model training. Following that, we increase the image size and reduce the batch size in the second stage. A thorough search of the related literature did not yield validations of a similar approach for data with a long-tailed distribution. We successfully balance accelerated model training and performance by combining the proposed technique with various enhancements, such as oversampling, postprocessing using view positions, and ensemble methods, despite using a smaller model architecture and smaller input image size.

Cite

Text

Yamagishi and Hanaoka. "Effect of Stage Training for Long-Tailed Multi-Label Image Classification." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00287

Markdown

[Yamagishi and Hanaoka. "Effect of Stage Training for Long-Tailed Multi-Label Image Classification." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/yamagishi2023iccvw-effect/) doi:10.1109/ICCVW60793.2023.00287

BibTeX

@inproceedings{yamagishi2023iccvw-effect,
  title     = {{Effect of Stage Training for Long-Tailed Multi-Label Image Classification}},
  author    = {Yamagishi, Yosuke and Hanaoka, Shohei},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2023},
  pages     = {2713-2720},
  doi       = {10.1109/ICCVW60793.2023.00287},
  url       = {https://mlanthology.org/iccvw/2023/yamagishi2023iccvw-effect/}
}