Effect of Stage Training for Long-Tailed Multi-Label Image Classification
Abstract
In this study, we focus on the multi-stage training approach for training image classification models in the ICCV CVAMD 2023 Shared Task CXR-LT: Multi-Label LongTailed Classification on Chest X-Rays. In the proposed approach, the input image size and batch size are adjusted at each stage of the training process. In the first stage, we use a smaller input image size and a larger batch size for model training. Following that, we increase the image size and reduce the batch size in the second stage. A thorough search of the related literature did not yield validations of a similar approach for data with a long-tailed distribution. We successfully balance accelerated model training and performance by combining the proposed technique with various enhancements, such as oversampling, postprocessing using view positions, and ensemble methods, despite using a smaller model architecture and smaller input image size.
Cite
Text
Yamagishi and Hanaoka. "Effect of Stage Training for Long-Tailed Multi-Label Image Classification." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00287Markdown
[Yamagishi and Hanaoka. "Effect of Stage Training for Long-Tailed Multi-Label Image Classification." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/yamagishi2023iccvw-effect/) doi:10.1109/ICCVW60793.2023.00287BibTeX
@inproceedings{yamagishi2023iccvw-effect,
title = {{Effect of Stage Training for Long-Tailed Multi-Label Image Classification}},
author = {Yamagishi, Yosuke and Hanaoka, Shohei},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2023},
pages = {2713-2720},
doi = {10.1109/ICCVW60793.2023.00287},
url = {https://mlanthology.org/iccvw/2023/yamagishi2023iccvw-effect/}
}