Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

Abstract

This study introduces a novel approach to bone age assessment (BAA) utilizing a multi-view and multi-task classification model based on the Sauvegrain method, which assesses a maturity score for several landmarks in the elbow and predicts bone age. A straightforward solution to automating the Sauvegrain method leveraging deep neural networks is to train classifiers independently to score each region of interest, but this approach restricts the analysis to isolated anatomical details and increases computational costs. To address these challenges, we propose a self-accumulative vision transformer (SAT) designed to manage anisotropic behaviors commonly encountered in multi-view, multi-task scenarios. The SAT enhances feature integration by employing two key strategies: token replay , which uses residual connections to maintain semantic representations of tokens from the same landmark, and regional attention bias , a modified self-attention mechanism that focuses on intra-region details. Extensive experiments show that the SAT not only effectively captures the interconnections between landmarks but also assimilates global morphological features, reducing the mean absolute error in BAA by 0.11 compared to prior methods. Furthermore, the proposed method has four times reduced parameters than an ensemble of individual classifiers of the previous work. These improvements in our model highlight its increased efficiency and accuracy, offering a valuable advancement for clinical applications in the field of BAA. Code is available at https://github.com/hongchunchoi/SAT .

Cite

Text

Choi et al. "Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-93806-1_13

Markdown

[Choi et al. "Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/choi2024eccvw-selfaccumulative/) doi:10.1007/978-3-031-93806-1_13

BibTeX

@inproceedings{choi2024eccvw-selfaccumulative,
  title     = {{Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method}},
  author    = {Choi, Hongjun and Na, Dongbin and Cho, Kyungjin and Bae, Byunguk and Kong, Seo Taek and Ahn, Hyunjoon and Choi, Sungchul and Kim, Jaeyoung},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {160-176},
  doi       = {10.1007/978-3-031-93806-1_13},
  url       = {https://mlanthology.org/eccvw/2024/choi2024eccvw-selfaccumulative/}
}