Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

Choi, Hongjun; Na, Dongbin; Cho, Kyungjin; Bae, Byunguk; Kong, Seo Taek; Ahn, Hyunjoon; Choi, Sungchul; Kim, Jaeyoung

doi:10.1007/978-3-031-93806-1_13

Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method

Hongjun Choi, Dongbin Na, Kyungjin Cho, Byunguk Bae, Seo Taek Kong, Hyunjoon Ahn, Sungchul Choi, Jaeyoung Kim

ECCVW 2024 pp. 160-176

doi:10.1007/978-3-031-93806-1_13 /eccvw/2024/choi2024eccvw-selfaccumulative/

Abstract

This study introduces a novel approach to bone age assessment (BAA) utilizing a multi-view and multi-task classification model based on the Sauvegrain method, which assesses a maturity score for several landmarks in the elbow and predicts bone age. A straightforward solution to automating the Sauvegrain method leveraging deep neural networks is to train classifiers independently to score each region of interest, but this approach restricts the analysis to isolated anatomical details and increases computational costs. To address these challenges, we propose a self-accumulative vision transformer (SAT) designed to manage anisotropic behaviors commonly encountered in multi-view, multi-task scenarios. The SAT enhances feature integration by employing two key strategies: token replay , which uses residual connections to maintain semantic representations of tokens from the same landmark, and regional attention bias , a modified self-attention mechanism that focuses on intra-region details. Extensive experiments show that the SAT not only effectively captures the interconnections between landmarks but also assimilates global morphological features, reducing the mean absolute error in BAA by 0.11 compared to prior methods. Furthermore, the proposed method has four times reduced parameters than an ensemble of individual classifiers of the previous work. These improvements in our model highlight its increased efficiency and accuracy, offering a valuable advancement for clinical applications in the field of BAA. Code is available at https://github.com/hongchunchoi/SAT .

PDF ECCVW Semantic Scholar

Cite

Text

Choi et al. "Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-93806-1_13

Markdown

[Choi et al. "Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/choi2024eccvw-selfaccumulative/) doi:10.1007/978-3-031-93806-1_13

BibTeX

@inproceedings{choi2024eccvw-selfaccumulative,
  title     = {{Self-Accumulative Vision Transformer for Bone Age Assessment Using the Sauvegrain Method}},
  author    = {Choi, Hongjun and Na, Dongbin and Cho, Kyungjin and Bae, Byunguk and Kong, Seo Taek and Ahn, Hyunjoon and Choi, Sungchul and Kim, Jaeyoung},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {160-176},
  doi       = {10.1007/978-3-031-93806-1_13},
  url       = {https://mlanthology.org/eccvw/2024/choi2024eccvw-selfaccumulative/}
}