Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Abstract

Multiple instance learning (MIL) is the most widely used framework in computational pathology encompassing sub-typing diagnosis prognosis and more. However the existing MIL paradigm typically requires an offline instance feature extractor such as a pre-trained ResNet or a foundation model. This approach lacks the capability for feature fine-tuning within the specific downstream tasks limiting its adaptability and performance. To address this issue we propose a Re-embedded Regional Transformer (RRT) for re-embedding the instance features online which captures fine-grained local features and establishes connections across different regions. Unlike existing works that focus on pre-training powerful feature extractor or designing sophisticated instance aggregator RRT is tailored to re-embed instance features online. It serves as a portable module that can seamlessly integrate into mainstream MIL models. Extensive experimental results on common computational pathology tasks validate that: 1) feature re-embedding improves the performance of MIL models based on ResNet-50 features to the level of foundation model features and further enhances the performance of foundation model features; 2) the RRT can introduce more significant performance improvements to various MIL models; 3) RRT-MIL as an RRT-enhanced AB-MIL outperforms other latest methods by a large margin. The code is available at: https://github.com/DearCaat/RRT-MIL.

Cite

Text

Tang et al. "Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01078

Markdown

[Tang et al. "Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/tang2024cvpr-feature/) doi:10.1109/CVPR52733.2024.01078

BibTeX

@inproceedings{tang2024cvpr-feature,
  title     = {{Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology}},
  author    = {Tang, Wenhao and Zhou, Fengtao and Huang, Sheng and Zhu, Xiang and Zhang, Yi and Liu, Bo},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {11343-11352},
  doi       = {10.1109/CVPR52733.2024.01078},
  url       = {https://mlanthology.org/cvpr/2024/tang2024cvpr-feature/}
}