Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Abstract
Multiple instance learning (MIL) is the most widely used framework in computational pathology encompassing sub-typing diagnosis prognosis and more. However the existing MIL paradigm typically requires an offline instance feature extractor such as a pre-trained ResNet or a foundation model. This approach lacks the capability for feature fine-tuning within the specific downstream tasks limiting its adaptability and performance. To address this issue we propose a Re-embedded Regional Transformer (RRT) for re-embedding the instance features online which captures fine-grained local features and establishes connections across different regions. Unlike existing works that focus on pre-training powerful feature extractor or designing sophisticated instance aggregator RRT is tailored to re-embed instance features online. It serves as a portable module that can seamlessly integrate into mainstream MIL models. Extensive experimental results on common computational pathology tasks validate that: 1) feature re-embedding improves the performance of MIL models based on ResNet-50 features to the level of foundation model features and further enhances the performance of foundation model features; 2) the RRT can introduce more significant performance improvements to various MIL models; 3) RRT-MIL as an RRT-enhanced AB-MIL outperforms other latest methods by a large margin. The code is available at: https://github.com/DearCaat/RRT-MIL.
Cite
Text
Tang et al. "Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01078Markdown
[Tang et al. "Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/tang2024cvpr-feature/) doi:10.1109/CVPR52733.2024.01078BibTeX
@inproceedings{tang2024cvpr-feature,
title = {{Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology}},
author = {Tang, Wenhao and Zhou, Fengtao and Huang, Sheng and Zhu, Xiang and Zhang, Yi and Liu, Bo},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {11343-11352},
doi = {10.1109/CVPR52733.2024.01078},
url = {https://mlanthology.org/cvpr/2024/tang2024cvpr-feature/}
}