Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization
Abstract
We propose a frequency-oriented perspective on retinal representation learning by analyzing masked autoencoders (MAE) through the lens of spatial frequency. Our analysis shows that MAE favors low-frequency content while under-encoding diagnostically critical high-frequency structures in retinal images. Because retinal pathology often manifests in high-frequency detail, this bias limits diagnostic performance and motivates frequency-balanced representations. Within a mutual-information (MI) formulation of MAE, we introduce the Frequency-Balanced Retinal Masked Autoencoder (RetMAE), which augments the reconstruction objective with a MI regularizer that suppresses low-frequency redundancy and accentuates clinically salient high-frequency information. Without altering the architecture, RetMAE learns frequency-balanced features that surpass those of MAE-based retinal encoders in both quantitative and qualitative evaluations. These results suggest that a frequency-oriented view provides a principled foundation for future advances in ophthalmic modeling, offering new insight into how MAE’s reconstruction objective amplifies ViT’s low-pass tendencies in spatially heterogeneous retinal images and enabling a simple MI-based correction that improves retinal encoders.
Cite
Text
Lee et al. "Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization." International Conference on Learning Representations, 2026.Markdown
[Lee et al. "Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/lee2026iclr-frequencybalanced/)BibTeX
@inproceedings{lee2026iclr-frequencybalanced,
title = {{Frequency-Balanced Retinal Representation Learning with Mutual Information Regularization}},
author = {Lee, Seunghoon and Kang, Seongjae and Park, Inhyuk and Kwon, Gitaek and Baek, Jihyeon and Park, Doohyun},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/lee2026iclr-frequencybalanced/}
}