Fast and Low-Cost Genomic Foundation Models via Outlier Removal
Abstract
To address the challenge of scarce computational resources in genomic modeling, we introduce GERM, a genomic foundation model optimized for accessibility and adaptability. GERM improves upon models like DNABERT-2 by eliminating outliers that hinder low-rank adaptation and post-training quantization, enhancing both efficiency and robustness. We replace the vanilla attention layer with an outlier-free mechanism inspired by associative memory models. By removing outliers during both pre-training and fine-tuning, this approach accelerates adaptation, reduces computational costs, and enhances quantization robustness within acceptable loss margins. Additionally, we propose GERM-T, a strategy that employs small-step continual learning within the outlier-free framework, leveraging original checkpoints to avoid retraining from scratch. Empirically, GERM improves fine-tuning performance by 37.98% and quantization by 64.34% over the baseline model. It also reduces average kurtosis by 92.14% and maximum infinity norm by 82.77%. Compared to leading methods, GERM consistently delivers superior performance, offering a practical solution for genomic modeling in resource-constrained settings.
Cite
Text
Luo et al. "Fast and Low-Cost Genomic Foundation Models via Outlier Removal." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Luo et al. "Fast and Low-Cost Genomic Foundation Models via Outlier Removal." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/luo2025icml-fast/)BibTeX
@inproceedings{luo2025icml-fast,
title = {{Fast and Low-Cost Genomic Foundation Models via Outlier Removal}},
author = {Luo, Haozheng and Qiu, Chenghao and Su, Maojiang and Zhou, Zhihan and Mehta, Zoe and Ye, Guo and Hu, Jerry Yao-Chieh and Liu, Han},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {41254-41289},
volume = {267},
url = {https://mlanthology.org/icml/2025/luo2025icml-fast/}
}