CryoFM: A Flow-Based Foundation Model for Cryo-EM Densities

Abstract

Cryo-electron microscopy (cryo-EM) is a powerful technique in structural biology and drug discovery, enabling the study of biomolecules at high resolution. Significant advancements by structural biologists using cryo-EM have led to the production of around 40k protein density maps at various resolutions. However, cryo-EM data processing algorithms have yet to fully benefit from our knowledge of biomolecular density maps, with only a few recent models being data-driven but limited to specific tasks. In this study, we present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps and generalizing effectively to downstream tasks. Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps. Furthermore, we introduce a flow posterior sampling method that leverages CryoFM as a flexible prior for several downstream tasks in cryo-EM and cryo-electron tomography (cryo-ET) without the need for fine-tuning, achieving state-of-the-art performance on most tasks and demonstrating its potential as a foundational model for broader applications in these fields.

Cite

Text

Zhou et al. "CryoFM: A Flow-Based Foundation Model for Cryo-EM Densities." International Conference on Learning Representations, 2025.

Markdown

[Zhou et al. "CryoFM: A Flow-Based Foundation Model for Cryo-EM Densities." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/zhou2025iclr-cryofm/)

BibTeX

@inproceedings{zhou2025iclr-cryofm,
  title     = {{CryoFM: A Flow-Based Foundation Model for Cryo-EM Densities}},
  author    = {Zhou, Yi and Li, Yilai and Yuan, Jing and Gu, Quanquan},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/zhou2025iclr-cryofm/}
}