AlphaFold Meets Flow Matching for Generating Protein Ensembles

Abstract

Recent breakthroughs in protein structure prediction have pointed to structural ensembles as the next frontier in the computational understanding of protein structure. At the same time, iterative refinement techniques such as diffusion have driven significant advancements in generative modeling. We explore the synergy of these developments by combining AlphaFold and ESMFold with flow matching, a powerful modern generative modeling framework, in order to sample the conformational landscape of proteins. When trained on the PDB and evaluated on proteins with multiple recent structures, our method produces ensembles with similar precision and greater diversity compared to MSA subsampling. When further fine-tuned on coarse-grained molecular dynamics trajectories, our model generalizes to unseen proteins and accurately predicts conformational flexbility, captures the joint distribution of atomic positions, and models higher-order physiochemical properties such as intermittent contacts and solvent exposure. These results open exciting avenues in the computational prediction of conformational flexibility.

Cite

Text

Jing et al. "AlphaFold Meets Flow Matching for Generating Protein Ensembles." NeurIPS 2023 Workshops: AI4D3, 2023.

Markdown

[Jing et al. "AlphaFold Meets Flow Matching for Generating Protein Ensembles." NeurIPS 2023 Workshops: AI4D3, 2023.](https://mlanthology.org/neuripsw/2023/jing2023neuripsw-alphafold/)

BibTeX

@inproceedings{jing2023neuripsw-alphafold,
  title     = {{AlphaFold Meets Flow Matching for Generating Protein Ensembles}},
  author    = {Jing, Bowen and Berger, Bonnie and Jaakkola, Tommi},
  booktitle = {NeurIPS 2023 Workshops: AI4D3},
  year      = {2023},
  url       = {https://mlanthology.org/neuripsw/2023/jing2023neuripsw-alphafold/}
}