FragFM: Efficient Fragment-Based Molecular Generation via Discrete Flow Matching
Abstract
We introduce FragFM, a novel fragment-based discrete flow matching framework for molecular graph generation. FragFM generates molecules at the fragment level, leveraging a coarse-to-fine autoencoding mechanism to reconstruct atom-level details. This approach reduces computational complexity while maintaining high chemical validity, enabling more efficient and scalable molecular generation. We benchmark FragFM against state-of-the-art diffusion- and flow-based models on standard molecular generation benchmarks and natural product datasets, demonstrating superior performance in validity, property control, and sampling efficiency. Notably, FragFM achieves over 99\% validity with significantly fewer sampling steps, improving scalability while preserving molecular diversity. These results highlight the potential of fragment-based generative modeling for large-scale, property-aware molecular design, paving the way for more efficient exploration of chemical space.
Cite
Text
Lee et al. "FragFM: Efficient Fragment-Based Molecular Generation via Discrete Flow Matching." ICLR 2025 Workshops: GEM, 2025.Markdown
[Lee et al. "FragFM: Efficient Fragment-Based Molecular Generation via Discrete Flow Matching." ICLR 2025 Workshops: GEM, 2025.](https://mlanthology.org/iclrw/2025/lee2025iclrw-fragfm/)BibTeX
@inproceedings{lee2025iclrw-fragfm,
title = {{FragFM: Efficient Fragment-Based Molecular Generation via Discrete Flow Matching}},
author = {Lee, Joongwon and Kim, Seonghwan and Kim, Woo Youn},
booktitle = {ICLR 2025 Workshops: GEM},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/lee2025iclrw-fragfm/}
}