BounDr.E: Predicting Drug-Likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization
Abstract
The advent of generative AI now enables large-scale $\textit{de novo}$ design of molecules, but identifying viable drug candidates among them remains an open problem. Existing drug-likeness prediction methods often rely on ambiguous negative sets or purely structural features, limiting their ability to accurately classify drugs from non-drugs. In this work, we introduce BounDr.E: a novel modeling of drug-likeness as a compact space surrounding approved drugs through a dynamic one-class boundary approach. Specifically, we enrich the chemical space through biomedical knowledge alignment, and then iteratively tighten the drug-like boundary by pushing non-drug-like compounds outside via an Expectation-Maximization (EM)-like process. Empirically, BounDr.E achieves 10% F1-score improvement over the previous state-of-the-art and demonstrates robust cross-dataset performance, including zero-shot toxic compound filtering. Additionally, we showcase its effectiveness through comprehensive case studies in large-scale $\textit{in silico}$ screening. Our codes and constructed benchmark data under various schemes are provided at: https://github.com/eugenebang/boundr_e.
Cite
Text
Bang et al. "BounDr.E: Predicting Drug-Likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Bang et al. "BounDr.E: Predicting Drug-Likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/bang2025icml-boundr/)BibTeX
@inproceedings{bang2025icml-boundr,
title = {{BounDr.E: Predicting Drug-Likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization}},
author = {Bang, Dongmin and Sung, Inyoung and Piao, Yinhua and Lee, Sangseon and Kim, Sun},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {2858-2893},
volume = {267},
url = {https://mlanthology.org/icml/2025/bang2025icml-boundr/}
}