Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores
Abstract
In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular structures and their relationship with their respective docking scores. This approach significantly surpasses traditional HTVS methods and recent Graph Neural Network (GNN) models in both recovery and efficiency due to a wider coverage of the chemical space and enhanced performance. Our results demonstrate that ScoreFormer achieves competitive performance in docking score prediction and offers a substantial 1.65-fold reduction in inference time compared to existing models. We evaluated ScoreFormer across multiple datasets under various conditions, confirming its robustness and reliability in identifying potential drug candidates rapidly.
Cite
Text
Serrano et al. "Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores." ICML 2024 Workshops: ML4LMS, 2024.Markdown
[Serrano et al. "Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores." ICML 2024 Workshops: ML4LMS, 2024.](https://mlanthology.org/icmlw/2024/serrano2024icmlw-scoreformer/)BibTeX
@inproceedings{serrano2024icmlw-scoreformer,
title = {{Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores}},
author = {Serrano, Alvaro Ciudad and Morales-Pastor, Adrian and Malo, Laura and Filella-Merce, Isaac and Guallar, Victor and Molina, Alexis},
booktitle = {ICML 2024 Workshops: ML4LMS},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/serrano2024icmlw-scoreformer/}
}