Holistic Molecular Representation Learning via Multi-View Fragmentation
Abstract
Learning chemically meaningful representations from unlabeled molecules plays a vital role in AI-based drug design and discovery. In response to this, several self-supervised learning methods have been developed, focusing either on global (e.g., graph-level) or local (e.g., motif-level) information of molecular graphs. However, it is still unclear which approach is more effective for learning better molecular representations. In this paper, we propose a novel holistic self-supervised molecular representation learning framework that effectively learns both global and local molecular information. Our key idea is to utilize fragmentation, which decomposes a molecule into a set of chemically meaningful fragments (e.g., functional groups), to associate a global graph structure to a set of local substructures, thereby preserving chemical properties and learn both information via contrastive learning between them. Additionally, we also consider the 3D geometry of molecules as another view for contrastive learning. We demonstrate that our framework outperforms prior molecular representation learning methods across various molecular property prediction tasks.
Cite
Text
Kim et al. "Holistic Molecular Representation Learning via Multi-View Fragmentation." Transactions on Machine Learning Research, 2024.Markdown
[Kim et al. "Holistic Molecular Representation Learning via Multi-View Fragmentation." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/kim2024tmlr-holistic/)BibTeX
@article{kim2024tmlr-holistic,
title = {{Holistic Molecular Representation Learning via Multi-View Fragmentation}},
author = {Kim, Seojin and Nam, Jaehyun and Kim, Junsu and Lee, Hankook and Ahn, Sungsoo and Shin, Jinwoo},
journal = {Transactions on Machine Learning Research},
year = {2024},
url = {https://mlanthology.org/tmlr/2024/kim2024tmlr-holistic/}
}