Graph-Based Molecular Representation Learning

Abstract

Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep molecular graph learning. In this survey, we systematically review these graph-based molecular representation techniques, especially the methods incorporating chemical domain knowledge. Specifically, we first introduce the features of 2D and 3D molecular graphs. Then we summarize and categorize MRL methods into three groups based on their input. Furthermore, we discuss some typical chemical applications supported by MRL. To facilitate studies in this fast-developing area, we also list the benchmarks and commonly used datasets in the paper. Finally, we share our thoughts on future research directions.

Cite

Text

Guo et al. "Graph-Based Molecular Representation Learning." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/744

Markdown

[Guo et al. "Graph-Based Molecular Representation Learning." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/guo2023ijcai-graph/) doi:10.24963/IJCAI.2023/744

BibTeX

@inproceedings{guo2023ijcai-graph,
  title     = {{Graph-Based Molecular Representation Learning}},
  author    = {Guo, Zhichun and Guo, Kehan and Nan, Bozhao and Tian, Yijun and Iyer, Roshni G. and Ma, Yihong and Wiest, Olaf and Zhang, Xiangliang and Wang, Wei and Zhang, Chuxu and Chawla, Nitesh V.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {6638-6646},
  doi       = {10.24963/IJCAI.2023/744},
  url       = {https://mlanthology.org/ijcai/2023/guo2023ijcai-graph/}
}