Molecule Design by Latent Prompt Transformer
Abstract
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task, where target biological properties or desired chemical constraints serve as conditioning variables.We propose the Latent Prompt Transformer (LPT), a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution modeled by a neural transformation of Gaussian white noise; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt. LPT can be learned by maximum likelihood estimation on molecule-property pairs. During property optimization, the latent prompt is inferred from target properties and constraints through posterior sampling and then used to guide the autoregressive molecule generation.After initial training on existing molecules and their properties, we adopt an online learning algorithm to progressively shift the model distribution towards regions that support desired target properties. Experiments demonstrate that LPT not only effectively discovers useful molecules across single-objective, multi-objective, and structure-constrained optimization tasks, but also exhibits strong sample efficiency.
Cite
Text
Kong et al. "Molecule Design by Latent Prompt Transformer." Neural Information Processing Systems, 2024. doi:10.52202/079017-2826Markdown
[Kong et al. "Molecule Design by Latent Prompt Transformer." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/kong2024neurips-molecule/) doi:10.52202/079017-2826BibTeX
@inproceedings{kong2024neurips-molecule,
title = {{Molecule Design by Latent Prompt Transformer}},
author = {Kong, Deqian and Huang, Yuhao and Xie, Jianwen and Honig, Edouardo and Xu, Ming and Xue, Shuanghong and Lin, Pei and Zhou, Sanping and Zhong, Sheng and Zheng, Nanning and Wu, Ying Nian},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-2826},
url = {https://mlanthology.org/neurips/2024/kong2024neurips-molecule/}
}