Graph Diffusion Policy Optimization

Abstract

Recent research has made significant progress in optimizing diffusion models for downstream objectives, which is an important pursuit in fields such as graph generation for drug design. However, directly applying these models to graph presents challenges, resulting in suboptimal performance. This paper introduces graph diffusion policy optimization (GDPO), a novel approach to optimize graph diffusion models for arbitrary (e.g., non-differentiable) objectives using reinforcement learning. GDPO is based on an eager policy gradient tailored for graph diffusion models, developed through meticulous analysis and promising improved performance. Experimental results show that GDPO achieves state-of-the-art performance in various graph generation tasks with complex and diverse objectives. Code is available at https://github.com/sail-sg/GDPO.

Cite

Text

Liu et al. "Graph Diffusion Policy Optimization." Neural Information Processing Systems, 2024. doi:10.52202/079017-0305

Markdown

[Liu et al. "Graph Diffusion Policy Optimization." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/liu2024neurips-graph-a/) doi:10.52202/079017-0305

BibTeX

@inproceedings{liu2024neurips-graph-a,
  title     = {{Graph Diffusion Policy Optimization}},
  author    = {Liu, Yijing and Du, Chao and Pang, Tianyu and Li, Chongxuan and Lin, Min and Chen, Wei},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0305},
  url       = {https://mlanthology.org/neurips/2024/liu2024neurips-graph-a/}
}