Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling
Abstract
3D molecule generation is crucial for drug discovery and material science, requiring models to process complex multi-modalities, including atom types, chemical bonds, and 3D coordinates. A key challenge is integrating these modalities of different shapes while maintaining SE(3) equivariance for 3D coordinates. To achieve this, existing approaches typically maintain separate latent spaces for invariant and equivariant modalities, reducing efficiency in both training and sampling. In this work, we propose **U**nified Variational **A**uto-**E**ncoder for **3D** Molecular Latent Diffusion Modeling (**UAE-3D**), a multi-modal VAE that compresses 3D molecules into latent sequences from a unified latent space, while maintaining near-zero reconstruction error. This unified latent space eliminates the complexities of handling multi-modality and equivariance when performing latent diffusion modeling. We demonstrate this by employing the Diffusion Transformer--a general-purpose diffusion model without any molecular inductive bias--for latent generation. Extensive experiments on GEOM-Drugs and QM9 datasets demonstrate that our method significantly establishes new benchmarks in both *de novo* and conditional 3D molecule generation, achieving leading efficiency and quality. On GEOM-Drugs, it reduces FCD by 72.6% over the previous best result, while achieving over 70% relative average improvements in geometric fidelity. Our code is released at [https://github.com/lyc0930/UAE-3D/](https://github.com/lyc0930/UAE-3D/).
Cite
Text
Luo et al. "Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling." Advances in Neural Information Processing Systems, 2025.Markdown
[Luo et al. "Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/luo2025neurips-unified/)BibTeX
@inproceedings{luo2025neurips-unified,
title = {{Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling}},
author = {Luo, Yanchen and Liu, Zhiyuan and Zhao, Yi and Li, Sihang and Cai, Hengxing and Kawaguchi, Kenji and Chua, Tat-Seng and Zhang, Yang and Wang, Xiang},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/luo2025neurips-unified/}
}