A Self-Attention Ansatz for Ab-Initio Quantum Chemistry

Abstract

We present a novel neural network architecture using self-attention, the Wavefunction Transformer (PsiFormer), which can be used as an approximation (or "Ansatz") for solving the many-electron Schrödinger equation, the fundamental equation for quantum chemistry and material science. This equation can be solved *from first principles*, requiring no external training data. In recent years, deep neural networks like the FermiNet and PauliNet have been used to significantly improve the accuracy of these first-principle calculations, but they lack an attention-like mechanism for gating interactions between electrons. Here we show that the PsiFormer can be used as a drop-in replacement for these other neural networks, often dramatically improving the accuracy of the calculations. On larger molecules especially, the ground state energy can be improved by dozens of kcal/mol, a qualitative leap over previous methods. This demonstrates that self-attention networks can learn complex quantum mechanical correlations between electrons, and are a promising route to reaching unprecedented accuracy in chemical calculations on larger systems.

Cite

Text

von Glehn et al. "A Self-Attention Ansatz for Ab-Initio Quantum Chemistry." International Conference on Learning Representations, 2023.

Markdown

[von Glehn et al. "A Self-Attention Ansatz for Ab-Initio Quantum Chemistry." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/vonglehn2023iclr-selfattention/)

BibTeX

@inproceedings{vonglehn2023iclr-selfattention,
  title     = {{A Self-Attention Ansatz for Ab-Initio Quantum Chemistry}},
  author    = {von Glehn, Ingrid and Spencer, James S and Pfau, David},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/vonglehn2023iclr-selfattention/}
}