A Self-Attention Ansatz for Ab-Initio Quantum Chemistry
Abstract
We present a novel neural network architecture using self-attention, the Wavefunction Transformer (PsiFormer), which can be used as an approximation (or "Ansatz") for solving the many-electron Schrödinger equation, the fundamental equation for quantum chemistry and material science. This equation can be solved *from first principles*, requiring no external training data. In recent years, deep neural networks like the FermiNet and PauliNet have been used to significantly improve the accuracy of these first-principle calculations, but they lack an attention-like mechanism for gating interactions between electrons. Here we show that the PsiFormer can be used as a drop-in replacement for these other neural networks, often dramatically improving the accuracy of the calculations. On larger molecules especially, the ground state energy can be improved by dozens of kcal/mol, a qualitative leap over previous methods. This demonstrates that self-attention networks can learn complex quantum mechanical correlations between electrons, and are a promising route to reaching unprecedented accuracy in chemical calculations on larger systems.
Cite
Text
von Glehn et al. "A Self-Attention Ansatz for Ab-Initio Quantum Chemistry." International Conference on Learning Representations, 2023.Markdown
[von Glehn et al. "A Self-Attention Ansatz for Ab-Initio Quantum Chemistry." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/vonglehn2023iclr-selfattention/)BibTeX
@inproceedings{vonglehn2023iclr-selfattention,
title = {{A Self-Attention Ansatz for Ab-Initio Quantum Chemistry}},
author = {von Glehn, Ingrid and Spencer, James S and Pfau, David},
booktitle = {International Conference on Learning Representations},
year = {2023},
url = {https://mlanthology.org/iclr/2023/vonglehn2023iclr-selfattention/}
}