Adaptive Pseudo-Labeling for Quantum Calculations

Abstract

Machine learning models have recently shown promise in predicting molecular quantum chemical properties. However, the path to real-life adoption requires (1) learning under low-resource constraint and (2) out-of-distribution generalization to unseen, structurally diverse molecules. We observe that these two challenges originate from label scarcity issue. We hypothesize that pseudo-labeling on vast array of unlabeled molecules can serve as proxies as gold-label to greatly expand the training labeled data. The challenge in pseudo-labeling is to prevent the bad pseudo-labels from biasing the model. We develop a simple and effective strategy Pseudo-Sigma that can assign pseudo-labels, detect bad pseud-labels through evidential uncertainty, and then prevent them from biasing the model using adaptive weighting. Empirically, Pseudo-Sigma improves quantum calculations accuracy across full data, low data and out-of-distribution settings.

Cite

Text

Huang et al. "Adaptive Pseudo-Labeling for Quantum Calculations." NeurIPS 2021 Workshops: AI4Science, 2021.

Markdown

[Huang et al. "Adaptive Pseudo-Labeling for Quantum Calculations." NeurIPS 2021 Workshops: AI4Science, 2021.](https://mlanthology.org/neuripsw/2021/huang2021neuripsw-adaptive/)

BibTeX

@inproceedings{huang2021neuripsw-adaptive,
  title     = {{Adaptive Pseudo-Labeling for Quantum Calculations}},
  author    = {Huang, Kexin and Sresht, Vishnu and Rai, Brajesh and Bordyuh, Mykola},
  booktitle = {NeurIPS 2021 Workshops: AI4Science},
  year      = {2021},
  url       = {https://mlanthology.org/neuripsw/2021/huang2021neuripsw-adaptive/}
}