Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science

Abstract

The Open MatSci ML Toolkit is a flexible, self-contained and scalable Python-based framework to apply deep learning models and methods on scientific data with a specific focus on materials science and the OpenCatalyst Dataset. The primary components of our toolkit include: 1.Scalable computation of experiments leveraging PyTorch Lightning across different computation capabilities (laptop, server, cluster) and hardware platforms (CPU, GPU, XPU) without sacrificing performance in the compute and modeling; 2. Support for DGL for rapid graph neural network development. By sharing this toolkit with the research community via open-source release, we aim to: 1. Ease of use for new machine learning researchers and practitioners that want get started on interacting with the OpenCatalyst dataset which currently makes up the largest computational materials science dataset; 2. Enable the scientific community to apply advanced machine learning tools to high-impact scientific challenges, such as modeling of materials behavior for climate change applications.

Cite

Text

Miret et al. "Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science." NeurIPS 2022 Workshops: AI4Mat, 2022.

Markdown

[Miret et al. "Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science." NeurIPS 2022 Workshops: AI4Mat, 2022.](https://mlanthology.org/neuripsw/2022/miret2022neuripsw-open/)

BibTeX

@inproceedings{miret2022neuripsw-open,
  title     = {{Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science}},
  author    = {Miret, Santiago and Lee, Kin Long Kelvin and Gonzales, Carmelo and Nassar, Marcel and Sadowski, Krzysztof},
  booktitle = {NeurIPS 2022 Workshops: AI4Mat},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/miret2022neuripsw-open/}
}