Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science
Abstract
The Open MatSci ML Toolkit is a flexible, self-contained and scalable Python-based framework to apply deep learning models and methods on scientific data with a specific focus on materials science and the OpenCatalyst Dataset. The primary components of our toolkit include: 1.Scalable computation of experiments leveraging PyTorch Lightning across different computation capabilities (laptop, server, cluster) and hardware platforms (CPU, GPU, XPU) without sacrificing performance in the compute and modeling; 2. Support for DGL for rapid graph neural network development. By sharing this toolkit with the research community via open-source release, we aim to: 1. Ease of use for new machine learning researchers and practitioners that want get started on interacting with the OpenCatalyst dataset which currently makes up the largest computational materials science dataset; 2. Enable the scientific community to apply advanced machine learning tools to high-impact scientific challenges, such as modeling of materials behavior for climate change applications.
Cite
Text
Miret et al. "Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science." NeurIPS 2022 Workshops: AI4Mat, 2022.Markdown
[Miret et al. "Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science." NeurIPS 2022 Workshops: AI4Mat, 2022.](https://mlanthology.org/neuripsw/2022/miret2022neuripsw-open/)BibTeX
@inproceedings{miret2022neuripsw-open,
title = {{Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science}},
author = {Miret, Santiago and Lee, Kin Long Kelvin and Gonzales, Carmelo and Nassar, Marcel and Sadowski, Krzysztof},
booktitle = {NeurIPS 2022 Workshops: AI4Mat},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/miret2022neuripsw-open/}
}