MatKG-2: Unveiling Precise Material Science Ontology Through Autonomous Committees of LLMs
Abstract
This paper introduces MatKG-2, a Material Science knowledge graph autonomously generated through a Large Language Model (LLM) driven pipeline. Building on the groundwork of MatKG, MatKG-2 employs a novel 'committee of large language models' approach to extract and classify knowledge triples with an established ontology. Unlike the previous version, which relied on statistical co-occurrence, MatKG-2 offers more nuanced, ontology-based relationships. Using open LLMs such as Llama2 7b and Bloom 1b/7b, the study offers reproducibility and broad community engagement. By using 4-bit and 8-bit quantized versions for fine-tuning and inference, MatKG-2 is also more computationally tractable and therefore compatible with most commercially available GPUs. Our work highlights the potential of MatKG-2 in supporting Material Science data infrastructure and in contributing to the semantic web.
Cite
Text
Venugopal and Olivetti. "MatKG-2: Unveiling Precise Material Science Ontology Through Autonomous Committees of LLMs." NeurIPS 2023 Workshops: AI4Mat, 2023.Markdown
[Venugopal and Olivetti. "MatKG-2: Unveiling Precise Material Science Ontology Through Autonomous Committees of LLMs." NeurIPS 2023 Workshops: AI4Mat, 2023.](https://mlanthology.org/neuripsw/2023/venugopal2023neuripsw-matkg2/)BibTeX
@inproceedings{venugopal2023neuripsw-matkg2,
title = {{MatKG-2: Unveiling Precise Material Science Ontology Through Autonomous Committees of LLMs}},
author = {Venugopal, Vineeth and Olivetti, Elsa},
booktitle = {NeurIPS 2023 Workshops: AI4Mat},
year = {2023},
url = {https://mlanthology.org/neuripsw/2023/venugopal2023neuripsw-matkg2/}
}