Training a Scientific Reasoning Model for Chemistry

Abstract

Reasoning models are large language models that use extra "thought tokens" before answering, providing both higher accuracy and explicit reasoning for their response. A major question has been whether language model reasoning generalizes beyond mathematics, programming, and logic, where most previous work has focused. We demonstrate that reasoning models can be post-trained in scientific domains without additional domain pretraining, and require substantially less data compared to contemporary domain-specific models. We report ether0, a 24B parameter LLM (based on Mistral-Small-24B) that can reason in natural language and respond with chemical structures. This reasoning model was trained with reinforcement learning on 577,790 experimentally-grounded chemistry tasks involving synthesized organic molecules. Our model outperforms all previous general-purpose chemistry models, frontier models, and humans, and is more data efficient relative to specialized models. We anticipate that this method can be applied to train highly data-efficient language models specialized for predictive and generative tasks across a wide variety of scientific domains.

Cite

Text

Narayanan et al. "Training a Scientific Reasoning Model for Chemistry." Advances in Neural Information Processing Systems, 2025.

Markdown

[Narayanan et al. "Training a Scientific Reasoning Model for Chemistry." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/narayanan2025neurips-training/)

BibTeX

@inproceedings{narayanan2025neurips-training,
  title     = {{Training a Scientific Reasoning Model for Chemistry}},
  author    = {Narayanan, Siddharth and Braza, James D. and Griffiths, Ryan-Rhys and Bou, Albert and Wellawatte, Geemi and Ramos, Mayk Caldas and Mitchener, Ludovico and Pieler, Michael Martin and Rodriques, Samuel G and White, Andrew},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/narayanan2025neurips-training/}
}