DeepMath-103k: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

He, Zhiwei; Liang, Tian; Xu, Jiahao; Liu, Qiuzhi; Chen, Xingyu; Wang, Yue; Song, Linfeng; Yu, Dian; Liang, Zhenwen; Wang, Wenxuan; Zhang, Zhuosheng; Wang, Rui; Tu, Zhaopeng; Mi, Haitao; Yu, Dong

DeepMath-103k: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu

ICLR 2026

/iclr/2026/he2026iclr-deepmath103k/

Abstract

Reinforcement learning (RL) with large language models shows promise in complex reasoning. However, its progress is hindered by the lack of large-scale training data that is sufficiently challenging, contamination-free and verifiable. To solve this problem, we introduce DeepMath-103K, a large-scale mathematical dataset designed with high difficulty (primarily levels 5-9), rigorous decontamination against numerous benchmarks, and verifiable answers for rule-based RL reward. It further includes three distinct R1 solutions adaptable for diverse training paradigms such as supervised fine-tuning. Spanning a wide range of mathematical topics, DeepMath-103K fosters the development of generalizable and advancing reasoning. Notably, models trained on DeepMath-103K achieve leading results on challenging mathematical benchmarks and demonstrate generalization beyond math such as biology, physics and chemistry, underscoring its broad efficacy.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

He et al. "DeepMath-103k: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning." International Conference on Learning Representations, 2026.

Markdown

[He et al. "DeepMath-103k: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/he2026iclr-deepmath103k/)

BibTeX

@inproceedings{he2026iclr-deepmath103k,
  title     = {{DeepMath-103k: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning}},
  author    = {He, Zhiwei and Liang, Tian and Xu, Jiahao and Liu, Qiuzhi and Chen, Xingyu and Wang, Yue and Song, Linfeng and Yu, Dian and Liang, Zhenwen and Wang, Wenxuan and Zhang, Zhuosheng and Wang, Rui and Tu, Zhaopeng and Mi, Haitao and Yu, Dong},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/he2026iclr-deepmath103k/}
}