GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-Theoretic Tasks

Xu, Hao; Jian, Xiangru; Zhao, Xinjian; Pang, Wei; Zhang, Chao; Wang, Suyuchen; Zhang, Qixin; Dong, Zhengyuan; Monteiro, Joao; Liu, Bang; Sun, Qiuzhuang; Yu, Tianshu

GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-Theoretic Tasks

Hao Xu, Xiangru Jian, Xinjian Zhao, Wei Pang, Chao Zhang, Suyuchen Wang, Qixin Zhang, Zhengyuan Dong, Joao Monteiro, Bang Liu, Qiuzhuang Sun, Tianshu Yu

ICLR 2026

/iclr/2026/xu2026iclr-graphomni/

Abstract

This paper introduces GraphOmni, a comprehensive benchmark designed to evaluate the reasoning capabilities of LLMs on graph-theoretic tasks articulated in natural language. GraphOmni spans diverse graph types, serialization formats, and prompting schemes, substantially extending upon prior efforts in both scope and depth. Through systematic evaluation, we uncover critical interactions among these dimensions, revealing their decisive impact on model performance. Our experiments show that state-of-the-art closed-source models such as Claude-3.5 and o4-mini consistently lead overall, yet still leave considerable headroom, while open-source models display pronounced sensitivity to various design choices. Beyond the standard scope, larger graphs, real-world graphs, and additional NP-hard tasks are further discussed. We further analyze efficiency via output token usage, highlighting cost–accuracy trade-offs, and introduce a reinforcement learning-based optimizer that adaptively selects factor combinations, reducing evaluation cost by 75\% while retaining strong accuracy. This flexible and extensible benchmark not only deepens understanding of LLM performance on structured graph reasoning but also establishes a robust foundation for advancing model design and evaluation.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Xu et al. "GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-Theoretic Tasks." International Conference on Learning Representations, 2026.

Markdown

[Xu et al. "GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-Theoretic Tasks." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/xu2026iclr-graphomni/)

BibTeX

@inproceedings{xu2026iclr-graphomni,
  title     = {{GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-Theoretic Tasks}},
  author    = {Xu, Hao and Jian, Xiangru and Zhao, Xinjian and Pang, Wei and Zhang, Chao and Wang, Suyuchen and Zhang, Qixin and Dong, Zhengyuan and Monteiro, Joao and Liu, Bang and Sun, Qiuzhuang and Yu, Tianshu},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/xu2026iclr-graphomni/}
}