Petrui, Cezara

1 publications

NeurIPS 2025 RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics Jie Zhang, Cezara Petrui, Kristina Nikolić, Florian Tramèr