Nikolić, Kristina

3 publications

NeurIPS 2025 RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics Jie Zhang, Cezara Petrui, Kristina Nikolić, Florian Tramèr
ICML 2025 The Jailbreak Tax: How Useful Are Your Jailbreak Outputs? Kristina Nikolić, Luze Sun, Jie Zhang, Florian Tramèr
ICLRW 2025 The Jailbreak Tax: How Useful Are Your Jailbreak Outputs? Kristina Nikolić, Luze Sun, Jie Zhang, Florian Tramèr