Nikolić, Kristina

5 publications

ICLR 2026 Modal Aphasia: Can Unified Multimodal Models Describe Images from Memory? Michael Aerni, Joshua Swanson, Kristina Nikolić, Florian Tramèr
ICLR 2026 Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLMs Alexander Panfilov, Evgenii Kortukov, Kristina Nikolić, Matthias Bethge, Sebastian Lapuschkin, Wojciech Samek, Ameya Prabhu, Maksym Andriushchenko, Jonas Geiping
NeurIPS 2025 RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics Jie Zhang, Cezara Petrui, Kristina Nikolić, Florian Tramèr
ICML 2025 The Jailbreak Tax: How Useful Are Your Jailbreak Outputs? Kristina Nikolić, Luze Sun, Jie Zhang, Florian Tramèr
ICLRW 2025 The Jailbreak Tax: How Useful Are Your Jailbreak Outputs? Kristina Nikolić, Luze Sun, Jie Zhang, Florian Tramèr