Kran, Esben

4 publications

ICLR 2025 DarkBench: Benchmarking Dark Patterns in Large Language Models Esben Kran, Hieu Minh Nguyen, Akash Kundu, Sami Jawhar, Jinsuk Park, Mateusz Maria Jurewicz
AAAI 2025 Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems Pierre Peigné, Mikolaj Kniejski, Filip Sondej, Matthieu David, Jason Hoelscher-Obermaier, Christian Schröder de Witt, Esben Kran
NeurIPSW 2023 DeepDecipher: Accessing and Investigating Neuron Activation in Large Language Models Albert Garde, Esben Kran, Fazl Barez
ICLRW 2023 N2G: A Scalable Approach for Quantifying Interpretable Neuron Representation in LLMs Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Fazl Barez