ML Anthology
Authors
Search
About
Kran, Esben
4 publications
ICLR
2025
DarkBench: Benchmarking Dark Patterns in Large Language Models
Esben Kran
,
Hieu Minh Nguyen
,
Akash Kundu
,
Sami Jawhar
,
Jinsuk Park
,
Mateusz Maria Jurewicz
AAAI
2025
Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems
Pierre Peigné
,
Mikolaj Kniejski
,
Filip Sondej
,
Matthieu David
,
Jason Hoelscher-Obermaier
,
Christian Schröder de Witt
,
Esben Kran
NeurIPSW
2023
DeepDecipher: Accessing and Investigating Neuron Activation in Large Language Models
Albert Garde
,
Esben Kran
,
Fazl Barez
ICLRW
2023
N2G: A Scalable Approach for Quantifying Interpretable Neuron Representation in LLMs
Alex Foote
,
Neel Nanda
,
Esben Kran
,
Ioannis Konstas
,
Fazl Barez