Lapuschkin, Sebastian
31 publications
ICLR
2025
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
NeurIPS
2025
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation
AAAI
2024
From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space
ECCVW
2024
Pruning by Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
NeurIPSW
2023
Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test
TMLR
2023
The Meta-Evaluation Problem in Explainable AI: Identifying Reliable Estimators with MetaQuantus