Dreyer, Maximilian
7 publications
ICLR
2025
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
AAAI
2024
From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space
7 publications