Dreyer, Maximilian
8 publications
ICLR
2025
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
AAAI
2024
From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space
8 publications