Delobelle, Pieter

2 publications

ICML 2024 Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, Pau Rodriguez
ECML-PKDD 2022 FairDistillation: Mitigating Stereotyping in Language Models Pieter Delobelle, Bettina Berendt