Dumas, Clément

4 publications

ICLR 2026 Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences Julian Minder, Clément Dumas, Stewart Slocum, Helena Casademunt, Cameron Holmes, Robert West, Neel Nanda
NeurIPS 2025 Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning Julian Minder, Clément Dumas, Caden Juang, Bilal Chughtai, Neel Nanda
ICLRW 2025 Robustly Identifying Concepts Introduced During Chat Fine-Tuning Using Crosscoders Julian Minder, Clément Dumas, Bilal Chughtai, Neel Nanda
ICMLW 2024 How Do Llamas Process Multilingual Text? a Latent Exploration Through Activation Patching Clément Dumas, Veniamin Veselovsky, Giovanni Monea, Robert West, Chris Wendler