Verdun, Claudio Mayrink
13 publications
ICLR
2026
Temporal Sparse Autoencoders: Leveraging the Sequential Nature of Language for Interpretability
NeurIPS
2025
HeavyWater and SimplexWater: Distortion-Free LLM Watermarks for Low-Entropy Distributions
ECCV
2024
Imaging with Confidence: Uncertainty Quantification for High-Dimensional Undersampled MR Images
NeurIPS
2024
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models