Gloaguen, Thibaud

10 publications

ICLR 2026 Fewer Weights, More Problems: A Practical Attack on LLM Pruning Kazuki Egashira, Robin Staab, Thibaud Gloaguen, Mark Vero, Martin Vechev
ICLR 2026 LLM Fingerprinting via Semantically Conditioned Watermarks Thibaud Gloaguen, Robin Staab, Nikola Jovanović, Martin Vechev
ICLR 2026 Watch Your Steps: Dormant Adversarial Behaviors That Activate upon LLM Finetuning Thibaud Gloaguen, Mark Vero, Robin Staab, Martin Vechev
ICLR 2026 Watermarking Diffusion Language Models Thibaud Gloaguen, Robin Staab, Nikola Jovanović, Martin Vechev
ICLR 2025 Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
ICML 2025 Discovering Spoofing Attempts on Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
ICLRW 2025 Discovering Spoofing Attempts on Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
ICLRW 2025 Towards Watermarking of Open-Source LLMs Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
ICMLW 2024 Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
ICMLW 2024 Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev