Croce, Francesco

29 publications

ICLR 2025 Is In-Context Learning Sufficient for Instruction Following in LLMs? Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
ICLR 2025 Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
NeurIPS 2025 OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents Thomas Kuntz, Agatha Duzan, Hao Zhao, Francesco Croce, J Zico Kolter, Nicolas Flammarion, Maksym Andriushchenko
ICLR 2025 Selective Induction Heads: How Transformers Select Causal Structures in Context Francesco D'Angelo, Francesco Croce, Nicolas Flammarion
ICMLW 2024 Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics Francesco Croce, Christian Schlarmann, Naman Deep Singh, Matthias Hein
NeurIPSW 2024 Is In-Context Learning Sufficient for Instruction Following in LLMs? Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
NeurIPS 2024 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong
ICMLW 2024 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong
ICMLW 2024 Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
ICML 2024 Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
ICML 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein
ICLRW 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein
ICMLW 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein
ECCV 2024 Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models Francesco Croce, Naman D. Singh, Matthias Hein
ICML 2023 A Modern Look at the Relationship Between Sharpness and Generalization Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion
ICLR 2023 Revisiting Adapters with Adversarial Training Sylvestre-Alvise Rebuffi, Francesco Croce, Sven Gowal
NeurIPS 2023 Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization Across Threat Models Naman Deep Singh, Francesco Croce, Matthias Hein
ICMLW 2023 Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models Francesco Croce, Naman Deep Singh, Matthias Hein
CVPR 2023 Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts Francesco Croce, Sylvestre-Alvise Rebuffi, Evan Shelhamer, Sven Gowal
ICML 2022 Adversarial Robustness Against Multiple and Single $l_p$-Threat Models via Quick Fine-Tuning of Robust Classifiers Francesco Croce, Matthias Hein
NeurIPS 2022 Diffusion Visual Counterfactual Explanations Maximilian Augustin, Valentyn Boreiko, Francesco Croce, Matthias Hein
ICML 2022 Evaluating the Adversarial Robustness of Adaptive Test-Time Defenses Francesco Croce, Sven Gowal, Thomas Brunner, Evan Shelhamer, Matthias Hein, Taylan Cemgil
AAAI 2022 Sparse-RS: A Versatile Framework for Query-Efficient Sparse Black-Box Adversarial Attacks Francesco Croce, Maksym Andriushchenko, Naman D. Singh, Nicolas Flammarion, Matthias Hein
ICML 2021 Mind the Box: $l_1$-APGD for Sparse Adversarial Attacks on Image Classifiers Francesco Croce, Matthias Hein
ICML 2020 Minimally Distorted Adversarial Examples with a Fast Adaptive Boundary Attack Francesco Croce, Matthias Hein
ICLR 2020 Provable Robustness Against All Adversarial $l_p$-Perturbations for $p\geq 1$ Francesco Croce, Matthias Hein
ICML 2020 Reliable Evaluation of Adversarial Robustness with an Ensemble of Diverse Parameter-Free Attacks Francesco Croce, Matthias Hein
ECCV 2020 Square Attack: A Query-Efficient Black-Box Adversarial Attack via Random Search Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion, Matthias Hein
AISTATS 2019 Provable Robustness of ReLU Networks via Maximization of Linear Regions Francesco Croce, Maksym Andriushchenko, Matthias Hein