Croce, Francesco

29 publications

ICLR 2025 Is In-Context Learning Sufficient for Instruction Following in LLMs? Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

ICLR 2025 Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

NeurIPS 2025 OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents Thomas Kuntz, Agatha Duzan, Hao Zhao, Francesco Croce, J Zico Kolter, Nicolas Flammarion, Maksym Andriushchenko

ICLR 2025 Selective Induction Heads: How Transformers Select Causal Structures in Context Francesco D'Angelo, Francesco Croce, Nicolas Flammarion

ICMLW 2024 Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics Francesco Croce, Christian Schlarmann, Naman Deep Singh, Matthias Hein

NeurIPSW 2024 Is In-Context Learning Sufficient for Instruction Following in LLMs? Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

NeurIPS 2024 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong

ICMLW 2024 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong

ICMLW 2024 Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

ICML 2024 Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

ICML 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein

ICLRW 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein

ICMLW 2024 Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein

ECCV 2024 Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models Francesco Croce, Naman D. Singh, Matthias Hein

ICML 2023 A Modern Look at the Relationship Between Sharpness and Generalization Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion

ICLR 2023 Revisiting Adapters with Adversarial Training Sylvestre-Alvise Rebuffi, Francesco Croce, Sven Gowal

NeurIPS 2023 Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization Across Threat Models Naman Deep Singh, Francesco Croce, Matthias Hein

ICMLW 2023 Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models Francesco Croce, Naman Deep Singh, Matthias Hein

CVPR 2023 Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts Francesco Croce, Sylvestre-Alvise Rebuffi, Evan Shelhamer, Sven Gowal

ICML 2022 Adversarial Robustness Against Multiple and Single $l_p$-Threat Models via Quick Fine-Tuning of Robust Classifiers Francesco Croce, Matthias Hein

NeurIPS 2022 Diffusion Visual Counterfactual Explanations Maximilian Augustin, Valentyn Boreiko, Francesco Croce, Matthias Hein

ICML 2022 Evaluating the Adversarial Robustness of Adaptive Test-Time Defenses Francesco Croce, Sven Gowal, Thomas Brunner, Evan Shelhamer, Matthias Hein, Taylan Cemgil

AAAI 2022 Sparse-RS: A Versatile Framework for Query-Efficient Sparse Black-Box Adversarial Attacks Francesco Croce, Maksym Andriushchenko, Naman D. Singh, Nicolas Flammarion, Matthias Hein

ICML 2021 Mind the Box: $l_1$-APGD for Sparse Adversarial Attacks on Image Classifiers Francesco Croce, Matthias Hein

ICML 2020 Minimally Distorted Adversarial Examples with a Fast Adaptive Boundary Attack Francesco Croce, Matthias Hein

ICLR 2020 Provable Robustness Against All Adversarial $l_p$-Perturbations for $p\geq 1$ Francesco Croce, Matthias Hein

ICML 2020 Reliable Evaluation of Adversarial Robustness with an Ensemble of Diverse Parameter-Free Attacks Francesco Croce, Matthias Hein

ECCV 2020 Square Attack: A Query-Efficient Black-Box Adversarial Attack via Random Search Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion, Matthias Hein

AISTATS 2019 Provable Robustness of ReLU Networks via Maximization of Linear Regions Francesco Croce, Maksym Andriushchenko, Matthias Hein