Shumailov, Ilia

19 publications

ICLR 2025 Breach by a Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses David Glukhov, Ziwen Han, Ilia Shumailov, Vardan Papyan, Nicolas Papernot

NeurIPS 2025 Exploring the Limits of Strong Membership Inference Attacks on Large Language Models Jamie Hayes, Ilia Shumailov, Christopher A. Choquette-Choo, Matthew Jagielski, Georgios Kaissis, Milad Nasr, Meenatchi Sundaram Muthu Selva Annamalai, Niloofar Mireshghallah, Igor Shilov, Matthieu Meeus, Yves-Alexandre de Montjoye, Katherine Lee, Franziska Boenisch, Adam Dziedzic, A. Feder Cooper

ICML 2025 Hardware and Software Platform Inference Cheng Zhang, Hanna Foerster, Robert D. Mullins, Yiren Zhao, Ilia Shumailov

ICML 2025 Interpreting the Repeated Token Phenomenon in Large Language Models Itay Yona, Ilia Shumailov, Jamie Hayes, Yossi Gandelsman

NeurIPS 2025 Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen, Kevin Klyman, Matthew Jagielski, Katja Filippova, Ken Liu, Alexandra Chouldechova, Jamie Hayes, Yangsibo Huang, Eleni Triantafillou, Peter Kairouz, Nicole Elyse Mitchell, Niloofar Mireshghallah, Abigail Z. Jacobs, James Grimmelmann, Vitaly Shmatikov, Christopher De Sa, Ilia Shumailov, Andreas Terzis, Solon Barocas, Jennifer Wortman Vaughan, Danah Boyd, Yejin Choi, Sanmi Koyejo, Fernando Delgado, Percy Liang, Daniel E. Ho, Pamela Samuelson, Miles Brundage, David Bau, Seth Neel, Hanna Wallach, Amy B. Cyphert, Mark Lemley, Nicolas Papernot, Katherine Lee

ICLR 2025 Measuring Memorization in RLHF for Code Completion Jamie Hayes, Ilia Shumailov, William P. Porter, Aneesh Pappu

ICML 2025 Position: Machine Learning Models Have a Supply Chain Problem Sarah Meiklejohn, Hayden Blauzvern, Mihai Maruseac, Spencer Schrock, Laurent Simon, Ilia Shumailov

TMLR 2025 Privacy Awareness for Information-Sharing Assistants: A Case-Study on Form-Filling with Contextual Integrity Sahra Ghalebikesabi, Eugene Bagdasarian, Ren Yi, Itay Yona, Ilia Shumailov, Aneesh Pappu, Chongyang Shi, Laura Weidinger, Robert Stanforth, Leonard Berrada, Pushmeet Kohli, Po-Sen Huang, Borja Balle

TMLR 2024 Beyond Labeling Oracles - What Does It Mean to Steal ML Models? Avital Shafran, Ilia Shumailov, Murat A Erdogdu, Nicolas Papernot

NeurIPS 2024 Beyond Slow Signs in High-Fidelity Model Extraction Hanna Foerster, Robert Mullins, Ilia Shumailov, Jamie Hayes

NeurIPSW 2024 Buffer Overflow in Mixture of Experts Jamie Hayes, Ilia Shumailov, Itay Yona

TMLR 2024 From Differential Privacy to Bounds on Membership Inference: Less Can Be More Anvith Thudi, Ilia Shumailov, Franziska Boenisch, Nicolas Papernot

ICML 2024 Position: Fundamental Limitations of LLM Censorship Necessitate New Approaches David Glukhov, Ilia Shumailov, Yarin Gal, Nicolas Papernot, Vardan Papyan

CVPR 2023 Architectural Backdoors in Neural Networks Mikel Bober-Irizar, Ilia Shumailov, Yiren Zhao, Robert Mullins, Nicolas Papernot

ICLRW 2023 Augmentation Backdoors Joseph Rance, Yiren Zhao, Ilia Shumailov, Robert D. Mullins

NeurIPSW 2022 DARTFormer: Finding the Best Type of Attention Jason Ross Brown, Yiren Zhao, Ilia Shumailov, Robert D. Mullins

ICML 2022 Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems Yue Gao, Ilia Shumailov, Kassem Fawaz

NeurIPSW 2022 Wide Attention Is the Way Forward for Transformers? Jason Ross Brown, Yiren Zhao, Ilia Shumailov, Robert D. Mullins

ICML 2021 Markpainting: Adversarial Machine Learning Meets Inpainting David Khachaturov, Ilia Shumailov, Yiren Zhao, Nicolas Papernot, Ross Anderson