Zverev, Egor

3 publications

ICLRW 2025 ASIDE: Architectural Separation of Instructions and Data in Language Models Egor Zverev, Evgenii Kortukov, Alexander Panfilov, Soroush Tabesh, Sebastian Lapuschkin, Wojciech Samek, Christoph H. Lampert
ICLR 2025 Can LLMs Separate Instructions from Data? and What Do We Even Mean by That? Egor Zverev, Sahar Abdelnabi, Soroush Tabesh, Mario Fritz, Christoph H. Lampert
ICLRW 2024 Can LLMs Separate Instructions from Data? and What Do We Even Mean by That? Egor Zverev, Sahar Abdelnabi, Mario Fritz, Christoph H. Lampert