ML Anthology
Authors
Search
About
Zverev, Egor
3 publications
ICLRW
2025
ASIDE: Architectural Separation of Instructions and Data in Language Models
Egor Zverev
,
Evgenii Kortukov
,
Alexander Panfilov
,
Soroush Tabesh
,
Sebastian Lapuschkin
,
Wojciech Samek
,
Christoph H. Lampert
ICLR
2025
Can LLMs Separate Instructions from Data? and What Do We Even Mean by That?
Egor Zverev
,
Sahar Abdelnabi
,
Soroush Tabesh
,
Mario Fritz
,
Christoph H. Lampert
ICLRW
2024
Can LLMs Separate Instructions from Data? and What Do We Even Mean by That?
Egor Zverev
,
Sahar Abdelnabi
,
Mario Fritz
,
Christoph H. Lampert