ML Anthology
Authors
Search
About
Kortukov, Evgenii
4 publications
ICLR
2026
ASIDE: Architectural Separation of Instructions and Data in Language Models
Egor Zverev
,
Evgenii Kortukov
,
Alexander Panfilov
,
Alexandra Volkova
,
Rush Tabesh
,
Sebastian Lapuschkin
,
Wojciech Samek
,
Christoph H. Lampert
ICLR
2026
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLMs
Alexander Panfilov
,
Evgenii Kortukov
,
Kristina Nikolić
,
Matthias Bethge
,
Sebastian Lapuschkin
,
Wojciech Samek
,
Ameya Prabhu
,
Maksym Andriushchenko
,
Jonas Geiping
ICLRW
2025
ASIDE: Architectural Separation of Instructions and Data in Language Models
Egor Zverev
,
Evgenii Kortukov
,
Alexander Panfilov
,
Soroush Tabesh
,
Sebastian Lapuschkin
,
Wojciech Samek
,
Christoph H. Lampert
NeurIPSW
2023
Exploring Practitioner Perspectives on Training Data Attribution Explanations
Elisa Nguyen
,
Evgenii Kortukov
,
Jean Song
,
Seong Joon Oh