ML Anthology
Authors
Search
About
Krzyzanowski, Robert
2 publications
ICLRW
2025
Chain-of-Thought Reasoning in the Wild Is Not Always Faithful
Iván Arcuschin
,
Jett Janiak
,
Robert Krzyzanowski
,
Senthooran Rajamanoharan
,
Neel Nanda
,
Arthur Conmy
ICMLW
2024
Interpreting Attention Layer Outputs with Sparse Autoencoders
Connor Kissane
,
Robert Krzyzanowski
,
Joseph Isaac Bloom
,
Arthur Conmy
,
Neel Nanda