ML Anthology
Authors
Search
About
Kissane, Connor
1 publications
ICMLW
2024
Interpreting Attention Layer Outputs with Sparse Autoencoders
Connor Kissane
,
Robert Krzyzanowski
,
Joseph Isaac Bloom
,
Arthur Conmy
,
Neel Nanda