ML Anthology
Authors
Search
About
Lawson, Tim
4 publications
ICLR
2026
Automated Interpretability Metrics Do Not Distinguish Trained and Random Transformers
Thomas Heap
,
Tim Lawson
,
Lucy Farnik
,
Laurence Aitchison
ICML
2025
Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Lucy Farnik
,
Tim Lawson
,
Conor Houghton
,
Laurence Aitchison
ICLR
2025
Residual Stream Analysis with Multi-Layer SAEs
Tim Lawson
,
Lucy Farnik
,
Conor Houghton
,
Laurence Aitchison
NeurIPSW
2024
Residual Stream Analysis with Multi-Layer SAEs
Tim Lawson
,
Lucy Farnik
,
Conor Houghton
,
Laurence Aitchison