ML Anthology
Authors
Search
About
Noubir, Soufiane
2 publications
NeurIPS
2024
Compact Proofs of Model Performance via Mechanistic Interpretability
Jason Gross
,
Rajashree Agrawal
,
Thomas Kwa
,
Euan Ong
,
Chun Hei Yip
,
Alex Gibson
,
Soufiane Noubir
,
Lawrence Chan
ICMLW
2024
Compact Proofs of Model Performance via Mechanistic Interpretability
Jason Gross
,
Rajashree Agrawal
,
Thomas Kwa
,
Euan Ong
,
Chun Hei Yip
,
Alex Gibson
,
Soufiane Noubir
,
Lawrence Chan