Noubir, Soufiane

2 publications

NeurIPS 2024 Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan
ICMLW 2024 Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan