ML Anthology
Authors
Search
About
Yip, Chun Hei
3 publications
NeurIPS
2024
Compact Proofs of Model Performance via Mechanistic Interpretability
Jason Gross
,
Rajashree Agrawal
,
Thomas Kwa
,
Euan Ong
,
Chun Hei Yip
,
Alex Gibson
,
Soufiane Noubir
,
Lawrence Chan
ICMLW
2024
Compact Proofs of Model Performance via Mechanistic Interpretability
Jason Gross
,
Rajashree Agrawal
,
Thomas Kwa
,
Euan Ong
,
Chun Hei Yip
,
Alex Gibson
,
Soufiane Noubir
,
Lawrence Chan
ICMLW
2024
ReLU MLPs Can Compute Numerical Integration: Mechanistic Interpretation of a Non-Linear Activation
Chun Hei Yip
,
Rajashree Agrawal
,
Jason Gross