Yip, Chun Hei

3 publications

NeurIPS 2024 Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan

ICMLW 2024 Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan

ICMLW 2024 ReLU MLPs Can Compute Numerical Integration: Mechanistic Interpretation of a Non-Linear Activation Chun Hei Yip, Rajashree Agrawal, Jason Gross