Sankaranarayanan, Aruna

3 publications

ICML 2025 MIB: A Mechanistic Interpretability Benchmark Aaron Mueller, Atticus Geiger, Sarah Wiegreffe, Dana Arad, Iván Arcuschin, Adam Belfki, Yik Siu Chan, Jaden Fried Fiotto-Kaufman, Tal Haklay, Michael Hanna, Jing Huang, Rohan Gupta, Yaniv Nikankin, Hadas Orgad, Nikhil Prakash, Anja Reusch, Aruna Sankaranarayanan, Shun Shao, Alessandro Stolfo, Martin Tutek, Amir Zur, David Bau, Yonatan Belinkov
ICMLW 2024 Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan, Dylan Hadfield-Menell, Aaron Mueller
NeurIPSW 2023 Is the Facebook Ad Algorithm a Climate Discourse Influencer? Aruna Sankaranarayanan, Erik Hemberg, Piotr Sapiezynski, Una-May O'Reilly