Sharma, Arnab Sen

6 publications

ICLR 2026 LLMs Process Lists with General Filter Heads Arnab Sen Sharma, Giordano Rogers, Natalie Shapira, David Bau
ICLR 2026 Language Models Use Lookbacks to Track Beliefs Nikhil Prakash, Natalie Shapira, Arnab Sen Sharma, Christoph Riedl, Yonatan Belinkov, Tamar Rott Shaham, David Bau, Atticus Geiger
ICLR 2025 NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals Jaden Fried Fiotto-Kaufman, Alexander Russell Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla E. Brodley, Arjun Guha, Jonathan Bell, Byron C Wallace, David Bau
ICLR 2024 Function Vectors in Large Language Models Eric Todd, Millicent Li, Arnab Sen Sharma, Aaron Mueller, Byron C Wallace, David Bau
ICLR 2024 Linearity of Relation Decoding in Transformer Language Models Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau
ICLR 2023 Mass-Editing Memory in a Transformer Kevin Meng, Arnab Sen Sharma, Alex J Andonian, Yonatan Belinkov, David Bau