Chauhan, Sonakshi

2 publications

JMLR 2025 Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability Atticus Geiger, Duligur Ibeling, Amir Zur, Maheep Chaudhary, Sonakshi Chauhan, Jing Huang, Aryaman Arora, Zhengxuan Wu, Noah Goodman, Christopher Potts, Thomas Icard
NeurIPSW 2024 GPT-2 Small Fine-Tuned on Logical Reasoning Summarizes Information on Punctuation Tokens Sonakshi Chauhan, Atticus Geiger