Patil, Kunal

1 publications

ICLRW 2025 LLM Neurosurgeon: Targeted Knowledge Removal in LLMs Using Sparse Autoencoders Kunal Patil, Dylan Zhou, Yifan Sun, Karthik Lakshmanan, Senthooran Rajamanoharan, Arthur Conmy