Maliha, Maisha

2 publications

ICLR 2026 Hessian-Enhanced Token Attribution (HETA): Interpreting Autoregressive LLMs Vishal Pramanik, Maisha Maliha, Nathaniel D. Bastian, Sumit Kumar Jha
ICLR 2026 Jailbreaking the Matrix: Nullspace Steering for Controlled Model Subversion Vishal Pramanik, Maisha Maliha, Susmit Jha, Sumit Kumar Jha