Pawar, Urja

2 publications

NeurIPS 2025 Detecting High-Stakes Interactions with Activation Probes Alex McKenzie, Urja Pawar, Phil Blandfort, William Bankes, David Krueger, Ekdeep Singh Lubana, Dmitrii Krasheninnikov
CLeaR 2024 On the Impact of Neighbourhood Sampling to Satisfy Sufficiency and Necessity Criteria in Explainable AI Urja Pawar, Christian Beder, Ruairi O\textsc\char13Reilly, Donna O\textsc\char13Shea