Joseph, Sonia

5 publications

CVPRW 2025 Decoding Vision Transformers: The Diffusion Steering Lens Ryota Takatsuki, Sonia Joseph, Ippei Fujisawa, Ryota Kanai
NeurIPS 2025 From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers Praneet Suresh, Jack Stanley, Sonia Joseph, Luca Scimeca, Danilo Bzdok
NeurIPS 2025 Too Late to Recall: Explaining the Two-Hop Problem in Multimodal Knowledge Retrieval Constantin Venhoff, Ashkan Khakzar, Sonia Joseph, Philip Torr, Neel Nanda
ICMLW 2024 Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent Karolis Jucys, George Adamopoulos, Mehrab Hamidi, Stephanie Milani, Mohammad Reza Samsami, Artem Zholus, Sonia Joseph, Blake Aaron Richards, Irina Rish, Özgür Şimşek
NeurIPSW 2023 On the Information Geometry of Vision Transformers Sonia Joseph, Kumar Krishna Agrawal, Arna Ghosh, Blake Aaron Richards