Lal, Vasudev

16 publications

ICML 2025 A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment Raanan Yehezkel Rohekar, Yaniv Gurwicz, Sungduk Yu, Estelle Aflalo, Vasudev Lal
CVPRW 2025 Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders Matthew Lyle Olson, Musashi Hinck, Neale Ratzlaff, Changbai Li, Phillip Howard, Vasudev Lal, Shao-Yen Tseng
ICLRW 2025 FastRM: An Efficient and Automatic Explainability Framework for Multimodal Generative Models Gabriela Ben-Melech Stan, Estelle Aflalo, Man Luo, Shachar Rosenman, Tiep Le, Sayak Paul, Shao-Yen Tseng, Vasudev Lal
ICML 2025 SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard
NeurIPSW 2024 Causal World Representation in the GPT Model Raanan Yehezkel Rohekar, Yaniv Gurwicz, Sungduk Yu, Vasudev Lal
NeurIPSW 2024 Debiasing Large Vision-Language Models by Ablating Protected Attribute Representations Neale Ratzlaff, Matthew Lyle Olson, Musashi Hinck, Shao-Yen Tseng, Vasudev Lal, Phillip Howard
ECCV 2024 Getting It Right: Improving Spatial Consistency in Text-to-Image Models Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Guez Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hanna Hajishirzi, Vasudev Lal, Chitta R Baral, Yezhou Yang
CVPRW 2024 ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models Avinash Madasu, Vasudev Lal
NeurIPSW 2024 Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review Sungduk Yu, Man Luo, Avinash Madasu, Vasudev Lal, Phillip Howard
CVPR 2024 SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Anahita Bhiwandiwalla, Vasudev Lal
NeurIPSW 2023 Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks Avinash Madasu, Anahita Bhiwandiwalla, Vasudev Lal
NeurIPS 2023 Brain Encoding Models Based on Multimodal Transformers Can Transfer Across Language and Vision Jerry Tang, Meng Du, Vy Vo, Vasudev Lal, Alexander Huth
AAAI 2023 BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan
NeurIPS 2023 COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs Tiep Le, Vasudev Lal, Phillip Howard
CVPRW 2023 Is Multimodal Vision Supervision Beneficial to Language? Avinash Madasu, Vasudev Lal
CVPR 2022 VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal