Khan, Zaid

9 publications

ICCV 2025 DWIM: Towards Tool-Aware Visual Reasoning via Discrepancy-Aware Workflow Generation & Instruct-Masking Tuning Fucai Ke, B G Vijay Kumar, Xingjian Leng, Zhixi Cai, Zaid Khan, Weiqing Wang, Pari Delir Haghighi, Hamid Rezatofighi, Manmohan Chandraker
ICLR 2025 DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback Zaid Khan, Elias Stengel-Eskin, Jaemin Cho, Mohit Bansal
CVPR 2024 Consistency and Uncertainty: Identifying Unreliable Responses from Black-Box Vision-Language Models for Selective Visual Question Answering Zaid Khan, Yun Fu
CVPR 2024 Self-Training Large Language Models for Improved Visual Program Synthesis with Visual Reinforcement Zaid Khan, Vijay Kumar Bg, Samuel Schulter, Yun Fu, Manmohan Chandraker
ICLR 2023 Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning Zaid Khan, Yun Fu
NeurIPS 2023 Exploring Question Decomposition for Zero-Shot VQA Zaid Khan, B G Vijay Kumar, Samuel Schulter, Manmohan Chandraker, Yun Fu
CVPR 2023 Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? a: Self-Train on Unlabeled Images! Zaid Khan, Vijay Kumar Bg, Samuel Schulter, Xiang Yu, Yun Fu, Manmohan Chandraker
NeurIPSW 2023 Selective Prediction for Open-Ended Question Answering in Black-Box Vision-Language Models Zaid Khan, Yun Fu
ECCV 2022 Single-Stream Multi-Level Alignment for Vision-Language Pretraining Zaid Khan, B G Vijay Kumar, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu