Garjani, Ali

4 publications

ICLR 2026 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Rahul Ramachandran, Ali Garjani, Roman Bachmann, Andrei Atanov, Oğuzhan Fatih Kar, Amir Zamir
ICLR 2026 Multimodality as Supervision: Self-Supervised Specialization to the Test Environment via Multimodality Kunal Pratap Singh, Ali Garjani, Rishubh Singh, Muhammad Uzair Khattak, Efe Tarhan, Jason Toskov, Andrei Atanov, Oğuzhan Fatih Kar, Amir Zamir
NeurIPS 2024 4m-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann, Oğuzhan Fatih Kar, David Mizrahi, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
WACV 2023 Neural Distributed Image Compression with Cross-Attention Feature Alignment Nitish Mital, Ezgi Özyilkan, Ali Garjani, Deniz Gündüz