Kar, Oğuzhan Fatih

10 publications

ICLR 2026 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Rahul Ramachandran, Ali Garjani, Roman Bachmann, Andrei Atanov, Oğuzhan Fatih Kar, Amir Zamir
ICLR 2026 Multimodality as Supervision: Self-Supervised Specialization to the Test Environment via Multimodality Kunal Pratap Singh, Ali Garjani, Rishubh Singh, Muhammad Uzair Khattak, Efe Tarhan, Jason Toskov, Andrei Atanov, Oğuzhan Fatih Kar, Amir Zamir
ICML 2025 FlexTok: Resampling Images into 1d Token Sequences of Flexible Length Roman Bachmann, Jesse Allardice, David Mizrahi, Enrico Fini, Oğuzhan Fatih Kar, Elmira Amirloo, Alaaeldin El-Nouby, Amir Zamir, Afshin Dehghan
NeurIPS 2024 4m-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann, Oğuzhan Fatih Kar, David Mizrahi, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
ECCV 2024 BRAVE: Broadening the Visual Encoding of Vision-Language Models Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari
ICLR 2024 Unraveling the Key Components of OOD Generalization via Diversification Harold Luc Benoit, Liangze Jiang, Andrei Atanov, Oguzhan Fatih Kar, Mattia Rigotti, Amir Zamir
ICCV 2023 Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback Teresa Yeo, Oğuzhan Fatih Kar, Zahra Sodagar, Amir Zamir
CVPR 2022 3D Common Corruptions and Data Augmentation Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir
ICMLW 2022 3D Common Corruptions for Object Recognition Oguzhan Fatih Kar, Teresa Yeo, Amir Zamir
ICCV 2021 Robustness via Cross-Domain Ensembles Teresa Yeo, Oğuzhan Fatih Kar, Amir Zamir