Omar, Mohamed

4 publications

WACV 2024 A Multimodal Benchmark and Improved Architecture for Zero Shot Learning Keval Doshi, Amanmeet Garg, Burak Uzkent, Xiaolong Wang, Mohamed Omar
ICCV 2023 Audio-Enhanced Text-to-Video Retrieval Using Text-Conditioned Feature Alignment Sarah Ibrahimi, Xiaohang Sun, Pichao Wang, Amanmeet Garg, Ashutosh Sanan, Mohamed Omar
CVPR 2023 Dynamic Inference with Grounding Based Vision and Language Models Burak Uzkent, Amanmeet Garg, Wentao Zhu, Keval Doshi, Jingru Yi, Xiaolong Wang, Mohamed Omar
CVPR 2023 Selective Structured State-Spaces for Long-Form Video Understanding Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid