Dehghan, Afshin

17 publications

CVPR 2025 Cubify Anything: Scaling Indoor 3D Object Detection Justin Lazarow, David Griffiths, Gefen Kohavi, Francisco Crespo, Afshin Dehghan
ICML 2025 FlexTok: Resampling Images into 1d Token Sequences of Flexible Length Roman Bachmann, Jesse Allardice, David Mizrahi, Enrico Fini, Oğuzhan Fatih Kar, Elmira Amirloo, Alaaeldin El-Nouby, Amir Zamir, Afshin Dehghan
ICCV 2025 MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs Erik Daxberger, Nina Wenzel, David Griffiths, Haiming Gang, Justin Lazarow, Gefen Kohavi, Kai Kang, Marcin Eichner, Yinfei Yang, Afshin Dehghan, Peter Grasch
ICLR 2025 MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-Tuning Haotian Zhang, Mingfei Gao, Zhe Gan, Philipp Dufter, Nina Wenzel, Forrest Huang, Dhruti Shah, Xianzhi Du, Bowen Zhang, Yanghao Li, Sam Dodge, Keen You, Zhen Yang, Aleksei Timofeev, Mingze Xu, Hong-You Chen, Jean-Philippe Fauconnier, Zhengfeng Lai, Haoxuan You, Zirui Wang, Afshin Dehghan, Peter Grasch, Yinfei Yang
NeurIPS 2025 Rooms from Motion: Un-Posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow, Kai Kang, Afshin Dehghan
NeurIPS 2025 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Haibo Wang, Bo Feng, Zhengfeng Lai, Mingze Xu, Shiyu Li, Weifeng Ge, Afshin Dehghan, Meng Cao, Ping Huang
NeurIPS 2025 UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation Rui Tian, Mingfei Gao, Mingze Xu, Jiaming Hu, Jiasen Lu, Zuxuan Wu, Yinfei Yang, Afshin Dehghan
NeurIPS 2024 4m-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann, Oğuzhan Fatih Kar, David Mizrahi, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
NeurIPS 2023 4m: Massively Multimodal Masked Modeling David Mizrahi, Roman Bachmann, Oguzhan Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir
NeurIPS 2022 GAUDI: A Neural Architect for Immersive 3D Scene Generation Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Joshua Susskind
CVPR 2015 GMMCP Tracker: Globally Optimal Generalized Maximum Multi Clique Problem for Multiple Object Tracking Afshin Dehghan, Shayan Modiri Assari, Mubarak Shah
CVPR 2015 Target Identity-Aware Network Flow for Online Multiple Target Tracking Afshin Dehghan, Yicong Tian, Philip H. S. Torr, Mubarak Shah
CVPR 2014 Improving Semantic Concept Detection Through the Dictionary of Visually-Distinct Elements Afshin Dehghan, Haroon Idrees, Mubarak Shah
CVPR 2014 Who Do I Look like? Determining Parent-Offspring Resemblance via Gated Autoencoders Afshin Dehghan, Enrique G. Ortiz, Ruben Villegas, Mubarak Shah
CVPR 2013 Improving an Object Detector and Extracting Regions Using Superpixels Guang Shu, Afshin Dehghan, Mubarak Shah
ECCV 2012 GMCP-Tracker: Global Multi-Object Tracking Using Generalized Minimum Clique Graphs Amir Roshan Zamir, Afshin Dehghan, Mubarak Shah
CVPR 2012 Part-Based Multiple-Person Tracking with Partial Occlusion Handling Guang Shu, Afshin Dehghan, Omar Oreifej, Emily M. Hand, Mubarak Shah