Misra, Ishan

49 publications

NeurIPS 2025 CAT: Content-Adaptive Image Tokenization Junhong Shen, Kushal Tirumala, Michihiro Yasunaga, Ishan Misra, Luke Zettlemoyer, Lili Yu, Chunting Zhou
ICCV 2025 Generating Multi-Image Synthetic Data for Text-to-Image Customization Nupur Kumari, Xi Yin, Jun-Yan Zhu, Ishan Misra, Samaneh Azadi
ICML 2025 LLMs Can See and Hear Without Any Training Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen, Ishan Misra, Rohit Girdhar
TMLR 2025 SelfEval: Leveraging Discriminative Nature of Generative Models for Evaluation Sai Saketh Rambhatla, Ishan Misra
TMLR 2024 DINOv2: Learning Robust Visual Features Without Supervision Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mido Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Herve Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski
ECCV 2024 Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Mian Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra
CVPR 2024 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang, Bichen Wu, Jialiang Wang, Licheng Yu, Kunpeng Li, Yinan Zhao, Ishan Misra, Jia-Bin Huang, Peizhao Zhang, Peter Vajda, Diana Marculescu
CVPR 2024 Generating Illustrated Instructions Sachit Menon, Ishan Misra, Rohit Girdhar
CVPR 2024 InstanceDiffusion: Instance-Level Control for Image Generation Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra
CVPR 2024 VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation Xudong Wang, Ishan Misra, Ziyun Zeng, Rohit Girdhar, Trevor Darrell
CVPR 2023 Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra
CVPR 2023 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze, Nicolas Carion, Ishan Misra
CVPR 2023 ImageBind: One Embedding Space to Bind Them All Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra
CVPR 2023 Learning Video Representations from Large Language Models Yue Zhao, Ishan Misra, Philipp Krähenbühl, Rohit Girdhar
ICCV 2023 MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery Sai Saketh Rambhatla, Ishan Misra, Rama Chellappa, Abhinav Shrivastava
ICML 2023 MonoNeRF: Learning Generalizable NeRFs from Monocular Videos Without Camera Poses Yang Fu, Ishan Misra, Xiaolong Wang
CVPR 2023 OmniMAE: Single Model Masked Pretraining on Images and Videos Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra
ICLR 2023 RoPAWS: Robust Semi-Supervised Representation Learning from Uncurated Data Sangwoo Mo, Jong-Chyi Su, Chih-Yao Ma, Mido Assran, Ishan Misra, Licheng Yu, Sean Bell
CVPR 2023 Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann LeCun, Nicolas Ballas
ICCV 2023 The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollar, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra
ICLR 2023 The Hidden Uniform Cluster Prior in Self-Supervised Learning Mido Assran, Randall Balestriero, Quentin Duval, Florian Bordes, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Nicolas Ballas
ICCVW 2023 Vision-Language Models Performing Zero-Shot Tasks Exhibit Disparities Between Gender Groups Melissa Hall, Laura Gustafson, Aaron Adcock, Ishan Misra, Candace Ross
NeurIPS 2022 A Data-Augmentation Is Worth a Thousand Samples: Analytical Moments and Sampling-Free Training Randall Balestriero, Ishan Misra, Yann LeCun
ECCV 2022 Detecting Twenty-Thousand Classes Using Image-Level Supervision Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra
ICLR 2022 Frame Averaging for Invariant and Equivariant Network Design Omri Puny, Matan Atzmon, Edward J. Smith, Ishan Misra, Aditya Grover, Heli Ben-Hamu, Yaron Lipman
ECCV 2022 Masked Siamese Networks for Label-Efficient Learning Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas
CVPR 2022 Masked-Attention Mask Transformer for Universal Image Segmentation Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar
CVPR 2022 Omnivore: A Single Model for Many Visual Modalities Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan Misra
CVPR 2021 3D Spatial Recognition Without Spatially Labeled 3D Zhongzheng Ren, Ishan Misra, Alexander G. Schwing, Rohit Girdhar
ICCV 2021 An End-to-End Transformer Model for 3D Object Detection Ishan Misra, Rohit Girdhar, Armand Joulin
CVPR 2021 Audio-Visual Instance Discrimination with Cross-Modal Agreement Pedro Morgado, Nuno Vasconcelos, Ishan Misra
ICML 2021 Barlow Twins: Self-Supervised Learning via Redundancy Reduction Jure Zbontar, Li Jing, Ishan Misra, Yann LeCun, Stephane Deny
ICCV 2021 Emerging Properties in Self-Supervised Vision Transformers Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin
NeurIPS 2021 Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers Mandela Patrick, Dylan Campbell, Yuki Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, João F. Henriques
ICCV 2021 MDETR - Modulated Detection for End-to-End Multi-Modal Understanding Aishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion
CVPR 2021 Robust Audio-Visual Instance Discrimination Pedro Morgado, Ishan Misra, Nuno Vasconcelos
ICCV 2021 Self-Supervised Pretraining of 3D Features on Any Point-Cloud Zaiwei Zhang, Rohit Girdhar, Armand Joulin, Ishan Misra
ICCV 2021 Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Armand Joulin, Nicolas Ballas, Michael Rabbat
ICCV 2021 Space-Time Crop & Attend: Improving Cross-Modal Video Representation Learning Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques
NeurIPS 2020 Unsupervised Learning of Visual Features by Contrasting Cluster Assignments Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, Armand Joulin
CVPRW 2019 Does Object Recognition Work for Everyone? Terrance DeVries, Ishan Misra, Changhan Wang, Laurens van der Maaten
ICCVW 2019 Evaluating Text-to-Image Matching Using Binary Image Selection (BISON) Hexiang Hu, Ishan Misra, Laurens van der Maaten
ICCV 2017 Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection Debidatta Dwibedi, Ishan Misra, Martial Hebert
CVPR 2017 From Red Wine to Red Tomato: Composition with Context Ishan Misra, Abhinav Gupta, Martial Hebert
CVPR 2016 Cross-Stitch Networks for Multi-Task Learning Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert
CVPR 2016 Seeing Through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross Girshick
ECCV 2016 Shuffle and Learn: Unsupervised Learning Using Temporal Order Verification Ishan Misra, C. Lawrence Zitnick, Martial Hebert
CVPR 2015 Watch and Learn: Semi-Supervised Learning for Object Detectors from Video Ishan Misra, Abhinav Shrivastava, Martial Hebert
WACV 2014 Data-Driven Exemplar Model Selection Ishan Misra, Abhinav Shrivastava, Martial Hebert