Parikh, Devi

100 publications

CVPR 2024 Emu Edit: Precise Image Editing via Recognition and Generation Tasks Shelly Sheynin, Adam Polyak, Uriel Singer, Yuval Kirstain, Amit Zohar, Oron Ashual, Devi Parikh, Yaniv Taigman
ECCV 2024 Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Mian Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra
ECCV 2024 Video Editing via Factorized Diffusion Distillation Uriel Singer, Amit Zohar, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman
ICLR 2023 AudioGen: Textually Guided Audio Generation Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi
ICLR 2023 Make-a-Video: Text-to-Video Generation Without Text-Video Data Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman
ICCV 2023 Make-an-Animation: Large-Scale Text-Conditional 3D Human Motion Generation Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta
CVPR 2023 SpaText: Spatio-Textual Representation for Controllable Image Generation Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin
ICML 2023 Text-to-4D Dynamic Scene Generation Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman
CVPR 2022 Episodic Memory Question Answering Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Mukul Khanna, Dhruv Batra, Devi Parikh
ECCV 2022 Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh
ECCV 2022 MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh
ECCV 2022 Make-a-Scene: Scene-Based Text-to-Image Generation with Human Priors Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman
ICCV 2021 Contrast and Classify: Training Robust VQA Models Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal
ICLR 2021 Creative Sketch Generation Songwei Ge, Vedanuj Goswami, Larry Zitnick, Devi Parikh
NeurIPS 2021 Human-Adversarial Visual Question Answering Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Magana, Tristan Thrush, Wojciech Galuba, Devi Parikh, Douwe Kiela
CVPR 2021 KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach
CVPR 2021 Vx2Text: End-to-End Learning of Video-Based Text Generation from Multimodal Inputs Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani
ICLR 2020 DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra
NeurIPS 2020 Dialog Without Dialog Data: Learning Visual Dialog Agents from VQA Data Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra
IJCAI 2020 Embodied Multimodal Multitask Learning Devendra Singh Chaplot, Lisa Lee, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra
ICLR 2020 Emergence of Compositional Language with Deep Generational Transmission Michael Cogswell, Jiasen Lu, Stefan Lee, Devi Parikh, Dhruv Batra
ICMLW 2020 Extended Abstract: Improving Vision-and-Language Navigation with Image-Text Pairs from the Web Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra
IJCAI 2020 IR-VIC: Unsupervised Discovery of Sub-Goals for Transfer in RL Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam
ECCV 2020 Improving Vision-and-Language Navigation with Image-Text Pairs from the Web Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra
CoRL 2020 Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents Samyak Datta, Oleksandr Maksymets, Judy Hoffman, Stefan Lee, Dhruv Batra, Devi Parikh
ECCV 2020 Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das
ECCV 2020 Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh
CoRL 2020 Sim-to-Real Transfer for Vision-and-Language Navigation Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee
ECCV 2020 Spatially Aware Multimodal Transformers for TextVQA Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
NeurIPS 2019 Chasing Ghosts: Instruction Following as Bayesian State Tracking Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee
ICML 2019 Counterfactual Visual Explanations Yash Goyal, Ziyan Wu, Jan Ernst, Dhruv Batra, Devi Parikh, Stefan Lee
NeurIPS 2019 Cross-Channel Communication Networks Jianwei Yang, Zhile Ren, Chuang Gan, Hongyuan Zhu, Devi Parikh
ICLR 2019 Modeling the Long Term Future in Model-Based Reinforcement Learning Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra
ICML 2019 Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh
NeurIPS 2019 RUBi: Reducing Unimodal Biases for Visual Question Answering Remi Cadene, Corentin Dancette, Hedi Ben Younes, Matthieu Cord, Devi Parikh
ICML 2019 TarMAC: Targeted Multi-Agent Communication Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Mike Rabbat, Joelle Pineau
NeurIPS 2019 ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee
ECCV 2018 Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, Stefan Lee
CVPRW 2018 Embodied Question Answering Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
ECCV 2018 Graph R-CNN for Scene Graph Generation Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh
CoRL 2018 Neural Modular Control for Embodied Question Answering Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
ECCV 2018 Visual Coreference Resolution in Visual Dialog Using Neural Module Networks Satwik Kottur, Jose M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach
CoRL 2018 Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh
NeurIPS 2017 Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model Jiasen Lu, Anitha Kannan, Jianwei Yang, Devi Parikh, Dhruv Batra
CVPR 2017 Context-Aware Captions from Context-Agnostic Supervision Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
CVPR 2017 Counting Everyday Objects in Everyday Scenes Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh
ICCV 2017 Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra
CVPR 2017 Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher
ICLR 2017 LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation Jianwei Yang, Anitha Kannan, Dhruv Batra, Devi Parikh
CVPR 2017 Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh
CVPR 2017 Visual Dialog Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, Jose M. F. Moura, Devi Parikh, Dhruv Batra
ECCV 2016 Deep Learning the City: Quantifying Urban Perception at a Global Scale Abhimanyu Dubey, Nikhil Naik, Devi Parikh, Ramesh Raskar, César A. Hidalgo
NeurIPS 2016 Hierarchical Question-Image Co-Attention for Visual Question Answering Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh
CVPR 2016 Joint Unsupervised Learning of Deep Representations and Image Clusters Jianwei Yang, Devi Parikh, Dhruv Batra
ECCV 2016 Leveraging Visual Question Answering for Image-Caption Ranking Xiao Lin, Devi Parikh
CVPR 2016 Visual Word2Vec (vis-W2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes Satwik Kottur, Ramakrishna Vedantam, Jose M. F. Moura, Devi Parikh
CVPR 2016 We Are Humor Beings: Understanding and Predicting Visual Humor Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
CVPR 2016 Yin and Yang: Balancing and Answering Binary Visual Questions Peng Zhang, Yash Goyal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh
CVPR 2015 CIDEr: Consensus-Based Image Description Evaluation Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
CVPR 2015 Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense for Non-Visual Tasks Xiao Lin, Devi Parikh
CVPR 2015 Image Specificity Mainak Jas, Devi Parikh
ICCV 2015 Learning Common Sense Through Visual Abstraction Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh
CVPR 2015 Understanding Image Virality Arturo Deza, Devi Parikh
ICCV 2015 VQA: Visual Question Answering Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
ECCV 2014 Interactively Guiding Semi-Supervised Clustering via Attribute-Based Explanations Shrenik Lad, Devi Parikh
CVPR 2014 Predicting Failures of Vision Systems Peng Zhang, Jiuling Wang, Ali Farhadi, Martial Hebert, Devi Parikh
CVPR 2014 Predicting User Annoyance Using Visual Attributes Gordon Christie, Amar Parkash, Ujwal Krothapalli, Devi Parikh
ECCV 2014 Towards Transparent Systems: Semantic Characterization of Failure Modes Aayush Bansal, Ali Farhadi, Devi Parikh
ECCV 2014 Zero-Shot Learning via Visual Abstraction Stanislaw Antol, C. Lawrence Zitnick, Devi Parikh
CVPR 2013 Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi, Sanja Fidler, Jian Yao, Raquel Urtasun, Devi Parikh
ICCV 2013 Attribute Dominance: What Pops Out? Naman Turakhia, Devi Parikh
CVPR 2013 Bringing Semantics into Focus Using Visual Abstraction C. L. Zitnick, Devi Parikh
ICCV 2013 Implied Feedback: Learning Nuances of User Behavior in Image Search Devi Parikh, Kristen Grauman
ICCV 2013 Learning the Visual Interpretation of Sentences C. L. Zitnick, Devi Parikh, Lucy Vanderwende
CVPR 2013 Multi-Attribute Queries: To Merge or Not to Merge? Mohammad Rastegari, Ali Diba, Devi Parikh, Ali Farhadi
CVPR 2013 Simultaneous Active Learning of Classifiers & Attributes via Relative Feedback Arijit Biswas, Devi Parikh
ICCV 2013 Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing Amir Sadovnik, Andrew Gallagher, Devi Parikh, Tsuhan Chen
ICCVW 2013 Which Edges Matter? Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew C. Gallagher, Larry Zitnick
ECCV 2012 Attributes for Classifier Feedback Amar Parkash, Devi Parikh
CVPR 2012 Automatic Discovery of Groups of Objects for Scene Understanding Congcong Li, Devi Parikh, Tsuhan Chen
CVPR 2012 Discovering Localized Attributes for Fine-Grained Recognition Kun Duan, Devi Parikh, David J. Crandall, Kristen Grauman
AAAI 2012 Relative Attributes for Enhanced Human-Machine Communication Devi Parikh, Adriana Kovashka, Amar Parkash, Kristen Grauman
CVPR 2012 The Role of Image Understanding in Contour Detection C. Lawrence Zitnick, Devi Parikh
CVPR 2012 WhittleSearch: Image Search with Relative Attribute Feedback Adriana Kovashka, Devi Parikh, Kristen Grauman
ICCV 2011 Extracting Adaptive Contextual Cues from Unlabeled Regions Congcong Li, Devi Parikh, Tsuhan Chen
CVPR 2011 Finding the Weakest Link in Person Detectors Devi Parikh, C. Lawrence Zitnick
CVPR 2011 Inference for Order Reduction in Markov Random Fields Andrew C. Gallagher, Dhruv Batra, Devi Parikh
CVPR 2011 Interactively Building a Discriminative Vocabulary of Nameable Attributes Devi Parikh, Kristen Grauman
ICCV 2011 Recognizing Jumbled Images: The Role of Local and Global Information in Image Classification Devi Parikh
ICCV 2011 Relative Attributes Devi Parikh, Kristen Grauman
NeurIPS 2011 Understanding the Intrinsic Memorability of Images Phillip Isola, Devi Parikh, Antonio Torralba, Aude Oliva
CVPR 2010 Beyond Trees: MRF Inference via Outer-Planar Decomposition Dhruv Batra, Andrew C. Gallagher, Devi Parikh, Tsuhan Chen
CVPR 2010 The Role of Features, Algorithms and Data in Visual Recognition Devi Parikh, C. Lawrence Zitnick
CVPR 2010 iCoseg: Interactive Co-Segmentation with Intelligent Scribble Guidance Dhruv Batra, Adarsh Kowdle, Devi Parikh, Jiebo Luo, Tsuhan Chen
CVPRW 2009 Cutout-Search: Putting a Name to the Picture Dhruv Batra, Adarsh Kowdle, Devi Parikh, Tsuhan Chen
CVPR 2009 Unsupervised Learning of Hierarchical Spatial Structures in Images Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen
ECCV 2008 Determining Patch Saliency Using Low-Level Context Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen
CVPR 2008 From Appearance to Context-Based Recognition: Dense Labeling in Small Images Devi Parikh, C. Lawrence Zitnick, Tsuhan Chen
ICCV 2007 Hierarchical Semantics of Objects (hSOs) Devi Parikh, Tsuhan Chen
CVPR 2007 Unsupervised Learning of Hierarchical Semantics of Objects (hSOs) Devi Parikh, Tsuhan Chen