Agrawal, Harsh

13 publications

ICLR 2025 Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms Zhangheng Li, Keen You, Haotian Zhang, Di Feng, Harsh Agrawal, Xiujun Li, Mohana Prasad Sathya Moorthy, Jeffrey Nichols, Yinfei Yang, Zhe Gan
CVPR 2025 From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Andrew Szot, Bogdan Mazoure, Omar Attia, Aleksei Timofeev, Harsh Agrawal, Devon Hjelm, Zhe Gan, Zsolt Kira, Alexander Toshev
ICCV 2025 UINavBench: A Framework for Comprehensive Evaluation of Interactive Digital Agents Harsh Agrawal, Eldon Schoop, Xinlei Pan, Anuj Mahajan, Ari Seff, Di Feng, Ruijia Cheng, Andres Romero Mier Y Teran, Esteban Gomez, Abhishek Sundararajan, Forrest Huang, Amanda Swearngin, Mohana Prasad Sathya Moorthy, Jeff Nichols, Alexander Toshev
NeurIPS 2024 Grounding Multimodal Large Language Models in Actions Andrew Szot, Bogdan Mazoure, Harsh Agrawal, Devon Hjelm, Zsolt Kira, Alexander Toshev
ICLR 2024 Large Language Models as Generalizable Policies for Embodied Tasks Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Rin Metcalf, Walter Talbott, Natalie Mackraz, R Devon Hjelm, Alexander T Toshev
AAAI 2023 Simple and Effective Synthesis of Indoor 3D Scenes Jing Yu Koh, Harsh Agrawal, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson
ECCV 2022 Housekeep: Tidying Virtual Households Using Commonsense Reasoning Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal
ICCV 2021 Contrast and Classify: Training Robust VQA Models Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal
UAI 2021 Known Unknowns: Learning Novel Concepts Using Reasoning-by-Elimination Harsh Agrawal, Eli A. Meirom, Yuval Atzmon, Shie Mannor, Gal Chechik
NeurIPS 2021 SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation Abhinav Moudgil, Arjun Majumdar, Harsh Agrawal, Stefan Lee, Dhruv Batra
ICCV 2021 The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation Xiaoming Zhao, Harsh Agrawal, Dhruv Batra, Alexander G. Schwing
ECCV 2020 Spatially Aware Multimodal Transformers for TextVQA Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
CVPR 2016 Object-Proposal Evaluation Protocol Is 'Gameable' Neelima Chavali, Harsh Agrawal, Aroma Mahendru, Dhruv Batra