Manocha, Dinesh

93 publications

ICCV 2025 AURELIA: Test-Time Reasoning Distillation in Audio-Visual LLMs Sanjoy Chowdhury, Hanan Gani, Nishit Anand, Sayan Nag, Ruohan Gao, Mohamed Elhoseiny, Salman Khan, Dinesh Manocha
ICCV 2025 AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Yaoting Wang, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha
ICML 2025 Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Sreyan Ghosh, Zhifeng Kong, Sonal Kumar, S Sakshi, Jaehyeon Kim, Wei Ping, Rafael Valle, Dinesh Manocha, Bryan Catanzaro
NeurIPS 2025 Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models Sreyan Ghosh, Arushi Goel, Jaehyeon Kim, Sonal Kumar, Zhifeng Kong, Sang-gil Lee, Chao-Han Huck Yang, Ramani Duraiswami, Dinesh Manocha, Rafael Valle, Bryan Catanzaro
ECML-PKDD 2025 Better Features, Better Calibration: A Simple Fix for Overconfident Networks Soumya Suvra Ghosal, Ramya Hebbalaguppe, Dinesh Manocha
TMLR 2025 Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning Peihong Yu, Manav Mishra, Alec Koppel, Carl Busart, Priya Narayan, Dinesh Manocha, Amrit Singh Bedi, Pratap Tokekar
ICML 2025 Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time Mohamad Fares El Hajj Chehade, Soumya Suvra Ghosal, Souradip Chakraborty, Avinash Reddy, Dinesh Manocha, Hao Zhu, Amrit Singh Bedi
ICLR 2025 Collab: Controlled Decoding Using Mixture of Agents for LLM Alignment Souradip Chakraborty, Sujay Bhatt, Udari Madhushani Sehwag, Soumya Suvra Ghosal, Jiahao Qiu, Mengdi Wang, Dinesh Manocha, Furong Huang, Alec Koppel, Sumitra Ganesh
NeurIPS 2025 Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models Soumya Suvra Ghosal, Souradip Chakraborty, Avinash Reddy, Yifu Lu, Mengdi Wang, Dinesh Manocha, Furong Huang, Mohammad Ghavamzadeh, Amrit Singh Bedi
CVPR 2025 EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching Dongki Jung, Jaehoon Choi, Yonghan Lee, Somi Jeong, Taejae Lee, Dinesh Manocha, Suyong Yeon
ICCV 2025 EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception Sanjoy Chowdhury, Subrata Biswas, Sayan Nag, Tushar Nagarajan, Calvin Murdock, Ishwarya Ananthabhotla, Yijun Qian, Vamsi Krishna Ithapu, Dinesh Manocha, Ruohan Gao
CoRL 2025 HALO : Human Preference Aligned Offline Reward Learning for Robot Navigation Gershom Seneviratne, Jianyu An, Sahire Ellahy, Kasun Weerakoon, Mohamed Bashir Elnoor, Jonathan Deepak Kannan, Amogha Thalihalla Sunil, Dinesh Manocha
ICLR 2025 How Learnable Grids Recover Fine Detail in Low Dimensions: A Neural Tangent Kernel Analysis of Multigrid Parametric Encodings Samuel Audia, Soheil Feizi, Matthias Zwicker, Dinesh Manocha
ICCV 2025 IM360: Large-Scale Indoor Mapping with 360 Cameras Dongki Jung, Jaehoon Choi, Yonghan Lee, Dinesh Manocha
CVPR 2025 Immune: Improving Safety Against Jailbreaks in Multi-Modal LLMs via Inference-Time Alignment Soumya Suvra Ghosal, Souradip Chakraborty, Vaibhav Singh, Tianrui Guan, Mengdi Wang, Ahmad Beirami, Furong Huang, Alvaro Velasquez, Dinesh Manocha, Amrit Singh Bedi
NeurIPS 2025 MAGNET: A Multi-Agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks Sanjoy Chowdhury, Mohamed Elmoghany, Yohan Abeysinghe, Junjie Fei, Sayan Nag, Salman Khan, Mohamed Elhoseiny, Dinesh Manocha
ICLR 2025 MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark S Sakshi, Utkarsh Tyagi, Sonal Kumar, Ashish Seth, Ramaneswaran Selvakumar, Oriol Nieto, Ramani Duraiswami, Sreyan Ghosh, Dinesh Manocha
NeurIPS 2025 RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization Dongki Jung, Jaehoon Choi, Yonghan Lee, Dinesh Manocha
ICLR 2025 Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Sreyan Ghosh, Sonal Kumar, Zhifeng Kong, Rafael Valle, Bryan Catanzaro, Dinesh Manocha
ICLR 2025 Towards Optimal Multi-Draft Speculative Decoding Zhengmian Hu, Tong Zheng, Vignesh Viswanathan, Ziyi Chen, Ryan A. Rossi, Yihan Wu, Dinesh Manocha, Heng Huang
NeurIPS 2025 VideoHallu: Evaluating and Mitigating Multi-Modal Hallucinations on Synthetic Video Understanding Zongxia Li, Xiyang Wu, Guangyao Shi, Yubin Qin, Hongyang Du, Tianyi Zhou, Dinesh Manocha, Jordan Lee Boyd-Graber
ICLR 2025 Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
ICML 2024 A Closer Look at the Limitations of Instruction Tuning Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S, Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha
CVPR 2024 AV-RIR: Audio-Visual Room Impulse Response Estimation Anton Ratnarajah, Sreyan Ghosh, Sonal Kumar, Purva Chiniya, Dinesh Manocha
ICLR 2024 CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Reddy Evuru, Ramaneswaran S, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha
CVPR 2024 HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou
CVPR 2024 LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-Time Rendering Jaehoon Choi, Rajvi Shah, Qinbo Li, Yipeng Wang, Ayush Saraf, Changil Kim, Jia-Bin Huang, Dinesh Manocha, Suhib Alsisan, Johannes Kopf
WACV 2024 MITFAS: Mutual Information Based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition Ruiqi Xian, Xijun Wang, Dinesh Manocha
ICML 2024 MaxMin-RLHF: Alignment with Diverse Human Preferences Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit Bedi, Mengdi Wang
ICMLW 2024 MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang
CVPR 2024 MeLFusion: Synthesizing Music from Image and Language Cues Using Diffusion Models Sanjoy Chowdhury, Sayan Nag, K J Joseph, Balaji Vasan Srinivasan, Dinesh Manocha
ECCV 2024 Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha
ICLR 2024 PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback Souradip Chakraborty, Amrit Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, Furong Huang
WACV 2024 PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha
ICML 2024 Position: On the Possibilities of AI-Generated Text Detection Souradip Chakraborty, Amrit Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang
CVPRW 2024 Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha
ICML 2024 Towards Global Optimality for Practical Average Reward Reinforcement Learning Without Mixing Time Oracles Bhrij Patel, Wesley A Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Dinesh Manocha, Amrit Bedi
NeurIPS 2024 Transfer Q-Star : Principled Decoding for LLM Alignment Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang
ECCV 2024 V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su, Subhadra Gopalakrishnan, Dinesh Manocha
TMLR 2023 A Survey on the Possibilities & Impossibilities of AI-Generated Text Detection Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit Bedi
ICCV 2023 AdVerb: Visually Guided Audio Dereverberation Sanjoy Chowdhury, Sreyan Ghosh, Subhrajyoti Dasgupta, Anton Ratnarajah, Utkarsh Tyagi, Dinesh Manocha
ICML 2023 Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic Wesley A Suttle, Amrit Bedi, Bhrij Patel, Brian M. Sadler, Alec Koppel, Dinesh Manocha
ICCV 2023 CrossLoc3D: Aerial-Ground Cross-Source 3D Place Recognition Tianrui Guan, Aswath Muthuselvam, Montana Hoover, Xijun Wang, Jing Liang, Adarsh Jagan Sathyamoorthy, Damon Conover, Dinesh Manocha
AAAI 2023 DocEdit: Language-Guided Document Editing Puneet Mathur, Rajiv Jain, Jiuxiang Gu, Franck Dernoncourt, Dinesh Manocha, Vlad I. Morariu
CoRL 2023 Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning Xiyang Wu, Rohan Chandra, Tianrui Guan, Amrit Bedi, Dinesh Manocha
WACV 2023 LayerDoc: Layer-Wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents Puneet Mathur, Rajiv Jain, Ashutosh Mehra, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu N., Quan Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu
ICCV 2023 LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference Cong Wang, Yu-Ping Wang, Dinesh Manocha
WACV 2023 Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes James F. Mullen, Divya Kothandaraman, Aniket Bera, Dinesh Manocha
AAAI 2023 Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Souradip Chakraborty, Amrit Singh Bedi, Pratap Tokekar, Alec Koppel, Brian M. Sadler, Furong Huang, Dinesh Manocha
WACV 2023 SALAD: Source-Free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha
ICML 2023 STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning Souradip Chakraborty, Amrit Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha
CVPR 2023 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by Using Differentiable Rendering Jaehoon Choi, Dongki Jung, Taejae Lee, Sangwook Kim, Youngdong Jung, Dinesh Manocha, Donghwan Lee
CVPR 2022 3MASSIV: Multilingual, Multimodal and Multi-Aspect Dataset of Social Media Short Videos Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha
ECCV 2022 A Repulsive Force Unit for Garment Collision Handling in Neural Networks Qingyang Tan, Yi Zhou, Tuanfeng Wang, Duygu Ceylan, Xin Sun, Dinesh Manocha
ECCV 2022 D2-TPred: Discontinuous Dependency for Trajectory Prediction Under Traffic Lights Yuzhen Zhang, Wentong Wang, Weizhi Guo, Pei Lv, Mingliang Xu, Wei Chen, Dinesh Manocha
ECCV 2022 FAR: Fourier Aerial Video Recognition Divya Kothandaraman, Tianrui Guan, Xijun Wang, Shuowen Hu, Ming Lin, Dinesh Manocha
CoRL 2022 HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm Kasun Weerakoon, Souradip Chakraborty, Nare Karapetyan, Adarsh Jagan Sathyamoorthy, Amrit Bedi, Dinesh Manocha
ECCV 2022 Human Trajectory Prediction via Neural Social Physics Jiangbei Yue, Dinesh Manocha, He Wang
WACV 2022 M3DETR: Multi-Representation, Multi-Scale, Mutual-Relation 3D Object Detection with Transformers Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha
ICML 2022 N-Penetrate: Active Learning of Neural Collision Handler for Complex 3D Mesh Deformations Qingyang Tan, Zherong Pan, Breannan Smith, Takaaki Shiratori, Dinesh Manocha
NeurIPSW 2022 Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Souradip Chakraborty, Amrit Bedi, Alec Koppel, Pratap Tokekar, Furong Huang, Dinesh Manocha
CVPR 2022 STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes Peishan Cong, Xinge Zhu, Feng Qiao, Yiming Ren, Xidong Peng, Yuenan Hou, Lan Xu, Ruigang Yang, Dinesh Manocha, Yuexin Ma
CVPR 2021 Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha
ICCVW 2021 BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments Divya Kothandaraman, Rohan Chandra, Dinesh Manocha
ICCV 2021 DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes Dongki Jung, Jaehoon Choi, Yonghan Lee, Deokhwa Kim, Changick Kim, Dinesh Manocha, Donghwan Lee
ICCV 2021 HighlightMe: Detecting Highlights from Human-Centric Videos Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli, Viswanathan Swaminathan, Dinesh Manocha
AAAI 2021 LCollision: Fast Generation of Collision-Free Human Poses Using Learned Non-Penetration Constraints Qingyang Tan, Zherong Pan, Dinesh Manocha
IJCAI 2021 Point-Based Acoustic Scattering for Interactive Sound Propagation via Surface Encoding Hsien-Yu Meng, Zhenyu Tang, Dinesh Manocha
ICCV 2021 Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVIS Hui Miao, Feixiang Lu, Zongdai Liu, Liangjun Zhang, Dinesh Manocha, Bin Zhou
ICCVW 2021 SS-SFDA : Self-Supervised Source-Free Domain Adaptation for Road Segmentation in Hazardous Environments Divya Kothandaraman, Rohan Chandra, Dinesh Manocha
ECCV 2020 AutoTrajectory: Label-Free Trajectory Extraction and Prediction from Videos Using Dynamic Points Yuexin Ma, Xinge Zhu, Xinjing Cheng, Ruigang Yang, Jiming Liu, Dinesh Manocha
IJCAI 2020 Crowd-Steer: Realtime Smooth and Collision-Free Robot Navigation in Densely Crowded Scenarios Trained Using High-Fidelity Simulation Jing Liang, Utsav Patel, Adarsh Jagan Sathyamoorthy, Dinesh Manocha
AAAI 2020 M3ER: Multiplicative Multimodal Emotion Recognition Using Facial, Textual, and Speech Cues Trisha Mittal, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha
AAAI 2020 NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations Qiaoyun Wu, Dinesh Manocha, Jun Wang, Kai Xu
AAAI 2020 STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits Uttaran Bhattacharya, Trisha Mittal, Rohan Chandra, Tanmay Randhavane, Aniket Bera, Dinesh Manocha
ECCV 2020 Take an Emotion Walk: Perceiving Emotions from Gaits Using Hierarchical Attention Pooling and Affective Mapping Uttaran Bhattacharya, Christian Roncal, Trisha Mittal, Rohan Chandra, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha
CVPRW 2019 Improving Socially-Aware Multi-Channel Human Emotion Prediction for Robot Navigation Aniket Bera, Tanmay Randhavane, Dinesh Manocha
CVPRW 2019 Modelling Multi-Channel Emotions Using Facial Expression and Trajectory Cues for Improving Socially-Aware Robot Navigation Aniket Bera, Tanmay Randhavane, Dinesh Manocha
CVPRW 2019 Reflection and Diffraction-Aware Sound Source Localization Inkyu An, Jung-Woo Choi, Dinesh Manocha, Sung-Eui Yoon
CVPRW 2019 The Emotionally Intelligent Robot: Improving Socially-Aware Human Prediction in Crowded Environments Aniket Bera, Tanmay Randhavane, Dinesh Manocha
AAAI 2019 TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents Yuexin Ma, Xinge Zhu, Sibo Zhang, Ruigang Yang, Wenping Wang, Dinesh Manocha
CVPRW 2018 AutonoVi-Sim: Autonomous Vehicle Simulation Platform with Weather, Sensing, and Traffic Control Andrew Best, Sahil Narang, Lucas Pasqualin, Daniel Barber, Dinesh Manocha
CVPRW 2018 Classifying Group Emotions for Socially-Aware Autonomous Vehicle Navigation Aniket Bera, Tanmay Randhavane, Austin Wang, Dinesh Manocha, Emily Kubin, Kurt Gray
CVPRW 2018 Efficient and Safe Vehicle Navigation Based on Driver Behavior Classification Ernest Cheung, Aniket Bera, Dinesh Manocha
AAAI 2018 MixedPeds: Pedestrian Detection in Unannotated Videos Using Synthetically Generated Human-Agents for Training Ernest Cheung, Anson Wong, Aniket Bera, Dinesh Manocha
IJCAI 2017 Aggressive, Tense or Shy? Identifying Personality Traits from Crowd Videos Aniket Bera, Tanmay Randhavane, Dinesh Manocha
ECCV 2016 LCrowdV: Generating Labeled Videos for Simulation-Based Crowd Behavior Learning Ernest Cheung, Tsan Kwong Wong, Aniket Bera, Xiaogang Wang, Dinesh Manocha
ECCVW 2016 LCrowdV: Generating Labeled Videos for Simulation-Based Crowd Behavior Learning Ernest Cheung, Tsan Kwong Wong, Aniket Bera, Xiaogang Wang, Dinesh Manocha
CVPRW 2016 Realtime Anomaly Detection Using Trajectory-Level Crowd Behavior Learning Aniket Bera, Sujeong Kim, Dinesh Manocha
CVPR 2015 3D Reconstruction in the Presence of Glasses by Acoustic and Stereo Fusion Mao Ye, Yu Zhang, Ruigang Yang, Dinesh Manocha
ICCVW 2015 Preface to 3D Reconstruction and Understanding with Video and Sound Dinesh Manocha, Marc Pollefeys, Rif Saurous, Rahul Sukthankar, Ruigang Yang
AAAI 2011 Self-Aware Traffic Route Planning David Wilkie, Jur P. van den Berg, Ming C. Lin, Dinesh Manocha
AAAI 2010 G-Planner: Real-Time Motion Planning and Global Navigation Using GPUs Jia Pan, Christian Lauterbach, Dinesh Manocha