Manocha, Dinesh
93 publications
NeurIPS
2025
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
ICCV
2025
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
CVPR
2025
Immune: Improving Safety Against Jailbreaks in Multi-Modal LLMs via Inference-Time Alignment
NeurIPS
2025
MAGNET: A Multi-Agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
NeurIPS
2025
RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization
NeurIPS
2025
VideoHallu: Evaluating and Mitigating Multi-Modal Hallucinations on Synthetic Video Understanding
ICMLW
2024
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences
ICLR
2024
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
CoRL
2023
Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning
WACV
2023
LayerDoc: Layer-Wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents
ICCV
2023
LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference
WACV
2023
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
CVPR
2023
TMO: Textured Mesh Acquisition of Objects with a Mobile Device by Using Differentiable Rendering
WACV
2022
M3DETR: Multi-Representation, Multi-Scale, Mutual-Relation 3D Object Detection with Transformers
NeurIPSW
2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
AAAI
2021
LCollision: Fast Generation of Collision-Free Human Poses Using Learned Non-Penetration Constraints
ECCV
2020
AutoTrajectory: Label-Free Trajectory Extraction and Prediction from Videos Using Dynamic Points
AAAI
2020
M3ER: Multiplicative Multimodal Emotion Recognition Using Facial, Textual, and Speech Cues
AAAI
2020
NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations
CVPRW
2019
The Emotionally Intelligent Robot: Improving Socially-Aware Human Prediction in Crowded Environments