Nieto, Oriol

6 publications

ICML 2025 FLAM: Frame-Wise Language-Audio Modeling Yusong Wu, Christos Tsirigotis, Ke Chen, Cheng-Zhi Anna Huang, Aaron Courville, Oriol Nieto, Prem Seetharaman, Justin Salamon
ICLR 2025 MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark S Sakshi, Utkarsh Tyagi, Sonal Kumar, Ashish Seth, Ramaneswaran Selvakumar, Oriol Nieto, Ramani Duraiswami, Sreyan Ghosh, Dinesh Manocha
CVPR 2025 Video-Guided Foley Sound Generation with Multimodal Controls Ziyang Chen, Prem Seetharaman, Bryan Russell, Oriol Nieto, David Bourgin, Andrew Owens, Justin Salamon
ICLR 2025 Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
ICLR 2024 CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Reddy Evuru, Ramaneswaran S, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha
CVPR 2023 Language-Guided Audio-Visual Source Separation via Trimodal Consistency Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko