Goel, Arushi

10 publications

NeurIPS 2025 Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models Sreyan Ghosh, Arushi Goel, Jaehyeon Kim, Sonal Kumar, Zhifeng Kong, Sang-gil Lee, Chao-Han Huck Yang, Ramani Duraiswami, Dinesh Manocha, Rafael Valle, Bryan Catanzaro
ICML 2025 ETTA: Elucidating the Design Space of Text-to-Audio Models Sang-Gil Lee, Zhifeng Kong, Arushi Goel, Sungwon Kim, Rafael Valle, Bryan Catanzaro
ICLR 2025 Fugatto 1: Foundational Generative Audio Transformer Opus 1 Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai, Siddharth Gururani, Aya Aljafari, Alexander H. Liu, Kevin J. Shih, Ryan Prenger, Wei Ping, Chao-Han Huck Yang, Bryan Catanzaro
CVPRW 2025 Visually Interpretable Subtask Reasoning for Visual Question Answering Yu Cheng, Arushi Goel, Hakan Bilen
ICML 2024 Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro
ICCV 2023 Encyclopedic VQA: Visual Questions About Detailed Properties of Fine-Grained Categories Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari
CoRL 2023 Language-Guided Robot Grasping: CLIP-Based Referring Grasp Synthesis in Clutter Georgios Tziafas, Yucheng Xu, Arushi Goel, Mohammadreza Kasaei, Zhibin Li, Hamidreza Kasaei
ICCV 2023 Who Are You Referring to? Coreference Resolution in Image Narrations Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen
CVPR 2022 Not All Relations Are Equal: Mining Informative Labels for Scene Graph Generation Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen
ECCVW 2020 Injecting Prior Knowledge into Image Caption Generation Arushi Goel, Basura Fernando, Thanh-Son Nguyen, Hakan Bilen