Agrawal, Aishwarya

16 publications

CVPR 2025 Assessing and Learning Alignment of Unimodal Vision and Language Models Le Zhang, Qian Yang, Aishwarya Agrawal
CVPR 2025 CTRL-O: Language-Controllable Object-Centric Visual Representation Learning Aniket Didolkar, Andrii Zadaianchuk, Rabiul Awal, Maximilian Seitzer, Efstratios Gavves, Aishwarya Agrawal
ICCV 2025 Controlling Multimodal LLMs via Reward-Guided Decoding Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal
NeurIPS 2025 The Promise of RL for Autoregressive Image Editing Saba Ahmadi, Rabiul Awal, Ankur Sikarwar, Amirhossein Kazemnejad, Ge Ya Luo, Juan A. Rodriguez, Sai Rajeswar, Siva Reddy, Christopher Pal, Benno Krojer, Aishwarya Agrawal
ICML 2025 UI-Vision: A Desktop-Centric GUI Benchmark for Visual Perception and Interaction Shravan Nayak, Xiangru Jian, Kevin Qinghong Lin, Juan A. Rodriguez, Montek Kalsi, Nicolas Chapados, M. Tamer Özsu, Aishwarya Agrawal, David Vazquez, Christopher Pal, Perouz Taslakian, Spandana Gella, Sai Rajeswar
ICLRW 2025 WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation Rabiul Awal, Mahsa Massoud, Zichao Li, Aarash Feizi, Suyuchen Wang, Christopher Pal, Aishwarya Agrawal, David Vazquez, Siva Reddy, Juan A. Rodriguez, Perouz Taslakian, Spandana Gella, Sai Rajeswar
NeurIPSW 2024 CTRL-O: Language-Controllable Object-Centric Visual Representation Learning Aniket Rajiv Didolkar, Andrii Zadaianchuk, Rabiul Awal, Maximilian Seitzer, Efstratios Gavves, Aishwarya Agrawal
CVPR 2024 Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding Le Zhang, Rabiul Awal, Aishwarya Agrawal
NeurIPSW 2024 Controlling Multimodal LLMs via Reward-Guided Decoding Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal
NeurIPSW 2024 Enhancing Multi-Agent Multi-Modal Collaboration with Fine-Grained Reward Modeling Qian Yang, Weixiang Yan, Aishwarya Agrawal
AAAI 2024 Improving Automatic VQA Evaluation Using Large Language Models Oscar Mañas, Benno Krojer, Aishwarya Agrawal
TMLR 2024 Improving Text-to-Image Consistency via Automatic Prompt Optimization Oscar Mañas, Pietro Astolfi, Melissa Hall, Candace Ross, Jack Urbanek, Adina Williams, Aishwarya Agrawal, Adriana Romero-Soriano, Michal Drozdzal
NeurIPS 2024 VisMin: Visual Minimal-Change Understanding Rabiul Awal, Saba Ahmadi, Le Zhang, Aishwarya Agrawal
NeurIPSW 2024 Visual Language Alignment Tuning Le Zhang, Qian Yang, Aishwarya Agrawal
NeurIPS 2018 Overcoming Language Priors in Visual Question Answering with Adversarial Regularization Sainandan Ramakrishnan, Aishwarya Agrawal, Stefan Lee
ICCV 2015 VQA: Visual Question Answering Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh