Manmatha, R.

30 publications

CVPR 2025 Scaling up Image Segmentation Across Data and Tasks Pei Wang, Zhaowei Cai, Hao Yang, Ashwin Swaminathan, R. Manmatha, Stefano Soatto
AAAI 2024 DocFormerv2: Local Features for Document Understanding Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha
AAAI 2024 No Head Left Behind - Multi-Head Alignment Distillation for Transformers Tianyang Zhao, Kunwar Yashraj Singh, Srikar Appalaraju, Peng Tang, Vijay Mahadevan, R. Manmatha, Ying Nian Wu
CVPR 2024 On the Scalability of Diffusion-Based Text-to-Image Generation Hao Li, Yang Zou, Ying Wang, Orchid Majumder, Yusheng Xie, R. Manmatha, Ashwin Swaminathan, Zhuowen Tu, Stefano Ermon, Stefano Soatto
ECCV 2024 VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha
ICCV 2023 DocTr: Document Transformer for Structured Information Extraction in Documents Haofu Liao, Aruni RoyChowdhury, Weijian Li, Ankan Bansal, Yuting Zhang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan
CVPR 2023 PolyFormer: Referring Image Segmentation as Sequential Polygon Generation Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha
ECCV 2022 GLASS: Global to Local Attention for Scene-Text Spotting Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha
CVPR 2022 LaTr: Layout-Aware Transformer for Scene-Text VQA Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha
ECCVW 2022 On Calibration of Scene-Text Recognition Models Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha
CVPRW 2022 ResNeSt: Split-Attention Networks Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin, Zhi Zhang, Yue Sun, Tong He, Jonas Mueller, R. Manmatha, Mu Li, Alexander J. Smola
CVPR 2022 Towards Weakly-Supervised Text Spotting Using a Multi-Task Transformer Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona
ECCVW 2022 YORO - Lightweight End to End Visual Grounding Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos
ICCV 2021 DocFormer: End-to-End Transformer for Document Understanding Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha
WACV 2021 Saliency Driven Perceptual Image Compression Yash Patel, Srikar Appalaraju, R. Manmatha
CVPR 2021 Sequence-to-Sequence Contrastive Learning for Text Recognition Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona
ICCV 2017 Sampling Matters in Deep Embedding Learning Chao-Yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krahenbuhl
CVPR 2016 Deep Decision Network for Multi-Class Image Classification Venkatesh N. Murthy, Vivek Singh, Terrence Chen, R. Manmatha, Dorin Comaniciu
ECCV 2016 Efficient Exploration of Text Regions in Natural Scene Images Using Adaptive Image Sampling Ismet Zeki Yalniz, Douglas Gray, R. Manmatha
ECCVW 2016 Efficient Exploration of Text Regions in Natural Scene Images Using Adaptive Image Sampling Ismet Zeki Yalniz, Douglas Gray, R. Manmatha
CVPRW 2013 Formulating Action Recognition as a Ranking Problem Ethem F. Can, R. Manmatha
NeurIPS 2003 A Model for Learning the Semantics of Pictures Victor Lavrenko, R. Manmatha, Jiwoon Jeon
CVPR 2003 Word Image Matching Using Dynamic Time Warping Toni M. Rath, R. Manmatha
ICCV 2001 Automatic Segmentation and Indexing in a Database of Bird Images Madirakshi Das, R. Manmatha
ICCV 1998 Retrieving Images by Appearance Srinivas Ravela, R. Manmatha
ECCV 1996 Image Retrieval Using Scale-Space Matching Srinivas Ravela, R. Manmatha, Edward M. Riseman
CVPR 1996 Word Spotting: A New Approach to Indexing Handwriting R. Manmatha, Chengfeng Han, Edward M. Riseman
CVPR 1994 A Framework for Recovering Affine Transforms Using Points, Lines or Image Brightnesses R. Manmatha
ECCV 1994 Measuring the Affine Transform Using Gaussian Filters R. Manmatha
CVPR 1989 A Data Set for Quantitative Motion Analysis Rabindranath Dutta, R. Manmatha, Lance R. Williams, Edward M. Riseman