ML Anthology
Authors
Search
About
Manmatha, R.
30 publications
CVPR
2025
Scaling up Image Segmentation Across Data and Tasks
Pei Wang
,
Zhaowei Cai
,
Hao Yang
,
Ashwin Swaminathan
,
R. Manmatha
,
Stefano Soatto
AAAI
2024
DocFormerv2: Local Features for Document Understanding
Srikar Appalaraju
,
Peng Tang
,
Qi Dong
,
Nishant Sankaran
,
Yichu Zhou
,
R. Manmatha
AAAI
2024
No Head Left Behind - Multi-Head Alignment Distillation for Transformers
Tianyang Zhao
,
Kunwar Yashraj Singh
,
Srikar Appalaraju
,
Peng Tang
,
Vijay Mahadevan
,
R. Manmatha
,
Ying Nian Wu
CVPR
2024
On the Scalability of Diffusion-Based Text-to-Image Generation
Hao Li
,
Yang Zou
,
Ying Wang
,
Orchid Majumder
,
Yusheng Xie
,
R. Manmatha
,
Ashwin Swaminathan
,
Zhuowen Tu
,
Stefano Ermon
,
Stefano Soatto
ECCV
2024
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
Ofir Abramovich
,
Niv Nayman
,
Sharon Fogel
,
Inbal Lavi
,
Ron Litman
,
Shahar Tsiper
,
Royee Tichauer
,
Srikar Appalaraju
,
Shai Mazor
,
R. Manmatha
ICCV
2023
DocTr: Document Transformer for Structured Information Extraction in Documents
Haofu Liao
,
Aruni RoyChowdhury
,
Weijian Li
,
Ankan Bansal
,
Yuting Zhang
,
Zhuowen Tu
,
Ravi Kumar Satzoda
,
R. Manmatha
,
Vijay Mahadevan
CVPR
2023
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu
,
Hui Ding
,
Zhaowei Cai
,
Yuting Zhang
,
Ravi Kumar Satzoda
,
Vijay Mahadevan
,
R. Manmatha
ECCV
2022
GLASS: Global to Local Attention for Scene-Text Spotting
Roi Ronen
,
Shahar Tsiper
,
Oron Anschel
,
Inbal Lavi
,
Amir Markovitz
,
R. Manmatha
CVPR
2022
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten
,
Ron Litman
,
Yusheng Xie
,
Srikar Appalaraju
,
R. Manmatha
ECCVW
2022
On Calibration of Scene-Text Recognition Models
Ron Slossberg
,
Oron Anschel
,
Amir Markovitz
,
Ron Litman
,
Aviad Aberdam
,
Shahar Tsiper
,
Shai Mazor
,
Jon Wu
,
R. Manmatha
CVPRW
2022
ResNeSt: Split-Attention Networks
Hang Zhang
,
Chongruo Wu
,
Zhongyue Zhang
,
Yi Zhu
,
Haibin Lin
,
Zhi Zhang
,
Yue Sun
,
Tong He
,
Jonas Mueller
,
R. Manmatha
,
Mu Li
,
Alexander J. Smola
CVPR
2022
Towards Weakly-Supervised Text Spotting Using a Multi-Task Transformer
Yair Kittenplon
,
Inbal Lavi
,
Sharon Fogel
,
Yarin Bar
,
R. Manmatha
,
Pietro Perona
ECCVW
2022
YORO - Lightweight End to End Visual Grounding
Chih-Hui Ho
,
Srikar Appalaraju
,
Bhavan Jasani
,
R. Manmatha
,
Nuno Vasconcelos
ICCV
2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
,
Bhavan Jasani
,
Bhargava Urala Kota
,
Yusheng Xie
,
R. Manmatha
WACV
2021
Saliency Driven Perceptual Image Compression
Yash Patel
,
Srikar Appalaraju
,
R. Manmatha
CVPR
2021
Sequence-to-Sequence Contrastive Learning for Text Recognition
Aviad Aberdam
,
Ron Litman
,
Shahar Tsiper
,
Oron Anschel
,
Ron Slossberg
,
Shai Mazor
,
R. Manmatha
,
Pietro Perona
ICCV
2017
Sampling Matters in Deep Embedding Learning
Chao-Yuan Wu
,
R. Manmatha
,
Alexander J. Smola
,
Philipp Krahenbuhl
CVPR
2016
Deep Decision Network for Multi-Class Image Classification
Venkatesh N. Murthy
,
Vivek Singh
,
Terrence Chen
,
R. Manmatha
,
Dorin Comaniciu
ECCV
2016
Efficient Exploration of Text Regions in Natural Scene Images Using Adaptive Image Sampling
Ismet Zeki Yalniz
,
Douglas Gray
,
R. Manmatha
ECCVW
2016
Efficient Exploration of Text Regions in Natural Scene Images Using Adaptive Image Sampling
Ismet Zeki Yalniz
,
Douglas Gray
,
R. Manmatha
CVPRW
2013
Formulating Action Recognition as a Ranking Problem
Ethem F. Can
,
R. Manmatha
NeurIPS
2003
A Model for Learning the Semantics of Pictures
Victor Lavrenko
,
R. Manmatha
,
Jiwoon Jeon
CVPR
2003
Word Image Matching Using Dynamic Time Warping
Toni M. Rath
,
R. Manmatha
ICCV
2001
Automatic Segmentation and Indexing in a Database of Bird Images
Madirakshi Das
,
R. Manmatha
ICCV
1998
Retrieving Images by Appearance
Srinivas Ravela
,
R. Manmatha
ECCV
1996
Image Retrieval Using Scale-Space Matching
Srinivas Ravela
,
R. Manmatha
,
Edward M. Riseman
CVPR
1996
Word Spotting: A New Approach to Indexing Handwriting
R. Manmatha
,
Chengfeng Han
,
Edward M. Riseman
CVPR
1994
A Framework for Recovering Affine Transforms Using Points, Lines or Image Brightnesses
R. Manmatha
ECCV
1994
Measuring the Affine Transform Using Gaussian Filters
R. Manmatha
CVPR
1989
A Data Set for Quantitative Motion Analysis
Rabindranath Dutta
,
R. Manmatha
,
Lance R. Williams
,
Edward M. Riseman