ML Anthology
Authors
Search
About
Pang, Guan
12 publications
ICLR
2025
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Deqing Fu
,
Tong Xiao
,
Rui Wang
,
Wang Zhu
,
Pengchuan Zhang
,
Guan Pang
,
Robin Jia
,
Lawrence Chen
ECCV
2024
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai
,
Xiaoliang Dai
,
Lawrence Chen
,
Guan Pang
,
James M Rehg
,
Miao Liu
CVPR
2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
,
Jindong Jiang
,
Di Liu
,
Licheng Yu
,
Xiaoliang Dai
,
Ankit Ramchandani
,
Guan Pang
,
Dimitris N. Metaxas
,
Praveen Krishnan
ECCV
2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
,
Thomas Hayes
,
Harry Yang
,
Xi Yin
,
Guan Pang
,
David Jacobs
,
Jia-Bin Huang
,
Devi Parikh
ECCV
2022
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Thomas Hayes
,
Songyang Zhang
,
Xi Yin
,
Guan Pang
,
Sasha Sheng
,
Harry Yang
,
Songwei Ge
,
Qiyuan Hu
,
Devi Parikh
CVPR
2021
A Multiplexed Network for End-to-End, Multilingual OCR
Jing Huang
,
Guan Pang
,
Rama Kovvuri
,
Mandy Toh
,
Kevin J Liang
,
Praveen Krishnan
,
Xi Yin
,
Tal Hassner
CVPR
2021
Img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
Vitor Albiero
,
Xingyu Chen
,
Xi Yin
,
Guan Pang
,
Tal Hassner
CVPR
2021
TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text
Amanpreet Singh
,
Guan Pang
,
Mandy Toh
,
Jing Huang
,
Wojciech Galuba
,
Tal Hassner
ECCV
2020
Mask TextSpotter V3: Segmentation Proposal Network for Robust Scene Text Spotting
Minghui Liao
,
Guan Pang
,
Jing Huang
,
Tal Hassner
,
Xiang Bai
CVPRW
2018
DeepGlobe 2018: A Challenge to Parse the Earth Through Satellite Images
Ilke Demir
,
Krzysztof Koperski
,
David Lindenbaum
,
Guan Pang
,
Jing Huang
,
Saikat Basu
,
Forest Hughes
,
Devis Tuia
,
Ramesh Raskar
WACV
2013
Estimation of Camera Pose with Respect to Terrestrial LiDAR Data
Wei Guan
,
Suya You
,
Guan Pang
WACV
2013
The Gixel Array Descriptor (GAD) for Multimodal Image Matching
Guan Pang
,
Ulrich Neumann