ML Anthology
Authors
Search
About
Song, Sibo
5 publications
ICLR
2026
Revisiting Multimodal Positional Encoding in Vision–Language Models
Jie Huang
,
Xuejing Liu
,
Sibo Song
,
RuiBing Hou
,
Hong Chang
,
Junyang Lin
,
Shuai Bai
CVPR
2024
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
Jianqiang Wan
,
Sibo Song
,
Wenwen Yu
,
Yuliang Liu
,
Wenqing Cheng
,
Fei Huang
,
Xiang Bai
,
Cong Yao
,
Zhibo Yang
CVPR
2023
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Zhibo Yang
,
Rujiao Long
,
Pengfei Wang
,
Sibo Song
,
Humen Zhong
,
Wenqing Cheng
,
Xiang Bai
,
Cong Yao
CVPR
2022
Vision-Language Pre-Training for Boosting Scene Text Detectors
Sibo Song
,
Jianqiang Wan
,
Zhibo Yang
,
Jun Tang
,
Wenqing Cheng
,
Xiang Bai
,
Cong Yao
CVPRW
2016
Multimodal Multi-Stream Deep Learning for Egocentric Activity Recognition
Sibo Song
,
Vijay Chandrasekhar
,
Bappaditya Mandal
,
Liyuan Li
,
Joo-Hwee Lim
,
Giduthuri Sateesh Babu
,
Phyo Phyo San
,
Ngai-Man Cheung