ML Anthology
Authors
Search
About
Zhu, Sijie
16 publications
CVPRW
2025
Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model
Lu Xu
,
Sijie Zhu
,
Chunyuan Li
,
Chia-Wen Kuo
,
Fan Chen
,
Xinyao Wang
,
Guang Chen
,
Dawei Du
,
Ye Yuan
,
Longyin Wen
ICCV
2025
D-Attn: Decomposed Attention for Large Vision-and-Language Model
Chia-Wen Kuo
,
Sijie Zhu
,
Fan Chen
,
Xiaohui Shen
,
Longyin Wen
ICLR
2025
Multi-Reward as Condition for Instruction-Based Image Editing
Xin Gu
,
Ming Li
,
Libo Zhang
,
Fan Chen
,
Longyin Wen
,
Tiejian Luo
,
Sijie Zhu
ICCV
2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
,
Xin Gu
,
Fan Chen
,
Xiaoying Xing
,
Longyin Wen
,
Chen Chen
,
Sijie Zhu
NeurIPS
2024
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li
,
Xinyao Wang
,
Sijie Zhu
,
Chia-Wen Kuo
,
Lu Xu
,
Fan Chen
,
Jitesh Jain
,
Humphrey Shi
,
Longyin Wen
CVPR
2023
R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
Sijie Zhu
,
Linjie Yang
,
Chen Chen
,
Mubarak Shah
,
Xiaohui Shen
,
Heng Wang
CVPR
2023
TopNet: Transformer-Based Object Placement Network for Image Compositing
Sijie Zhu
,
Zhe Lin
,
Scott Cohen
,
Jason Kuen
,
Zhifei Zhang
,
Chen Chen
CVPRW
2022
Consistency-Based Active Learning for Object Detection
Weiping Yu
,
Sijie Zhu
,
Taojiannan Yang
,
Chen Chen
ECCV
2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
Sijie Zhu
,
Zhe Lin
,
Scott Cohen
,
Jason Kuen
,
Zhifei Zhang
,
Chen Chen
CVPR
2022
TransGeo: Transformer Is All You Need for Cross-View Image Geo-Localization
Sijie Zhu
,
Mubarak Shah
,
Chen Chen
ICCV
2021
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
,
Sijie Zhu
,
Matias Mendieta
,
Taojiannan Yang
,
Chen Chen
,
Zhengming Ding
WACV
2021
Revisiting Street-to-Aerial View Image Geo-Localization and Orientation Estimation
Sijie Zhu
,
Taojiannan Yang
,
Chen Chen
CVPR
2021
VIGOR: Cross-View Image Geo-Localization Beyond One-to-One Retrieval
Sijie Zhu
,
Taojiannan Yang
,
Chen Chen
CVPRW
2020
Density mAP Guided Object Detection in Aerial Images
Changlin Li
,
Taojiannan Yang
,
Sijie Zhu
,
Chen Chen
,
Shanyue Guan
NeurIPS
2020
GradAug: A New Regularization Method for Deep Neural Networks
Taojiannan Yang
,
Sijie Zhu
,
Chen Chen
ECCV
2020
MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution
Taojiannan Yang
,
Sijie Zhu
,
Chen Chen
,
Shen Yan
,
Mi Zhang
,
Andrew Willis