Song, Yibing
55 publications
NeurIPS
2025
$\textit{HiMaCon:}$ Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
CVPR
2025
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
ICCV
2023
Both Diverse and Realism Matter: Physical Attribute and Style Alignment for Rainy Image Generation
ICCV
2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
NeurIPS
2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
CVPR
2022
Self-Supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
NeurIPS
2022
VideoMAE: Masked Autoencoders Are Data-Efficient Learners for Self-Supervised Video Pre-Training
CVPR
2021
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
NeurIPS
2021
Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning