Shao, Jie
37 publications
CVPR
2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
ICCV
2025
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
NeurIPS
2025
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models Under Data Constraints
AAAI
2024
Abstract and Explore: A Novel Behavioral Metric with Cyclic Dynamics in Reinforcement Learning
ICCV
2021
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-Training for Vision-Language Representation