Shan, Ying
160 publications
AAAI
2025
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
ICCV
2025
GeometryCrafter: Consistent Geometry Estimation for Open-World Videos with Diffusion Priors
ICCV
2025
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
CVPR
2025
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
ICCV
2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
CVPR
2024
ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis
CVPR
2024
SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models
AAAI
2024
Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views
ICML
2023
$\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-Task Interpolation
NeurIPS
2023
CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation
CVPR
2023
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
ICCV
2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
NeurIPS
2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
CVPR
2023
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
ECCV
2022
Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space
CVPR
2022
UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection
CVPR
2008
Discovering Class Specific Composite Features Through Discriminative Sampling with Swendsen-Wang Cut