Fan, Hehe
37 publications
CVPR
2025
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
ICML
2025
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
CVPR
2025
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space
NeurIPS
2024
TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment
CVPR
2024
Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
ICCV
2023
Masked Spatio-Temporal Structure Prediction for Self-Supervised Learning on Point Cloud Videos