Zhang, Ji
44 publications
ICML
2025
Score as Action: Fine Tuning Diffusion Generative Models by Continuous-Time Reinforcement Learning
ICLRW
2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-Time Reinforcement Learning
CVPR
2025
Skip Tuning: Pre-Trained Vision-Language Models Are Effective and Efficient Adapters Themselves
NeurIPS
2025
VLM-R³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
ICLR
2025
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
NeurIPS
2024
MaVEn: An Effective Multi-Granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model
NeurIPS
2024
Mobile-Agent-V2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
ICLR
2023
Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance
ECML-PKDD
2023
Uncovering Multivariate Structural Dependency for Analyzing Irregularly Sampled Time Series