Zhao, Han
105 publications
ICLR
2026
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-Language-Action Model
ICLR
2026
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denosing Diffusion Process
NeurIPS
2025
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
ICLR
2025
Learning Structured Representations by Embedding Class Hierarchy with Fast Optimal Transport
NeurIPS
2025
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
NeurIPS
2025
Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining
ICLR
2025
VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation
CVPR
2023
Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning
ICML
2021
Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation
NeurIPS
2020
Trade-Offs and Guarantees of Adversarial Representation Learning for Information Obfuscation