Ding, Pengxiang
14 publications
ICLR
2026
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-Language-Action Model
ICLR
2026
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denosing Diffusion Process
CoRL
2025
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation
NeurIPS
2025
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning