Zhao, Bin
47 publications
ICCV
2025
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
CoRL
2025
FastUMI: A Scalable and Hardware-Independent Universal Manipulation Interface with Dataset
ICCV
2025
MoMa-Kitchen: A 100k+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation
NeurIPS
2024
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
NeurIPS
2024
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering
ICML
2024
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
NeurIPS
2023
Diffusion Model Is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning