Shao, Rui
21 publications
ICCV
2025
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
NeurIPS
2025
CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
IJCAI
2025
Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation Prediction
NeurIPS
2025
PUO-Bench: A Panel Understanding and Operation Benchmark with a Privacy-Preserving Framework
ICML
2025
STAR: Learning Diverse Robot Skill Abstractions Through Rotation-Augmented Vector Quantization
CVPR
2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
ECCV
2024
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios