Shen, Heng Tao
77 publications
AAAI
2025
CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer
NeurIPS
2025
SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-Then-Restore Mechanism
CoRL
2025
Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation
CVPR
2025
Skip Tuning: Pre-Trained Vision-Language Models Are Effective and Efficient Adapters Themselves
NeurIPS
2025
Table2LaTeX-RL: High-Fidelity LaTeX Code Generation from Table Images via Reinforced Multimodal Language Models
NeurIPS
2024
Alleviating Hallucinations in Large Vision-Language Models Through Hallucination-Induced Optimization
ICLR
2024
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization
IJCAI
2021
PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation