Shen, Heng Tao
84 publications
ICLR
2026
GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
AAAI
2025
CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer
NeurIPS
2025
SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-Then-Restore Mechanism
CoRL
2025
Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation
CVPR
2025
Skip Tuning: Pre-Trained Vision-Language Models Are Effective and Efficient Adapters Themselves
NeurIPS
2025
Table2LaTeX-RL: High-Fidelity LaTeX Code Generation from Table Images via Reinforced Multimodal Language Models
NeurIPS
2024
Alleviating Hallucinations in Large Vision-Language Models Through Hallucination-Induced Optimization
ICLR
2024
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization
IJCAI
2021
PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
CVPR
2019
Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables