Chen, Long
85 publications
ICML
2025
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
CVPR
2025
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
NeurIPS
2025
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
CVPR
2025
Embracing Collaboration over Competition: Condensing Multiple Prompts for Visual In-Context Learning
NeurIPS
2025
Interaction-Centric Knowledge Infusion and Transfer for Open Vocabulary Scene Graph Generation
CVPR
2025
Inversion Circle Interpolation: Diffusion-Based Image Augmentation for Data-Scarce Classification
ICLR
2025
Multi-Resolution Decomposable Diffusion Model for Non-Stationary Time Series Anomaly Detection
AAAI
2025
Open-World Multimodal Understanding and Generation with Efficiently Finetuned Foundation Models
NeurIPS
2025
SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models
NeurIPS
2024
$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
ECCV
2024
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
ECCV
2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
ICML
2024
Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity Based Pruning
CVPR
2024
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
NeurIPS
2023
Two Heads Are Better than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning
NeurIPS
2023
Zero-Shot Visual Relation Detection via Composite Visual Cues from Large Language Models
CVPR
2022
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
NeurIPS
2021
FMMformer: Efficient and Flexible Transformer via Decomposed Near-Field and Far-Field Attention
AAAI
2019
Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks
IJCAI
2019
MR-GNN: Multi-Resolution and Dual Graph Neural Network for Predicting Structured Entity Interactions