Jiang, Yu-Gang
125 publications
ICCV
2025
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
ICCV
2025
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
NeurIPS
2025
Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection
NeurIPS
2025
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
ICCV
2025
From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning
NeurIPS
2025
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
CVPRW
2025
UniToken: Harmonizing Multimodal Understanding and Generation Through Unified Visual Encoding
AAAI
2024
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network
NeurIPS
2024
UnSeg: One Universal Unlearnable Example Generator Is Enough Against All Image Segmentation
AAAI
2024
nuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario
CVPR
2023
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
ICML
2023
Open-VCLIP: Transforming CLIP to an Open-Vocabulary Video Model via Interpolated Weight Optimization
CVPR
2021
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning
ECCV
2020
Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos