Sun, Xiaoshuai
62 publications
ICLR
2026
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning
NeurIPS
2025
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
ICLR
2025
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-Modal Large Language Models
NeurIPS
2024
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion
AAAI
2024
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
NeurIPS
2024
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
ICML
2024
X-Oscar: A Progressive Framework for High-Quality Text-Guided 3D Animatable Avatar Generation
NeurIPS
2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
NeurIPS
2023
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-Trained Models
CVPR
2020
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
AAAI
2019
Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer
IJCAI
2018
Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval
CVPR
2018
GroupCap: Group-Based Image Captioning with Structured Relevance and Diversity Constraints