Wang, Wenjie
24 publications
NeurIPS
2025
Auto-Search and Refinement: An Automated Framework for Gender Bias Mitigation in Large Language Models
AAAI
2025
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
AAAI
2025
MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses for Vision Language Models
CVPR
2025
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-Guided Self-Training