Wang, Zihan
49 publications
ICCV
2025
Auxiliary Prompt Tuning of Vision-Language Models for Few-Shot Out-of-Distribution Detection
CVPR
2025
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-Modal LLMs in Video Analysis
CoRL
2024
I Can Tell What I Am Doing: Toward Real-World Natural Language Grounding of Robot Experiences
ICLR
2024
Implicit Bias of SGD in $l_2$-Regularized Linear DNNs: One-Way Jumps from High to Low Rank
CVPR
2024
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
CVPR
2024
Multi-Scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition
NeurIPS
2024
SciInstruct: A Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
AAAI
2023
Learning to Imagine: Distillation-Based Interactive Context Exploitation for Dialogue State Tracking
NeurIPSW
2023
WavSpA: Wavelet Space Attention for Boosting Transformers' Long Sequence Learning Ability