Li, Lu
18 publications
IJCAI
2025
ListenNet: A Lightweight Spatio-Temporal Enhancement Nested Network for Auditory Attention Detection
ICLR
2025
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
18 publications