Ji, Lei
18 publications
NeurIPS
2025
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning
NeurIPS
2023
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-Ray Images
CVPR
2023
MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering
WACV
2022
Learning Temporal Video Procedure Segmentation from an Automatically Collected Large Dataset