Hu, Jian
10 publications
NeurIPS
2025
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
NeurIPS
2025
Uncertainty-Quantified Rollout Policy Adaptation for Unlabelled Cross-Domain Video Temporal Grounding
10 publications