Phan, Thinh

2 publications

NeurIPS 2024 HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model Khoa Vo, Thinh Phan, Kashu Yamazaki, Minh Tran, Ngan Le
WACV 2024 ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald Adjeroh, Ngan Le