Yamazaki, Kashu

4 publications

NeurIPS 2024 HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model Khoa Vo, Thinh Phan, Kashu Yamazaki, Minh Tran, Ngan Le
ECCV 2024 R^2-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Kashu Yamazaki, Hao Chen, Rita Singh, Xiaonan Huang, Bhiksha Raj
CVPRW 2023 DNA: Deformable Neural Articulations Network for Template-Free Dynamic 3D Human Reconstruction from Monocular RGB-D Video Khoa Vo, Trong-Thang Pham, Kashu Yamazaki, Minh Q. Tran, Ngan Le
AAAI 2023 VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning Kashu Yamazaki, Khoa Vo, Quang Sang Truong, Bhiksha Raj, Ngan Le