Marks, Tim K.
27 publications
CVPR
2024
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation
CVPR
2020
LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
CVPRW
2018
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description