Glass, James

14 publications

ICCV 2025 Teaching VLMs to Localize Specific Objects from In-Context Examples Sivan Doveh, Nimrod Shabtay, Eli Schwartz, Hilde Kuehne, Raja Giryes, Rogerio Feris, Leonid Karlinsky, James Glass, Assaf Arbelle, Shimon Ullman, M. Jehanzeb Mirza
CVPR 2024 What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogerio Feris, James Glass, Hilde Kuehne
CVPR 2022 Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogerio S. Feris, David Harwath, James Glass, Hilde Kuehne
ICCV 2021 Multimodal Clustering Networks for Self-Supervised Learning from Unlabeled Videos Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie Boggust, Rameswar Panda, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, Michael Picheny, Shih-Fu Chang
CVPR 2021 Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions Mathew Monfort, SouYoung Jin, Alexander Liu, David Harwath, Rogerio Feris, James Glass, Aude Oliva
ICLR 2020 Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech David Harwath, Wei-Ning Hsu, James Glass
ICLR 2019 Detecting Egregious Responses in Neural Sequence-to-Sequence Models Tianxing He, James Glass
ICLR 2019 Identifying and Controlling Important Neurons in Neural Machine Translation Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James Glass
ECCV 2018 Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input David Harwath, Adria Recasens, Didac Suris, Galen Chuang, Antonio Torralba, James Glass
NeurIPS 2018 Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James Glass
NeurIPS 2017 Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems Yonatan Belinkov, James Glass
NeurIPS 2017 Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data Wei-Ning Hsu, Yu Zhang, James Glass
NeurIPS 2016 Unsupervised Learning of Spoken Language with Visual Context David Harwath, Antonio Torralba, James Glass
NeurIPS 1990 From Speech Recognition to Spoken Language Understanding: The Development of the MIT SUMMIT and VOYAGER Systems Victor Zue, James Glass, David Goodine, Lynette Hirschman, Hong Leung, Michael Phillips, Joseph Polifroni, Stephanie Seneff