Glass, James R.

23 publications

CVPR 2025 CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment Edson Araujo, Andrew Rouditchenko, Yuan Gong, Saurabhchand Bhati, Samuel Thomas, Brian Kingsbury, Leonid Karlinsky, Rogerio Feris, James R. Glass, Hilde Kuehne
NeurIPS 2025 Can Diffusion Models Disentangle? a Theoretical Perspective Liming Wang, Muhammad Jehanzeb Mirza, Yishu Gong, Yuan Gong, Jiaqi Zhang, Brian H. Tracey, Katerina Placek, Marco Vilela, James R. Glass
TMLR 2025 GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models Muhammad Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogerio Feris, Leonid Karlinsky, James R. Glass
NeurIPS 2025 Meta CLIP 2: A Worldwide Scaling Recipe Yung-Sung Chuang, Yang Li, Dong Wang, Ching-Feng Yeh, Kehan Lyu, Ramya Raghavendra, James R. Glass, Lifei Huang, Jason E Weston, Luke Zettlemoyer, Xinlei Chen, Zhuang Liu, Saining Xie, Wen-tau Yih, Shang-Wen Li, Hu Xu
ICLR 2025 Quantifying Generalization Complexity for Large Language Models Zhenting Qi, Hongyin Luo, Xuliang Huang, Zhuokai Zhao, Yibo Jiang, Xiangjun Fan, Himabindu Lakkaraju, James R. Glass
NeurIPS 2025 ROVER: Recursive Reasoning over Videos with Vision-Language Models for Embodied Tasks Philip Schroeder, Ondrej Biza, Thomas Weng, Hongyin Luo, James R. Glass
ICLR 2025 Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Junmo Kang, Leonid Karlinsky, Hongyin Luo, Zhen Wang, Jacob A Hansen, James R. Glass, David Daniel Cox, Rameswar Panda, Rogerio Feris, Alan Ritter
ICML 2025 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Yung-Sung Chuang, Benjamin Cohen-Wang, Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James R. Glass, Shang-Wen Li, Wen-Tau Yih
ICLR 2025 UniWav: Towards Unified Pre-Training for Speech Representation Learning and Generation Alexander H. Liu, Sang-gil Lee, Chao-Han Huck Yang, Yuan Gong, Yu-Chiang Frank Wang, James R. Glass, Rafael Valle, Bryan Catanzaro
NeurIPSW 2024 A Closer Look at Neural Codec Resynthesis: Bridging the Gap Between Codec and Waveform Generation Alexander H. Liu, Qirui Wang, Yuan Gong, James R. Glass
NeurIPSW 2024 Curiosity-Driven Red Teaming for Large Language Models Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
ICLR 2024 Curiosity-Driven Red-Teaming for Large Language Models Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
ICLR 2024 DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James R. Glass, Pengcheng He
ICLR 2024 Listen, Think, and Understand Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass
ICLR 2023 Contrastive Audio-Visual Masked Autoencoder Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass
ICMLW 2022 Growing ObjectNet: Adding Speech, VQA, Occlusion, and Measuring Dataset Difficulty David Mayo, David Lu, Chris Zhang, Jesse Cummings, Xinyu Lin, Boris Katz, James R. Glass, Andrei Barbu
AAAI 2022 SSAST: Self-Supervised Audio Spectrogram Transformer Yuan Gong, Cheng-I Lai, Yu-An Chung, James R. Glass
CVPRW 2019 Grounding Spoken Words in Unlabeled Video Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogério Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James R. Glass
AAAI 2019 NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass
AAAI 2019 What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass
AAAI 2018 Fact Checking in Community Forums Tsvetomila Mihaylova, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, Mitra Mohtarami, Georgi Karadzhov, James R. Glass
ICCV 2005 Visual Speech Recognition with Loosely Synchronized Feature Streams Kate Saenko, Karen Livescu, Michael Siracusa, Kevin W. Wilson, James R. Glass, Trevor Darrell
NeurIPS 1990 Phonetic Classification and Recognition Using the Multi-Layer Perceptron Hong C. Leung, James R. Glass, Michael S. Phillips, Victor W. Zue