Metze, Florian

12 publications

ICLR 2026 WearVox: An Egocentric Multichannel Voice Assistant Benchmark for Wearables Zhaojiang Lin, Yong Xu, Kai Sun, Jing Zheng, Yin Huang, Surya Teja Appini, Krish Narang, Renjie Tao, Ishan Kapil Jain, Siddhant Arora, Ruizhi Li, Yiteng Huang, Kaushik Patnaik, Wenfang Xu, Suwon Shon, Yue Liu, Ahmed A Aly, Anuj Kumar, Florian Metze, Xin Luna Dong
NeurIPSW 2024 FSD: Acoustic Echo Cancellation with Fewer Step Diffusion Yang Liu, Li Wan, Yiteng Huang, Ming Sun, Changsheng Zhao, Zhaoheng Ni, Xinhao Mei, Yangyang Shi, Florian Metze
ICMLW 2023 Audio-Journey: Efficient Visual+LLM-Aided Audio Encodec Diffusion Juncheng B Li, Jackson Sam Michaels, Laura Yao, Lijun Yu, Zach Wood-Doughty, Florian Metze
ICMLW 2023 Dissecting Efficient Architectures for Wake-Word Detection Cody Berger, Juncheng B Li, Yiyuan Li, Aaron Berger, Dmitri Berger, Karthik Ganesan, Emma Strubell, Florian Metze
NeurIPS 2022 Masked Autoencoders That Listen Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer
CVPR 2022 Self-Supervised Object Detection from Audio-Visual Correspondence Triantafyllos Afouras, Yuki M. Asano, Francois Fagan, Andrea Vedaldi, Florian Metze
CVPR 2021 How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language Amanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto
NeurIPS 2021 Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers Mandela Patrick, Dylan Campbell, Yuki Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, João F. Henriques
ICCV 2021 Space-Time Crop & Attend: Improving Cross-Modal Video Representation Learning Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques
ICLR 2021 Support-Set Bottlenecks for Video-Text Representation Learning Mandela Patrick, Po-Yao Huang, Yuki Asano, Florian Metze, Alexander G Hauptmann, Joao F. Henriques, Andrea Vedaldi
AAAI 2020 Towards Zero-Shot Learning for Automatic Phonemic Transcription Xinjian Li, Siddharth Dalmia, David R. Mortensen, Juncheng Li, Alan W. Black, Florian Metze
NeurIPS 2019 Adversarial Music: Real World Audio Adversary Against Wake-Word Detection System Juncheng Li, Shuhui Qu, Xinjian Li, Joseph Szurley, J. Zico Kolter, Florian Metze