Adi, Yossi

27 publications

TMLR 2025 Auto-Regressive vs Flow-Matching: A Comparative Study of Modeling Paradigms for Text-to-Music Generation Or Tal, Felix Kreuk, Yossi Adi
ICCV 2025 CAFA: A Controllable Automatic Foley Artist Roi Benita, Michael Finkelson, Tavi Halperin, Gleb Sterkin, Yossi Adi
TMLR 2025 Discrete Audio Tokens: More than a Survey! Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch, Jinyu Li, Cem Subakan, Phil Woodland, Minje Kim, Hung-yi Lee, Shinji Watanabe, Yossi Adi, Mirco Ravanelli
TMLR 2025 On the Landscape of Spoken Language Models: A Comprehensive Survey Siddhant Arora, Kai-Wei Chang, Chung-Ming Chien, Yifan Peng, Haibin Wu, Yossi Adi, Emmanuel Dupoux, Hung-yi Lee, Karen Livescu, Shinji Watanabe
CVPR 2025 Through-the-Mask: Mask-Based Motion Trajectories for Image-to-Video Generation Guy Yariv, Yuval Kirstain, Amit Zohar, Shelly Sheynin, Yaniv Taigman, Yossi Adi, Sagie Benaim, Adam Polyak
ICML 2024 An Independence-Promoting Loss for Music Generation with Language Models Jean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez
NeurIPS 2024 Discrete Flow Matching Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman
AAAI 2024 Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi
ICMLW 2024 Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Idan Schwartz, Guy Yariv, Itai Gat, Yossi Adi, Sagie Benaim, Lior Wolf
AAAI 2024 Layer Collaboration in the Forward-Forward Algorithm Guy Lorberbom, Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan
ICLR 2024 Masked Audio Generation Using a Single Non-Autoregressive Transformer Alon Ziv, Itai Gat, Gael Le Lan, Tal Remez, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Yossi Adi
JMLR 2024 Scaling Speech Technology to 1,000+ Languages Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli
ICLR 2023 AudioGen: Textually Guided Audio Generation Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi
NeurIPS 2023 From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Defossez
TMLR 2023 High Fidelity Neural Audio Compression Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi
CVPR 2023 ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi
NeurIPS 2023 Simple and Controllable Music Generation Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Defossez
NeurIPS 2023 Textually Pretrained Speech Language Models Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi
NeurIPS 2023 Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu
TMLR 2022 Differentiable Model Compression via Pseudo Quantization Noise Alexandre Défossez, Yossi Adi, Gabriel Synnaeve
ICLR 2022 Learning Discrete Structured Variational Auto-Encoder Using Natural Evolution Strategies Alon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan
NeurIPS 2022 On the Importance of Gradient Norm in PAC-Bayesian Bounds Itai Gat, Yossi Adi, Alex Schwing, Tamir Hazan
ICML 2020 Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani, Yossi Adi, Lior Wolf
NeurIPS 2018 Out-of-Distribution Detection Using Multiple Semantic Label Representations Gabi Shalev, Yossi Adi, Joseph Keshet
ICLR 2017 Fine-Grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg
NeurIPS 2017 Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples Moustapha M Cisse, Yossi Adi, Natalia Neverova, Joseph Keshet
JMLR 2016 StructED: Risk Minimization in Structured Prediction Yossi Adi, Joseph Keshet