Jansen, Aren

7 publications

ICML 2025 Long-Form Speech Generation with Spoken Language Models Se Jin Park, Julian Salazar, Aren Jansen, Keisuke Kinoshita, Yong Man Ro, Rj Skerry-Ryan
ICLR 2025 MELODI: Exploring Memory Compression for Long Contexts Yinpeng Chen, DeLesley Hutchins, Aren Jansen, Andrey Zhmoginov, David Racz, Jesper Sparre Andersen
NeurIPS 2024 A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation Gwanghyun Kim, Alonso Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang, Aren Jansen, Jacob Walker, Krishna Somandepalli
AAAI 2024 V2Meow: Meowing to the Visual Beat via Video-to-Music Generation Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk
NeurIPSW 2022 MAQA: A Multimodal QA Benchmark for Negation Judith Yue Li, Aren Jansen, Qingqing Huang, Ravi Ganti, Joonseok Lee, Dima Kuzmin
NeurIPS 2021 Attention Bottlenecks for Multimodal Fusion Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun
ICLR 2021 Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Dan Ellis, John R. Hershey