Assran, Mido

12 publications

TMLR 2025 A Shortcut-Aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs Benno Krojer, Mojtaba Komeili, Candace Ross, Quentin Garrido, Koustuv Sinha, Nicolas Ballas, Mido Assran
ICLR 2025 An Image Is Worth More than 16x16 Patches: Exploring Transformers on Individual Pixels Duy Kien Nguyen, Mido Assran, Unnat Jain, Martin R. Oswald, Cees G. M. Snoek, Xinlei Chen
ICML 2025 LOCATE 3D: Real-World Object Localization via Self-Supervised Learning in 3D Paul Mcvay, Sergio Arnaud, Ada Martin, Arjun Majumdar, Krishna Murthy Jatavallabhula, Phillip Thomas, Ruslan Partsey, Daniel Dugas, Abha Gejji, Alexander Sax, Vincent-Pierre Berges, Mikael Henaff, Ayush Jain, Ang Cao, Ishita Prasad, Mrinal Kalakrishnan, Michael Rabbat, Nicolas Ballas, Mido Assran, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier
TMLR 2025 SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision Maxime Poli, Mahi Luthra, Youssef Benchekroun, Yosuke Higuchi, Martin Gleize, Jiayi Shen, Robin Algayres, Yu-An Chung, Mido Assran, Juan Pino, Emmanuel Dupoux
ICLR 2025 VEDIT: Latent Prediction Architecture for Procedural Video Representation Learning Han Lin, Tushar Nagarajan, Nicolas Ballas, Mido Assran, Mojtaba Komeili, Mohit Bansal, Koustuv Sinha
TMLR 2024 DINOv2: Learning Robust Visual Features Without Supervision Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mido Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Herve Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski
ICML 2024 Modeling Caption Diversity in Contrastive Vision-Language Pretraining Samuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mido Assran, Andrew Gordon Wilson, Aaron Courville, Nicolas Ballas
TMLR 2024 Revisiting Feature Prediction for Learning Visual Representations from Video Adrien Bardes, Quentin Garrido, Jean Ponce, Xinlei Chen, Michael Rabbat, Yann LeCun, Mido Assran, Nicolas Ballas
ICML 2024 Stochastic Positional Embeddings Improve Masked Image Modeling Amir Bar, Florian Bordes, Assaf Shocher, Mido Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann Lecun
ICLR 2023 RoPAWS: Robust Semi-Supervised Representation Learning from Uncurated Data Sangwoo Mo, Jong-Chyi Su, Chih-Yao Ma, Mido Assran, Ishan Misra, Licheng Yu, Sean Bell
ICLR 2023 The Hidden Uniform Cluster Prior in Self-Supervised Learning Mido Assran, Randall Balestriero, Quentin Duval, Florian Bordes, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Nicolas Ballas
ICLR 2022 Memory Augmented Optimizers for Deep Learning Paul-Aymeric Martin McRae, Prasanna Parthasarathi, Mido Assran, Sarath Chandar