Bengio, Samy

78 publications

ICLR 2025 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models Seyed Iman Mirzadeh, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy Bengio, Mehrdad Farajtabar
NeurIPS 2025 The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Parshin Shojaee, Seyed Iman Mirzadeh, Keivan Alizadeh, Maxwell Horton, Samy Bengio, Mehrdad Farajtabar
JMLR 2024 Generalization on the Unseen, Logic Reasoning and Degree Curriculum Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Kevin Rizk
NeurIPS 2024 How Far Can Transformers Reason? the Globality Barrier and Inductive Scratchpad Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Colin Sandon, Omid Saremi
ICLR 2024 What Algorithms Can Transformers Learn? a Study in Length Generalization Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Joshua M. Susskind, Samy Bengio, Preetum Nakkiran
ICLR 2024 When Can Transformers Reason with Abstract Symbols? Enric Boix-Adserà, Omid Saremi, Emmanuel Abbe, Samy Bengio, Etai Littwin, Joshua M. Susskind
ICLR 2023 Continuous Pseudo-Labeling from the Start Dan Berrebbi, Ronan Collobert, Samy Bengio, Navdeep Jaitly, Tatiana Likhomanenko
ICML 2023 Generalization on the Unseen, Logic Reasoning and Degree Curriculum Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Kevin Rizk
NeurIPS 2023 Transformers Learn Through Gradual Rank Increase Enric Boix-Adsera, Etai Littwin, Emmanuel Abbe, Samy Bengio, Joshua Susskind
NeurIPSW 2023 What Algorithms Can Transformers Learn? a Study in Length Generalization Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Joshua Susskind, Samy Bengio, Preetum Nakkiran
JMLR 2022 Are All Layers Created Equal? Chiyuan Zhang, Samy Bengio, Yoram Singer
NeurIPSW 2022 Continuous Soft Pseudo-Labeling in ASR Tatiana Likhomanenko, Ronan Collobert, Navdeep Jaitly, Samy Bengio
NeurIPS 2022 Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures Emmanuel Abbe, Samy Bengio, Elisabetta Cornacchia, Jon M. Kleinberg, Aryo Lotfi, Maithra Raghu, Chiyuan Zhang
NeurIPS 2021 Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss Michael Iuzzolino, Michael Mozer, Samy Bengio
NeurIPS 2021 Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding Yang Li, Si Si, Gang Li, Cho-Jui Hsieh, Samy Bengio
ICLR 2020 Fantastic Generalization Measures and Where to Find Them Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio
ICLR 2020 Identity Crisis: Memorization and Generalization Under Extreme Overparameterization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer
NeurIPS 2020 Memory Based Trajectory-Conditioned Policies for Learning from Sparse Rewards Yijie Guo, Jongwook Choi, Marcin Moczulski, Shengyu Feng, Samy Bengio, Mohammad Norouzi, Honglak Lee
ICLR 2020 Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML Aniruddh Raghu, Maithra Raghu, Samy Bengio, Oriol Vinyals
ICMLW 2019 Are All Layers Created Equal? Chiyuan Zhang, Samy Bengio, Yoram Singer
ICML 2019 Area Attention Yang Li, Lukasz Kaiser, Samy Bengio, Si Si
ICMLW 2019 Identity Crisis: Memorization and Generalization Under Extreme Overparameterization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer
ICLR 2019 Predicting the Generalization Gap in Deep Networks with Margin Distributions Yiding Jiang, Dilip Krishnan, Hossein Mobahi, Samy Bengio
NeurIPS 2019 Transfusion: Understanding Transfer Learning for Medical Imaging Maithra Raghu, Chiyuan Zhang, Jon Kleinberg, Samy Bengio
NeurIPS 2018 Content Preserving Text Generation with Attribute Controls Lajanugen Logeswaran, Honglak Lee, Samy Bengio
ICML 2018 Fast Decoding in Sequence Models Using Discrete Latent Variables Lukasz Kaiser, Samy Bengio, Aurko Roy, Ashish Vaswani, Niki Parmar, Jakob Uszkoreit, Noam Shazeer
NeurIPS 2018 Insights on Representational Similarity in Neural Networks with Canonical Correlation Ari Morcos, Maithra Raghu, Samy Bengio
NeurIPS 2018 Large Margin Deep Networks for Classification Gamaleldin Elsayed, Dilip Krishnan, Hossein Mobahi, Kevin Regan, Samy Bengio
ICLR 2017 Adversarial Examples in the Physical World Alexey Kurakin, Ian J. Goodfellow, Samy Bengio
ICLR 2017 Adversarial Machine Learning at Scale Alexey Kurakin, Ian J. Goodfellow, Samy Bengio
CVPR 2017 Context-Aware Captions from Context-Agnostic Supervision Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
ICLR 2017 Density Estimation Using Real NVP Laurent Dinh, Jascha Sohl-Dickstein, Samy Bengio
ICML 2017 Device Placement Optimization with Reinforcement Learning Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jeff Dean
ICLR 2017 Learning to Remember Rare Events Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio
ICLR 2017 Neural Combinatorial Optimization with Reinforcement Learning Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, Samy Bengio
ICML 2017 Sharp Minima Can Generalize for Deep Nets Laurent Dinh, Razvan Pascanu, Samy Bengio, Yoshua Bengio
ICLR 2017 Understanding Deep Learning Requires Rethinking Generalization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, Oriol Vinyals
ICML 2016 ADIOS: Architectures Deep in Output Space Moustapha Cisse, Maruan Al-Shedivat, Samy Bengio
NeurIPS 2016 An Online Sequence-to-Sequence Model Using Partial Conditioning Navdeep Jaitly, Quoc V Le, Oriol Vinyals, Ilya Sutskever, David Sussillo, Samy Bengio
NeurIPS 2016 Can Active Memory Replace Attention? Łukasz Kaiser, Samy Bengio
JMLR 2016 LLORMA: Local Low-Rank Matrix Approximation Joonseok Lee, Seungyeon Kim, Guy Lebanon, Yoram Singer, Samy Bengio
ICLR 2016 Order Matters: Sequence to Sequence for Sets Oriol Vinyals, Samy Bengio, Manjunath Kudlur
NeurIPS 2016 Reward Augmented Maximum Likelihood for Neural Structured Prediction Mohammad Norouzi, Samy Bengio, Zhifeng Chen, Navdeep Jaitly, Mike Schuster, Yonghui Wu, Dale Schuurmans
CVPR 2015 Learning Semantic Relationships for Better Action Retrieval in Images Vignesh Ramanathan, Congcong Li, Jia Deng, Wei Han, Zhen Li, Kunlong Gu, Yang Song, Samy Bengio, Charles Rosenberg, Li Fei-Fei
NeurIPS 2015 Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks Samy Bengio, Oriol Vinyals, Navdeep Jaitly, Noam Shazeer
CVPR 2015 Show and Tell: A Neural Image Caption Generator Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
ECCV 2014 Large-Scale Object Classification Using Label Relation Graphs Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy, Samy Bengio, Yuan Li, Hartmut Neven, Hartwig Adam
JMLR 2014 Training Highly Multiclass Classifiers Maya R. Gupta, Samy Bengio, Jason Weston
ICLR 2014 Zero-Shot Learning by Convex Combination of Semantic Embeddings Mohammad Norouzi, Tomás Mikolov, Samy Bengio, Yoram Singer, Jonathon Shlens, Andrea Frome, Greg Corrado, Jeffrey Dean
NeurIPS 2013 DeViSE: A Deep Visual-Semantic Embedding Model Andrea Frome, Greg S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Marc'Aurelio Ranzato, Tomas Mikolov
IJCAI 2011 WSABIE: Scaling up to Large Vocabulary Image Annotation Jason Weston, Samy Bengio, Nicolas Usunier
NeurIPS 2010 Label Embedding Trees for Large Multi-Class Tasks Samy Bengio, Jason Weston, David Grangier
MLJ 2010 Large Scale Image Annotation: Learning to Rank with Joint Word-Image Embeddings Jason Weston, Samy Bengio, Nicolas Usunier
JMLR 2010 Large Scale Online Learning of Image Similarity Through Ranking Gal Chechik, Varun Sharma, Uri Shalit, Samy Bengio
JMLR 2010 Why Does Unsupervised Pre-Training Help Deep Learning? Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre-Antoine Manzagol, Pascal Vincent, Samy Bengio
NeurIPS 2009 An Online Algorithm for Large Scale Image Similarity Learning Gal Chechik, Uri Shalit, Varun Sharma, Samy Bengio
NeurIPS 2009 Group Sparse Coding Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow
AISTATS 2009 The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training Dumitru Erhan, Pierre-Antoine Manzagol, Yoshua Bengio, Samy Bengio, Pascal Vincent
ICML 2008 A Distance Model for Rhythms Jean-François Paiement, Yves Grandvalet, Samy Bengio, Douglas Eck
JMLR 2007 The Need for Open Source Software in Machine Learning Sören Sonnenburg, Mikio L. Braun, Cheng Soon Ong, Samy Bengio, Leon Bottou, Geoffrey Holmes, Yann LeCun, Klaus-Robert Müller, Fernando Pereira, Carl Edward Rasmussen, Gunnar Rätsch, Bernhard Schölkopf, Alexander Smola, Pascal Vincent, Jason Weston, Robert Williamson
ECML-PKDD 2006 A Discriminative Approach for the Retrieval of Images from Text Queries David Grangier, Florent Monay, Samy Bengio
ICML 2005 A Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space Jean-François Paiement, Douglas Eck, Samy Bengio, David Barber
NeurIPS 2005 A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification Yves Grandvalet, Johnny Mariethoz, Samy Bengio
NeurIPS 2005 Benchmarking Non-Parametric Statistical Tests Mikaela Keller, Samy Bengio, Siew Y. Wong
NeurIPS 2005 Learning Influence Among Interacting Markov Chains Dong Zhang, Daniel Gatica-perez, Samy Bengio, Deb Roy
CVPR 2005 Semi-Supervised Adapted HMMs for Unusual Event Detection Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan
ICML 2004 Links Between Perceptrons, MLPs and SVMs Ronan Collobert, Samy Bengio
CVPR 2004 Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, Guillaume Lathoud
CVPRW 2004 Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, Guillaume Lathoud
NeCo 2002 A Parallel Mixture of SVMs for Very Large Scale Problems Ronan Collobert, Samy Bengio, Yoshua Bengio
NeurIPS 2002 An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition Samy Bengio
NeurIPS 2001 A Parallel Mixture of SVMs for Very Large Scale Problems Ronan Collobert, Samy Bengio, Yoshua Bengio
JMLR 2001 SVMTorch: Support Vector Machines for Large-Scale Regression Problems (Kernel Machines Section) Ronan Collobert, Samy Bengio
NeurIPS 2000 New Approaches Towards Robust and Adaptive Speech Recognition Hervé Bourlard, Samy Bengio, Katrin Weber
NeurIPS 1999 Modeling High-Dimensional Discrete Data with Multi-Layer Neural Networks Yoshua Bengio, Samy Bengio
NeCo 1999 Stochastic Learning of Strategic Equilibria for Auctions Samy Bengio, Yoshua Bengio, Jacques Robert, Gilles Bélanger
NeurIPS 1997 Shared Context Probabilistic Transducers Yoshua Bengio, Samy Bengio, Jean-Franc Isabelle, Yoram Singer
NeurIPS 1989 A Neural Network to Detect Homologies in Proteins Yoshua Bengio, Samy Bengio, Yannick Pouliot, Patrick Agin