ML Anthology
Authors
Search
About
Szlam, Arthur
42 publications
NeurIPS
2025
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Zachary Charles
,
Gabriel Teston
,
Lucio M. Dery
,
J Keith Rush
,
Nova Fallen
,
Zachary Garrett
,
Arthur Szlam
,
Arthur Douillard
ICML
2025
Deliberation in Latent Space via Differentiable Cache Augmentation
Luyang Liu
,
Jonas Pfeiffer
,
Jiaxing Wu
,
Jun Xie
,
Arthur Szlam
ICMLW
2024
Asynchronous Local-SGD Training for Language Modeling
Bo Liu
,
Rachita Chhaparia
,
Arthur Douillard
,
Satyen Kale
,
Andrei Alex Rusu
,
Jiajun Shen
,
Arthur Szlam
,
MarcAurelio Ranzato
CoLLAs
2024
Compositional Interfaces for Compositional Generalization
Jelena Luketina
,
Jack Lanchantin
,
Sainbayar Sukhbaatar
,
Arthur Szlam
ICMLW
2024
DiLoCo: Distributed Low-Communication Training of Language Models
Arthur Douillard
,
Qixuan Feng
,
Andrei Alex Rusu
,
Rachita Chhaparia
,
Yani Donchev
,
Adhiguna Kuncoro
,
MarcAurelio Ranzato
,
Arthur Szlam
,
Jiajun Shen
AAAI
2023
A Data Source for Reasoning Embodied Agents
Jack Lanchantin
,
Sainbayar Sukhbaatar
,
Gabriel Synnaeve
,
Yuxuan Sun
,
Kavya Srinet
,
Arthur Szlam
ICMLW
2023
Compositional Interfaces for Compositional Generalization
Jelena Luketina
,
Jack Lanchantin
,
Sainbayar Sukhbaatar
,
Arthur Szlam
NeurIPS
2023
Learning to Reason and Memorize with Self-Notes
Jack Lanchantin
,
Shubham Toshniwal
,
Jason Weston
,
Arthur Szlam
,
Sainbayar Sukhbaatar
ICML
2021
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Ramakrishna Vedantam
,
Arthur Szlam
,
Maximillian Nickel
,
Ari Morcos
,
Brenden M Lake
NeurIPS
2021
Hash Layers for Large Sparse Models
Stephen Roller
,
Sainbayar Sukhbaatar
,
Arthur Szlam
,
Jason Weston
ICML
2021
Not All Memories Are Created Equal: Learning to Forget by Expiring
Sainbayar Sukhbaatar
,
Da Ju
,
Spencer Poff
,
Stephen Roller
,
Arthur Szlam
,
Jason Weston
,
Angela Fan
JMLR
2021
Residual Energy-Based Models for Text
Anton Bakhtin
,
Yuntian Deng
,
Sam Gross
,
Myle Ott
,
Marc'Aurelio Ranzato
,
Arthur Szlam
ICML
2020
Fast Adaptation to New Environments via Policy-Dynamics Value Functions
Roberta Raileanu
,
Max Goldstein
,
Arthur Szlam
,
Rob Fergus
AAAI
2020
Generating Interactive Worlds with Text
Angela Fan
,
Jack Urbanek
,
Pratik Ringshia
,
Emily Dinan
,
Emma Qian
,
Siddharth Karamcheti
,
Shrimai Prabhumoye
,
Douwe Kiela
,
Tim Rocktäschel
,
Arthur Szlam
,
Jason Weston
ICLR
2020
Residual Energy-Based Models for Text Generation
Yuntian Deng
,
Anton Bakhtin
,
Myle Ott
,
Arthur Szlam
,
Marc'Aurelio Ranzato
ICLR
2019
Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
Kenneth Marino
,
Abhinav Gupta
,
Rob Fergus
,
Arthur Szlam
ICML
2018
Composable Planning with Attributes
Amy Zhang
,
Sainbayar Sukhbaatar
,
Adam Lerer
,
Arthur Szlam
,
Rob Fergus
ICLR
2018
Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
Sainbayar Sukhbaatar
,
Zeming Lin
,
Ilya Kostrikov
,
Gabriel Synnaeve
,
Arthur Szlam
,
Rob Fergus
ICLR
2018
Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent
Zhilin Yang
,
Saizheng Zhang
,
Jack Urbanek
,
Will Feng
,
Alexander Miller
,
Arthur Szlam
,
Douwe Kiela
,
Jason Weston
ICML
2018
Modeling Others Using Oneself in Multi-Agent Reinforcement Learning
Roberta Raileanu
,
Emily Denton
,
Arthur Szlam
,
Rob Fergus
ICML
2018
Optimizing the Latent Space of Generative Networks
Piotr Bojanowski
,
Armand Joulin
,
David Lopez-Pas
,
Arthur Szlam
ICLR
2017
Automatic Rule Extraction from Long Short Term Memory Networks
W. James Murdoch
,
Arthur Szlam
CVPR
2017
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Sam Gross
,
Marc'Aurelio Ranzato
,
Arthur Szlam
ICLR
2017
Tracking the World State with Recurrent Entity Networks
Mikael Henaff
,
Jason Weston
,
Arthur Szlam
,
Antoine Bordes
,
Yann LeCun
ICLR
2016
Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems
Jesse Dodge
,
Andreea Gane
,
Xiang Zhang
,
Antoine Bordes
,
Sumit Chopra
,
Alexander H. Miller
,
Arthur Szlam
,
Jason Weston
NeurIPS
2016
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
,
Arthur Szlam
,
Rob Fergus
ICML
2016
Recurrent Orthogonal Networks and Long-Memory Tasks
Mikael Henaff
,
Arthur Szlam
,
Yann LeCun
NeurIPS
2016
The Product Cut
Thomas Laurent
,
James von Brecht
,
Xavier Bresson
,
Arthur Szlam
NeurIPS
2015
Deep Generative Image Models Using a Laplacian Pyramid of Adversarial Networks
Emily L Denton
,
Soumith Chintala
,
Arthur Szlam
,
Rob Fergus
NeurIPS
2015
End-to-End Memory Networks
Sainbayar Sukhbaatar
,
Arthur Szlam
,
Jason Weston
,
Rob Fergus
CVPR
2014
Better Feature Tracking Through Subspace Constraints
Bryan Poling
,
Gilad Lerman
,
Arthur Szlam
ICML
2014
Signal Recovery from Pooling Representations
Joan Bruna Estrach
,
Arthur Szlam
,
Yann LeCun
ICLR
2014
Spectral Networks and Locally Connected Networks on Graphs
Joan Bruna
,
Wojciech Zaremba
,
Arthur Szlam
,
Yann LeCun
ICLR
2014
Unsupervised Feature Learning by Deep Sparse Coding
Yunlong He
,
Koray Kavukcuoglu
,
Yun Wang
,
Arthur Szlam
,
Yanjun Qi
ICLR
2013
Learning Stable Group Invariant Representations with Convolutional Networks
Joan Bruna
,
Arthur Szlam
,
Yann LeCun
ICLR
2013
Tree Structured Sparse Coding on Cubes
Arthur Szlam
ECCV
2012
Fast Approximations to Structured Sparse Coding and Applications to Object Classification
Arthur Szlam
,
Karol Gregor
,
Yann LeCun
CVPR
2012
Incremental Gradient on the Grassmannian for Online Foreground and Background Separation in Subsampled Video
Jun He
,
Laura Balzano
,
Arthur Szlam
CVPR
2010
Randomized Hybrid Linear Modeling by Local Best-Fit Flats
Teng Zhang
,
Arthur Szlam
,
Yi Wang
,
Gilad Lerman
ICML
2010
Total Variation, Cheeger Cuts
Arthur Szlam
,
Xavier Bresson
ICML
2009
Discriminative K-Metrics
Arthur Szlam
,
Guillermo Sapiro
ICCVW
2009
Median K-Flats for Hybrid Linear Modeling with Many Outliers
Teng Zhang
,
Arthur Szlam
,
Gilad Lerman