Bondaschi, Marco

12 publications

ICLR 2025 Attention with Markov: A Curious Case of Single-Layer Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar
ICLRW 2025 From Markov to Laplace: How Mamba In-Context Learns Markov Chains Marco Bondaschi, Nived Rajaraman, Xiuying Wei, Kannan Ramchandran, Razvan Pascanu, Caglar Gulcehre, Michael Gastpar, Ashok Vardhan Makkuva
NeurIPS 2025 What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi, Nived Rajaraman, Michael Gastpar, Jason D. Lee, Paul Pu Liang
ICMLW 2024 Attention with Markov: A Curious Case of Single-Layer Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Alliot Nagle, Adway Girish, Hyeji Kim, Martin Jaggi, Michael Gastpar
NeurIPS 2024 Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models Alliot Nagle, Adway Girish, Marco Bondaschi, Michael Gastpar, Ashok Vardhan Makkuva, Hyeji Kim
ICMLW 2024 Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models Adway Girish, Alliot Nagle, Ashok Vardhan Makkuva, Marco Bondaschi, Michael Gastpar, Hyeji Kim
ICML 2024 LASER: Linear Compression in Wireless Distributed Optimization Ashok Vardhan Makkuva, Marco Bondaschi, Thijs Vogels, Martin Jaggi, Hyeji Kim, Michael Gastpar
NeurIPS 2024 Local to Global: Learning Dynamics and Effect of Initialization for Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar
ICMLW 2024 Local to Global: Learning Dynamics and Effect of Initialization for Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar
NeurIPS 2024 Transformers on Markov Data: Constant Depth Suffices Nived Rajaraman, Marco Bondaschi, Kannan Ramchandran, Michael Gastpar, Ashok Vardhan Makkuva
ICMLW 2024 Transformers on Markov Data: Constant Depth Suffices Nived Rajaraman, Marco Bondaschi, Ashok Vardhan Makkuva, Kannan Ramchandran, Michael Gastpar
NeurIPSW 2023 LASER: Linear Compression in Wireless Distributed Optimization Ashok Vardhan Makkuva, Marco Bondaschi, Thijs Vogels, Martin Jaggi, Hyeji Kim, Michael Gastpar