Ragan-Kelley, Jonathan

12 publications

ICML 2025 Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping Muru Zhang, Mayank Mishra, Zhongzhu Zhou, William Brandon, Jue Wang, Yoon Kim, Jonathan Ragan-Kelley, Shuaiwen Leon Song, Ben Athiwaratkun, Tri Dao

ICML 2025 Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding Tian Jin, Ellie Y Cheng, Zachary Ankner, Nikunj Saunshi, Blake M Elias, Amir Yazdanbakhsh, Jonathan Ragan-Kelley, Suvinay Subramanian, Michael Carbin

NeurIPS 2024 Reducing Transformer Key-Value Cache Size with Cross-Layer Attention William Brandon, Mayank Mishra, Aniruddha Nrusimha, Rameswar Panda, Jonathan Ragan-Kelley

ICLR 2024 The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory Before In-Context Learning Tian Jin, Nolan Clement, Xin Dong, Vaishnavh Nagarajan, Michael Carbin, Jonathan Ragan-Kelley, Gintare Karolina Dziugaite

ICMLW 2023 Differentiating Metropolis-Hastings to Optimize Intractable Densities Gaurav Arya, Ruben Seyer, Frank Schäfer, Kartik Chandra, Alexander K. Lew, Mathieu Huot, Vikash Mansinghka, Jonathan Ragan-Kelley, Christopher Vincent Rackauckas, Moritz Schauer

ICMLW 2023 Distributions for Compositionally Differentiating Parametric Discontinuities Jesse Michel, Kevin Mu, Xuanda Yang, Sai Praveen Bangaru, Elias Rojas Collins, Gilbert Bernstein, Jonathan Ragan-Kelley, Michael Carbin, Tzu-Mao Li

NeurIPSW 2023 How to Guess a Gradient Utkarsh Singhal, Brian Cheung, Kartik Chandra, Jonathan Ragan-Kelley, Joshua B. Tenenbaum, Tomaso A Poggio, Stella X. Yu

NeurIPS 2023 Inferring the Future by Imagining the past Kartik Chandra, Tony Chen, Tzu-Mao Li, Jonathan Ragan-Kelley, Josh Tenenbaum

ICMLW 2023 Inferring the Future by Imagining the past Kartik Chandra, Tony Chen, Tzu-Mao Li, Jonathan Ragan-Kelley, Joshua B. Tenenbaum

NeurIPS 2022 Gradient Descent: The Ultimate Optimizer Kartik Chandra, Audrey Xie, Jonathan Ragan-Kelley, Erik Meijer

ICLR 2020 DiffTaichi: Differentiable Programming for Physical Simulation Yuanming Hu, Luke Anderson, Tzu-Mao Li, Qi Sun, Nathan Carr, Jonathan Ragan-Kelley, Frédo Durand

ICML 2020 Neural Kernels Without Tangents Vaishaal Shankar, Alex Fang, Wenshuo Guo, Sara Fridovich-Keil, Jonathan Ragan-Kelley, Ludwig Schmidt, Benjamin Recht