ML Anthology
Authors
Search
About
Ragan-Kelley, Jonathan
12 publications
ICML
2025
Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping
Muru Zhang
,
Mayank Mishra
,
Zhongzhu Zhou
,
William Brandon
,
Jue Wang
,
Yoon Kim
,
Jonathan Ragan-Kelley
,
Shuaiwen Leon Song
,
Ben Athiwaratkun
,
Tri Dao
ICML
2025
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
Tian Jin
,
Ellie Y Cheng
,
Zachary Ankner
,
Nikunj Saunshi
,
Blake M Elias
,
Amir Yazdanbakhsh
,
Jonathan Ragan-Kelley
,
Suvinay Subramanian
,
Michael Carbin
NeurIPS
2024
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
William Brandon
,
Mayank Mishra
,
Aniruddha Nrusimha
,
Rameswar Panda
,
Jonathan Ragan-Kelley
ICLR
2024
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory Before In-Context Learning
Tian Jin
,
Nolan Clement
,
Xin Dong
,
Vaishnavh Nagarajan
,
Michael Carbin
,
Jonathan Ragan-Kelley
,
Gintare Karolina Dziugaite
ICMLW
2023
Differentiating Metropolis-Hastings to Optimize Intractable Densities
Gaurav Arya
,
Ruben Seyer
,
Frank Schäfer
,
Kartik Chandra
,
Alexander K. Lew
,
Mathieu Huot
,
Vikash Mansinghka
,
Jonathan Ragan-Kelley
,
Christopher Vincent Rackauckas
,
Moritz Schauer
ICMLW
2023
Distributions for Compositionally Differentiating Parametric Discontinuities
Jesse Michel
,
Kevin Mu
,
Xuanda Yang
,
Sai Praveen Bangaru
,
Elias Rojas Collins
,
Gilbert Bernstein
,
Jonathan Ragan-Kelley
,
Michael Carbin
,
Tzu-Mao Li
NeurIPSW
2023
How to Guess a Gradient
Utkarsh Singhal
,
Brian Cheung
,
Kartik Chandra
,
Jonathan Ragan-Kelley
,
Joshua B. Tenenbaum
,
Tomaso A Poggio
,
Stella X. Yu
NeurIPS
2023
Inferring the Future by Imagining the past
Kartik Chandra
,
Tony Chen
,
Tzu-Mao Li
,
Jonathan Ragan-Kelley
,
Josh Tenenbaum
ICMLW
2023
Inferring the Future by Imagining the past
Kartik Chandra
,
Tony Chen
,
Tzu-Mao Li
,
Jonathan Ragan-Kelley
,
Joshua B. Tenenbaum
NeurIPS
2022
Gradient Descent: The Ultimate Optimizer
Kartik Chandra
,
Audrey Xie
,
Jonathan Ragan-Kelley
,
Erik Meijer
ICLR
2020
DiffTaichi: Differentiable Programming for Physical Simulation
Yuanming Hu
,
Luke Anderson
,
Tzu-Mao Li
,
Qi Sun
,
Nathan Carr
,
Jonathan Ragan-Kelley
,
Frédo Durand
ICML
2020
Neural Kernels Without Tangents
Vaishaal Shankar
,
Alex Fang
,
Wenshuo Guo
,
Sara Fridovich-Keil
,
Jonathan Ragan-Kelley
,
Ludwig Schmidt
,
Benjamin Recht