ML Anthology
Authors
Search
About
Jones, Dalton
2 publications
NeurIPS
2025
KeyDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments
Junyoung Park
,
Dalton Jones
,
Matthew J Morse
,
Raghavv Goel
,
Mingu Lee
,
Christopher Lott
ICLR
2025
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Pierre-David Letourneau
,
Manish Kumar Singh
,
Hsin-Pai Cheng
,
Shizhong Han
,
Yunxiao Shi
,
Dalton Jones
,
Matthew Harper Langston
,
Hong Cai
,
Fatih Porikli