Desai, Aditya

8 publications

ICML 2025 HashAttention: Semantic Sparsity for Faster Inference Aditya Desai, Shuo Yang, Alejandro Cuadron, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica
ICML 2025 Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation Tianyi Zhang, Junda Su, Aditya Desai, Oscar Wu, Zhaozhuo Xu, Anshumali Shrivastava
ICLR 2024 In Defense of Parameter Sharing for Model-Compression Aditya Desai, Anshumali Shrivastava
NeurIPS 2024 SS1: Accelerating Inference with Fast and Expressive Sketch Structured Transform Kimia Saedi, Aditya Desai, Apoorv Walia, Jihyeong Lee, Keren Zhou, Anshumali Shrivastava
ICML 2023 Hardware-Aware Compression with Random Operation Access Specific Tile (ROAST) Hashing Aditya Desai, Keren Zhou, Anshumali Shrivastava
NeurIPS 2023 Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
NeurIPS 2022 The Trade-Offs of Model Size in Large Recommendation Models : 100GB to 10MB Criteo-Tb DLRM Model Aditya Desai, Anshumali Shrivastava
NeurIPS 2021 Raw Nav-Merge Seismic Data to Subsurface Properties with MLP Based Multi-Modal Information Unscrambler Aditya Desai, Zhaozhuo Xu, Menal Gupta, Anu Chandran, Antoine Vial-Aussavy, Anshumali Shrivastava