Sharify, Sayeh

4 publications

ICML 2025 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena, Sayeh Sharify, Kaushik Roy, Xin Wang
ICLRW 2025 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena, Sayeh Sharify, Kaushik Roy, Xin Wang
ICLRW 2025 Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs Zifei Xu, Sayeh Sharify, Wanzin Yazar, Tristan J Webb, Xin Wang
ICLR 2017 Bit-Pragmatic Deep Neural Network Computing Jorge Albericio, Patrick Judd, Alberto Delmas, Sayeh Sharify, Andreas Moshovos