Kong, Jason

1 publications

ICMLW 2024 TinyAgent: Quantization-Aware Model Compression and Adaptation for On-Device LLM Agent Deployment Jason Kong, Lanxiang Hu, Flavio Ponzina, Tajana Rosing