Ding, Dujian

5 publications

ICML 2025 BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute Dujian Ding, Ankur Mallick, Shaokun Zhang, Chi Wang, Daniel Madrigal, Mirian Del Carmen Hipolito Garcia, Menglin Xia, Laks V. S. Lakshmanan, Qingyun Wu, Victor Rühle
ICLRW 2025 EcoAct: Economic Agent Determines When to Register What Action Shaokun Zhang, Jieyu Zhang, Dujian Ding, Jiale Liu, Mirian Del Carmen Hipolito Garcia, Ankur Mallick, Daniel Madrigal, Menglin Xia, Victor Rühle, Qingyun Wu, Chi Wang
ICLR 2025 OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference Dujian Ding, Bicheng Xu, Laks V. S. Lakshmanan
ICLR 2024 Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Rühle, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah
TMLR 2024 PASS: Pruning Attention Heads with Almost-Sure Sparsity Targets Dujian Ding, Ganesh Jawahar, Laks V. S. Lakshmanan