Robust Function-Calling for On-Device Language Model via Function Masking
Abstract
Large language models have demonstrated impressive value in performing as autonomous agents when equipped with external tools and API calls. Nonetheless, effectively harnessing their potential for executing complex tasks crucially relies on enhancements in their function-calling capabilities. This paper identifies a critical gap in existing function-calling models, where performance varies significantly across benchmarks, often due to over-fitting to specific naming conventions. To address such an issue, we introduce Hammer, a novel family of foundation models specifically engineered for on-device function calling. Hammer employs an augmented dataset that enhances models’ sensitivity to irrelevant functions and incorporates function masking techniques to minimize over-fitting. Our empirical evaluations reveal that Hammer not only outperforms larger models but also demonstrates robust generalization across diverse benchmarks, achieving state-of-the-art results. Our open-source contributions include a specialized dataset for irrelevance detection, a tuning framework for enhanced generalization, and the Hammer models, establishing a new standard for function-calling performance.
Cite
Text
Lin et al. "Robust Function-Calling for On-Device Language Model via Function Masking." International Conference on Learning Representations, 2025.Markdown
[Lin et al. "Robust Function-Calling for On-Device Language Model via Function Masking." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/lin2025iclr-robust/)BibTeX
@inproceedings{lin2025iclr-robust,
title = {{Robust Function-Calling for On-Device Language Model via Function Masking}},
author = {Lin, Qiqiang and Wen, Muning and Peng, Qiuying and Nie, Guanyu and Liao, Junwei and Wang, Jun and Mo, Xiaoyun and Zhou, Jiamu and Cheng, Cheng and Zhao, Yin and Wang, Jun and Zhang, Weinan},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/lin2025iclr-robust/}
}