Suitable Is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection

Abstract

Deep learning technologies have demonstrated remarkable performance in vulnerability detection. Existing works primarily adopt a uniform and consistent feature learning pattern across the entire target set. While designed for general-purpose detection tasks, they lack sensitivity towards target code comprising multiple functional modules or diverse vulnerability subtypes. In this paper, we present a knowledge fusion-based vulnerability detection method (KF-GVD) that integrates specific vulnerability knowledge into the Graph Neural Network feature learning process. KF-GVD achieves accurate vulnerability detection across different functional modules of the Linux kernel and vulnerability subtypes without compromising general task performance. Extensive experiments demonstrate that KF-GVD outperforms SOTAs on function-level and statement-level vulnerability detection across various target tasks, with an average increase of 40.9% in precision and 26.1% in recall. Notably, KF-GVD discovered 9 undisclosed vulnerabilities when employing on C/C++ open-source projects without ground truth.

Cite

Text

Wang et al. "Suitable Is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection." Neural Information Processing Systems, 2024. doi:10.52202/079017-3849

Markdown

[Wang et al. "Suitable Is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/wang2024neurips-suitable/) doi:10.52202/079017-3849

BibTeX

@inproceedings{wang2024neurips-suitable,
  title     = {{Suitable Is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection}},
  author    = {Wang, Jingjing and Huang, Minhuan and Nie, Yuanpin and Li, Xiang and Du, Qianjin and Kong, Wei and Deng, Huan and Kuang, Xiaohui},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3849},
  url       = {https://mlanthology.org/neurips/2024/wang2024neurips-suitable/}
}