Multimodal information fusion for software vulnerability detection based on both source and binary codes

•The study detects software vulnerability based on both source and binary codes.•A novel fusion strategy is designed to accommodate the characteristics of two modalities.•ChatGPT and two pre-trained models are used for the process of feature analysis process. Context: Many researchers have proposed...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Science of computer programming Ročník 250; s. 103411
Hlavní autoři:	Liu, Yuzhou, Wang, Qi, Jiang, Shuang, Wu, Runze, Tian, Hongxu, Zhang, Peng
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Elsevier B.V 01.05.2026
Témata:	Binary codes feature Multimodal fusion Software vulnerability detection Source codes feature Source codes feature Multimodal fusion Software vulnerability detection Binary codes feature
ISSN:	0167-6423
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	•The study detects software vulnerability based on both source and binary codes.•A novel fusion strategy is designed to accommodate the characteristics of two modalities.•ChatGPT and two pre-trained models are used for the process of feature analysis process. Context: Many researchers have proposed vulnerability detection methods to enhance software reliability by analyzing the program. However, some vulnerabilities are difficult to be identified only from the source codes, especially the ones related to the execution. Objectives: To solve this problem, this paper introduces extra binary codes and proposes a novel solution for software vulnerability detection based on the multimodal information fusion. Methods: The approach treats the source and binary codes as different modalities, and uses two pre-trained models as feature extractors to analyze them separately. Then, we design an attention-based information fusion strategy that taking the information from source codes as the main body while the one from binary codes as the supplement. It could not only capture the correlations among features across different modalities, but also filter the redundancy from the binary codes in the fusion process. In this way, a more comprehensive representation of software is gained and finally taken as the basis for the vulnerability detection. Results: Our method was comprehensively evaluated on three widely-used datasets in different languages, that is Reveal in C, Devign in C++, and Code_vulnerability_java in Java: (1) For vulnerability detection performance, the Accuracy reached 86.09 %, 84.58 %, and 80.43 % across the three datasets, with F1-scores of 82.87 %, 84.62 %, and 79.58 % respectively; (2) Compared with seven state-of-the-art baseline methods, our approach achieved Accuracy improvements of 2.38 %-3.01 % and F1-score enhancements of 2.32 %-8.47 % across the datasets; (3) Moreover, the ablation experiment shows when combining binary codes with source codes (versus using source codes alone), the Accuracy improved by 6.83 %-13.76 % and F1-score increased by 5.36 %-9.86 %, demonstrating the significant performance gains from multimodal data integration. Conclusion: The results show that our approach can achieve good performance for the task of software vulnerability detection. Meanwhile, ablation experiments confirm the contributions of binary codes to the detection and indicate the effectiveness of our fusion strategy. We have released the codes and datasets (https://github.com/Wangqxn/Vul-detection) to facilitate follow-up research.
ISSN:	0167-6423
DOI:	10.1016/j.scico.2025.103411