一种Linux安全漏洞修复补丁自动识别方法

周鹏; 武延军; 赵琛

doi:10.7544/issn1000-1239.20200492

一种Linux安全漏洞修复补丁自动识别方法

Identify Linux Security Vulnerability Fix Patches Automatically

摘要

摘要: 及时获取并应用安全漏洞修复补丁对保障服务器用户的安全至关重要.但是，学者和机构研究发现开源软件维护者经常悄无声息地修复安全漏洞，比如维护者88%的情况在发布软件新版本时才在发行说明中告知用户修复了安全漏洞，并且只有9%的漏洞修复补丁明确给出对应的CVE(common vulnerabilities and exposures)标号，只有3%的修复会及时主动通知安全监控服务提供者.这导致在很多情况下，安全工程师不能通过补丁的代码和描述信息直接区分漏洞修复、Bug修复、功能性补丁.造成漏洞修复补丁不能被用户及时识别和应用，同时用户从大量的补丁提交中识别漏洞修复补丁代价很高.以代表性Linux内核为例，给出一种自动识别漏洞修复补丁的方法，该方法为补丁的代码和描述部分分别定义特征，构建机器学习模型，训练学习可区分安全漏洞补丁的分类器.实验表明，该方法可以取得91.3%的精确率、92%的准确率、87.53%的召回率，并将误报率降低到5.2%，性能提升明显.

Abstract: It is critical to catch and apply the vulnerability fix patches in time to ensure the security of information system. However, it is found that open source software maintainers often silently fix security vulnerabilities. For example, 88% of maintainers delay informing users to fix vulnerabilities in the release notes of new software version, and only 9% of the bug fixes clearly give the corresponding CVE ID, and only 3% of the fixes will actively notify the security service provider in time. In many cases, security engineers can’t directly distinguish vulnerability fixes, bug fixes, and feature patches from the code and log message of patches. As a result, vulnerability fixes can’t be identified and applied by users timely. At the same time, it is costly for users to identify vulnerability fixes from a large number of patch submissions. Taking Linux as an example, this paper presents a method of identifying vulnerability patches automatically. This method defines features for the code and log message from patches, builds machine learning model, and trains to learn classifiers that can distinguish vulnerability patches. Experiments indicate that our approach is effective, which can get 91.3% precision, 92% accuracy, 87.53% recall rate, and reduce the false positive rate to 5.2%.

HTML全文

参考文献(0)

施引文献

资源附件(0)