基于边缘样本的智能网络入侵检测系统数据污染防御方法

刘广睿; 张伟哲; 李欣洁

doi:10.7544/issn1000-1239.20220509

基于边缘样本的智能网络入侵检测系统数据污染防御方法

¹(哈尔滨工业大学网络空间安全学院哈尔滨 150001)
²(鹏城实验室广东深圳 518055) (liuguangrui@hit.edu.cn)

基金项目: 国家重点研发计划项目(2020YFB1406902)；广东省重点领域研究研发计划项目(2020B0101360001)；深圳市科学技术研究发展基金项目(JCYJ20190806143418198)；中央高校基本科研业务费专项资金项目(HIT.OCEF.2021007)；鹏城实验室项目(PCL2021A02)

详细信息

中图分类号: TP309
计量
- 文章访问数: 189
- HTML全文浏览量: 7
- PDF下载量: 122
出版历程
- 发布日期: 2022-09-30

Data Contamination Defense Method for Intelligent Network Intrusion Detection Systems Based on Edge Examples

¹(School of Cyberspace Science, Harbin Institute of Technology, Harbin 150001)
²(Peng Cheng Laboratory, Shenzhen, Guangdong 518055)

Funds: This work was supported by the National Key Research and Development Program of China (2020YFB1406902), the Key-Area Research and Development Program of Guangdong Province (2020B0101360001), Shenzhen Science and Technology Research and Development Foundation (JCYJ20190806143418198), the Fundamental Research Funds for the Central Universities (HIT.OCEF.2021007), and the Peng Cheng Laboratory Project (PCL2021A02).

摘要

摘要: 人工智能已被广泛应用于网络入侵检测系统.然而由于流量样本存在概念漂移现象，用于恶意流量识别的模型必须频繁更新以适应新的特征分布.更新后模型的有效性依赖新增训练样本的质量，所以防止数据污染尤为重要.然而目前流量样本的污染过滤工作仍依赖专家经验，这导致在模型更新过程中存在样本筛选工作量大、模型准确率不稳定、系统易受投毒攻击等问题.现有工作无法在保证模型性能的同时实现污染过滤或模型修复.为解决上述问题，为智能网络入侵检测系统设计了一套支持污染数据过滤的通用模型更新方法.首先设计了EdgeGAN算法，利用模糊测试使生成对抗网络快速拟合模型边缘样本分布.然后通过检查新增训练样本与原模型的MSE值和更新后模型对旧边缘样本的F\-β分数，识别出污染样本子集.通过让模型学习恶意边缘样本，抑制投毒样本对模型的影响，保证模型在中毒后快速复原.最后通过在5种典型智能网络入侵检测系统上的实验测试，验证了提出的更新方法在污染过滤与模型修复上的有效性.对比现有最先进的方法，新方法对投毒样本的检测率平均提升12.50%，对中毒模型的修复效果平均提升6.38%.该方法适用于保护任意常见智能网络入侵检测系统的更新过程，可减少人工样本筛选工作，有效降低了投毒检测与模型修复的代价，对模型的性能和鲁棒性起到保障作用.新方法也可以用于保护其他相似的智能威胁检测模型.
- 网络入侵检测 /
- 数据污染 /
- 投毒攻击 /
- 生成对抗网络 /
- 边缘样本
Abstract: Artificial intelligence has been widely used in network intrusion detection systems. Due to the concept drift of traffic samples, the models used for malicious traffic identification must be updated frequently to adapt to new feature distributions. The effectiveness of the updated model depends on the quality of the new training samples, so it is essential to prevent data contamination. However, contamination filtering of traffic samples still relies on expert experience, which leads to the problems such as the immense workload of sample screening, unstable model accuracy, and vulnerability to poisoning attacks during the model update. Existing works cannot achieve contamination filtering or model repair while maintaining model performance. We design a general model update method for intelligent network intrusion detection systems to solve the above problems. In this paper, we first design the EdgeGAN algorithm to make the generative adversarial network fit the model edge example distribution through fuzzing. Then a subset of contaminated examples is identified by examining the MSE values of the new training samples and the original model and checking the F\-β scores of the updated model on the old edge examples. The influence of poisoned examples is suppressed by letting the model learn malicious edge examples, and the model is guaranteed to recover quickly after poisoning. Finally, the effectiveness of the update method on contamination filtering and model restoration is verified by experimental testing on 5 typical intelligent network intrusion detection systems. Compared with the state-of-the-art methods, the new method improves the detection rate of poisoned examples by 12.50% and the restoration effect of poisoned models by 6.38%. The method is applicable to protect the update process of any common intelligent network intrusion detection systems, which can reduce the manual sample screening work, effectively reduce the cost of poison detection and model repair, and provide guarantees for model performance and robustness. The new method can also protect similar intelligent threat detection models.
- network intrusion detection /
- data contamination /
- poisoning attack /
- generative adversarial network /
- edge example

HTML全文

参考文献(0)

施引文献(49)

期刊类型引用(24)

1.	李廷. 面向物联网安全的区块链技术应用与挑战. 信息与电脑. 2025(01): 71-73 . 百度学术
2.	杨向军，祁麟阁，蒲金珠，纪子豪，李小海，李晓民，贾彦党，罗婷. 基于外包计算的智慧高速公路物联网访问控制方法. 甘肃科技. 2025(01): 7-12+18 . 百度学术
3.	胡金炜，张玉健，蔡莹，余志文，王琦. 虚拟电厂网络安全研究综述及展望. 中国电机工程学报. 2025(08): 2876-2899 . 百度学术
4.	许静萱，张亮，盛剑桥，沈越欣. 基于访问控制技术的物联网设备接入共识机制. 粘接. 2025(05): 193-196 . 百度学术
5.	牛浩浩，张祎彤，缪健琛，姚俊，雷驰. 智能质检物联网系统的安全策略构建与实施. 机电信息. 2025(08): 30-34 . 百度学术
6.	孟海宁，陆杰，李昊峰，黄永恒，曹立庆，李炼 . Web应用访问控制漏洞检测研究进展. 高技术通讯. 2025(03): 227-240 . 百度学术
7.	薛俊伟，吴凯，周静. 耳机式物联网血氧监护系统的设计. 中国医学物理学杂志. 2024(01): 60-65 . 百度学术
8.	商钰玲，李鹏，朱枫，王汝传. 基于模糊逻辑的物联网流量攻击检测技术综述. 计算机科学. 2024(03): 3-13 . 百度学术
9.	吴平，孙浩洋，周莉梅，尚宇炜，高飞. 基于软件定义安全的配电物联网分组转发异常检测. 信息工程大学学报. 2024(02): 227-234 . 百度学术
10.	郭文俊，杨泽民. 基于区块链的物联网可信访问控制研究. 软件工程. 2024(06): 30-33 . 百度学术
11.	蒋伟进，李恩，罗田甜，周文颖，杨莹. 基于区块链和可信执行环境的细粒度访问控制方案研究与应用——以物联网为例. 系统工程理论与实践. 2024(07): 2394-2410 . 百度学术
12.	刘立鹏，左兆迎，滕卉卉. 实验室铁矿石水分检测的数智化探究. 实验室检测. 2024(08): 56-60 . 百度学术
13.	王勇，熊毅，杨天宇，沈益冉. 一种面向耳戴式设备的用户安全连续认证方法. 计算机研究与发展. 2024(11): 2821-2834 . 本站查看
14.	夏清洁，攸彩红，赵英杰. 改进向量空间模型的相似专利检测技术研究. 粘接. 2024(11): 193-196 . 百度学术
15.	张力文，张宏亮，沈蓉，黄春烨. 基于物联网技术的轨道交通智能手操箱系统研究. 物联网技术. 2024(11): 140-145+148 . 百度学术
16.	谢志利，李文昭. 物联网产品安全监管研究. 标准科学. 2023(01): 117-121 . 百度学术
17.	刘艳芳，侯钰龙，高凯. 基于MQTT的液漏监测系统的设计与实现. 计算机测量与控制. 2023(03): 76-82 . 百度学术
18.	章俊. 零信任架构下基于属性的访问控制在远程急救中的应用. 通信技术. 2023(02): 250-254 . 百度学术
19.	唐雪雯. 物联网背景下个人信息保护优化路径研究. 重庆科技学院学报(社会科学版). 2023(03): 27-36 . 百度学术
20.	马晋，蒲晓虎，赵思亮. 零信任在气象网络安全中的应用研究. 自动化与仪器仪表. 2023(07): 138-142 . 百度学术
21.	李欢，卢延荣. 随机博弈下的工业机器人物联网主动防御研究. 无线电工程. 2023(09): 2135-2142 . 百度学术
22.	张学旺，姚亚宁，付佳丽，谢昊飞. 策略隐藏的高效多授权机构CP-ABE物联网数据共享方案. 计算机研究与发展. 2023(10): 2193-2202 . 本站查看
23.	吴杰成. 基于物联网的智能家居网络安全风险与保障措施. 玩具世界. 2023(05): 132-134 . 百度学术
24.	陈鸿龙，林凯，闫娜，刘宝. 基于指纹的射频识别标签定位实验平台设计与实现. 实验室研究与探索. 2023(11): 50-53 . 百度学术