高级检索
    孙骞, 薛雷琦, 高岭, 王海, 王宇翔. 基于随机博弈与禁忌搜索的网络防御策略选取[J]. 计算机研究与发展, 2020, 57(4): 767-777. DOI: 10.7544/issn1000-1239.2020.20190870
    引用本文: 孙骞, 薛雷琦, 高岭, 王海, 王宇翔. 基于随机博弈与禁忌搜索的网络防御策略选取[J]. 计算机研究与发展, 2020, 57(4): 767-777. DOI: 10.7544/issn1000-1239.2020.20190870
    Sun Qian, Xue Leiqi, Gao Ling, Wang Hai, Wang Yuxiang. Selection of Network Defense Strategies Based on Stochastic Game and Tabu Search[J]. Journal of Computer Research and Development, 2020, 57(4): 767-777. DOI: 10.7544/issn1000-1239.2020.20190870
    Citation: Sun Qian, Xue Leiqi, Gao Ling, Wang Hai, Wang Yuxiang. Selection of Network Defense Strategies Based on Stochastic Game and Tabu Search[J]. Journal of Computer Research and Development, 2020, 57(4): 767-777. DOI: 10.7544/issn1000-1239.2020.20190870

    基于随机博弈与禁忌搜索的网络防御策略选取

    Selection of Network Defense Strategies Based on Stochastic Game and Tabu Search

    • 摘要: 网络防御策略是决定网络安全防护效果的关键因素,现有的网络防御决策研究的是完全理性前提条件以及攻防效益函数参数选择等方面,对实际网络攻防中信息不对称、法律惩戒等因素存在模型偏差,降低了策略的实用性与可靠性.结合实际问题,在有限理性的前置条件基础上构建禁忌随机博弈模型,引入了禁忌搜索方法对随机博弈进行有限理性的分析,并设计具有记忆功能的搜索方法,通过禁忌表数据结构实现记忆功能,并利用数据驱动的记忆结合博弈模型得出最优防御策略.实验结果表明:该方法在攻防收益量化方面提高了精准度,防御效益相对于现有典型的方法提高了准确度,方法空间复杂度优于强化学习等典型方法.

       

      Abstract: The network defence strategy is the key factor to determine the effect of network security protection. In terms of the rational precondition of the existing network defence decision-making research and the parameter selection of the attack and defence benefit function, there are model deviations for the factors such as information asymmetry and legal punishment in the actual network attack and defence, which reduces the practicability and reliability of the strategy. In this paper, the Tabu random game model is constructed on the basis of the preconditions of bounded rationality, the Tabu search algorithm is introduced to analyze the bounded rationality of random game, and a search algorithm with memory function is designed. The data structure of the Tabu table is used to realize the memory function, and the data-driven memory combined with the game model is used to get the optimal defence strategy. The experimental results show that this method improves the accuracy in the quantification of attack and defence benefits, improves the accuracy of defence benefits compared with the existing typical methods, and the algorithm space complexity is better than the reinforcement learning and other typical algorithms.

       

    /

    返回文章
    返回