ISSN 1000-1239 CN 11-1777/TP

• 网络技术 •

### 基于随机博弈与改进WoLF-PHC的网络防御决策方法

1. (中国人民解放军战略支援部队信息工程大学 郑州 450001) (624519905@qq.com)
• 出版日期: 2019-05-01
• 基金资助:
国家“八六三”高技术研究发展计划基金项目(2014AA7116082,2015AA7116040)

### Network Defense Decision-Making Method Based on Stochastic Game and Improved WoLF-PHC

Yang Junnan, Zhang Hongqi, Zhang Chuanfu

1. (Zhengzhou Information Science and Technology Institute, Zhengzhou 450001)
• Online: 2019-05-01

Abstract: At present, the method of network attack and defense analysis based on stochastic game adopts the assumption of complete rationality, but in the actual network attack-defense confrontation, it is difficult for both sides of attack and defense to meet the high requirement of complete rationality, which reduces the accuracy and guiding value of the existing methods. Based on the reality of network attack-defense confrontation, the influence of bounded rationality on attack-defense stochastic game is analyzed. Under the constraints of bounded rationality, a stochastic game model is constructed. Aiming at the problem of network state explosion, a method of extracting network state and attack-defense action based on attack-defense graph is proposed, which the game state space is effectively reduced. On this basis, WoLF-PHC algorithm in reinforcement learning is introduced to carry out bounded rational stochastic game analysis and design a defensive decision-making algorithm with online learning ability. By learning, the algorithm can obtain the optimal defense strategy for the current attacker. The obtained strategy is superior to the Nash equilibrium strategy of the existing attack-defense stochastic game model under bounded rationality. By introducing eligibility trace to improve WoLF-PHC, the learning speed of defenders is further improved. The experimental results verify the effectiveness and advancement of the proposed method.