• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于牵引控制的深度强化学习路由策略生成

孙鹏浩, 兰巨龙, 申涓, 胡宇翔

孙鹏浩, 兰巨龙, 申涓, 胡宇翔. 基于牵引控制的深度强化学习路由策略生成[J]. 计算机研究与发展, 2021, 58(7): 1563-1572. DOI: 10.7544/issn1000-1239.2021.20200018
引用本文: 孙鹏浩, 兰巨龙, 申涓, 胡宇翔. 基于牵引控制的深度强化学习路由策略生成[J]. 计算机研究与发展, 2021, 58(7): 1563-1572. DOI: 10.7544/issn1000-1239.2021.20200018
Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. DOI: 10.7544/issn1000-1239.2021.20200018
Citation: Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. DOI: 10.7544/issn1000-1239.2021.20200018
孙鹏浩, 兰巨龙, 申涓, 胡宇翔. 基于牵引控制的深度强化学习路由策略生成[J]. 计算机研究与发展, 2021, 58(7): 1563-1572. CSTR: 32373.14.issn1000-1239.2021.20200018
引用本文: 孙鹏浩, 兰巨龙, 申涓, 胡宇翔. 基于牵引控制的深度强化学习路由策略生成[J]. 计算机研究与发展, 2021, 58(7): 1563-1572. CSTR: 32373.14.issn1000-1239.2021.20200018
Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. CSTR: 32373.14.issn1000-1239.2021.20200018
Citation: Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. CSTR: 32373.14.issn1000-1239.2021.20200018

基于牵引控制的深度强化学习路由策略生成

基金项目: 国家重点研发计划项目(2020YFB1804803);国家自然科学基金项目(62002382,61702547,61872382); 广东省重点领域研发计划项目(2018B010113001)
详细信息
  • 中图分类号: TP393

Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning

Funds: This work was supported by the National Key Research and Development Program of China (2020YFB1804803), the National Natural Science Foundation of China (62002382, 61702547, 61872382), and the Key Research and Development Project of Guangdong Province (2018B010113001).
  • 摘要: 当前网络规模的高速增长带来网络流量复杂度的日益提高,增加了对流量特征精确建模的难度.近年来业界提出使用深度强化学习技术实现网络路由的智能化生成,一定程度上克服了人工进行流量分析和建模的缺点.然而,目前提出的解决方案普遍存在可扩展性差等问题.对此,提出了一种基于牵引控制理论的深度强化学习路由策略生成技术Hierar-DRL,通过引入牵引控制理论并结合深度强化学习的自动策略搜索能力,提高了智能路由算法可扩展性.仿真实验结果表明:所提方案相比当前最优方案的端到端时延最多降低了28.5%,证明了所提智能路由方案的有效性.
    Abstract: Computer networks have been playing an important role in modern society. The rapid growth of the network scale makes the network traffic more and more complicated, which is hard to accurately model. This condition makes the optimal routing policy in communication networks an NP-hard problem. To solve this problem, traditional methods for routing and traffic engineering mainly use hand-crafted algorithms, which cannot ensure both the accuracy and efficiency. In recent years, deep reinforcement learning (DRL)-based network routing strategies have been proposed, which overcome the shortcomings of manually analysis and modelling by human experts to some extent. However, current DRL-based routing strategies all have problems in scalability, which means they cannot be used in large scale networks. Under this circumstance, this paper proposes Hierar-DRL, a DRL-based network routing technology that employs pinning control theory. Pinning control helps Hierar-DRL to select a subset of network nodes as the target control nodes of DRL. With the advantages of pinning control and the automatic policy exploring ability of DRL, Hierar-DRL shows better scalability in large networks. Simulation results show that the proposed scheme can reduce the average end-to-end transmission delay in the test network topologies by up to 28.5% compared with the state-of-the-art, which validates the proposed scheme.
  • 期刊类型引用(2)

    1. 刘臣,倪仁倢,周立欣,侯昌佑. 多声学特征融合的语音自动剪辑深度学习模型. 小型微型计算机系统. 2023(08): 1713-1719 . 百度学术
    2. 李净. 国际视野下治理虚假新闻的技术手段及相关模型. 中国传媒科技. 2021(08): 17-21 . 百度学术

    其他类型引用(7)

计量
  • 文章访问数:  737
  • HTML全文浏览量:  0
  • PDF下载量:  294
  • 被引次数: 9
出版历程
  • 发布日期:  2021-06-30

目录

    /

    返回文章
    返回