引用本文: | 孙鹏浩, 兰巨龙, 申涓, 胡宇翔. 基于牵引控制的深度强化学习路由策略生成[J]. 计算机研究与发展, 2021, 58(7): 1563-1572. doi: 10.7544/issn1000-1239.2021.20200018 |
Citation: | Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. doi: 10.7544/issn1000-1239.2021.20200018 |