ISSN 1000-1239 CN 11-1777/TP

计算机研究与发展 ›› 2021, Vol. 58 ›› Issue (2): 319-337.doi: 10.7544/issn1000-1239.2021.20200384

所属专题: 2021大数据时代的存储系统与智能存储技术专题

• 系统结构 • 上一篇    下一篇



  1. 1(福州大学物理与信息工程学院 福州 350116);2(区域光纤通信网与新型光通信系统国家重点实验室(上海交通大学) 上海 200240) (
  • 出版日期: 2021-02-01
  • 基金资助: 

Node-Constraint Store-and-Forward Scheduling Method for Inter-Datacenter Networks

Lin Xiao1, Ji Shuo1, Yue Shengnan2, Sun Weiqiang2, Hu Weisheng2   

  1. 1(College of Physics and Information Engineering, Fuzhou University, Fuzhou 350116);2(State Key Laboratory of Advanced Optical Communication Systems and Networks (Shanghai Jiao Tong University), Shanghai 200240)
  • Online: 2021-02-01
  • Supported by: 
    This work was supported by the National Natural Science Foundation of China for Young Scientists (61901118), the Key Program of the National Natural Science Foundation of China (61433009), and the Open Foundation of the State Key Laboratory of Advanced Optical Communication Systems and Networks (2019GZKF03003).

摘要: 借助海量数据中心存储,通过存储转发(store-and-forward, SnF)调度大数据传输,已被证明能有效解决跨数据中心间大数据传输难题.然而,多数现有调度方法将数据途经的所有网络节点(例如数据中心)均纳入SnF调度决策,导致其计算复杂度过高,难以为大规模网络提供实时调度服务.针对跨数据中心光网络场景,给出SnF模型,量化分析存储节点数量对调度问题性能与复杂度的影响.研究表明:在一定条件下,无需将所有节点都纳入调度决策也可获得良好的调度性能.由此,提出了节点约束SnF调度方法.该方法的特点在于:1)仅将部分数据途经节点纳入调度决策,降低调度问题求解难度;2)引入拓扑抽象,将被选节点间链路状态压缩,缩小调度问题规模、提高算法求解效率.仿真结果表明:在阻塞率和算法计算时间方面,该方法优于现有调度方法.

关键词: 大数据传输, 跨数据中心网络, 波长路由, 存储, 调度方法

Abstract: Performing store-and-forward (SnF) using abundant storage resources inside datacenters has been proven to be effective in overcoming the challenges faced by inter-datacenter bulk transfers. Most prior studies attempt to fully leverage the network infrastructure and maximize the flexibility of the SnF scheme. Their proposed scheduling methods hence aim at a full storage placement where all network nodes (e.g., datacenters) are SnF-enabled and every node is taken into account in the scheduling process. However, the computational complexity of the prior methods exponentially increases with the network scale. As a result, the prior methods may become too complicated to implement for large-scale networks and online scheduling. In this work, based on the inter-datacenter optical network, SnF models are presented to quantify the impact of the number of SnF-enabled nodes on the performance and the complexity of the SnF scheduling problem. Our key findings show that taking a few SnF-enabled nodes into account in the scheduling process can provide high performance while maintaining low complexity under certain circumstances. It is unnecessary to take every node into account in the scheduling process. Therefore, a node-constraint SnF scheduling method is proposed, whose features are twofold: 1) by taking a portion of nodes into account, it reduces the complexity of the SnF scheduling problem; 2) by introducing a topology abstraction, it condenses the link states between the considered nodes and hence reduces the problem size, which improves its efficiency in solving the SnF scheduling problem. Simulations demonstrate that the proposed method outperforms the prior method in terms of blocking probability and computation time.

Key words: big data transfers, inter-datacenter networks, wavelength routing, storage, scheduling method