  1. (国防科技大学计算机学院 长沙 410073) (
  • 出版日期: 2021-10-01
Review of Coded Computing

Zheng Tengfei, Zhou Tongqing, Cai Zhiping, Wu Hongjia   

  1. (College of Computer, National University of Defense Technology, Changsha 410073)
  • Online: 2021-10-01
    This work was supported by the National Key Research and Development Program of China (2020YFC2003400, 2018YFB0204301), the National Natural Science Foundation of China (62072465, 62102425, 62172155), and the National University of Defense Technology Research (ZK19-38).

摘要: 编码计算将编码理论融于分布式计算中,利用灵活多样的编码方式降低数据洗牌造成的高通信负载,缓解掉队节点导致的计算延迟,有效提升分布式计算系统的整体性能,并通过纠错机制和数据掩藏等技术为分布式计算系统提供安全保障.鉴于其在通信、存储和计算复杂度等方面的优势,受到学术界的广泛关注,成为分布式计算领域的热门方向.对此,首先介绍编码计算的研究背景,明确编码计算的内涵与定义;随后对现有编码计算方案进行评述,从核心挑战入手,分别对面向通信瓶颈,计算延迟和安全隐私的编码计算方案展开介绍、总结和对比分析;最后指出未来可能的研究方向和技术挑战,为相关领域的研究提供有价值的参考.

关键词: 编码计算, 分布式计算, 分布式机器学习, 网络编码, 性能优化, 系统安全, 数据隐私

Abstract: By integrating the coding theory with distributed computing and exploiting flexible coding methods, coded computing manages to relieve the transmission burden and the negative effects of stragglers. In this way, it improves the overall performance of distributed computing systems. Meanwhile, coded computing schemes are also designed and used to provide security and privacy guarantees for distributed computing systems, where mechanisms, such as error-correcting and data masking, are generally adopted. Due to the advantages of coded computing in communication, storage and computational complexity, it has attracted extensive attention and has become a popular direction in the field of distributed computing. In this survey, the background of coded computing is reviewed with its definition and core ideology clarified. Afterward, the existing coding schemes for communication bottleneck, computation delay and security privacy are introduced and comparatively analyzed in detail. Finally, future research directions and technical challenges of coded computing are analyzed and introduced to provide valuable references for related researchers.

Key words: coded computing, distributed computing, distributed machine learning, network coding, performance optimizing, system security, data privacy