在网存储系统研究综述

汪庆; 李俊儒; 舒继武

doi:10.7544/issn1000-1239.202220865

摘要: 以可编程交换机和智能网卡为代表的可编程网络设备在数据中心被越来越广泛地应用，它们支持在网络数据传输路径上执行自定义的数据处理逻辑，这为构建高性能的在网存储系统带来了新的机遇. 然而，可编程网络设备的硬件资源限制较多，如何充分发挥它们的优势、最大限度地加速存储系统仍面临着诸多挑战. 系统地综述了在网存储系统的研究进展，首先介绍了可编程网络设备的硬件结构与性能特征，并基于此总结了构建高性能在网存储系统面临的两大挑战：软硬件分工以及系统容错. 然后根据可编程网络设备执行的任务（缓存、协调、调度、聚合）对现有的在网存储系统进行分类和阐述，并以多个在网存储系统为实例分析对应的设计难点以及软件技术. 最后指明了在网存储系统进一步研究中需要着重探索的问题，包括交换机与网卡的协同、安全、多租户以及自动卸载.

Abstract: Programmable network devices, represented by programmable switches and SmartNICs, are increasingly used in modern data centers to support the execution of customized data processing logic on network data transmission paths, which brings new opportunities for building high-performance in-network storage systems. However, programmable network devices have hardware resource limitations (e.g., limited expressive powers and small memory space), and there are still many challenges to fully utilize their advantages and maximize the acceleration of storage systems. We systematically review the recent research progress of in-network storage systems. First, we describe the hardware architecture and performance characteristics of programmable network devices, and based on this, we summarize two major challenges in building high-performance in-network storage systems: 1) division of labor between hardware and software, 2) fault tolerance of the storage systems. Then, according to the tasks performed by programmable network devices (data caching, distributed coordination, request scheduling, data aggregation), we classify and describe existing in-network storage systems. Moreover, using several examples of in-network storage systems, we analyze corresponding design difficulties and software technologies. Finally, we indicate open problems that need to be explored in further research on in-network storage systems, including switch-NIC collaboration, data security, multi-tenancy, and automatic function offloading.

在网存储系统研究综述

Survey on In-Network Storage Systems