基于可重构架构的数据中心异构加速软硬件系统级平台

王彦伟; 李仁刚; 徐冉; 刘钧锴

doi:10.7544/issn1000-1239.202440041

基于可重构架构的数据中心异构加速软硬件系统级平台

Data Center Heterogeneous Acceleration Software-Hardware System-Level Platform Based on Reconfigurable Architecture

摘要

摘要: 构建数据中心加速服务的软硬件系统级原型平台，需要考虑高计算能力、扩展性、灵活性和低成本等因素. 为了提高数据中心的能力，从软硬件协同的角度研究数据中心异构计算在云平台架构、硬件实现、高速互连和应用等方面的创新，研究设计并构建了一个可重构组合的软硬件加速原型系统，简化了现有以处理器为中心的系统级计算平台构建方法，实现目标软硬件设计的快速部署与系统级原型验证. 针对以上目标，通过解耦的可重构架构设备虚拟化和远程映射等方法，发掘独立计算单元的潜力，构建了一套ISOF（independent system of FPGA（field programmable gate arrays））软硬件计算平台系统，可使其超越普通服务器设计所能提供的能力，实现计算单元低成本高效扩展，使客户端可灵活使用外设资源，并且为满足系统级通信挑战，设计了一套计算单元之间的通信硬件平台和交互机制. 此外，为提升软硬件系统级平台的敏捷性，ISOF提供了灵活统一的调用接口. 最后，通过对平台目标系统级的分析评估，验证了该平台在满足了当下计算与加速需求下，保证了高速、低延时的通信，以及良好的吞吐率和弹性扩容效率，另外在高速通信的基础上改进的拥塞避免和丢包恢复机制，满足了数据中心规模通信的稳定性需求.

Abstract: Constructing a software and hardware system-level prototype platform for accelerating data center services requires the consideration of factors such as high computing power, scalability, flexibility, and low cost. To enhance data center capabilities, research from the perspective of software-hardware synergy has been conducted on the innovation of heterogeneous computing in cloud platform architecture, hardware implementation, high-speed interconnection, and applications. A reconfigurable and combinable software-hardware acceleration prototype system is designed and built to simplify existing processor-centric system-level computing platform construction methods, enabling rapid deployment and system-level prototype validation of target software-hardware designs. To achieve these objectives, methods such as decoupled reconfigurable architecture device virtualization and remote mapping are utilized to uncover the potential of independent computing units. An ISOF (independent system of FPGA) software-hardware computing platform system is constructed to surpass the capabilities of conventional server designs, enabling low-cost and efficient expansion of computing units while allowing clients to flexibly utilize peripheral resources. To address system-level communication challenges, a communication hardware platform and interaction mechanism between computing units are designed. Additionally, to enhance the agility of the software-hardware system-level platform, ISOF provides a flexible and unified invocation interface. Finally, through the analysis and evaluation of the system-level objectives of the platform, it has been verified that the platform meets the current computing and acceleration requirements, ensuring high-speed, low-latency communication, as well as good throughput and efficient elastic scalability. In addition, improvements have been made in congestion avoidance and packet recovery mechanisms based on high-speed communication, meeting the stability requirements of communication at data center scale.

HTML全文

参考文献(31)

施引文献

资源附件(0)