高级检索

    基于CXL的存储阵列全局缓存技术

    Storage Arrays Global Cache Technology Based on CXL

    • 摘要: 全局缓存是企业级存储阵列的关键技术,其核心功能是提高数据的写入速度:用户I/O写入存储控制器的内存缓存后立即返回,无需等待数据持久化至低速HDD或SSD设备。然而,企业级存储产品的全局缓存设计需综合考虑副本数量、系统高可用及故障场景等因素。受这些因素的相互制约,存储集群的控制器数量通常难以超过4个,且系统可用性无法支持4个控制器同时坏3个。此外,控制器间通过RDMA(remote direct memory access)传输缓存副本数据,也会影响存储性能。为此,基于最新的CXL(compute express link)技术,设计了内存分离式的全局缓存系统原型。内存缓存从控制器内部解耦至外部后,系统具有以下优势:控制器与内存缓存可独立横向扩展,缓存副本可以支持多个;控制器故障不会导致缓存数据丢失,n个控制器可以同时故障n−1个;CXL技术在在提升系统扩展性与可用性的同时,能显著优化存储性能。实验结果表明,CXL全局缓存性能比RDMA全局缓存提升38%。

       

      Abstract: Global cache is a core technology for enterprise-grade storage arrays, whose primary function is to improve data write performance. I/O operations return immediately once data is written into the memory cache of storage controllers, without waiting for the data to be persisted to slow HDDs or SSDs. However, the design of global cache in enterprise-grade storage products must comprehensively consider factors such as the number of replicas, system high availability, and failure scenarios. Constrained by the mutual interactions of these factors, the number of controllers in storage clusters typically cannot exceed 4, and system high availability fails to support the scenario where 3 out of 4 controllers fail concurrently. Furthermore, the transmission of cache replica data between controllers via RDMA(remote direct memory access) technology also impairs storage performance. Thus, a memory-disaggregated global cache system prototype is designed based on the latest CXL(compute express link) technology. With memory cache disaggregated from within controllers to external, the system offers three key advantages: first, controllers and memory cache can scale out independently, supporting multiple cache replicas; second, controller failures do not cause cache data loss, supporting concurrent failures of n-1 out of n controllers; third, CXL technology significantly optimizes storage performance while enhancing system scalability and high availability. Experimental results demonstrate that the CXL-based global cache achieves a 38% performance improvement compared with the RDMA-based global cache.

       

    /

    返回文章
    返回