高级检索

    DPZB+tree:基于ZNS SSD与持久化内存的高效B+树索引设计

    DPZB+tree: An Efficient B+tree Index for ZNS SSD with Persistent Memory

    • 摘要: 新型分区命名空间固态硬盘(zoned namespace solid state drive,ZNS SSD)有望解决传统块设备固态盘写放大率高、存储密度低、I/O路径复杂等问题,为存储技术的发展创造机遇。B+树作为一种高效的树形索引结构,被广泛应用于各类数据库和文件系统中,以支撑大模型高效数据加载、外部知识库构建及结构化元数据管理,从而显著提升训练效率与知识调用性能。然而,由于ZNS SSD的硬件特性不同于传统块设备,直接将B+树部署到ZNS SSD中不仅会导致较高的写放大率,还会引起级联更新,严重影响存储系统的性能。针对以上问题,结合新型持久化内存(persistent memory,PM)提出了一种基于ZNS SSD的B+树索引结构DPZB+tree。首先,DPZB+tree采用DRAM-PM-ZNS SSD混合存储架构,实现冷热数据分离存储;其次,DPZB+tree设计了冷热节点识别策略,以提高存储系统的读写效率;然后,针对PM容量有限的问题,提出了冷热节点动态放置策略,实现冷热数据的自适应迁移;最后,结合硬件特性和局部性原理设计了叶节点分裂及合并操作。DPZB+tree索引方案基于ZNS SSD模拟器和英特尔傲腾PM实现。实验结果表明,在多种工作负载下,相较于LSM-tree,SSDB+tree,DZB+tree,Baseline,DPZB+tree均取得了优异的读写性能以及更低的恢复耗时。

       

      Abstract: Emerging Zoned Namespace Solid-State Drive (ZNS SSD) addresses several critical issues inherent in traditional SSDs, including high write amplification, low storage density, and complex I/O paths. These advancements create new opportunities for the progress of storage technologies. The B+tree, as an efficient tree index structure, is widely used in various databases and file systems to support the efficient loading of large models, the construction of external knowledge bases, and the management of structured metadata, thereby significantly improving training efficiency and knowledge invocation performance. However, due to the hardware characteristics of ZNS SSDs being different from traditional block devices, directly deploying B+tree to ZNS SSDs not only results in higher write amplification rates but also causes cascading updates, seriously affecting the performance of the storage system. For the above problems, this paper proposes the DPZB+tree, a B+tree indexing structure optimized for ZNS SSDs and incorporating Persistent Memory (PM). First, the DPZB+tree adopts a hybrid DRAM-PM-ZNS SSD storage architecture, thereby effectively separating hot and cold data. Second, the DPZB+tree introduces a lightweight hot/cold node identification strategy, which in turn improves I/O efficiency. Third, to address the limited capacity of PM, this paper presents a dynamic node placement strategy that adaptively migrates data between PM and ZNS SSD. Finally, this paper combines hardware characteristics and spatiotemporal locality principles to design leaf node splitting and merging operations. DPZB+tree was implemented using a ZNS SSD simulator and Intel Optane Persistent Memory. Experimental results show that under various workloads, DPZB+tree outperforms LSM tree, SSDB+tree, DZB+tree, and Baseline in terms of performance and recovery time.

       

    /

    返回文章
    返回