ISSN 1000-1239 CN 11-1777/TP

计算机研究与发展 ›› 2020, Vol. 57 ›› Issue (2): 281-290.doi: 10.7544/issn1000-1239.2020.20190574

所属专题: 2020大数据与智能存储系统前沿技术专题

• 系统结构 • 上一篇    下一篇

一种持久性内存文件系统数据页的混合管理机制

陈游旻1, 朱博弘1, 韩银俊2, 屠要峰2, 舒继武1   

  1. 1(清华大学计算机科学与技术系 北京 100084);2(中兴通讯股份有限公司 南京 210012) (chenym16@mails.tsinghua.edu.cn)
  • 出版日期: 2020-02-01
  • 基金资助: 
    国家重点研发计划项目(2018YFB1003301);中兴通讯股份有限公司合作项目(20182002008);广东省科技创新战略专项项目(2018B010109002)

A Hybrid Approach for Managing Data Pages in Persistent Memory File Systems

Chen Youmin1, Zhu Bohong1, Han Yinjun2, Tu Yaofeng2, and Shu Jiwu1   

  1. 1(Department of Computer Science and Technology, Tsinghua University, Beijing 100084);2(ZTE Corporation, Nanjing 210012)
  • Online: 2020-02-01
  • Supported by: 
    This work was supported by the National Key Research and Development Program of China (2018YFB1003301), the Project of ZTE (20182002008), and the Science and Technology Innovation Project of Guangdong Province (2018B010109002).

摘要: 英特尔于2019年4月正式发布基于3D-Xpoint技术的傲腾持久性内存(Optane DC persistent memory),这为构建高效的持久性内存存储系统提供了新的机遇.然而,现有的存储系统软件并不能很好地利用其字节寻址特性,持久性内存性能很难充分发挥.提出一种文件系统数据页的混合管理机制HDPM,通过选择性使用写时复制机制和日志结构管理文件数据,充分发挥持久性内存字节可寻址特性,从而避免了传统单一模式在非对齐写或者小写造成的写放大问题.为避免影响读性能,HDPM引入逆向扫描机制,实现日志结构重构数据页时不引入额外数据拷贝.HDPM还提出一种多重垃圾回收机制进行日志清理.当单个日志结构过大时,通过读写流程主动回收日志结构;当持久性内存空间受限时,则通过后台线程使用免锁机制异步释放日志空间.实验显示,HDPM相比于NOVA文件系统,单线程写延迟降低达58%,且读延迟不受影响;Filebench多线程测试显示,HDPM相比于NOVA提升吞吐率33%.

关键词: 持久性内存, 文件系统, 写时复制, 日志结构, 垃圾回收

Abstract: Intel has officially released the Optane DC Persistent Memory based on 3D-Xpoint technology in April 2019, which provides new opportunities for building efficient persistent memory storage systems. However, existing software is far from fully exploiting the hardware performance, due to the ignorance of utilizing the byte-addressable feature of persistent memory. This paper proposes a hybrid data page management (HDPM) mechanism. It manages file data by selectively using the copy-on-write technique and log-structure, so as to fully utilize the byte-addressable feature of persistent memory. It can avoid the redundant copy overhead as in traditional approaches when processing un-aligned or small-sized writes. To guarantee the read performance unaffected, HDPM introduces reverse-scanning mechanism, which avoids the additional data copying when rebuilding data pages from the log. HDPM also introduces a multi-stage garbage collection mechanism for log cleaning. When a single log is too large, it’s automatically reclaimed by read/write system calls. When the persistent memory space is limited, a background thread asynchronously reclaims the log space with a lock-free approach, without affecting the normal read/write performance. Experiments show that HDPM provides high write performance. Compared with NOVA, a state-of-the-art persistent memory file system, HDPM exhibits 58% lower write latency at most with the small-sized and write-intensive workload, and provides comparable performance for read operations. Our evaluation with Filebench shows that HDPM outperforms NOVA by 33% at most with 10 concurrent threads.

Key words: persistent memory, file system, copy-on-write, log-structure, garbage collection

中图分类号: