ISSN 1000-1239 CN 11-1777/TP

计算机研究与发展 ›› 2019, Vol. 56 ›› Issue (4): 755-766.doi: 10.7544/issn1000-1239.2019.20170832

• 系统结构 • 上一篇    下一篇

基于工作负载感知的固态硬盘阵列系统的架构设计与研究

张强1,梁杰1,许胤龙1,2,李永坤1,2   

  1. 1(中国科学技术大学计算机科学与技术学院 合肥 230026); 2(安徽省高性能计算重点实验室(中国科学技术大学) 合肥 230026) (zhgqiang@mail.ustc.edu.cn)
  • 出版日期: 2019-04-01
  • 基金资助: 
    国家自然科学基金项目(61520106005,61761136014);国家重点研发计划项目(2017YFB1010000)

Research of SSD Array Architecture Based on Workload Awareness

Zhang Qiang1, Liang Jie1, Xu Yinlong1,2, Li Yongkun1,2   

  1. 1(School of Computer Science and Technology, University of Science and Technology of China, Hefei 230026); 2(Anhui Province Key Laboratory of High Performance Computing (University of Science and Technology of China), Hefei 230026)
  • Online: 2019-04-01

摘要: 在RAID-0阵列中设计了一种基于冷热数据分离存储的固态硬盘阵列系统架构HA-RAID,并结合滑动窗口技术进行优化.其主要思想是,利用固定长度的滑动窗口将阵列系统中的盘划分为普通盘和热点盘,热点盘专门存放热数据,普通盘存放普通数据,且阵列中各盘的角色随着滑动窗口的移动而不断变化.在8块商用固态硬盘组成的RAID-0阵列系统上对HA-RAID予以实验分析.实验结果表明,相比于未引入冷热数据分离机制的原始RAID-0做法,HA-RAID可以将热数据相对均匀地存储到各个盘上,很好地实现了阵列系统级的负载和磨损均衡,从而将阵列中热点盘出现的比例降低到几乎为0.在真实的企业级工作负载下,相比原始RAID-0,HA-RAID减少12.01%~41.06%的平均响应时间,很好地实现了阵列系统级的I/O性能提升.

关键词: SSD阵列架构设计, 冷热数据区分, RAID-0, 负载均衡, 响应时间

Abstract: The fixed data layout of traditional array system and the locality of workloads cause the partial disks of the array system to become hot disks, which affects the reliability and the overall concurrency performance of the array system. This paper proposes a new RAID architecture for SSD array systems, HA-RAID, which leverages hot/cold data separation and sliding window techniques. The main idea is that HA-RAID divides the disk array into hot disks and ordinary disks, which stores hot data on hot disks and cold data on ordinary disks, and it changes the role of each disk dynamically by moving a fixed-length sliding window. So, each disk has the opportunity to become a hot disk and stores hot data which achieves the purpose of storing hot data evenly on each disk. Experiments under real-world workloads on a RAID-0 array system composed of eight commercial SSDs show that HA-RAID can achieve an even distribution of hot data across all disks and reduce the percentage of hot disks appearing in the array to almost zero. This implies that HA-RAID achieves load balance and wear balance at the device level. In terms of performance, HA-RAID reduces the average response time by 12.01%~41.06% which achieves the I/O performance enhancement, compared with traditional RAID-0 array.

Key words: SSD array architecture design, cold and hot data identification, RAID-0, load balance, response time

中图分类号: