ISSN 1000-1239 CN 11-1777/TP

• 系统结构 •

### 基于并发跳表的云数据处理双层索引架构研究

1. (云南大学软件学院 昆明 650091) (zwei@ynu.edu.cn)
• 基金资助:
基金项目：国家自然科学基金项目 (61363021,61363084)；云南省软件工程重点实验室开放基金项目(2011SE01,2012SE304)；云南省青年基金项目(2012FD004)；云南省教育厅科学研究基金项目(2014Y013)

### Concurrent Skiplist Based Double-Layer Index Framework for Cloud Data Processing

Zhou Wei, Lu Jin, Zhou Keren, Wang Shipu, Yao Shaowen

1. (School of Software, Yunnan University, Kunming 650091)
• Online: 2015-07-01

Abstract: Cloud data processing plays an essential infrastructure in cloud systems. Without efficient structures, cloud systems cannot support the necessary high throughput and provide services for millions of users. However, most existing cloud storage systems generally adopt a distributed Hash table (DHT) approach to index data, which lacks to support range-query and dynamic real-time character. It is necessary to generate a scalable, dynamical and multi-query functional index structure in cloud environment. Based on the summary and analysis of the double-layer index systems for cloud storage, this paper provides a novel concurrent skiplist based double-layer index (referred as CSD-index) for cloud data processing. Two-layer architecture, which can breakthrough single machine memory and hard drive limitation, is used to extend indexing scope. Online migration algorithm of skiplist’s nodes between local servers is used to make dynamic load-balancing. The details of the design and the implement of the concurrent skiplist are discussed in this paper. Optimistic concurrency control (OCC) technique is introduced to enhance the concurrency. Through concurrent skiplist CSD-index improves the load bearing capacity of the global index and enhances the overall throughput of the index. Experimental results show the efficiency of the concurrent skiplist based double-layer index and it has viability as an alternative approach for cloud-suitable data structures.