ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2015, Vol. 52 ›› Issue (9): 1965-1975.doi: 10.7544/issn1000-1239.2015.20140832

Previous Articles     Next Articles

A Nash-Pareto Strategy Based Automatic Data Distribution Method and Its Supporting Tool

Wang Xiaoyan1,2,3, Chen Jinchuan1, Guo Xiaoyan4, Du Xiaoyong1,3,5   

  1. 1(Key Laboratory of Data Engineering and Knowledge Engineering (Renmin University of China), Ministry of Education, Beijing 100872); 2(Information Center, The Supreme People’s Court, Beijing 100745); 3(School of Information, Renmin University of China, Beijing 100872); 4(EMC Labs China, Beijing 100084); 5(State Key Laboratory of Software Development Environment (Beihang University), Beijing 100191)
  • Online:2015-09-01

Abstract: The era of big data brings new challenges in the field of data storage and management. With the dramatic increase of data volume, automatic data distribution has been one of the key techniques and intractable problem for distributed systems. Based on the studies on data, workload and node in this field, this work abstracts the problem of data distribution as a triangle model called DaWN (data, workload, node), and summarizes their relationships with each other as data fragmentation, data allocation and workload processing. According to DaWN, it proposes an automatic solution for data distribution in large-scale on-line transaction processing (OLTP) applications, and discusses the details and interactions of each module in this consolidation architecture. Combined with our existing research, it applies the optimal equilibrium conduct of Nash-Pareto strategy into practice. According to the results of a series of experiments, the proposedapproach shows nice overall performance and effectiveness. Meanwhile, this work also implements a prototype tool called ADDvisor for automatic data distribution supporting in the expect of smoothly promoting more research work into real world practice and effectively coordinating automatic data distribution in large scale OLTP distributed applications.

Key words: data distribution, triangle model, automatic solution, optimal equilibrium, on-line transaction processing (OLTP)

CLC Number: