• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhou Jiang, Wang Weiping, Meng Dan, Ma Can, Gu Xiaoyan, Jiang Jie. Key Technology in Distributed File System Towards Big Data Analysis[J]. Journal of Computer Research and Development, 2014, 51(2): 382-394.
Citation: Zhou Jiang, Wang Weiping, Meng Dan, Ma Can, Gu Xiaoyan, Jiang Jie. Key Technology in Distributed File System Towards Big Data Analysis[J]. Journal of Computer Research and Development, 2014, 51(2): 382-394.

Key Technology in Distributed File System Towards Big Data Analysis

More Information
  • Published Date: February 14, 2014
  • With the arrival of big data period, data analysis and processing are becoming a more important technology which the data center and Internet companies depend on. Mass data storage is a hotspot topic in big data analysis with the expansion of information and variety of data structure. Traditional distributed file systems are lack of the new demands in scalability, reliability and performance. In this paper, a cluster file system towards big data analysis is designed, which is named Clover. Clover uses the namespace management based on directory sharding and consistent hashing to solve the problem of metadata extension. It provides metadata consistency for distributed transactions through a modified two-phase commit protocol. Moreover, Clover presents a highly available mechanism based on the shared storage pool. It achieves metadata reliability with hot standby and global state recovery mechanism. The evaluation results reveal that Clover could improve metadata performance linearly with the average value from 5.13% to 159.32% by adding one metadata server. Namespace management and distributed transactions would cause the degradation of performance on multiple metadata servers, but the influence is negligible (less than 10%). Comparing with HDFS, Clover could keep the similar throughput and quickly recover from metadata server failures. Practical application tests show that Clover is suitable for building high scalable and high available storage system.
  • Related Articles

    [1]Xiang Chaocan, Cheng Wenhui, Zhang Zhao, Jiao Xianlong, Qu Yuben, Chen Chao, Dai Haipeng. Intelligent Edge Computing-Empowered Adaptive Urban Traffic Sensing Data Recovery[J]. Journal of Computer Research and Development, 2023, 60(3): 619-634. DOI: 10.7544/issn1000-1239.202110962
    [2]Shen Yijie, Zeng Dan, Xiong Jin. A Benefit Model Based Data Reuse Mechanism for Spark SQL[J]. Journal of Computer Research and Development, 2020, 57(2): 318-332. DOI: 10.7544/issn1000-1239.2020.20190563
    [3]Shi Weisong, Zhang Xingzhou, Wang Yifan, Zhang Qingyang. Edge Computing: State-of-the-Art and Future Directions[J]. Journal of Computer Research and Development, 2019, 56(1): 69-89. DOI: 10.7544/issn1000-1239.2019.20180760
    [4]Zhang Qi, Hu Yupeng, Ji Cun, Zhan Peng, Li Xueqing. Edge Computing Application: Real-Time Anomaly Detection Algorithm for Sensing Data[J]. Journal of Computer Research and Development, 2018, 55(3): 524-536. DOI: 10.7544/issn1000-1239.2018.20170804
    [5]Xie Heng, Wang Mei, Le Jiajin, Sun Li. Calculation Results Characteristics Extract and Reuse Strategy Based on Hive[J]. Journal of Computer Research and Development, 2015, 52(9): 2014-2024. DOI: 10.7544/issn1000-1239.2015.20140548
    [6]Zhang Qi, Wang Mei, Le Jiajin, Liu Guohua. Scheduling Algorithm for the Reuse Buffers in Column-Store Data Warehouse Query Execution[J]. Journal of Computer Research and Development, 2011, 48(10): 1942-1950.
    [7]Zhong Liang, Hu Chunming, Wo Tianyu, Li Jianxin, and Kang Junbing. Prefetching Mechanism for On-demand Software Streaming[J]. Journal of Computer Research and Development, 2011, 48(7): 1178-1189.
    [8]Hu Xiao and Chen Shuming. Code Layout for Phase Prefetch on Instruction Cache[J]. Journal of Computer Research and Development, 2009, 46(5): 747-755.
    [9]Ban Zhijie, Gu Zhimin, Jin Yu. A Survey of Web Prefetching[J]. Journal of Computer Research and Development, 2009, 46(2): 202-210.
    [10]Wu Jiajun, Feng Xiaobing, Zhang Zhaoqing. Data Prefetching Technique of Nonlinear Memory Access[J]. Journal of Computer Research and Development, 2007, 44(2): 355-360.

Catalog

    Article views (1932) PDF downloads (2379) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return