• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zheng Yasong, Wang Da, Ye Xiaochun, Cui Huimin, Xu Yuanchao, Fan Dongrui. MALK: A MapReduce Framework for High-Efficiently Processing Large Amount of Keys[J]. Journal of Computer Research and Development, 2014, 51(12): 2711-2723. DOI: 10.7544/issn1000-1239.2014.20131333
Citation: Zheng Yasong, Wang Da, Ye Xiaochun, Cui Huimin, Xu Yuanchao, Fan Dongrui. MALK: A MapReduce Framework for High-Efficiently Processing Large Amount of Keys[J]. Journal of Computer Research and Development, 2014, 51(12): 2711-2723. DOI: 10.7544/issn1000-1239.2014.20131333

MALK: A MapReduce Framework for High-Efficiently Processing Large Amount of Keys

More Information
  • Published Date: November 30, 2014
  • The overhead of memory allocation is one of the major bottlenecks for shared-memory MapReduce, especially for the applications that have large amount of keys. In order to solve this problem, this paper presents a less memory consumption MapReduce, namely MALK, which can high-efficiently process applications with a large number of keys. Firstly, MALK succeeds in avoiding the constant allocations of massive small memory blocks by managing the discrete keys using contiguous area of storage. Secondly, MALK pipelines the process of Map-tasks and Reduce-tasks to decrease the active data in the system at the same time, and proposes a reusable mechanism of Hash table to reuse the memory space so as to avoid the memory reallocation of Hash table. What is more, MALK determines the suitable number of Reduce tasks, by evaluating the effect of task quantity and granularity on performance, to get optimal performance. The experiments show that, compared with Phoenix++, MALK achieves up to 3.8X higher speedup (average of 2.8X), and saves up to 95.2% memory in Map phase and 87.8% memory in Reduce phase. In addition, MALK reduces 30% waiting time with better load balance in Reduce phase, and cuts down more than 35% cache miss rate on average.
  • Related Articles

    [1]Wang Qing, Zhu Bohong, Shu Jiwu. A Multicore-Friendly Persistent Memory Key-Value Store[J]. Journal of Computer Research and Development, 2021, 58(2): 397-405. DOI: 10.7544/issn1000-1239.2021.20200381
    [2]Chen Bo, Lu Youyou, Cai Tao, Chen Youmin, Tu Yaofeng, Shu Jiwu. A Consistency Mechanism for Distributed Persistent Memory File System[J]. Journal of Computer Research and Development, 2020, 57(3): 660-667. DOI: 10.7544/issn1000-1239.2020.20190074
    [3]Zhang Liqing, Guo Dong, Wu Shaoling, Cui Haibo, Wang Wei. An Ultra Lightweight Container that Maximizes Memory Sharing and Minimizes the Runtime Environment[J]. Journal of Computer Research and Development, 2019, 56(7): 1545-1555. DOI: 10.7544/issn1000-1239.2019.20180511
    [4]You Litong, Wang Zhenjie, Huang Linpeng. A Log-Structured Key-Value Store Based on Non-Volatile Memory[J]. Journal of Computer Research and Development, 2018, 55(9): 2038-2049. DOI: 10.7544/issn1000-1239.2018.20180258
    [5]Chen Juan, Hu Qingda, Chen Youmin, Lu Youyou, Shu Jiwu, Yang Xiaohui. A Tiny-Log Based Persistent Transactional Memory System[J]. Journal of Computer Research and Development, 2018, 55(9): 2029-2037. DOI: 10.7544/issn1000-1239.2018.20180294
    [6]Zhu Pengfei, Lu Tianyue, Chen Mingyu. A Trace-Driven Simulation of Memory System in Multithread Applications[J]. Journal of Computer Research and Development, 2015, 52(6): 1266-1277. DOI: 10.7544/issn1000-1239.2015.20150160
    [7]Zhong Qi, Wang Jing, Guan Xuetao, Huang Tao, Wang Keyi. Data Object Scale Aware Rank-Level Memory Allocation[J]. Journal of Computer Research and Development, 2014, 51(3): 672-680.
    [8]Zhao Hui, Yang Shuqiang, Chen Zhikun, Yin Hong, and Jin Songchang. Optimization of Range Queries and Analysis for MapReduce Systems[J]. Journal of Computer Research and Development, 2014, 51(3): 606-617.
    [9]Yuan Qingbo, Zhao Jianbo, Chen Mingyu, Sun Ninghui. Performance Bottleneck Analysis and Solution of Shared Memory Operating System on a Multi-Core Platform[J]. Journal of Computer Research and Development, 2011, 48(12): 2268-2276.
    [10]Liang Yi, Wang Lei, Fan Jianping, Fang Juan. Research on the Shared Memory-Based Checkpointing for Cluster Services[J]. Journal of Computer Research and Development, 2010, 47(4): 571-580.

Catalog

    Article views (1402) PDF downloads (543) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return