• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Yan Hongfei, Zhang Xudong, Shan Dongdong, Mao Xianling, Zhao Xin. SIMD-Based Inverted Index Compression Algorithms[J]. Journal of Computer Research and Development, 2015, 52(5): 995-1004. DOI: 10.7544/issn1000-1239.2015.20131548
Citation: Yan Hongfei, Zhang Xudong, Shan Dongdong, Mao Xianling, Zhao Xin. SIMD-Based Inverted Index Compression Algorithms[J]. Journal of Computer Research and Development, 2015, 52(5): 995-1004. DOI: 10.7544/issn1000-1239.2015.20131548

SIMD-Based Inverted Index Compression Algorithms

More Information
  • Published Date: April 30, 2015
  • The rapid growth of text information has brought about new challenges to traditional information retrieval. In large search engines, indexing is required to help users acquire important data they need, and techniques of inverted index have great influence on the efficiency of query processing in such systems. The data in inverted index is stored in the form of arrays of integers, and techniques of compression are required to reduce the cost of storing such data in disks and memory, as well as to boost the hit rate of CPU cache and speed up transferring data. Therefore, it is necessary to choose a highly efficient compression algorithm to process query effectively. In this paper, we propose two instruction-level-parallelized algorithms, i.e. SIMD-PB and SIMD-PFD, which improve two competitive compression algorithms respectively, i.e. PackedBinary and PForDelta, and exploit SIMD instructions to accelerate the Pack and Unpack procedure in the algorithms. Experiments based on public datasets of GOV2 and ClueWeb09B show that our novel algorithms have good performance on encoding and decoding speed without impairing the compression ratio, and outperform the former fastest inverted list compression algorithms by at most 17%, with respect to decompression speed. Furthermore, experiments indicate that our novel algorithms have better performance on longer posting list and larger block size w.r.t. decoding speed.
  • Related Articles

    [1]Wang Yuwei, Liu Min, Ma Cheng, Li Pengfei. High Performance Load Balancing Mechanism for Network Function Virtualization[J]. Journal of Computer Research and Development, 2018, 55(4): 689-703. DOI: 10.7544/issn1000-1239.2018.20170923
    [2]Wang Peng, Huang Yan, Li Kun, Guo Youming. Load Balancing Degree First Algorithm on Phase Space for Cloud Computing Cluster[J]. Journal of Computer Research and Development, 2014, 51(5): 1095-1107.
    [3]Shen Zhijun, Zeng Huashen. A Load Balanced Switch Architecture Based on Implicit Flow Splitter[J]. Journal of Computer Research and Development, 2012, 49(6): 1220-1227.
    [4]Zhang Lilun, Ye Hong, Wu Jianping, Song Junqiang. Parallel Load-Balancing Performance Analysis Based on Maximal Ratio of Load Offset[J]. Journal of Computer Research and Development, 2010, 47(6).
    [5]Liu Xinhua, Li Fangmin, Kuang Hailan, Fang Yilin. An Distributed and Directed Clustering Algorithm Based on Load Balance for Wireless Sensor Network[J]. Journal of Computer Research and Development, 2009, 46(12): 2044-2052.
    [6]Wang Xianghui, Zhang Guoyin, and Xie Xiaoqin. A Load Balance Clustering Algorithm for Multilevel Energy Heterogeneous Wireless Sensor Networks[J]. Journal of Computer Research and Development, 2008, 45(3): 392-399.
    [7]Chai Yunpeng, Gu Lei, and Li Sanli. Cluster-Based Edge Streaming Server with Adaptive Load Balance in Mobile Grid[J]. Journal of Computer Research and Development, 2007, 44(12): 2136-2142.
    [8]Li Zhenyu, Xie Gaogang. A Load Balancing Algorithm for DHT-Based P2P Systems[J]. Journal of Computer Research and Development, 2006, 43(9): 1579-1585.
    [9]Tian Junfeng, Liu Yuling, and Du Ruizhong. Research of a Load Balancing Model Based on Mobile Agent[J]. Journal of Computer Research and Development, 2006, 43(9): 1571-1578.
    [10]Zhang Xiangquan, Guo Wei. A Bidirectional Path Re-Selection Based Load-Balanced Routing Protocol for Ad-Hoc Networks[J]. Journal of Computer Research and Development, 2006, 43(2): 218-223.

Catalog

    Article views (1767) PDF downloads (588) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return