• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Chunlei, Zhang Yanfeng, Bao Yubin, Zhao Changkuan, Yu Ge, Gao Lixin. Asyn-SimRank: An Asynchronous Large-Scale SimRank Algorithm[J]. Journal of Computer Research and Development, 2015, 52(7): 1567-1579. DOI: 10.7544/issn1000-1239.2015.20140313
Citation: Wang Chunlei, Zhang Yanfeng, Bao Yubin, Zhao Changkuan, Yu Ge, Gao Lixin. Asyn-SimRank: An Asynchronous Large-Scale SimRank Algorithm[J]. Journal of Computer Research and Development, 2015, 52(7): 1567-1579. DOI: 10.7544/issn1000-1239.2015.20140313

Asyn-SimRank: An Asynchronous Large-Scale SimRank Algorithm

More Information
  • Published Date: June 30, 2015
  • The SimRank algorithm, which exploits network structure to measure the similarity between node pairs, has been widely used in many areas, such as online social networks and link prediction. In recent years, with the development of big data, the input data set of the SimRank algorithm is constantly increasing. People are utilizing distributed computing models, such as MapReduce, to design large-scale SimRank algorithm for solving the big data problems. However, since SimRank algorithm contains a high-cost iterative process with synchronization barriers between iterations and the computational complexity is high in each iteration, the large-scale SimRank computation does not result in the satisfactory performance. In this paper, 1)We propose Asyn-SimRank by employing the iterate-cumulate approach,which asynchronously executes the core iterative computation to avoid the high-cost synchronization barriers in large-scale distributed environments, and effectively reduces the amount of computation and communication.2) We further propose the keypoint priority scheduling mechanism to accelerate convergence. 3)We prove the accuracy and convergence property of Asyn-SimRank as well as the efficiency of the keypoint priority scheduling. 4)We then implement Asyn-SimRank on Maiter, which is a distributed framework supporting asynchronous iteration. Expremental results show that, compared with the SimRank and Delta-SimRank implementing on Hadoop and Spark, the large-scale Asyn-SimRank significantly promotes the computational efficiency and accelerates the convergence.

Catalog

    Article views (1725) PDF downloads (682) Cited by()
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return