ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2019, Vol. 56 ›› Issue (3): 623-634.doi: 10.7544/issn1000-1239.2019.20170961

Previous Articles     Next Articles

A Link Prediction Model Based on Hierarchical Information Granular Representation for Attributed Graphs

Luo Sheng1,2, Miao Duoqian1,2, Zhang Zhifei1,3, Zhang Yuanjian1,2, Hu Shengdan1,2   

  1. 1(Department of Computer Science and Technology, Tongji University, Shanghai 201804); 2(Key Laboratory of Embedded System and Service Computing (Tongji University), Ministry of Education, Shanghai 201804); 3(State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023)
  • Online:2019-03-01

Abstract: With the accumulation of the network graph data coupled with node attributes, the relations between node attributes and node linkages become more and more complex, which brings a lot of challenges to the task of the link prediction in complex network. The main reason is the inconsistency existing in the different source data, that is, the relations between the latent linkages which are implied by the node attributes and the observed linkages from network topological structure, respectively. This phenomenon directly affects the correctness and accuracy of link predictions. In order to effectively deal with multi-source data inconsistency and fuse the heterogeneous data, with the idea of granular computing and data multi-layer granular representation, we model the original data at different levels of granular representation. According to the data granular representation, we ultimately eliminate data inherent inconsistencies by finding the optimal granular structure. In this paper, we firstly define the data granular representation and the relation between different level granular; Then, we construct a log-likelihood model of the data, and place a lot of constraints decided by the granular relations to regularize the model; At last, we use the trained model to perform the link probability between nodes. Experiments show that, multi-source data can ultimately reduce the inconsistency by granular representation, and the statistic model regulated by these granular relations outperforms the state-of-the-art methods, and effectively improves the accuracy of the link prediction in the attributed graph.

Key words: granular representation learning, granular computing, attributed graph, link prediction, data fusion

CLC Number: