ISSN 1000-1239 CN 11-1777/TP

• 软件技术 •

### 基于低秩和稀疏矩阵分解的多源融合链接预测算法

1. 1(中山大学信息科学与技术学院 广州 510006); 2(暨南大学信息科学技术学院 广州 510632); 3(中山大学软件学院 广州 510006) (jourkliu@163.com)
• 出版日期: 2015-02-01
• 基金资助:
基金项目：国家自然科学基金项目(61033010,61272065,61370021,61472453,U1401256)；广东省自然科学基金项目(S2011020001182,S2012010009311);广东省科技计划项目(2011B040200007,2012A010701013)

### Multiple Sources Fusion for Link Prediction via Low-Rank and Sparse Matrix Decomposition

Liu Ye1,Zhu Weiheng2,Pan Yan3, Yin Jian1

1. 1(School of Information Science and Technology, Sun Yat-sen University, Guangzhou 510006); 2(College of Information Science Technology, Jinan University, Guangzhou 510632); 3(School of Software, Sun Yat-sen University, Guangzhou 510006)
• Online: 2015-02-01

Abstract: In recent years, link prediction is a popular research field of link mining in social network and other complex networks. In the problem of link prediction, there usually exist multiple additional sources of information used to improve the performance of predicting the probability of the links in network. Among all the sources, the major source of all the information sources usually plays the most significant role on predicting. It is important to design a robust algorithm to make full use of all the sources and balance the major source and additional sources to get better link prediction result. Meanwhile, the traditional unsupervised algorithms based on topological calculation are mostly useful methods to calculate the scores for solving link prediction problem. In the approach of link prediction methods, the most important step is to construct a precise input seed matrix. Since many real-world network data may be noisy, which decreases the accuracy of most link prediction methods. In this paper, we propose a novel method with the multiple additional sources which take advantage of the leading information seed source matrix and others. And then, the seed source matrix is combined with other sources to construct a better matrix with lower noise and more precise structure than the seed matrix. The new matrix is used as the input matrix to traditional unsupervised topological algorithm. Experiment results show that the new proposed method can get better performance of the link prediction problem in different kinds of multiple sources real-world datasets.