Dong Kunjie, Zhou Lihua, Zhu Yueying, Du Guowang, Huang Tong. Heterogeneous Attribute Network Embedding Based on the PPMI[J]. Journal of Computer Research and Development, 2022, 59(12): 2781-2793. DOI: 10.7544/issn1000-1239.20210763
Citation:
Dong Kunjie, Zhou Lihua, Zhu Yueying, Du Guowang, Huang Tong. Heterogeneous Attribute Network Embedding Based on the PPMI[J]. Journal of Computer Research and Development, 2022, 59(12): 2781-2793. DOI: 10.7544/issn1000-1239.20210763
Dong Kunjie, Zhou Lihua, Zhu Yueying, Du Guowang, Huang Tong. Heterogeneous Attribute Network Embedding Based on the PPMI[J]. Journal of Computer Research and Development, 2022, 59(12): 2781-2793. DOI: 10.7544/issn1000-1239.20210763
Citation:
Dong Kunjie, Zhou Lihua, Zhu Yueying, Du Guowang, Huang Tong. Heterogeneous Attribute Network Embedding Based on the PPMI[J]. Journal of Computer Research and Development, 2022, 59(12): 2781-2793. DOI: 10.7544/issn1000-1239.20210763
1(School of Information Science and Engineering, Yunnan University, Kunming 650504)
2(School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116086)
Funds: This work was supported by the National Natural Science Foundation of China (62062066, 61762090, 61966036, 62276227); the National Social Science Foundation of China (18XZZ005); the Key Project of Basic Research Program of Yunnan Province (202201AS070015); and the Science Research Foundation of Education Department of Yunnan Province (2021Y026).
Attribute network embedding aims to map nodes and link relationships in a network into a latent low-dimensional space, while preserving the intrinsic essence of node attribute and network topology. Heterogeneous attribute network contains the multiple-typed nodes and link relationships, which provide the rich auxiliary information and bring the new challenges for the network embedding. A novel model named HANEP (heterogeneous attribute network embedding based on the PPMI) is proposed for mapping multiple-typed nodes and link relationship in a heterogeneous attribute network into a latent low-dimensional space, while preserving the attribute features of nodes as well as complex, diverse and rich semantic information of different-typed heterogeneous links. Specifically, HANEP first transforms attribute features into an attribute graph and extracts network topology graphs based on the different meta-paths. Next, it constructs the probabilistic co-occurrence (PCO) matrixes with respect to nodes attribute and multiple topology graphs by the random surfing respectively, calculates the positive point-wise mutual information (PPMI), and then learns representations of nodes by the multiple auto-encoders. Meta-paths can capture the link relationships between the multiple types of nodes in a heterogeneous network, the attribute graph clearly describes the non-linear manifolds structure of node attributes, pairwise constraint is helpful to integrate the consistency and complementary relationships, and PPMI representations can capture the high-order proximity and potentially nonlinear relationships of attribute and topology. Experimental results on three datasets verify the effectiveness of the HANEP.