基于2阶段集成的多层网络社区发现算法

赵兴旺; 张珧溥; 梁吉业

doi:10.7544/issn1000-1239.202220214

基于2阶段集成的多层网络社区发现算法

Two-Stage Ensemble-Based Community Discovery Algorithm in Multilayer Networks

摘要

摘要: 社区发现旨在挖掘复杂网络蕴含的社区结构，是复杂网络分析的重要任务之一. 然而,现有的社区发现方法大多针对单层网络数据，对现实世界中广泛存在的多层网络数据的研究较少. 针对多层网络的社区发现问题，提出了一个基于2阶段集成的社区发现算法，以提高社区发现结果的准确性和可解释性. 首先，在各层分别得到基社区划分；其次以各层社区划分结构信息为主并结合其他各层网络得到的基社区划分中最优的社区划分信息进行局部集成；再次，基于信息熵对各层局部社区划分中各个社区的稳定性进行度量，并通过其他层社区划分结果来对各个局部社区划分的准确性进行评价；最后，基于各个社区以及社区划分的重要性进行全局加权集成得到最终的社区划分结果. 在人造多层网络和真实多层网络数据上与已有的多层网络社区发现算法进行了比较分析. 实验结果表明，提出的算法在多层模块度、标准化互信息等评价指标上优于已有算法.

Abstract: Community discovery aims to uncover the community structure embedded in complex networks, which is one of the important tasks in complex network analysis. However, most of the existing community discovery methods are aimed at single-layer network data, and less research has been done on the widespread multi-layer networks in the real world. In order to solve the problem of community discovery in multi-layer networks, we propose a two-stage ensemble-based community discovery algorithm. The algorithm can improve the accuracy and interpretability of community discovery results. Firstly, after the base communities are obtained at each layer, the local ensemble is performed based on the base community structure information of each layer and the optimal base community structure of each other layer. Secondly, the stability of each community in the local community division of each layer is measured based on information entropy, and the accuracy of each local community division is evaluated through the results of other layer community division. Finally, a global weighted ensemble is carried out based on the importance of each community and the community structure to obtain the final community structure. A comparative analysis is carried out on artificial multi-layer networks and real multi-layer networks with existing multi-layer network community discovery algorithms. The experimental results show that our proposed algorithm is superior to the existing algorithms in terms of multi-layer modularity and normalized mutual information.

HTML全文

参考文献(39)

施引文献

资源附件(0)