高级检索

    基于随机游走路径的自监督图拓扑不平衡学习

    Self-Supervised Graph Topology-Imbalance Learning Based on Random Walk Paths

    • 摘要: 图拓扑不平衡问题,由于节点在拓扑空间中的不均匀和不对称分布,对图神经网络性能产生了严重的负面影响. 当前的研究主要侧重于标记节点,而对无标记节点的关注较少. 为应对这一挑战,提出了一种基于随机游走路径的自监督学习方法,旨在解决拓扑不平衡问题带来的同质性假设限制、拓扑距离衰减以及注释衰减等难题. 该方法引入了多跳路径的子图邻域概念,以更全面地捕捉节点之间的关系和局部特征. 首先,通过路径间聚合策略,学习多跳路径中的同质和异质特征,从而不仅保留了节点的原始属性,还维护了它们在随机游走序列中的初始结构连接. 此外,结合了基于多条路径的子图采样和子图生成策略以及结构化的对比损失,最大化了同一节点局部子图的内在特征,从而增强了图表示的表达能力. 经过实验验证,该方法在多种不平衡场景下都表现出了出色的有效性和泛化性能. 这一研究为解决图拓扑不平衡问题提供了新的方法和视角.

       

      Abstract: The problem of topological imbalance in graphs, arising from the non-uniform and asymmetric distribution of nodes in the topological space, significantly hampers the performance of graph neural networks. Current research predominantly focuses on labeled nodes, with relatively less attention given to unlabeled nodes. To address this challenge, we propose a self-supervised learning method based on random walk paths aimed at tackling the issues posed by topological imbalance, including the constraints imposed by homogeneity assumptions, topological distance decay, and annotation attenuation. Our method introduces the concept of multi-hop paths within the subgraph neighborhood, aiming to comprehensively capture relationships and local features among nodes. Firstly, through a strategy of aggregating between paths, we can learn both homogeneous and heterogeneous features within multi-hop paths, thereby preserving not only the nodes' original attributes but also maintaining their initial structural connections in the random walk sequences. Additionally, by combining a strategy of aggregating subgraph samples based on multiple paths with structured contrastive loss, we maximize the intrinsic features of local subgraphs for the same node, enhancing the expressive power of graph representations. Experimental results validate the effectiveness and generalization performance of our method across various imbalanced scenarios. This research provides a novel approach and perspective for addressing topological imbalance issues.

       

    /

    返回文章
    返回