高级检索

    非独立同分布数据下层次化Sinkhorn距离聚类联邦学习算法

    Hierarchical Sinkhorn Distance Clustering Federated Learning for Non-IID Data

    • 摘要: 联邦学习作为一种边缘计算中的新兴分布式神经网络训练方法面临着客户端数据异构性挑战,其中聚类联邦学习被认为是一种颇具潜力的解决方案,然而现有聚类联邦学习算法未深入探究量化客户端数据分布差异. 针对该问题提出了一种新颖的层次化聚类联邦学习算法(hierarchical Sinkhorn distance-based clustering federated algorithm,HS-CFA),采用最优传输代价衡量局部训练时客户端数据分布特性,提出层次化聚类策略动态调整全局模型聚合时的参与权重. 具体而言,HS-CFA引入Sinkhorn距离量化客户端间的分布差异,提出使用基于密度的聚类算法在服务器端进行动态层次聚类. 在多个基准数据集上的实验结果表明,相比传统算法在高度数据分布异构性的场景中显著提升了全局模型的精度和鲁棒性.

       

      Abstract: Federated learning, as an emerging distributed neural network training paradigm in edge computing, faces a key challenge of data heterogeneity over clients, which significantly impacts global model performance. While clustering federated learning, serving as a promising solution, has shown improvements in model accuracy, its effectiveness is limited by the insufficient utilization of local data distribution statistics. To address this issue, this paper proposes a novel federated learning framework named HS-CFA (Hierarchical Sinkhorn Distance-Based Clustering Federated Algorithm), which is designed to optimize the aggregation weights of local models under the heterogeneous data environments, thereby enhancing the performance of the global model. It utilizes the entropy-regularized optimal transport cost to capture the characteristics of local data distribution and dynamically adjusts the aggregation weights of local models with a hierarchical clustering strategy. Specially, HS-CFA employs the Sinkhorn distance as a lightweight optimal transport cost metric to measure distributional dissimilarities across clients. Furthermore, it adopts a hierarchical two-layer clustering mechanism, combining density-based spatial clustering and average aggregation during the server training phase, facilitating the dynamic and adaptive adjustment of local model aggregation weights. Experimental results on multiple benchmark datasets demonstrate that HS-CFA significantly enhances the accuracy and robustness of the global model in scenarios with highly heterogeneous distribution settings.

       

    /

    返回文章
    返回