高级检索

    一种稳定的加速自适应粒球生成方法

    A Stable, Accelerated and Adaptive Granular-Ball Generation Method

    • 摘要: 粒球生成是粒球计算中进行分类、聚类等学习任务的基础.然而,现有自适应粒球生成方法因随机选择粒球划分中心,引入了较大不确定性,导致粒球分类结果极不稳定.此外,基于k-division的自适应粒球生成方法需反复执行粒球重叠检测及去重叠操作,大幅增加了粒球迭代过程的计算复杂度,降低了粒球生成效率.针对上述问题,本文基于k-division自适应粒球划分思想,提出了一种稳定、加速的自适应粒球生成方法.一方面,该方法通过选取与数据簇中值向量差异最小的样本点作为划分中心,降低了对异常值和离群点的敏感性,能够提供更加稳健的集中趋势描述,并有效避免了因随机选择中心而导致的不一致性与不稳定性.另一方面,该方法引入了粒球覆盖率的概念并基于此提出新的自适应划分条件,替代了原始k-division自适应粒球生成方法中的最低质量下限以及重叠检测和去重叠机制,显著提升了粒球生成过程的效率.在标准数据集及包含不同比例噪声的数据集上的实验结果表明,与主流自适应粒球生成方法相比,所提方法在保持分类精度的同时,展现了更强的稳定性和更高的效率.

       

      Abstract: Granular-ball generation (GBG) serves as the foundation for various learning tasks in granular-ball computing (GBC), such as classification and clustering. However, existing adaptive GBG methods introduce significant uncertainty due to the strategy of randomly selecting the division center, resulting in a highly unstable granular-ball classification result. Moreover, the k-division based adaptive GBG method requires repeated overlap detection and de-overlapping of granular balls, which significantly increases the computational overhead during the iterative process and impairs the efficiency of GBG. To address the above issues, a new method for stable, accelerated and adaptive GBG (SAAGBG) is proposed in this paper, building upon the existing k-division strategy. On the one hand, SAAGBG selects the sample point with the smallest difference with the median vector of the data cluster as the division center, which effectively enhances the robustness to outliers and avoids the inconsistency and instability caused random center selection. On the other hand, a new adaptive division condition based on granular-ball coverage is introduced into SAAGBG, which replaces the minimum quality threshold and overlap detection steps in the original k-division based GBG method, significantly improving the efficiency of the GBG process. Experimental results on benchmark datasets and datasets with varying noise ratios demonstrate that SAAGBG achieves greater stability and efficiency without loss of classification accuracy compared to the mainstream adaptive GBG methods.

       

    /

    返回文章
    返回