基于双端联合学习的多视图聚类

杜亮; 李晓东; 陈艳; 周芃; 钱宇华

doi:10.7544/issn1000-1239.202440175

摘要: 在应对大规模多视图聚类这一挑战时都面临多个问题. 其中，一致性锚点图学习方法难以处理锚点图不对齐问题，并且过度依赖一致性图，限制了其聚类结果的准确性和可靠性；锚点图集成聚类方法则是在基聚类器的生成与融合过程中割裂了不同锚点图之间的联系，影响了其聚类效果的有效性和稳定性. 为解决这些问题，提出了一种基于双端联合学习的新型多视图聚类方法. 该方法充分考虑了多锚点图信息和锚点端聚类对样本端聚类的联合作用，实现了锚点端聚类和样本端聚类同步进行，并通过对多锚点图信息的综合实现了样本端聚类与多个锚点端聚类的集成对齐. 与现有方法不同，该方法无需直接学习一致性锚点图，可以处理任意类型的锚点不对齐问题，并且规避了图学习与图划分分步处理对聚类性能的不利影响. 此外，其在一个完整的优化框架中同时利用多个锚点图进行锚点端聚类和样本端聚类，有效解决了基聚类器生成阶段无法利用除自身外的其他锚点图和集成阶段无法充分利用所有锚点图的问题. 实验结果表明，所提出的方法在聚类性能和时间消耗方面均优于多个对比方法，有效增强了多视图数据的聚类性能. 所提出方法以及所采用对比方法的相关代码附可在http://github.com/lxd1204/DLMC中查询.

Abstract: Multi-anchor graph approaches have attracted more and more attention for their potential in addressing the challenges of large-scale multi-view clustering. However, existing methods leveraging multi-anchor graphs encounter several hurdles when they come to tackling this challenge. The consistency-anchored graph learning methods struggle with handling misaligned anchor graphs and necessitates additional post-processing with consistency graph, thereby constraining the accuracy and reliability of clustering outcomes. And the anchor graph ensemble clustering method fails to harness the complementary information from different views during the independent generation of candidate base clustering and overlooks the original anchor graphs during fusion, thus impacting the effectiveness and stability of clustering results. To address these challenges, we propose a novel approach based on double-ended joint learning for multi-view clustering. The method fully considers the duality between multi-anchor information and samples in multi-anchor graphs, achieving synchronized clustering between anchor-end and sample-end. Moreover, under the guidance of multi-anchor information, it achieves joint alignment between sample-end clustering and multiple anchor-end clustering. Unlike existing methods, the approach does not require direct learning of consistent anchor graph, thus can handle any type of anchor misalignment issues, and mitigate the negative impact of separate graph learning and partition on clustering performance. Additionally, it utilizes multiple anchor graphs for anchor-end clustering and sample-end clustering within a unified optimization framework, effectively addressing the limitations of base clustering and the ensemble stage in leveraging multiple anchor graphs. Experimental results demonstrate that the proposed method outperforms several comparative methods in terms of clustering performance and time consumption, effectively enhancing the clustering performance of multi-view data. The relevant code for the proposed method and comparative methods is provided in the supplementary material: http://github.com/lxd1204/DLMC.

基于双端联合学习的多视图聚类

Double-Ended Joint Learning for Multi-View Clustering