消除互补性争议的多视图聚类算法

赵玉涵; 陈松灿

doi:10.7544/issn1000-1239.202440426

消除互补性争议的多视图聚类算法

Multi-View Clustering Algorithm via Eliminating the Complementarity Controversy

摘要

摘要: 多视图聚类旨在利用来自不同视图的异构信息发现底层数据结构，并划分样本所属类别. 一致性和互补性是影响多视图聚类性能的2个关键要素. 一致性强调不同视图间的语义相似性，互补性则强调每个视图内特有信息的相互补充. 目前对一致性研究已相对深入，但对互补性研究存在争议，其中一些方法认为一致性和互补性能互助，但仅将二者约束至同一特征空间中实际上造成了二者的冲突. 而另一些方法则据此认为应丢弃互补信息，但这又造成信息浪费. 直觉上互补性应该存在，贡献在于发现了现有方法没有足够洞悉并触及到互补性的本质，即一致性和互补性并非独立而是相互耦合，结果导致冲突. 受此启发，通过解耦实现了2种信息的分离，具体使它们位于不同的特征子空间而非现在的同一特征空间，从而发展出了一种兼顾一致性和互补性的多视图聚类算法，在有效提取出互补信息的同时避免二者冲突. 在标准数据集上的对比实验验证了所提算法的有效性.

Abstract: Multi-view clustering aims to use heterogeneous information from different views to discover the underlying data structure and divide the samples into clusters. Consistency and complementarity are two key elements that affect the performance of multi-view clustering. Consistency emphasizes the semantic similarity between different views. Complementarity emphasizes the mutual supplementation of specific information within each view. At present, the study of consistency has been relatively in-depth, but the study of complementarity is controversial, in which some methods believe that consistency and complementarity can assist each other, but constraining them to the same feature space actually causes a conflict between them. Other approaches accordingly argue that complementary information should be discarded, but this would result in a waste of information. Intuitively, complementarity should exist. The contribution of this paper is to find that existing methods do not have enough insight into the essence of complementarity, i.e., consistency and complementarity are not independent but entangled with each other, which results in conflict. Motivated by this finding, we realize the separation of the two kinds of information through disentangling, specifically making them located in different feature subspaces instead of the same feature space, thus developing a multi-view clustering algorithm that takes into account both consistency and complementarity, effectively extracting the complementary information while avoiding the conflict between consistency and complementarity. Comparative experiments on standard datasets demonstrate the effectiveness of the proposed algorithm.

HTML全文

参考文献(42)

施引文献

资源附件(0)