高级检索

    一种基于模态逻辑的聚类结果评价方法

    An Index of Cluster Validity Based on Modal Logic

    • 摘要: 聚类评价指标对衡量一个聚类的优劣有着重要作用.现有的聚类评价指标通常都基于统计理论或模糊理论.收到基础理论的限制,在一些特殊场合,这些指标不能对聚类进行正确的评估.提出了一种基于模态逻辑的新的聚类评价指标.通过把相似性定义成数据集上的二元关系聚类被描述成Kripke结构.用原子公式表示每个簇后,聚类的结果可以用一组逻辑公式来表示.根据最小描述长度原则,聚类评价指标由这种表示方式的准确性和复杂性构成.由于这种新的评价指标对相似性没有任何附加的限制,它较之现有的评价指标更为通用,而那些指标往往都默认了某种相似性度量方式.列举了用于对比新旧指标的实验.实验结果表明,这种新的评价指标在一般情况下与大多数评价指标一致,而在一些类似“双环”的特殊情况下比现有评价方式更有效.

       

      Abstract: Clustering validity index plays an important role to show whether a clustering is good enough. Most of current indexes are based on statistical theory and fuzzy theory. Limited by the basic theories, these indexes would give some incorrect indication in some special cases. In this paper, a new index of clustering validity index which is based on the theory of modal logic is presented. The clustering is described by Kripke structures, where the similarity is defined as a binary relation on the data set. Each cluster is represented by a propositional sentence so that the result of clustering can be represented by logical formulas. According to minimum description length principle, the clustering validity index is built by veracity and complexity of the representation. Since this new index imposes no additional restrictive conditions of the similarity measurement for clustering, it is therefore more universal than current ones which usually contain default measurement of similarity. The experiments to compare the new index with the common indexes are also shown in this paper. The experimental results show that this new index is consistent with others in the normal case as well as more effective in some special cases such as the two rings data set.

       

    /

    返回文章
    返回