• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Shi Qianyu, Liang Jiye, Zhao Xingwang. A Clustering Ensemble Algorithm for Incomplete Mixed Data[J]. Journal of Computer Research and Development, 2016, 53(9): 1979-1989. DOI: 10.7544/issn1000-1239.2016.20150592
Citation: Shi Qianyu, Liang Jiye, Zhao Xingwang. A Clustering Ensemble Algorithm for Incomplete Mixed Data[J]. Journal of Computer Research and Development, 2016, 53(9): 1979-1989. DOI: 10.7544/issn1000-1239.2016.20150592

A Clustering Ensemble Algorithm for Incomplete Mixed Data

More Information
  • Published Date: August 31, 2016
  • Cluster ensembles have recently emerged a powerful clustering analysis technology and caught high attention of researchers due to their good generalization ability. From the existing work, these techniques held great promise, most of which generate the final results for complete data sets with numerical attributes. However, real life data sets are usually incomplete mixed data described by numerical and categorical attributes at the same time. And these existing algorithms are not very effective for an incomplete mixed data set. To overcome this deficiency, this paper proposes a new clustering ensemble algorithm which can be used to ensemble final clustering results for mixed numerical and categorical incomplete data. Firstly, the algorithm conducts completion of incomplete mixed data using three different missing value filling methods. Then, a set of clustering solutions are produced by executing K-Prototypes clustering algorithm on three different kinds of complete data sets multiple times, respectively. Next, a similarity matrix is constructed by considering all the clustering solutions. After that, the final clustering result is obtained by hierarchical clustering algorithms based on the similarity matrix. The effectiveness of the proposed algorithm is empirically demonstrated over some UCI real data sets and three benchmark evaluation measures. The experimental results show that the proposed algorithm is able to generate higher clustering quality in comparison with several traditional clustering algorithms.
  • Related Articles

    [1]Kong Hao, Lu Wenyan, Chen Yan, Yan Guihai, Li Xiaowei. Survey of Sort Acceleration Methods on FPGA[J]. Journal of Computer Research and Development, 2024, 61(3): 780-798. DOI: 10.7544/issn1000-1239.202220789
    [2]Qi Le, Chang Yisong, Chen Yuxiao, Zhang Xu, Chen Mingyu, Bao Yungang, Zhang Ke. A System-Level Platform with SoC-FPGA for RISC-V Hardware-Software Integration[J]. Journal of Computer Research and Development, 2023, 60(6): 1204-1215. DOI: 10.7544/issn1000-1239.202330060
    [3]Li Xiaobo, Tang Zhimin, Li Wen. FPGA Verification for Heterogeneous Multi-Core Processor[J]. Journal of Computer Research and Development, 2021, 58(12): 2684-2695. DOI: 10.7544/issn1000-1239.2021.20200289
    [4]Chen Ji, Liu Haikun, Wang Xiaoyuan, Zhang Yu, Liao Xiaofei, Jin Hai. Largepages Supported Hierarchical DRAMNVM Hybrid Memory Systems[J]. Journal of Computer Research and Development, 2018, 55(9): 2050-2065. DOI: 10.7544/issn1000-1239.2018.20180269
    [5]Li Junnan, Yang Xiangrui, Sun Zhigang. DrawerPipe: A Reconfigurable Packet Processing Pipeline for FPGA[J]. Journal of Computer Research and Development, 2018, 55(4): 717-728. DOI: 10.7544/issn1000-1239.2018.20170927
    [6]Zhu Ying, Chen Cheng, Xu Xiaohong, and Li Yanzhe. Design and Implementation of FPGA Verification Platform for Multi-core Processor[J]. Journal of Computer Research and Development, 2014, 51(6): 1295-1303.
    [7]Xia Fei, Dou Yong, Xu Jiaqing, Zhang Yang. Fine-Grained Parallel Zuker Algorithm Accelerator with Storage Optimization on FPGA[J]. Journal of Computer Research and Development, 2011, 48(4): 709-719.
    [8]Wang Jiandong, Zhu Chao, Xie Yingke, Han Chengde, Zhao Zili. FPGA-Based Parallel Real-Time System for 10Gbps Traffic Processing[J]. Journal of Computer Research and Development, 2009, 46(2): 177-185.
    [9]Hao Zhiquan, Wang Zhensong, Liu Bo. Research on Real-Time Realizing PGA Algorithm in FPGA[J]. Journal of Computer Research and Development, 2008, 45(2): 342-347.
    [10]Guo Meng, Jian Fangjun, Zhang Qin, Xu Bin, Wang Zhensong, Han Chengde. FPGA-Based Real-Time Imaging System for Spaceborne SAR[J]. Journal of Computer Research and Development, 2007, 44(3).
  • Cited by

    Periodical cited type(2)

    1. 李翔宇,李瑞兴,曾燕清. 基于改进核函数的支持向量机时间序列数据分类. 信阳农林学院学报. 2021(01): 121-126 .
    2. 宋奎勇,王念滨,王红滨. 基于Shapelets的多变量D-S证据加权集成分类. 吉林大学学报(信息科学版). 2021(02): 205-214 .

    Other cited types(6)

Catalog

    Article views (1456) PDF downloads (651) Cited by(8)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return