• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Ji Yimu, Zhang Yongpan, Lang Xianbo, Zhang Dianchao, Wang Ruchuan. Parallel of Decision Tree Classification Algorithm for Stream Data[J]. Journal of Computer Research and Development, 2017, 54(9): 1945-1957. DOI: 10.7544/issn1000-1239.2017.20160554
Citation: Ji Yimu, Zhang Yongpan, Lang Xianbo, Zhang Dianchao, Wang Ruchuan. Parallel of Decision Tree Classification Algorithm for Stream Data[J]. Journal of Computer Research and Development, 2017, 54(9): 1945-1957. DOI: 10.7544/issn1000-1239.2017.20160554

Parallel of Decision Tree Classification Algorithm for Stream Data

More Information
  • Published Date: August 31, 2017
  • With the rise of cloud computing, Internet of things and other technologies, streaming data exists widely in telecommunications, Internet, finance and other fields as a new form of big data. Compared with the traditional static data, stream data in big data has the characters of rapidness, continuity and changing with time. At the same time, the implicit distribution of the data stream will bring about the concept drift problem. In order to satisfy the requirements of stream data classification algorithms in big data, we must improve the traditional static offline data classification algorithms, and propose P-HT parallel algorithm based on distributed computing platform Storm. To meet the requirements of Storm stream processing platform, we improve the flexibility and versatility of the algorithm through sliding window mechanism, alternative tree mechanism and parallel processing mechanism, and the algorithm can adapt to the concept-drift of data stream very well. Finally, we experimentally verify the validity and high efficiency of the algorithm. The results show that the improved P-HT algorithm has better throughput and faster processing speed than the traditional C45 algorithm in the case of no reduction in accuracy.
  • Related Articles

    [1]Han Songshen, Guo Songhui, Xu Kaiyong, Yang Bo, Yu Miao. Perturbation Analysis of the Vital Region in Speech Adversarial Example Based on Frame Structure[J]. Journal of Computer Research and Development, 2024, 61(3): 685-700. DOI: 10.7544/issn1000-1239.202221034
    [2]Li Ru, Wang Zhiqiang, Li Shuanghong, Liang Jiye, Collin Baker. Chinese Sentence Similarity Computing Based on Frame Semantic Parsing[J]. Journal of Computer Research and Development, 2013, 50(8): 1728-1736.
    [3]Zhou Jingang, Zhao Dazhe, Xu Li, Liu Jiren. Frame Refinement: Combining Frame-Based Software Development with Stepwise Refinement[J]. Journal of Computer Research and Development, 2013, 50(4): 711-721.
    [4]Zhang Yan, Yu Shengyang, Zhang Chongyang, Yang Jingyu. Extraction and Removal of Frame Line in Form Bill[J]. Journal of Computer Research and Development, 2008, 45(5): 909-914.
    [5]Mi Congjie, Liu Yang, and Xue Xiangyang. Video Texts Tracking and Segmentation Based on Multiple Frames[J]. Journal of Computer Research and Development, 2006, 43(9): 1523-1529.
    [6]Zhang Dongming, Shen Yanfei, Lin Shouxun, Zhang Yongdong. Low Complexity Mode Decision for H.264 Inter Frame Encoding[J]. Journal of Computer Research and Development, 2006, 43(9): 1516-1522.
    [7]Tang Yunting, Cheng Xianyi. The Studying of Frame APRF of Pattern-Recognition Based on Agent[J]. Journal of Computer Research and Development, 2006, 43(5): 867-873.
    [8]Wang Fangshi, Xu De, and Wu Weixin. A Cluster Algorithm of Automatic Key Frame Extraction Based on Adaptive Threshold[J]. Journal of Computer Research and Development, 2005, 42(10): 1752-1757.
    [9]Wang Rongrong, Jin Wanjun, and Wu Lide. A Novel Video Caption Detection Approach Using Multi-Frame Integration[J]. Journal of Computer Research and Development, 2005, 42(7): 1191-1197.
    [10]Zhang Chongyang, Chen Qiang, Lou Zhen, Yang Jingyu. A Form Frame Line Removal Algorithm Based on Gray-Level Image[J]. Journal of Computer Research and Development, 2005, 42(4): 635-639.
  • Cited by

    Periodical cited type(2)

    1. 谢景明,胡伟方,韩林,赵荣彩,荆丽娜. 基于“嵩山”超级计算机系统的量子傅里叶变换模拟. 计算机科学. 2021(12): 36-42 .
    2. Ze-yao MO. 超大规模并行计算:瓶颈与对策(英文). Frontiers of Information Technology & Electronic Engineering. 2018(10): 1251-1261 .

    Other cited types(1)

Catalog

    Article views (1690) PDF downloads (978) Cited by(3)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return