• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Liang Bin, Li Guanghui, Dai Chenglong. G-mean Weighted Classification Method for Imbalanced Data Stream with Concept Drift[J]. Journal of Computer Research and Development, 2022, 59(12): 2844-2857. DOI: 10.7544/issn1000-1239.20210471
Citation: Liang Bin, Li Guanghui, Dai Chenglong. G-mean Weighted Classification Method for Imbalanced Data Stream with Concept Drift[J]. Journal of Computer Research and Development, 2022, 59(12): 2844-2857. DOI: 10.7544/issn1000-1239.20210471

G-mean Weighted Classification Method for Imbalanced Data Stream with Concept Drift

Funds: This work was supported by the National Natural Science Foundation of China (62072216).
More Information
  • Published Date: November 30, 2022
  • Concept drift and class imbalance in data stream seriously degrade the performance and stability of the traditional data stream classification algorithms. To solve this issue in binary classification of data stream, an online G-mean weighted ensemble classification method for imbalanced data stream with concept drift termed OGUEIL is proposed. It exploits the online update mechanism of component classifiers’ weights to modify block-based ensemble algorithms, combining the hybrid resampling and adaptive sliding window algorithm. OGUEIL is based on the ensemble learning framework that once a new instance reaches, each component classifier in the ensemble and its weight are correspondingly updated online, and the minority class instance is randomly oversampled at the same time. Particularly, each component classifier determines its weight according to the G-mean performance on several recently incoming instances, where G-mean of each component classifier is calculated based on the time decay factor increment. At the same time, OGUEIL periodically constructs a balanced dataset according to the data in the current sliding window and trains a new candidate classifier, then adds it to the ensemble based on specific conditions. The experimental results on both real-world and synthesized datasets show that the comprehensive performance of the proposed method outperforms other baseline algorithms.
  • Cited by

    Periodical cited type(7)

    1. 朱诗能,韩萌,杨书蓉,代震龙,杨文艳,丁剑. 不平衡数据流的集成分类方法综述. 计算机工程与应用. 2025(02): 59-72 .
    2. 蔡博,张海清,李代伟,向筱铭,于曦,邓钧予. 基于增量加权的不平衡漂移数据流分类算法. 计算机应用研究. 2024(03): 854-860 .
    3. 郭虎升,刘艳杰,王文剑. 基于混合特征提取的流数据概念漂移处理方法. 计算机研究与发展. 2024(06): 1497-1510 . 本站查看
    4. 王婧,郭虎升,王文剑. 基于弱监督集成的概念演化自适应检测方法. 吉林大学学报(信息科学版). 2024(03): 406-420 .
    5. 郭虎升,张洋,王文剑. 面向不同类型概念漂移的两阶段自适应集成学习方法. 计算机研究与发展. 2024(07): 1799-1811 . 本站查看
    6. 马乾骏,郭虎升,王文剑. 在线深度神经网络的弱监督概念漂移检测方法. 小型微型计算机系统. 2024(09): 2094-2101 .
    7. 穆栋梁,韩萌,李昂,刘淑娟,高智慧. 概念漂移复杂数据流分类方法综述. 计算机应用. 2023(06): 1664-1675 .

    Other cited types(3)

Catalog

    Article views (114) PDF downloads (72) Cited by(10)
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return