• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Fu Junjie, Liu Gongshen. A GEV-Based Classification Algorithm for Imbalanced Data[J]. Journal of Computer Research and Development, 2018, 55(11): 2361-2371. DOI: 10.7544/issn1000-1239.2018.20170514
Citation: Fu Junjie, Liu Gongshen. A GEV-Based Classification Algorithm for Imbalanced Data[J]. Journal of Computer Research and Development, 2018, 55(11): 2361-2371. DOI: 10.7544/issn1000-1239.2018.20170514

A GEV-Based Classification Algorithm for Imbalanced Data

More Information
  • Published Date: October 31, 2018
  • The problem of binary classification with imbalanced data appears in many fields and is still not completely solved. In addition to predicting the classification label directly, many applications also care about the probability that data belongs to a certain class. However, much of the existing research is mainly focused on the classification performance but neglects the probability estimation. The aim of this paper is to improve the performance of class probability estimation (CPE) and ensure the classification performance. A new approach of regression is proposed by adopting the generalized linear model as the basic framework and using the calibration loss function as the objective optimization function. Considering the asymmetry and the flexibility of the generalized extreme value (GEV) distribution, we use it to formulate the link function, which contributes to binary classification with imbalanced data. As to the model estimation, because of the significant influence of the shape parameter on modeling precision, two methods to estimate the shape parameter in GEV distribution are proposed. Experiments on synthetic datasets prove the accuracy of the shape parameter estimation. Besides, experimental results on real data suggest that our proposed approach, compared with other three commonly used regression algorithms, performs well on the classification performance as well as CPE. In addition, the proposed algorithm also outperforms other optimization algorithms in terms of the computational efficiency.
  • Cited by

    Periodical cited type(9)

    1. 陈城,裴慧坤,刘丙财,林国安,魏恩伟,温启良. 基于公共边缘节点的输电物联网网关异构协议适配方法研究. 电测与仪表. 2024(11): 142-147 .
    2. 许明宇,王宜怀. 异构物联网中关联数据一致性规则挖掘模型. 计算机仿真. 2023(02): 425-428+442 .
    3. 常伟鹏,袁泉. 融合多模式匹配的网络信息实体关联研究仿真. 计算机仿真. 2021(01): 331-335 .
    4. 马早霞,李磊,刘心. 基于LoRaWAN协议的双向认证接入机制的研究. 河北工程大学学报(自然科学版). 2021(01): 92-98 .
    5. 汪滢,熊璐,刘晓. 基于大数据处理的模式匹配算法效率分析. 现代电子技术. 2021(09): 124-128 .
    6. 屈春一. 非均质性海量复杂异构数据的混合云存储技术. 单片机与嵌入式系统应用. 2021(08): 26-30 .
    7. 吴进伟,苏恺,董文斌. 基于混沌反馈控制的物联网配网物资数据选择算法研究. 电子设计工程. 2020(12): 105-108+113 .
    8. 韩高峰. 智能网络系统低匹配度数据深度挖掘算法研究. 宁夏师范学院学报. 2020(04): 82-88 .
    9. 张瑾. 电能计量在生活中的重要性研究. 电子元器件与信息技术. 2019(05): 99-102 .

    Other cited types(1)

Catalog

    Article views (974) PDF downloads (485) Cited by(10)
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return