• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Liu Ying, Yang Ke. Credit Fraud Detection for Extremely Imbalanced Data Based on Ensembled Deep Learning[J]. Journal of Computer Research and Development, 2021, 58(3): 539-547. DOI: 10.7544/issn1000-1239.2021.20200324
Citation: Liu Ying, Yang Ke. Credit Fraud Detection for Extremely Imbalanced Data Based on Ensembled Deep Learning[J]. Journal of Computer Research and Development, 2021, 58(3): 539-547. DOI: 10.7544/issn1000-1239.2021.20200324

Credit Fraud Detection for Extremely Imbalanced Data Based on Ensembled Deep Learning

Funds: This work was supported by the National Social Science Foundation of China (20BTJ062).
More Information
  • Published Date: February 28, 2021
  • The existence of class imbalance in credit fraud data significantly undermines model performance. In particular, when the sample distribution is extremely unbalanced, noise caused by information distortion, statistical discrepancy and reporting bias will severely damage the process of training models, leading to potential issues such as overfitting. For this reason, this paper proposes an algorithm based on ensembled deep belief network, which is meant to tackle credit fraud data featured by extreme imbalance. First, this paper proposes joint sampling strategy combining under-sampling and over-sampling to retrieve training subset data. Then, we introduce an algorithm of constructing classifier clusters through two stages. Support vector classifiers and random forest classifiers are combined by using Boosting algorithm to overcome classification interface deviation of support vector machine. Finally, deep belief network is exploited to assemble classifiers’ predictions and output final classification result. Besides, traditional evaluation methods put too much emphasis on majority samples, ignoring the reality where the minority matters even more. The revenue cost index that considers identification of both positive and negative samples has thereby been introduced. This paper conducts empirical study on European credit card data and concludes a 3% higher performance on revenue cost index of the proposed algorithm than others’ average. The experiment also evaluates the influence of imbalance ratio over algorithm’s performance and finds that proposed algorithm outperforms others in this aspect.
  • Related Articles

    [1]Su Mingfeng, Wang Guojun, Li Renfa. Resource Deployment with Prediction and Task Scheduling Optimization in Edge Cloud Collaborative Computing[J]. Journal of Computer Research and Development, 2021, 58(11): 2558-2570. DOI: 10.7544/issn1000-1239.2021.20200621
    [2]Duan Wenxue, Hu Ming, Zhou Qiong, Wu Tingming, Zhou Junlong, Liu Xiao, Wei Tongquan, Chen Mingsong. Reliability in Cloud Computing System: A Review[J]. Journal of Computer Research and Development, 2020, 57(1): 102-123. DOI: 10.7544/issn1000-1239.2020.20180675
    [3]Jiang Han, Xu Qiuliang. Secure Multiparty Computation in Cloud Computing[J]. Journal of Computer Research and Development, 2016, 53(10): 2152-2162. DOI: 10.7544/issn1000-1239.2016.20160685
    [4]Wang Binfeng, Su Jinshu, Chen Lin. Review of the Design of Data Center Network for Cloud Computing[J]. Journal of Computer Research and Development, 2016, 53(9): 2085-2106. DOI: 10.7544/issn1000-1239.2016.20150962
    [5]Wang Jin, Huang Zhiqiu. Privacy Requirement Modeling and Consistency Checking in Cloud Computing[J]. Journal of Computer Research and Development, 2015, 52(10): 2395-2410. DOI: 10.7544/issn1000-1239.2015.20150513
    [6]Ke Changbo, Huang Zhiqiu. Privacy Requirement Description and Checking Method in Cloud Computing[J]. Journal of Computer Research and Development, 2015, 52(4): 879-888. DOI: 10.7544/issn1000-1239.2015.20131906
    [7]Wang Qiang, Li Xiongfei, Wang Jing. A Data Placement and Task Scheduling Algorithm in Cloud Computing[J]. Journal of Computer Research and Development, 2014, 51(11): 2416-2426. DOI: 10.7544/issn1000-1239.2014.20130749
    [8]Tang Zhuo, Zhu Min, Yang Li, Tang Xiaoyong, Li Kenli. Random Task-Oriented User Utility Optimization Model in the Cloud Environment[J]. Journal of Computer Research and Development, 2014, 51(5): 1120-1128.
    [9]Zhou Jingcai, Zhang Huyin, Zha Wenliang, and Chen Yibo. User-Aware Resource Provision Policy for Cloud Computing[J]. Journal of Computer Research and Development, 2014, 51(5): 1108-1119.
    [10]Zhu Xia, Luo Junzhou, Song Aibo, and Dong Fang. A Multi-Dimensional Indexing for Complex Query in Cloud Computing[J]. Journal of Computer Research and Development, 2013, 50(8): 1592-1603.
  • Cited by

    Periodical cited type(5)

    1. 张钦宇,张智凯,安丽荣,杨君一,张瑞. 面向天基数据中心的编码修复数据流调度. 移动通信. 2023(07): 21-26 .
    2. 杨浩,李竣业. 电力用户多渠道自动缴费习惯预判预警系统设计. 信息技术. 2021(03): 155-160 .
    3. 包涵,王意洁,许方亮. 基于生成矩阵变换的跨数据中心纠删码写入方法. 计算机研究与发展. 2020(02): 291-305 . 本站查看
    4. 李慧,李贵洋,胡金平,周悦,江小玉,韩鸿宇. 基于分布式存储的OHitchhiker码. 计算机工程与设计. 2020(07): 1941-1946 .
    5. 严新成,陈越,巴阳,贾洪勇,朱彧. 云环境下支持可更新加密的分布式数据编码存储方案. 计算机研究与发展. 2019(10): 2170-2182 . 本站查看

    Other cited types(11)

Catalog

    Article views (1011) PDF downloads (535) Cited by(16)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return