• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于簇和阈值区间的高效关联规则隐藏算法

牛新征, 王崇屹, 叶志佳, 佘堃

牛新征, 王崇屹, 叶志佳, 佘堃. 基于簇和阈值区间的高效关联规则隐藏算法[J]. 计算机研究与发展, 2017, 54(12): 2785-2796. DOI: 10.7544/issn1000-1239.2017.20160612
引用本文: 牛新征, 王崇屹, 叶志佳, 佘堃. 基于簇和阈值区间的高效关联规则隐藏算法[J]. 计算机研究与发展, 2017, 54(12): 2785-2796. DOI: 10.7544/issn1000-1239.2017.20160612
Niu Xinzheng, Wang Chongyi, Ye Zhijia, She Kun. An Efficient Association Rule Hiding Algorithm Based on Cluster and Threshold Interval[J]. Journal of Computer Research and Development, 2017, 54(12): 2785-2796. DOI: 10.7544/issn1000-1239.2017.20160612
Citation: Niu Xinzheng, Wang Chongyi, Ye Zhijia, She Kun. An Efficient Association Rule Hiding Algorithm Based on Cluster and Threshold Interval[J]. Journal of Computer Research and Development, 2017, 54(12): 2785-2796. DOI: 10.7544/issn1000-1239.2017.20160612
牛新征, 王崇屹, 叶志佳, 佘堃. 基于簇和阈值区间的高效关联规则隐藏算法[J]. 计算机研究与发展, 2017, 54(12): 2785-2796. CSTR: 32373.14.issn1000-1239.2017.20160612
引用本文: 牛新征, 王崇屹, 叶志佳, 佘堃. 基于簇和阈值区间的高效关联规则隐藏算法[J]. 计算机研究与发展, 2017, 54(12): 2785-2796. CSTR: 32373.14.issn1000-1239.2017.20160612
Niu Xinzheng, Wang Chongyi, Ye Zhijia, She Kun. An Efficient Association Rule Hiding Algorithm Based on Cluster and Threshold Interval[J]. Journal of Computer Research and Development, 2017, 54(12): 2785-2796. CSTR: 32373.14.issn1000-1239.2017.20160612
Citation: Niu Xinzheng, Wang Chongyi, Ye Zhijia, She Kun. An Efficient Association Rule Hiding Algorithm Based on Cluster and Threshold Interval[J]. Journal of Computer Research and Development, 2017, 54(12): 2785-2796. CSTR: 32373.14.issn1000-1239.2017.20160612

基于簇和阈值区间的高效关联规则隐藏算法

基金项目: 国家自然科学基金项目(61300192);国家科技支撑计划基金项目(2013BAH33F02);中央高校基本科研业务费专项资金项目(ZYGX2014J052);四川省科技支撑计划基金项目(2015GZ0096);成都市科学技术局软科学研究项目(2015-RK00-00046-ZF);四川省公安厅科研项目(2015SCYYCX06);四川省自贡市公安局项目
详细信息
  • 中图分类号: TP301.6

An Efficient Association Rule Hiding Algorithm Based on Cluster and Threshold Interval

  • 摘要: 关联规则隐藏是隐私保护数据挖掘(privacy-preserving data mining, PPDM)的一种重要方法.针对当前的关联规则隐藏算法直接操作事务数据、I/O开销较大的缺陷,提出一种基于FP-tree快速关联规则隐藏的算法FP-DSRRC.算法首先对FP-tree的结构进行改进,增设事务编号索引并建立双向遍历结构,进而利用改进的FP-tree对事务信息进行快速处理,避免了遍历原始数据集产生的大量I/O时间;然后通过建立和维护事务索引表实现对敏感项的快速查找,并基于分簇策略对关联规则处理,以簇为单位进行敏感规则消除,同时采用规则支持度和置信度阈值区间的思想,减少了关联规则隐藏处理对原始数据集的影响;最后通过实验测试证明:相较于传统关联规则隐藏算法,FP-DSRRC算法在保证生成的数据集质量的同时,减少了50%~70%的算法执行时间,并在大规模真实数据集上有较好的可用性.
    Abstract: Association rules hiding is a very important method of privacy-preserving data mining (PPDM). Because the current association rules hiding algorithm operates the transaction database directly, it leads to a lot of I/O overhead. To solve this problem, we put forward a quick association rules hiding algorithm based on FT-tree, called FP-DSRRC. Firstly, the algorithm improves the structure of FP-tree by adding an index to the transaction number and establishing the bidirectional traverse structure. Then FP-DSRRC uses the improved FP-tree to quickly handle transaction data set, avoiding a large number of I/O overhead caused by traversal the raw transaction data set. Furthermore, FP-DSRRC finds the sensitive items quickly by building and maintaining a transaction index table, and then handles the association rules based on the clustering strategy. We eliminate the sensitive rules by clusters, and reduce the negative influence caused by association rules hiding progress to the original data set by adopting the idea of rule support and confidence degree interval at the same time. Finally, the experiment shows that compared with traditional association rules hiding algorithm, the executive time of FP-DSRRC has been decreased by 50%~70% while guaranteeing the quality of general data, moreover, FP-DSRRC has better availability on a large-scale real data set.
  • 期刊类型引用(3)

    1. 商涛,程瑶,陈禄明,邓立宗,蒋太交. 呼吸病学标准医学术语在电子病历中的使用情况调研. 中国科技术语. 2021(04): 53-59 . 百度学术
    2. 郑光敏,易天源,唐东昕,贺松. 基于BERT-BiLSTM-CRF模型的中国民族药知识抽取. 武汉大学学报(理学版). 2021(05): 393-402 . 百度学术
    3. 戴志宏,郝晓玲. 上下位关系抽取方法及其在金融市场的应用. 数据分析与知识发现. 2021(10): 60-70 . 百度学术

    其他类型引用(4)

计量
  • 文章访问数:  1078
  • HTML全文浏览量:  3
  • PDF下载量:  393
  • 被引次数: 7
出版历程
  • 发布日期:  2017-11-30

目录

    /

    返回文章
    返回