• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Lu Guoqing, Zhang Xiaojian, Ding Liping, Li Yanfeng, Liao Xin. Frequent Sequential Pattern Mining under Differential Privacy[J]. Journal of Computer Research and Development, 2015, 52(12): 2789-2801. DOI: 10.7544/issn1000-1239.2015.20140516
Citation: Lu Guoqing, Zhang Xiaojian, Ding Liping, Li Yanfeng, Liao Xin. Frequent Sequential Pattern Mining under Differential Privacy[J]. Journal of Computer Research and Development, 2015, 52(12): 2789-2801. DOI: 10.7544/issn1000-1239.2015.20140516

Frequent Sequential Pattern Mining under Differential Privacy

More Information
  • Published Date: November 30, 2015
  • Frequent sequential pattern mining is an exploratory problem in the field of data mining. However, directly releasing the discovered frequent patterns and the corresponding true supports may reveal the individuals privacy. The state-of-the-art solution for this problem is differential privacy, which offers a strong degree of privacy protection by adding noise. Due to the inherent sequentiality and high-dimensionality in sequences, it is challenging to apply differential privacy to frequent sequential pattern mining. To address those problems, this paper presents a differentially private method called Diff-FSPM that uses an interactive way to find frequent patterns. To reduce the impact of dimensionality, Diff-FSPM first employs the exponential mechanism to obtain the optimal sequential length for truncating each sequence in the original dataset. After that, Diff-FSPM relies on a prefix-tree to compress all the frequent patterns, and then utilizes the Laplace mechanism to perturb the true support of the frequent patterns. To efficiently allocate the privacy budget, Diff-FSPM uses the closet frequent pattern and Markov assumption to guide the mining process. Finally, Diff-FSPM adopts a post-processing technique with consistency constraint to boost the accuracy of the returned noisy support counts. We theoretically prove that Diff-FSPM satisfies ε-differential privacy, and the experimental results show that it outperforms the existing methods in terms of utility.
  • Related Articles

    [1]Liu Song, Wu Weiguo, Zhao Bo, Jiang Qing. Loop Tiling for Optimization of Locality and Parallelism[J]. Journal of Computer Research and Development, 2015, 52(5): 1160-1176. DOI: 10.7544/issn1000-1239.2015.20131387
    [2]Zhou Xu, Zhou Yantao, Ouyang Aijia, Li Kenli. An Efficient Tile Assembly Model for Maximum Clique Problem[J]. Journal of Computer Research and Development, 2014, 51(6): 1253-1262.
    [3]Li Kenli, Luo Xing, Wu Fan, Zhou Xu, and Huang Xin. An Algorithm in Tile Assembly Model for Maximum Clique Problem[J]. Journal of Computer Research and Development, 2013, 50(3): 666-675.
    [4]Zhou Xu, Li Kenli, Yue Guangxue, Yang Zhibang. A Volume Molecular Solution for the Maximum Matching Problem on DNA-Based Computing[J]. Journal of Computer Research and Development, 2011, 48(11): 2147-2154.
    [5]Li Kenli, Guo Li, Tang Zhuo, Jiang Yong, and Li Renfa. A Molecular Solution for the Ramsey Number on DNA-Based Supercomputing[J]. Journal of Computer Research and Development, 2011, 48(3): 447-454.
    [6]Xu Guang, An Hong, Xu Mu, Liu Gu, Yao Ping, Ren Yongqing, and Wang Fang. The Architecture and the Programming Model of a Data-Flow-Like Driven Tiled Stream Processor[J]. Journal of Computer Research and Development, 2010, 47(9): 1643-1653.
    [7]Li Kenli, Yao Fengjuan, Li Renfa, Xu Jin. Improved Molecular Solutions for the Knapsack Problem on DNA-Based Supercomputing[J]. Journal of Computer Research and Development, 2007, 44(6): 1063-1070.
    [8]Han Aili, Zhu Daming. DNA Computing Model Based on a New Scheme of Encoding Weight for Chinese Postman Problem[J]. Journal of Computer Research and Development, 2007, 44(6): 1053-1062.
    [9]Wang Lei, Lin Yaping, and Li Zhiyong. DNA Computation for a Category of Special Integer Planning Problem[J]. Journal of Computer Research and Development, 2005, 42(8): 1431-1437.
    [10]Chen Zhiping, Li Xiaolong, Wang Lei, Lin Yaping, and Cai Lijun. A Surface-Based DNA Algorithm for the Perfect Matching Problem[J]. Journal of Computer Research and Development, 2005, 42(7): 1241-1246.

Catalog

    Article views PDF downloads Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return