• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Hu Xinping, He Yuzhi, Ni Weiwei, and Zhang Yong. A Privacy-Preserving Data Publishing Method Based on Genetic Algorithm with Roulette Wheel[J]. Journal of Computer Research and Development, 2012, 49(11): 2432-2439.
Citation: Hu Xinping, He Yuzhi, Ni Weiwei, and Zhang Yong. A Privacy-Preserving Data Publishing Method Based on Genetic Algorithm with Roulette Wheel[J]. Journal of Computer Research and Development, 2012, 49(11): 2432-2439.

A Privacy-Preserving Data Publishing Method Based on Genetic Algorithm with Roulette Wheel

More Information
  • Published Date: November 14, 2012
  • Privacy-preserving micro-data publishing for clustering is an important issue in data mining research, which aims at protecting privacy of individual data meanwhile accommodating enough clustering usability of the published data. Different from traditional distance-preserving and distribution-preserving solutions, a data perturbation method RWSGA (roulette wheel selection genetic algorithm) is proposed from the view of maintaining neighboring relation stability of the dataset during the obfuscation process in this paper. Roulette-wheel-selection-based genetic methods are adopted to make data obfuscation by building imitating relations between crossing, mutating and data perturbation. Firstly, the solution randomly chooses a pair of data points from the k neighborhood of a data point using roulette wheel strategy. Subsequently, tailored crossing or mutating operations are applied to the selected pair of data points to protect micro-data values from leakage, meanwhile guaranteeing stability of the corresponding k neighborhood. Furthermore,to avoid too large changes originated by mutating operations, an optimization is applied to improve the choice of mutating domain leveraging specifying centers of k nearest neighborhood from data space with higher density. Theoretical analysis and experimental results testify that RWSGA can modify published micro-data values greatly from their original correspondences and keep the clustering difference between the original dataset and the published dataset small.

Catalog

    Article views (1023) PDF downloads (438) Cited by()
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return