高级检索

    基于环概化的半同构泛化算法研究

    Study on Semi-Homogenous Algorithm Based on Ring Generalization

    • 摘要: 为了防止个人隐私的泄漏,通常在数据共享前需要对其在准标识符上的属性值作概化处理,以消除链接攻击,从而实现在共享中对敏感属性的匿名保护.数据的概化处理增加了属性值的不确定性,也不可避免地造成一定的信息损失.基于环概化(ring generalization)的异构处理算法能够在减少匿名化所导致的数据信息损失的同时,提供更强的隐私保护.提出生成所有基于环概化置换的算法,同时研究置换计数问题,证明了其基数满足O(α\+n),α>1.在此基础上,提出了一种半同构泛化算法,能在数据共享中实现匿名数据保护,同时降低概化所带来的数据信息损失.

       

      Abstract: Data privacy has been a hot research topic in the database theory and cryptography communities in the past few decades. To prevent the disclosure of privacy, it requires preserving the anonymity of sensitive attributes in data sharing. The attribute values on quasi-identifiers often have to be generalized before data sharing to avoid linking attack, and thus to achieve the anonymity in data sharing. However, without careful treatment, it’s of high risk of privacy leakage for data anonymity. Among these solutions , data generalization is an important technique for privacy preserving in data publication and attracts considerable attention in the literature, which increases the uncertainty of attribute values, and leads to the loss of information to some extent. The non-homogenous algorithm which is based on ring generalization, can reduce the information loss, and in the meanwhile, offering strong privacy preservation. This paper presents an algorithm to generate all the permutations, and studies the cardinality of the permutations based on the ring generalization. In addition, we prove that its cardinality is O(α\+n), α>1. Furthermore, we propose a semi-generalization algorithm which can meet the requirement of preserving anonymity of sensitive attributes in data sharing, and greatly reduce the amount of information loss resulting from data generalization for implementing data anonymization.

       

    /

    返回文章
    返回