高级检索
    张战成, 王士同, 钟富礼. 具有隐私保护功能的协作式分类机制[J]. 计算机研究与发展, 2011, 48(6): 1018-1028.
    引用本文: 张战成, 王士同, 钟富礼. 具有隐私保护功能的协作式分类机制[J]. 计算机研究与发展, 2011, 48(6): 1018-1028.
    Zhang Zhancheng, Wang Shitong, Fu-Lai Chung. Collaborative Classification Mechanism for Privacy-Preserving[J]. Journal of Computer Research and Development, 2011, 48(6): 1018-1028.
    Citation: Zhang Zhancheng, Wang Shitong, Fu-Lai Chung. Collaborative Classification Mechanism for Privacy-Preserving[J]. Journal of Computer Research and Development, 2011, 48(6): 1018-1028.

    具有隐私保护功能的协作式分类机制

    Collaborative Classification Mechanism for Privacy-Preserving

    • 摘要: 提出了一种能够保护数据隐私的协作式分类机制,即C\+2MP\+2(collaborative classification mechanism for privacy-preserving),该算法利用2类样本各自的均值和协方差作为整体信息,将整体信息共享给对方,参与分类的双方,分别使用各自的隐私数据和对方的整体信息训练获得2个可以保护隐私的分类器,并由2个分类器协作得到最终的分类器.其线性模型的训练过程不仅可以保护双方数据元的隐私,还可以保护数据元的数量信息不泄露.针对测试过程的隐私保护,设计了可以保护待测样本的隐私和分类规则不泄露的安全算法.在C\+2MP\+2线性模型的基础上,分析了C\+2MP\+2和MPM(minimax probability machine),SVM(support vector machine)以及M\+4(maxi-min margin machine)在处理隐私数据方面的区别和联系.进一步使用核方法通过内积矩阵实现隐私保护的同时提高C\+2MP\+2的非线性识别能力,并通过模拟数据和标准数据集上实验检验了C\+2MP\+2线性模型和核化模型的有效性.

       

      Abstract: Privacy-preserving is becoming an increasingly important task in the Web-enabled world. Specifically we propose a novel two-party privacy-preserving classification solution called collaborative classification mechanism for Privacy-preserving(C\+2MP\+2) that is inspired from mean value and covariance matrix globally stating data location and direction, and the fact that sharing those global information with others will not disclose ones own privacy. This model collaboratively trains the decision boundary from two hyper-planes individually constructed by ones own privacy information and counter-partys global information. As a major contribution, we show that C\+2MP\+2 can protect both data-entries and number of entries. We describe the C\+2MP\+2 model definition, provide the geometrical interpretation, and present theoretical justifications. To guarantee the security of testing procedure, we then develop a testing algorithm based on homomorphic encryption scheme. Moreover, we show that C\+2MP\+2 can be transformed into existing minimax probability machine (MPM), support vector machine (SVM) and maxi-min margin machine (M\+4) model when privacy data satisfies certain conditions. We also extend C\+2MP\+2 to a nonlinear classifier by exploiting kernel trick without privacy disclosure. Furthermore, we perform a series of evaluations on both toy data sets and real-world benchmark data sets. Comparison with MPM and SVM demonstrates the advantages of our new model in protecting privacy.

       

    /

    返回文章
    返回