高级检索

    基于区块链辅助的半中心化联邦学习框架

    Blockchain-Assisted Semi-Centralized Federated Learning Framework

    • 摘要: 随着网络技术的发展,如何构建可信任的新一代信息管理系统成为了必要需求,区块链技术提供了去中心化、透明、不可篡改的可信分布式底座. 随着人工智能技术的发展,网络数据计算领域出现了数据孤岛问题,各开发者之间的不信任导致难以联合利用各方数据进行协同训练,联邦学习虽然提供了数据隐私性保障,但是服务器端安全性仍存在隐患. 传统方法通过将联邦学习框架中的服务器端替换为区块链系统以提供不可篡改的全局模型数据库,但是这种方式并未利用物联网场景中所有可用网络连接,并缺少了针对联邦学习任务的区块结构设计. 提出了基于区块链辅助的半中心化联邦学习框架,从物联网场景需求出发,构建了半中心化的物联网场景,利用了所有可信的网络连接以支撑联邦学习任务,同时通过区块链技术为不可信、距离远的客户端之间构建了不可篡改的模型库,相比传统区块链联邦学习框架有更小的通信开销和更好的普适性. 所提框架包含两大设计,半中心化的联邦学习框架通过客户端之间的可信连接减少聚合所带来的通信开销,并通过区块链存储客户端模型以便于距离较远或者相互不可信的客户端进行聚合;设计了针对联邦学习任务的区块链区块,使区块链能够支持底层联邦学习训练的需求. 实验证明所提框架在多个数据集上相比传统联邦学习算法有至少8%的准确率提升,并大幅度减少了客户端之间相互等待带来的通信开销,为实际场景下的区块链联邦学习系统部署提供了指导.

       

      Abstract: With the development of network technology, building a trusted new-generation information management system is necessary. Blockchain technology provides a decentralized, transparent, and tamper-proof distributed base. On the other hand, with the development of artificial intelligence technology, data islands have been a common issue in the field of network data computing. The distrust among developers has made it difficult to jointly utilize all parties’ data for collaborative training. Although federated learning provides data privacy protection, there are still hidden dangers in server-side security. The traditional methods replace the server in the federated learning framework with a blockchain system to provide a tamperproof global model database. However, this approach does not utilize all available network connections in the Internet of things scenario and lacks a block structure design for federated learning tasks. We propose a blockchain-assisted semi-centralized federated learning framework. Starting from the requirements of the Internet of things scenario, our approach constructs a semi-centralized Internet of things structure and utilizes all trusted network connections to support federated learning tasks. At the same time, our approach constructs a tamper-proof model database for untrusted and remote clients through blockchain technology. Compared with traditional blockchain federated learning frameworks, our approach has a smaller communication overhead and better universality. The framework includes two major designs. The semi-centralized federated learning framework reduces the communication overhead brought by aggregation through trusted connections between clients, and stores client models through blockchain for aggregation on remote or untrusted clients to improve the universality and performance of local models. The design of blockchain blocks for federated learning tasks can support the needs of underlying federated learning training. Experiments have shown that this framework has an accuracy improvement at least 8% compared with traditional federated learning algorithms on multiple datasets, and significantly reduces the communication overhead caused by the waiting aggregation process between clients, providing guidance for the deployment of blockchain federated learning systems in practical scenarios.

       

    /

    返回文章
    返回