高级检索

    基于强化学习的服务功能链部署研究综述

    A Survey on Reinforcement Learning-Based Service Function Chain Deployment

    • 摘要: 随着B5G/6G、物联网等新型网络技术的快速发展,网络服务呈现出业务类型多样化、服务质量要求差异化和资源需求动态化等特征.在软件定义网络和网络功能虚拟化范式下,服务功能链技术通过灵活编排虚拟网络功能实现了网络服务的定制化部署.然而,动态网络环境下SFC部署面临着决策空间大、环境复杂多变等挑战.强化学习因其自适应学习复杂环境特征并动态决策的能力,在解决服务功能链部署问题上展现出显著优势.尽管当前强化学习在服务功能链部署中的应用研究已取得一定进展,但尚缺乏系统性和全面性的综述分析.为此,本文首先阐述了服务功能链部署的基本概念与技术架构,并具体介绍了基于强化学习的服务功能链部署框架.随后,从算法设计、应用场景和优化策略等角度,梳理并深入分析了强化学习在服务功能链放置、调度和重配置三个关键阶段中的研究进展与创新应用.最后,本文总结了现有研究在算法设计、性能优化和实际部署方面的优势与局限,并分析了该领域的技术挑战与未来发展趋势.

       

      Abstract: With the rapid development of emerging network technologies such as B5G/6G and Internet of Things, network services present characteristics of diverse business types, differentiated quality requirements, and dynamic resource demands. Under the Software Defined Networking (SDN) and Network Function Virtualization (NFV) paradigm, Service Function Chaining (SFC) technology enables customized network service deployment through flexible orchestration of Virtual Network Functions (VNF). However, SFC deployment in dynamic network environments faces challenges such as large decision space and complex, changing environments. Reinforcement learning (RL) demonstrates significant advantages in solving Service Function Chain deployment problems due to its capability to adaptively learn complex environmental characteristics and make dynamic decisions. Although research on applying RL to SFC deployment has made certain progress, a systematic and comprehensive review analysis remains lacking. To address this gap, the fundamental concepts and technical architecture of SFC deployment are elaborated, along with a specific introduction to the RL-based SFC deployment framework. Subsequently, from the perspectives of algorithm design, application scenarios, and optimization strategies, the research progress and innovative applications of reinforcement learning in three key phases—SFC placement, scheduling, and reconfiguration—are systematically examined and thoroughly analyzed. Finally, the advantages and limitations of existing research in terms of algorithm design, performance optimization, and practical deployment are summarized, while technical challenges and future development trends in this field are analyzed.

       

    /

    返回文章
    返回