基于强化学习的服务功能链部署研究综述

爱因坦; 阮宏玮; 刘翰林; 李华

doi:10.7544/issn1000-1239.202550002

基于强化学习的服务功能链部署研究综述

A Survey on Reinforcement Learning-Based Service Function Chain Deployment

摘要

摘要: 在软件定义网络和网络功能虚拟化范式下，服务功能链（service function chaining，SFC）技术通过灵活编排虚拟网络功能实现了网络服务的定制化部署. 然而，动态网络环境下SFC部署面临着决策空间大、环境复杂多变等挑战. 强化学习因其自适应学习复杂环境特征并动态决策的能力，在解决SFC部署问题上展现出显著优势. 该研究首先阐述了SFC部署的基本概念与技术架构，并具体介绍了基于强化学习的SFC部署框架. 随后，从算法设计、应用场景和优化策略等角度，梳理并深入分析了强化学习在SFC放置、调度和重配置3个关键阶段中的研究进展与创新应用. 最后，总结了现有研究在算法设计、性能优化和实际部署方面的优势与局限，并分析了该领域的技术挑战与未来发展趋势.

Abstract: With the rapid development of emerging network technologies such as B5G/6G and Internet of Things, network services present characteristics of diverse business types, differentiated quality requirements, and dynamic resource demands. Under the Software Defined Networking (SDN) and Network Function Virtualization (NFV) paradigm, Service Function Chaining (SFC) technology enables customized network service deployment through flexible orchestration of Virtual Network Functions (VNF). However, SFC deployment in dynamic network environments faces challenges such as large decision space and complex, changing environments. Reinforcement learning (RL) demonstrates significant advantages in solving Service Function Chain deployment problems due to its capability to adaptively learn complex environmental characteristics and make dynamic decisions. Although research on applying RL to SFC deployment has made certain progress, a systematic and comprehensive review analysis remains lacking. To address this gap, the fundamental concepts and technical architecture of SFC deployment are elaborated, along with a specific introduction to the RL-based SFC deployment framework. Subsequently, from the perspectives of algorithm design, application scenarios, and optimization strategies, the research progress and innovative applications of reinforcement learning in three key phases—SFC placement, scheduling, and reconfiguration—are systematically examined and thoroughly analyzed. Finally, the advantages and limitations of existing research in terms of algorithm design, performance optimization, and practical deployment are summarized, while technical challenges and future development trends in this field are analyzed.

HTML全文

参考文献(94)

施引文献

资源附件(0)